BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1705
(309 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|189233776|ref|XP_001814509.1| PREDICTED: similar to CG5367 CG5367-PA [Tribolium castaneum]
gi|270015148|gb|EFA11596.1| cathepsin K precursor [Tribolium castaneum]
Length = 330
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 167/303 (55%), Positives = 223/303 (73%), Gaps = 11/303 (3%)
Query: 15 KKYKKDYRKKA---TDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
+K+K+ Y K D+ +KL WQSN +KI HN+EA +GLH Y ++EN LSD+ + Y+
Sbjct: 31 EKFKQSYNKTYYGYLDTSRKLAWQSNLEKIKKHNEEADKGLHSYYIKENDLSDMSTQSYL 90
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFS 126
++M +LT S R+ PE + IP+ ++W EKGF TP +NQ+DCG+CYAFS
Sbjct: 91 QKMVKLTKSTHRKV---DPEVVGDLFELLHHIPEEVNWVEKGFETPSYNQKDCGSCYAFS 147
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
IAS +Q QIFK T ++ LS QQ+VDCS+ GN GC GGSLRNTL Y++ AGGLM DY
Sbjct: 148 IASVLQAQIFKQTEKLVPLSEQQIVDCSVSMGNYGCGGGSLRNTLRYLEKAGGLMTYSDY 207
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY +Q C+F + +V++++W+VLP +DE AL++ +A +GP+A SINASPHTFQLY S
Sbjct: 208 PYLARQQRCRFDKHRAIVNLTTWAVLPARDERALELAVAKIGPVAASINASPHTFQLYHS 267
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
G+YDD AC+S++VNHAML+VGYT+N+WILKNWW HWG+ GYM L+RG NRCGIANYA Y
Sbjct: 268 GVYDDVACSSNHVNHAMLIVGYTKNAWILKNWWGKHWGEKGYMRLRRGKNRCGIANYAAY 327
Query: 307 ALI 309
AL+
Sbjct: 328 ALV 330
>gi|383852029|ref|XP_003701533.1| PREDICTED: cathepsin J-like [Megachile rotundata]
Length = 341
Score = 343 bits (881), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 158/299 (52%), Positives = 210/299 (70%), Gaps = 4/299 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +Y K Y + ++ ++ W+ N I+ HN A G H YTLR+NH++DL R Y++E
Sbjct: 44 KARYNKSY-SGSEETDRRASWEENLVTIYKHNMMAAAGHHSYTLRDNHIADLGTRQYVRE 102
Query: 74 MTRLTHSRIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M +L SR RR S ++ L+P +DWRE GF+TP NQ +CG+CYA+SIA +
Sbjct: 103 MVKLIPSRKRRVSTDSVVGAALSDPGLVPSRIDWRELGFVTPAENQRNCGSCYAYSIAGS 162
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
IQGQIFK T + LS QQ++DCS +GNLGC+GGSLRNTL Y++ A GLM + YPYK
Sbjct: 163 IQGQIFKRTGALIPLSEQQLIDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMSQAYYPYKA 222
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
KQ C+F+ VV+++SW+VLP +DE AL+ +AT+GPIA S+NASP TFQLY +G+YD
Sbjct: 223 KQGRCRFQEDLSVVNVTSWAVLPARDEKALEAAVATIGPIAASVNASPRTFQLYHNGVYD 282
Query: 251 DEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
DE C+SD VNHA+L+VGYT WILKNWW WG+NGYM L + NRCG+ANYA YA +
Sbjct: 283 DELCSSDMVNHAVLIVGYTPTEWILKNWWGDGWGENGYMRLAKMKNRCGVANYAAYAKV 341
>gi|380026639|ref|XP_003697053.1| PREDICTED: cathepsin J-like [Apis florea]
Length = 346
Score = 342 bits (878), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 166/300 (55%), Positives = 207/300 (69%), Gaps = 7/300 (2%)
Query: 17 YKKDYRKKAT---DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
YK Y K + +S +++ W+ N I+ HN A G H YTL++NH++DL R YI++
Sbjct: 47 YKARYNKSYSGDLESSRRMTWEENLITIYKHNMMAAAGHHSYTLKDNHIADLGTRQYIRD 106
Query: 74 MTRLTHSRIRR----TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
M +L SR RR TLV S+ IP LDWRE GF T NQ DCG+CYA+SIA
Sbjct: 107 MVKLIPSRKRRISKETLVGVSLSDHQRDIPRELDWRESGFKTRAENQRDCGSCYAYSIAG 166
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+I+GQIFK T + LS QQ+VDCS +GNLGC+GGSLRNTL Y++ A GLM ++ YPYK
Sbjct: 167 SIEGQIFKKTGMLLPLSEQQLVDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMAKKYYPYK 226
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
KQ C+F VV+I+SW+VLP +DE L+ +AT+GPIA SINASP TFQLY GIY
Sbjct: 227 AKQGPCRFNEDLSVVNITSWAVLPARDEKVLEAAVATIGPIAASINASPKTFQLYHKGIY 286
Query: 250 DDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
DDE C+SD VNHAML+VGYT WILKNWW WG+NGYM L + NRCG+ANYA YA +
Sbjct: 287 DDEVCSSDMVNHAMLIVGYTPTEWILKNWWGDGWGENGYMRLAKNKNRCGVANYAAYAKV 346
>gi|340710175|ref|XP_003393670.1| PREDICTED: cathepsin K-like [Bombus terrestris]
Length = 343
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/305 (53%), Positives = 209/305 (68%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +Y K Y ++ +++ W+ N I+ HN A G H YTLR+NH++DL R YI+E
Sbjct: 46 KTRYNKSYSGN-LETNRRIAWEENLITIYKHNMMAAAGHHSYTLRDNHIADLGTRQYIRE 104
Query: 74 MTRLTHSRIRRTLVRSPESNESVL---------IPDHLDWREKGFITPDWNQEDCGACYA 124
M +L SR RR SNE ++ IP LDWRE GF TP NQ DCG+CYA
Sbjct: 105 MVKLIPSRKRRV------SNEPIVGAVLHDPRRIPPQLDWREMGFKTPPENQRDCGSCYA 158
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
+SIA++IQGQIFK T + LS QQ++DCS +GNLGC+GGSLRNTL Y++ A GLM +
Sbjct: 159 YSIATSIQGQIFKKTGMLIPLSEQQLIDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMPQS 218
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
Y Y KQ C+F VV+I+SW+VLP +DE AL+ +AT+GPIA SINASP TFQLY
Sbjct: 219 RYSYTAKQGPCRFVEDLSVVNITSWAVLPARDEKALEAAVATIGPIAASINASPKTFQLY 278
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
+G+YDDE C+SD VNHAM++VGYT WILKNWW + WG+NGYM L + NRCGIANYA
Sbjct: 279 HTGVYDDEVCSSDTVNHAMVIVGYTPTEWILKNWWGNGWGENGYMRLAKNKNRCGIANYA 338
Query: 305 VYALI 309
YA +
Sbjct: 339 AYAKV 343
>gi|328789446|ref|XP_394277.3| PREDICTED: cathepsin J-like [Apis mellifera]
Length = 344
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 168/300 (56%), Positives = 207/300 (69%), Gaps = 8/300 (2%)
Query: 17 YKKDYRKKAT---DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
YK Y K + +S ++ W+ N I+ HN A G H YTL++NH++DL R YI+E
Sbjct: 46 YKARYNKSYSGDLESNRRTTWEENLITIYKHNMMAAAGHHSYTLKDNHIADLGTRQYIRE 105
Query: 74 MTRLTHSRIRR----TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
M +L SR RR TLV S+ IP LDWRE GF T NQ DCG+CYA+SIA
Sbjct: 106 MVKLIPSRKRRISKDTLVGVSLSDHQ-RIPRELDWREVGFKTQPENQRDCGSCYAYSIAG 164
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+I+GQIFK T + LS QQ+VDCS +GNLGC+GGSLRNTL Y++ A GLM ++ YPYK
Sbjct: 165 SIEGQIFKKTGMLLPLSEQQLVDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMAKKYYPYK 224
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
KQ C+FK VV+I+SW+VLP +DE L+ +AT+GPIA SINASP TFQLY G+Y
Sbjct: 225 AKQGQCRFKEDLSVVNITSWAVLPARDEKVLEAAVATIGPIAASINASPKTFQLYHKGVY 284
Query: 250 DDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
DDE C+SD VNHAML+VGYT WILKNWW WG+NGYM L + NRCGIANYA YA +
Sbjct: 285 DDEVCSSDMVNHAMLIVGYTPTEWILKNWWGDGWGENGYMRLAKNKNRCGIANYAAYAKV 344
>gi|350413611|ref|XP_003490051.1| PREDICTED: cathepsin J-like [Bombus impatiens]
Length = 343
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 162/305 (53%), Positives = 208/305 (68%), Gaps = 18/305 (5%)
Query: 17 YKKDYRKKAT---DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
YK Y K + ++ +++ W+ N I+ HN A G H YTLR+NH++DL R YI+E
Sbjct: 45 YKTRYNKSYSGNLETNRRIAWEENLITIYKHNMMAAAGHHSYTLRDNHIADLGTRQYIRE 104
Query: 74 MTRLTHSRIRRTLVRSPESNESVL---------IPDHLDWREKGFITPDWNQEDCGACYA 124
M +L SR RR SNE ++ IP LDWRE GF TP NQ DCG+CYA
Sbjct: 105 MVKLIPSRKRRV------SNEPIVGAVLHDPRRIPPQLDWREMGFKTPPENQRDCGSCYA 158
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
+S+A++IQGQIFK T + LS QQ++DCS +GNLGC+GGSLRNTL Y++ A GLM +
Sbjct: 159 YSVATSIQGQIFKKTGMLIPLSEQQLIDCSTSTGNLGCSGGSLRNTLRYLEKAKGLMPQS 218
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
Y Y KQ C+F VV+I+SW+VLP +DE AL+ +A +GPIA S+NASP TFQLY
Sbjct: 219 RYSYTAKQGPCRFVEDLSVVNITSWAVLPARDEKALEAAVAIIGPIAASVNASPKTFQLY 278
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
+G+YDDE C+SD VNHAM++VGYT WILKNWW + WG+NGYM L + NRCGIANYA
Sbjct: 279 HTGVYDDEVCSSDMVNHAMVIVGYTPTEWILKNWWGNGWGENGYMRLAKNKNRCGIANYA 338
Query: 305 VYALI 309
YA +
Sbjct: 339 AYAKV 343
>gi|345486539|ref|XP_001604490.2| PREDICTED: cathepsin L2-like [Nasonia vitripennis]
Length = 332
Score = 336 bits (861), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 162/301 (53%), Positives = 209/301 (69%), Gaps = 10/301 (3%)
Query: 17 YKKDYRKKAT---DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
YK + K T ++ ++ W+ N KI+ HN A G H Y LR+NH++DL Y++E
Sbjct: 34 YKMRHNKTYTGTLEAVRREAWEDNLLKIYEHNLLAAAGHHEYILRDNHIADLSTSSYMRE 93
Query: 74 MTRLTHSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ +L SR RR + E +VL IP LDWREKGF+T NQ DCG+CYA+SIA
Sbjct: 94 LVKLVPSRRRR--LDDDEMVAAVLHDPRRIPKSLDWREKGFVTKPENQRDCGSCYAYSIA 151
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+I GQIF+ T + LS QQ+VDCS +GNLGC+GGSLRNTL Y++ + GLM + YPY
Sbjct: 152 GSIAGQIFRQTGIVVPLSEQQLVDCSTQTGNLGCSGGSLRNTLRYLERSKGLMTDATYPY 211
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
Q +CKF+R VV+++SW++LP +DE AL+ +AT+GPIA SINA P TFQLY SGI
Sbjct: 212 TAHQGVCKFQRKLSVVNVTSWAILPARDERALEAAVATIGPIAASINAGPRTFQLYHSGI 271
Query: 249 YDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
YDD C+SD VNHAML+VGYT N WILKNWW WG+NGYM L++G NRCG+ANYA YA
Sbjct: 272 YDDPTCSSDLVNHAMLIVGYTPNYWILKNWWGASWGENGYMRLRKGKNRCGVANYAAYAK 331
Query: 309 I 309
+
Sbjct: 332 V 332
>gi|322783986|gb|EFZ11138.1| hypothetical protein SINV_08365 [Solenopsis invicta]
Length = 353
Score = 334 bits (857), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 159/310 (51%), Positives = 210/310 (67%), Gaps = 15/310 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y K ++++ + W+ N +I+ HN A G H YTLR+NH++DL+ R Y++E
Sbjct: 45 KTQFNKTYTKNLENARRAV-WEENLVEIYKHNLLAAAGHHSYTLRDNHIADLNTRQYMRE 103
Query: 74 MTRLTHSRIRRTL---VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M RL SR RR + S +S IP LDWRE GF+TP NQ++CG+CYA+SIA +
Sbjct: 104 MVRLMPSRRRRLPTEPIVSTALKDSRKIPASLDWRECGFVTPPVNQQNCGSCYAYSIAES 163
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
I+GQIFK T + +S QQ+VDCS GNLGC GGSLR TL Y++ + GLM YPY G
Sbjct: 164 IEGQIFKQTGMLLSVSAQQLVDCSTAIGNLGCTGGSLRTTLKYLEKSKGLMATSMYPYNG 223
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY------ 244
+Q CKF+R VV+I+SW++LP +DE AL++ +AT+GPIA SINA P TFQLY
Sbjct: 224 EQGECKFQRDQSVVNITSWAILPARDEKALEIAVATIGPIAASINAGPKTFQLYQYQSTR 283
Query: 245 -----ASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCG 299
+ G+YDD C+SD VNHAML+VGYT WILKNWW +WG+NGYM L R NRCG
Sbjct: 284 IFPLRSKGVYDDHHCSSDMVNHAMLVVGYTPTEWILKNWWGSNWGENGYMRLARNKNRCG 343
Query: 300 IANYAVYALI 309
+ANYA Y +
Sbjct: 344 VANYAAYVRV 353
>gi|307178052|gb|EFN66897.1| Cathepsin K [Camponotus floridanus]
Length = 295
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 154/285 (54%), Positives = 200/285 (70%), Gaps = 3/285 (1%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
++ ++ W+ N +I+ HN A G H YTLR+NH++DL R YI+ M +L S+ RR L
Sbjct: 12 ENTRRTAWEQNLVEIYKHNLMAAAGHHSYTLRDNHIADLSTRQYIRNMVKLIPSQ-RRRL 70
Query: 87 VRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
P + + IP HLDWRE GFIT NQ++CG+CYA+SI +IQGQIFK T +
Sbjct: 71 PTEPMVSAIHNPHIPKHLDWREYGFITLPVNQQNCGSCYAYSIVESIQGQIFKQTGMLLP 130
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
LS QQ+VDCS ++GN GC GGSLRNTL Y++ + GLM YPY +Q +C+F+R VV
Sbjct: 131 LSAQQLVDCSTVTGNRGCIGGSLRNTLKYLEKSKGLMAGYLYPYNAEQGVCRFQRDLSVV 190
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+I+SW++LP +DE AL+ +AT+GPIAVSINASP TFQLY G+YDD C SD VNHAML
Sbjct: 191 NITSWAILPARDEKALEAAVATIGPIAVSINASPKTFQLYHKGVYDDHRCDSDSVNHAML 250
Query: 265 LVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+VGYT WILKNWW +WG++GYM + R N CGIANYA Y +
Sbjct: 251 IVGYTPTEWILKNWWGDNWGEDGYMRVARNKNLCGIANYAAYVKV 295
>gi|307195722|gb|EFN77562.1| Cathepsin K [Harpegnathos saltator]
Length = 345
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/300 (52%), Positives = 205/300 (68%), Gaps = 5/300 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y K ++ +++ W+ N +I+ HN A G H YTLR+NH++DL Y+++
Sbjct: 47 KTQFNKTY-KGNLENARRIAWEQNLVEIYKHNLMAAAGHHSYTLRDNHIADLSSPQYMRK 105
Query: 74 MTRLTHSRIRRTLVRSPESNESVL----IPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
M +L SR RR P + ++ IP LDWRE GF T NQ +CG+CYA+SI
Sbjct: 106 MVKLVPSRRRRLSSSDPMLSATLQQPHNIPARLDWRELGFNTRPVNQRECGSCYAYSIVE 165
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+IQGQIFK T + LS QQ+VDCS +GN GCAGGSLRNTL Y++ + GLM + +YPYK
Sbjct: 166 SIQGQIFKQTGMLIPLSAQQLVDCSTATGNRGCAGGSLRNTLRYLERSKGLMAKTEYPYK 225
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ CKF R VV+I+SW++LP +DE AL+ +A+VGPIAVSINA P TFQLY G+Y
Sbjct: 226 AQDGQCKFHRDLSVVNITSWAILPARDETALEAAVASVGPIAVSINAMPKTFQLYHKGVY 285
Query: 250 DDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
DD C+SD VNHAML+VGYT WILKNWW +WG+NGYM L + NRCGIANYA Y +
Sbjct: 286 DDHLCSSDTVNHAMLIVGYTPTEWILKNWWGENWGENGYMRLAKNKNRCGIANYAAYVKV 345
>gi|195146732|ref|XP_002014338.1| GL19003 [Drosophila persimilis]
gi|194106291|gb|EDW28334.1| GL19003 [Drosophila persimilis]
Length = 335
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 145/299 (48%), Positives = 199/299 (66%), Gaps = 3/299 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K K Y + +++ ++ N+K I HN+ Q G + L N ++D+ Y+K
Sbjct: 37 KKINNKTYSRNFDETRSLKAFEVNYKIIKDHNKNYQDGQTTFRLATNIMADMSTEGYLKN 96
Query: 74 MTRLTHSR---IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
RL S+ + S++ IP+ LDWR KGF TP NQ+ CG+CYAFSIA +
Sbjct: 97 FLRLLKSQSNVADDNIAEIVGSSQMTNIPESLDWRRKGFTTPSQNQQSCGSCYAFSIAES 156
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
I+GQIFK T +I LS QQ+VDCS+ GN GC GGSLRNTL Y+Q GG+M+ +DY Y
Sbjct: 157 IEGQIFKRTGKILSLSEQQIVDCSVSHGNQGCTGGSLRNTLKYLQSTGGIMRSDDYKYVS 216
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K+ C+F R VV+I+SW++LP +E A++ +A +GPIAVSINA+P TFQLY+ GIYD
Sbjct: 217 KKGKCQFVRDLSVVNITSWAILPVNNEQAIQAAVAHIGPIAVSINATPRTFQLYSDGIYD 276
Query: 251 DEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
D +C S VNHAML++G+ ++ WILKNWW WG++GYM LK+G N CGIANYA YA++
Sbjct: 277 DASCVSTSVNHAMLVIGFGKDFWILKNWWGDRWGESGYMRLKKGINLCGIANYAAYAIV 335
>gi|195134024|ref|XP_002011438.1| GI14103 [Drosophila mojavensis]
gi|193912061|gb|EDW10928.1| GI14103 [Drosophila mojavensis]
Length = 334
Score = 310 bits (794), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 139/283 (49%), Positives = 193/283 (68%), Gaps = 11/283 (3%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ NH+ + HN++ G + L N L+D+ Y+K RL +R + + ++
Sbjct: 56 YEENHQIVEAHNKDYDSGRSSFRLAANTLADMSTDSYLKGFLRL----LRSPPISASDNM 111
Query: 94 ESVL-------IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
++ +PD LDWR+KGF TP +NQ+ CG+CYAFSIA +I+GQ+FK T I LS
Sbjct: 112 VDIVGATLMDNVPDSLDWRKKGFATPSYNQQSCGSCYAFSIAQSIEGQVFKRTGRILSLS 171
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
QQ+VDCSI GN GC GGSLRNTL Y+Q GGLM+ DY Y K+ C+F VV++
Sbjct: 172 EQQIVDCSISHGNQGCTGGSLRNTLRYLQATGGLMRSVDYKYASKKGACQFVSELAVVNV 231
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+SW++LP DE+A++ +A +GP+AVSINA+P TFQLY+ GIYDD C+S VNHAMLL+
Sbjct: 232 TSWAILPANDENAIQAAVAHIGPVAVSINATPKTFQLYSDGIYDDVTCSSTSVNHAMLLI 291
Query: 267 GYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
GY ++ WILKNWW WG++GYM +++G N CGIANYA YA++
Sbjct: 292 GYDKDYWILKNWWGEKWGESGYMRMRKGINLCGIANYAAYAIV 334
>gi|194761772|ref|XP_001963099.1| GF14107 [Drosophila ananassae]
gi|190616796|gb|EDV32320.1| GF14107 [Drosophila ananassae]
Length = 338
Score = 310 bits (793), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 205/312 (65%), Gaps = 3/312 (0%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+T E+ F + + + Y + + + ++ N+K I HN+ Q+G + L+ N
Sbjct: 27 VTESEYKNEFEIFKNENNRKYLRNDDEFRSFKAFEENYKIIEKHNKNFQEGQTSFRLKPN 86
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQE 117
+ +D++ Y+K RL S E S L +P+ LDWR KGF+TP NQ+
Sbjct: 87 NFADMNTDGYLKGYLRLIKSHTEEGADNIAEIVGSPLMTNVPESLDWRNKGFVTPPHNQQ 146
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+CYAFSIA +I GQ+FK T +I LS QQ+VDCS+ GN GC GGSLRNTLNY+Q
Sbjct: 147 TCGSCYAFSIAESISGQVFKRTGKILNLSEQQIVDCSVSHGNQGCVGGSLRNTLNYLQST 206
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GGLM+ +DY Y ++ C+F VV+++SW++LP DE A++ + +GP+A+SINA+
Sbjct: 207 GGLMRADDYKYVSRKGKCQFVSDLSVVNVTSWAILPAHDEQAIQAAVTHIGPVAISINAT 266
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNR 297
P TFQLY+ GIYDD C+S VNHAML++G+ ++ WILKNWW HHWG++GYM +++G N
Sbjct: 267 PKTFQLYSDGIYDDPMCSSASVNHAMLIIGFGKDFWILKNWWGHHWGESGYMRIRKGVNM 326
Query: 298 CGIANYAVYALI 309
CG+ANYA YA++
Sbjct: 327 CGVANYAAYAIV 338
>gi|195473621|ref|XP_002089091.1| GE26053 [Drosophila yakuba]
gi|194175192|gb|EDW88803.1| GE26053 [Drosophila yakuba]
Length = 338
Score = 310 bits (793), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 140/279 (50%), Positives = 194/279 (69%), Gaps = 3/279 (1%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ N+K I HNQ + G + L+ N +D+ Y+K RL S I + E
Sbjct: 60 FEENYKVIEEHNQNFKDGQTSFRLKPNIFADMSTDGYLKGYLRLLKSNIEDSADNMAEIV 119
Query: 94 ESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
S L +P+ LDWR KGFITP +NQ CG+CYAFSIA +I GQ+FK T +I LS QQ+
Sbjct: 120 GSPLMSNVPESLDWRSKGFITPPYNQLSCGSCYAFSIAESIVGQVFKRTGKILSLSKQQI 179
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWS 210
VDCS+ GN GC GGSLRNTL Y+Q GG+M+EEDYPY ++ C+F VV+++SW+
Sbjct: 180 VDCSVSHGNQGCVGGSLRNTLRYLQSTGGIMREEDYPYAARKGKCQFVPDLSVVNVTSWA 239
Query: 211 VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTR 270
+LP +DE A++ +A +GP+A+SINASP TFQLY+ GIYDD C+S VNHAM+++G+ +
Sbjct: 240 ILPVRDEQAIQAAVAHIGPVAISINASPKTFQLYSDGIYDDPLCSSASVNHAMVVIGFGK 299
Query: 271 NSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ WILKNWW +WG+NGY+ +++G N CG+ANYA YA++
Sbjct: 300 DYWILKNWWGQNWGENGYIRIRKGVNMCGMANYAAYAIV 338
>gi|194859829|ref|XP_001969459.1| GG23942 [Drosophila erecta]
gi|190661326|gb|EDV58518.1| GG23942 [Drosophila erecta]
Length = 338
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 140/294 (47%), Positives = 200/294 (68%), Gaps = 3/294 (1%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
+ Y + + + ++ N+K I HNQ + G + L+ N +D+ Y+K RL
Sbjct: 45 RKYLRTYDEMRSYKAFEENYKVIEEHNQNYKDGHSSFRLKPNIFADMSTSGYLKGYLRLL 104
Query: 79 HSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
S I + E S L +P+ LDWR KGFITP +NQ CG+CYAFSIA +I GQ+
Sbjct: 105 KSNIEDSADNMAEIVGSPLMSNVPESLDWRSKGFITPAYNQLTCGSCYAFSIAESIVGQV 164
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
FK T ++ LS QQ+VDCS+ GN GC GGSLRNTL+Y+Q GG+M+EEDYPY ++ C
Sbjct: 165 FKRTGKVLSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMREEDYPYVARKGKC 224
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+F VV+++SW++LP +DE A++ +A +GP+A+SINASP TFQLY+ GIYDD C+
Sbjct: 225 QFVHDLSVVNVTSWAILPVRDEQAIQAAVAHIGPVAISINASPKTFQLYSDGIYDDPLCS 284
Query: 256 SDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
S VNHAM+++G+ ++ WILKNWW +WG+NGY+ +++G N CG+ANYA YA++
Sbjct: 285 SASVNHAMVVIGFGKDYWILKNWWGPNWGENGYIRIRKGVNMCGMANYAAYAIV 338
>gi|24583376|ref|NP_609387.1| CG5367 [Drosophila melanogaster]
gi|22946140|gb|AAF52922.2| CG5367 [Drosophila melanogaster]
Length = 338
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 139/294 (47%), Positives = 199/294 (67%), Gaps = 3/294 (1%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
+ Y + + + ++ N K I HNQ ++G + L+ N +D+ Y+K RL
Sbjct: 45 RKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLKGFLRLL 104
Query: 79 HSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
S I + E S L +P+ LDWR KGFITP +NQ CG+CYAFSIA +I GQ+
Sbjct: 105 KSNIEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSCYAFSIAESIMGQV 164
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
FK T +I LS QQ+VDCS+ GN GC GGSLRNTL+Y+Q GG+M+++DYPY ++ C
Sbjct: 165 FKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDYPYVARKGKC 224
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+F VV+++SW++LP +DE A++ + +GP+A+SINASP TFQLY+ GIYDD C+
Sbjct: 225 QFVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIYDDPLCS 284
Query: 256 SDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
S VNHAM+++G+ ++ WILKNWW +WG+NGY+ +++G N CGIANYA YA++
Sbjct: 285 SASVNHAMVVIGFGKDYWILKNWWGQNWGENGYIRIRKGVNMCGIANYAAYAIV 338
>gi|195578153|ref|XP_002078930.1| GD22268 [Drosophila simulans]
gi|194190939|gb|EDX04515.1| GD22268 [Drosophila simulans]
Length = 338
Score = 306 bits (785), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 142/302 (47%), Positives = 202/302 (66%), Gaps = 7/302 (2%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
+K+K + +K + ++ ++ N K I HNQ + G + L+ N +D+ Y
Sbjct: 37 EKFKNNNNRKYLRTHDEMRSYKAFEENFKVIEEHNQNYKDGQTSFRLKPNIFADMSTDGY 96
Query: 71 IKEMTRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
+K RL S I + E S L +PD LDWR KGFITP +NQ CG+CYAFSI
Sbjct: 97 LKGYLRLLKSNIEDSADNMAEIVGSPLMTNVPDSLDWRSKGFITPPYNQLTCGSCYAFSI 156
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A +I GQ+FK T +I LS QQ+VDCS+ GN GC GGSLRNTL Y+Q GG+M+++DYP
Sbjct: 157 AESIVGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLTYLQSTGGIMRDQDYP 216
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y ++ C+F VV++SSW++LP +DE A++ + +GP+A+SINASP TFQLY+ G
Sbjct: 217 YVARKGKCQFVPDLSVVNVSSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDG 276
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYA 307
IYDD C+S VNHAM+++G+ ++ WILKNWW +WG+NGY+ +++G N CG+ANYA YA
Sbjct: 277 IYDDPLCSSASVNHAMVVIGFAKDYWILKNWWGQNWGENGYIRVRKGVNMCGLANYAAYA 336
Query: 308 LI 309
++
Sbjct: 337 IV 338
>gi|195339771|ref|XP_002036490.1| GM11735 [Drosophila sechellia]
gi|194130370|gb|EDW52413.1| GM11735 [Drosophila sechellia]
Length = 338
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 141/302 (46%), Positives = 202/302 (66%), Gaps = 7/302 (2%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
+K+K + +K + ++ ++ N K I HNQ + G + L+ N +D+ Y
Sbjct: 37 EKFKNNNNRKYIRTYDEMRSYKAFEENFKVIEEHNQNYKDGQTSFRLKPNIFADMSTDGY 96
Query: 71 IKEMTRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
+K RL S I + E S L +P+ LDWR KGFITP +NQ CG+CYAFSI
Sbjct: 97 LKGYLRLLKSNIEDSADNMAEIVGSPLMTNVPESLDWRSKGFITPPYNQLTCGSCYAFSI 156
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A +I GQ+FK T +I LS QQ+VDCS+ GN GC GGSLRNTL Y+Q GG+M+++DYP
Sbjct: 157 AESIVGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLTYLQSTGGIMRDQDYP 216
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y ++ C+F VV++SSW++LP +DE A++ + +GP+A+SINASP TFQLY+ G
Sbjct: 217 YVARKGKCQFVADLSVVNVSSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDG 276
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYA 307
IYDD C+S VNHAM+++G+ ++ WILKNWW +WG+NGY+ +++G N CG+ANYA YA
Sbjct: 277 IYDDPLCSSASVNHAMVVIGFAKDYWILKNWWGQNWGENGYIRVRKGVNMCGLANYAAYA 336
Query: 308 LI 309
++
Sbjct: 337 IV 338
>gi|263359699|gb|ACY70535.1| hypothetical protein DVIR88_6g0072 [Drosophila virilis]
Length = 336
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 205/312 (65%), Gaps = 12/312 (3%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KE IF +K + Y + + + ++ N ++ HN + G + L N ++
Sbjct: 31 KERFEIF---KKINNRSYARSHDEMRSYEAYEENQIIVNEHNTYYETGKSSFRLATNTMA 87
Query: 64 DLHPRHYIKEMTRLTHS-RIRRT-----LVRSPESNESVLIPDHLDWREKGFITPDWNQE 117
D++ Y+K RL S I + +V SP N +P+ DWR+KGFITP +NQ+
Sbjct: 88 DMNTDSYLKGYLRLLRSPEISDSDNIADIVGSPLMNN---VPESFDWRKKGFITPLYNQQ 144
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+CYAFSIA +I+GQ+FK T +I LS QQ+VDCS+ GN GC GGSLRNTL Y+Q
Sbjct: 145 SCGSCYAFSIAQSIEGQVFKRTGKIVALSEQQIVDCSVSHGNQGCIGGSLRNTLRYLQAT 204
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GGLM+ DY Y K+ C+F VV+++SW++LP +DE+A++ +A +GP+AVSINAS
Sbjct: 205 GGLMRSLDYKYASKKGECQFVSELAVVNVTSWAILPAKDENAIQAAVAHIGPVAVSINAS 264
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNR 297
P TFQLY+ GIYDD +CTS VNHAMLL+G+ +N WILKNWW WG+ G+M +++G N
Sbjct: 265 PKTFQLYSEGIYDDVSCTSTSVNHAMLLIGFDKNFWILKNWWGELWGEAGFMRMRKGINL 324
Query: 298 CGIANYAVYALI 309
CGIANYA YA++
Sbjct: 325 CGIANYAAYAIV 336
>gi|195402187|ref|XP_002059688.1| GJ20892 [Drosophila virilis]
gi|194155902|gb|EDW71086.1| GJ20892 [Drosophila virilis]
Length = 308
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 205/312 (65%), Gaps = 12/312 (3%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KE IF +K + Y + + + ++ N ++ HN + G + L N ++
Sbjct: 3 KERFEIF---KKINNRSYARSHDEMRSYEAYEENQIIVNEHNTYYETGKSSFRLATNTMA 59
Query: 64 DLHPRHYIKEMTRLTHS-RIRRT-----LVRSPESNESVLIPDHLDWREKGFITPDWNQE 117
D++ Y+K RL S I + +V SP N +P+ DWR+KGFITP +NQ+
Sbjct: 60 DMNTDSYLKGYLRLLRSPEISDSDNIADIVGSPLMNN---VPESFDWRKKGFITPLYNQQ 116
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+CYAFSIA +I+GQ+FK T +I LS QQ+VDCS+ GN GC GGSLRNTL Y+Q
Sbjct: 117 SCGSCYAFSIAQSIEGQVFKRTGKIVALSEQQIVDCSVSHGNQGCIGGSLRNTLRYLQAT 176
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GGLM+ DY Y K+ C+F VV+++SW++LP +DE+A++ +A +GP+AVSINAS
Sbjct: 177 GGLMRSLDYKYASKKGECQFVSELAVVNVTSWAILPAKDENAIQAAVAHIGPVAVSINAS 236
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNR 297
P TFQLY+ GIYDD +CTS VNHAMLL+G+ +N WILKNWW WG+ G+M +++G N
Sbjct: 237 PKTFQLYSEGIYDDVSCTSTSVNHAMLLIGFDKNFWILKNWWGELWGEAGFMRMRKGINL 296
Query: 298 CGIANYAVYALI 309
CGIANYA YA++
Sbjct: 297 CGIANYAAYAIV 308
>gi|195064100|ref|XP_001996497.1| GH23974 [Drosophila grimshawi]
gi|193892043|gb|EDV90909.1| GH23974 [Drosophila grimshawi]
Length = 337
Score = 299 bits (766), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 204/305 (66%), Gaps = 17/305 (5%)
Query: 19 KDYRKKATDSKKKLH--------WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
++++K S +LH ++ N + ++ HN+ + G + L N ++D++ Y
Sbjct: 36 ENFKKINNKSYMRLHDEIRSYKSFEENIRIVNEHNKFYETGKSSFRLSTNTMADMNTDSY 95
Query: 71 IKEMTRLTHSRIRRT------LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
++ RL S T +V SP N +P+ DWR+KGF TP +NQ+ CG+CYA
Sbjct: 96 LQGFLRLLRSPPNSTTDNIADIVGSPLMNN---VPESFDWRKKGFNTPPYNQQSCGSCYA 152
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
FS+A +I+GQ+FK T ++ LS QQ+VDCS+ GN GC GGSLRNTL Y+Q GGLM+
Sbjct: 153 FSVAQSIEGQVFKRTGKLLALSEQQIVDCSVSHGNHGCIGGSLRNTLTYLQATGGLMRSL 212
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
DY Y K+ C+F + VV+++SW++LP DE+A++ + VGP+AVSINA+P TFQLY
Sbjct: 213 DYKYAAKKGDCQFVKELAVVNVTSWAILPANDENAIQAAVVHVGPVAVSINATPKTFQLY 272
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
++GIYDD AC+S VNHAMLL+G+ ++ WILKNWW WG++G+M +++G N CGIANYA
Sbjct: 273 SAGIYDDVACSSTSVNHAMLLIGFDKDFWILKNWWGELWGESGFMRIRKGINLCGIANYA 332
Query: 305 VYALI 309
YA++
Sbjct: 333 AYAIV 337
>gi|193617639|ref|XP_001952206.1| PREDICTED: cathepsin L-like [Acyrthosiphon pisum]
Length = 226
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 128/208 (61%), Positives = 166/208 (79%)
Query: 102 LDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
+DWR++GF TP WNQ DCGACYAFSIAS ++GQ+FK+T ++ LS QQ++DCSI GNLG
Sbjct: 4 VDWRKRGFNTPGWNQLDCGACYAFSIASMLEGQLFKATGKLHTLSSQQIIDCSIAYGNLG 63
Query: 162 CAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALK 221
C+GGSL+NTL Y++ GG+M+ +Y YK ++++C FK+ V I S+LP DEHALK
Sbjct: 64 CSGGSLKNTLQYLKRVGGIMQGIEYSYKARKTLCHFKKFRAVTQIEKISILPQSDEHALK 123
Query: 222 VTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSH 281
V +A +GPI+VS+NASP TFQLY+SG+YDD AC+S VNHAMLLVGYT+++WILKNWWS
Sbjct: 124 VAVALIGPISVSVNASPKTFQLYSSGVYDDPACSSSTVNHAMLLVGYTKDAWILKNWWSS 183
Query: 282 HWGDNGYMYLKRGNNRCGIANYAVYALI 309
WGD+GYMYL RG N+C ++ YA YA I
Sbjct: 184 KWGDDGYMYLARGKNQCAVSTYAAYATI 211
>gi|357611722|gb|EHJ67628.1| cathepsin L-like proteinase [Danaus plexippus]
Length = 354
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/301 (46%), Positives = 181/301 (60%), Gaps = 12/301 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y+ + + W+SN K++ HNQE G YTL NH D YIK+
Sbjct: 55 KTKYNKKYQSRFHERSALSTWRSNMKRVAGHNQEYLAGKQAYTLHLNHFGDWSIFSYIKQ 114
Query: 74 MTRLTHSRI--------RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
+ +L + RRT R+ +P+ +DWREKGF Q CGACYAF
Sbjct: 115 LLKLIRTLPLFDPAEDRRRTTYRNTFDTR---LPERVDWREKGFRPRLEEQFHCGACYAF 171
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
+I A+Q Q++K + ELS QQ+VDCS GN GC GGSL+ L YV GL++E
Sbjct: 172 AITHAVQAQVYKRHGDWRELSPQQIVDCSFKDGNFGCDGGSLQAALRYVA-RDGLIRETY 230
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY G++ C + ++ + WS LPP DE A++ LAT+GP+AV++NA+P TFQLY
Sbjct: 231 YPYIGQRGACHYNSESVSARVRRWSSLPPGDEAAMERALATLGPLAVAVNAAPFTFQLYR 290
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
SGIYDD C S ++NHAMLLVGYT WIL NWW WG+NGYM +KRG N CG+AN A
Sbjct: 291 SGIYDDPFCVSWHLNHAMLLVGYTPEYWILLNWWGEQWGENGYMRIKRGLNICGVANMAT 350
Query: 306 Y 306
Y
Sbjct: 351 Y 351
>gi|242003816|ref|XP_002422872.1| Cathepsin J precursor, putative [Pediculus humanus corporis]
gi|212505754|gb|EEB10134.1| Cathepsin J precursor, putative [Pediculus humanus corporis]
Length = 257
Score = 265 bits (676), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 115/212 (54%), Positives = 159/212 (75%), Gaps = 5/212 (2%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP+ +W E+GF TP W+Q++CG+CYAFSIAS+ QGQ F+ T ++ +LS+QQ+VDCS+ +
Sbjct: 34 IPEEWNWLEQGFFTPSWDQQNCGSCYAFSIASSAQGQFFRKTGKLRDLSVQQIVDCSVTN 93
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GNLGC GGSLRN+L Y GGLM +YPY +Q ICK++ N V+++S+ +LP DE
Sbjct: 94 GNLGCYGGSLRNSLKYCMKVGGLMSANEYPYSARQKICKYRPWNRSVNVTSYVILPEYDE 153
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKN 277
A++ +ATVGP+A S++ASP+T GIYDD C+ VNHAML+VGYT+++WILKN
Sbjct: 154 EAIQEAVATVGPVACSVDASPYT-----RGIYDDPNCSHTKVNHAMLIVGYTKDAWILKN 208
Query: 278 WWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
WW HWG +GYM+LK+G N+C IA YA Y ++
Sbjct: 209 WWGEHWGIDGYMFLKKGVNQCAIAKYAGYPVV 240
>gi|148298724|ref|NP_001091806.1| cathepsin L-like proteinase precursor [Bombyx mori]
gi|116272515|gb|ABJ97193.1| cathepsin L-like proteinase [Bombyx mori]
Length = 402
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/285 (44%), Positives = 174/285 (61%), Gaps = 15/285 (5%)
Query: 32 LHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPE 91
+ W+ N +++ HN+E G+ Y+L NH D+H Y ++ +L I+ + P
Sbjct: 122 MKWRQNLRRVARHNREYLAGIQSYSLHLNHFGDMHVTEYFGKVLKL----IKAFPLFDPA 177
Query: 92 S---------NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
N +P +DWR++GF Q CGACYAF++ A+Q Q++K E
Sbjct: 178 EDHHKTAYRHNRRCKVPKRIDWRDQGFKPRREEQWQCGACYAFAVTHALQAQLYKRHGEW 237
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
ELS QQ+VDCSI GN+GC GGSLR L Y GL+ E YPY GK+ C++ +
Sbjct: 238 NELSPQQIVDCSIKDGNMGCDGGSLRGALRYAA-REGLVMESHYPYVGKKGYCRYDSNLV 296
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
W+ LP DE A++ LATVGP+AV++NA+P TFQLY SG+YDD C S ++NHA
Sbjct: 297 RARPRRWATLPSGDEEAMEKALATVGPLAVAVNAAPFTFQLY-SGVYDDPFCVSWHLNHA 355
Query: 263 MLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYA 307
MLLVGYT++ WIL NWW +WG++GYM ++RG NRCG+AN A Y
Sbjct: 356 MLLVGYTQDYWILLNWWGRNWGEDGYMRIRRGLNRCGVANMATYV 400
>gi|401758210|gb|AFQ01140.1| cathepsin L4-like protease, partial [Chilo suppressalis]
Length = 325
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 132/304 (43%), Positives = 172/304 (56%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
Q + K Y + W+ N K I HN+ G YTL N +D HP Y K+
Sbjct: 26 QTIFDKAYDSHRDEMAALSQWRRNLKLIAEHNKRFLAGEISYTLHLNQFADWHPEEYFKK 85
Query: 74 MTRLTHS--------RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
+ +L + I R+ R + IPD +DW KGF Q CGACYAF
Sbjct: 86 ILKLFDTIPLLDPALDIHRSHYRHLANKA---IPDRVDWSAKGFKPKLEEQWQCGACYAF 142
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
++A A+Q Q++K ELS QQ+VDCS GN GC GGSLR Y +G L+ E+
Sbjct: 143 AVAHAVQAQLYKKHGLWGELSPQQIVDCSAADGNEGCDGGSLRGAFRYAARSG-LVSEQY 201
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY GK+ CK +W++LP DE A++ LAT+GP+AV +NASP TFQLY
Sbjct: 202 YPYTGKKGHCKSSGLLARTKPKNWAMLPFGDEDAMEKALATIGPLAVGVNASPFTFQLYR 261
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
SG+YDD C +NHAMLLVGYT + WIL NWW WG++GYM ++RG NRCG+AN A
Sbjct: 262 SGVYDDPFCVPWALNHAMLLVGYTPDYWILLNWWGKKWGEDGYMRIRRGYNRCGVANMAA 321
Query: 306 YALI 309
Y ++
Sbjct: 322 YVVL 325
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 122/302 (40%), Positives = 174/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 91 MTGLKVPPSHSRSNDTLYIPDWEGRT---PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVG 147
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 148 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 205
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 206 GQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 265
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C SD +NHA+L VGY + WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 325
Query: 305 VY 306
+
Sbjct: 326 SF 327
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 174/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 91 MTGLKVPPSHSRSNDTLYIPDWESRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 147
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 148 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 205
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+ G+Y
Sbjct: 206 GQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 325
Query: 305 VY 306
+
Sbjct: 326 SF 327
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 124/301 (41%), Positives = 172/301 (57%), Gaps = 12/301 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y K + ++L W+ N KKI HN EA G H Y L NHL D+ +++
Sbjct: 30 KKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MT--RLTHSR-IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
MT R+ SR + +PE V PD +D+R+KG++TP NQ CG+C+AFS A A
Sbjct: 90 MTGLRVPPSRSFSNDTLYTPEWEGRV--PDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K T ++ LS Q +VDC +S N GC GG + YVQ GG+ E+ YPY G
Sbjct: 148 LEGQLKKKTGKLLALSPQNLVDC--VSENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVG 205
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C + + +P +E ALK +A VGP++VSI+AS +FQ Y+ G+Y
Sbjct: 206 QDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYY 265
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
DE C D VNHA+L+VGY WI+KN W WG+ GY+ L R NN CGI N A
Sbjct: 266 DENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLARNKNNACGITNLAS 325
Query: 306 Y 306
+
Sbjct: 326 F 326
>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
Length = 329
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 176/302 (58%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL PE PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPLSHSRSNDTLY-IPEWEGRA--PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
Precursor
gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
Length = 329
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 176/302 (58%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL PE PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPLSHSRSNDTLY-IPEWEGRA--PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
Length = 330
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 176/302 (58%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL PE PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPLSHSRSNDTLY-IPEWEGRA--PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
Length = 343
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 44 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 103
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 104 MTGLKVPLSHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 160
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 161 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 218
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 219 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 278
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 279 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 338
Query: 305 VY 306
+
Sbjct: 339 SF 340
>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
Length = 343
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 44 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 103
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 104 MTGLKVPLSHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 160
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 161 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 218
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 219 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 278
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 279 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 338
Query: 305 VY 306
+
Sbjct: 339 SF 340
>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
Length = 314
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 176/302 (58%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 15 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 74
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL PE PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 75 MTGLKVPLSHSRSNDTLY-IPEWEGRA--PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 131
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 132 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 189
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 190 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 249
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 250 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 309
Query: 305 VY 306
+
Sbjct: 310 SF 311
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPSHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
Length = 329
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPLSHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|31982433|ref|NP_031828.2| cathepsin K precursor [Mus musculus]
gi|12644320|sp|P55097.2|CATK_MOUSE RecName: Full=Cathepsin K; Flags: Precursor
gi|3550487|emb|CAA06825.1| cathepsin K [Mus musculus]
gi|12834090|dbj|BAB22783.1| unnamed protein product [Mus musculus]
gi|28277388|gb|AAH46320.1| Cathepsin K [Mus musculus]
gi|74209960|dbj|BAE21279.1| unnamed protein product [Mus musculus]
gi|148706870|gb|EDL38817.1| cathepsin K, isoform CRA_a [Mus musculus]
Length = 329
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/301 (40%), Positives = 173/301 (57%), Gaps = 12/301 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K+I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MT--RLTHSR-IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
MT R+ SR + +PE V PD +D+R+KG++TP NQ CG+C+AFS A A
Sbjct: 90 MTGLRIPPSRSYSNDTLYTPEWEGRV--PDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K T ++ LS Q +VDC ++ N GC GG + YVQ GG+ E+ YPY G
Sbjct: 148 LEGQLKKKTGKLLALSPQNLVDC--VTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVG 205
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C + + +P +E ALK +A VGPI+VSI+AS +FQ Y+ G+Y
Sbjct: 206 QDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYY 265
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
DE C D VNHA+L+VGY WI+KN W WG+ GY L R NN CGI N A
Sbjct: 266 DENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325
Query: 306 Y 306
+
Sbjct: 326 F 326
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 173/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKTYGKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 91 MTGLKVPPSHSRNNDTLYIPDWESRA---PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVG 147
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 148 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 205
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+ G+Y
Sbjct: 206 GQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 266 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 325
Query: 305 VY 306
+
Sbjct: 326 SF 327
>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
Length = 329
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPLSHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFEYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSRGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 FDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/305 (40%), Positives = 173/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K ++ ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKSYGKQYDSKVDETSRRLIWEKNLKHISIHNLEAALGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L R SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 91 MTGLKVPPSRS------RSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 144
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 145 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 202
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y
Sbjct: 203 PYVGQDESCMYNPTGKAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRK 262
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C SD +NHA+L VGY R WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 263 GVYYDENCNSDNLNHAVLAVGYGIQKGRKHWIIKNSWGENWGNKGYVLMARNKNNACGIA 322
Query: 302 NYAVY 306
N A +
Sbjct: 323 NLASF 327
>gi|332024535|gb|EGI64733.1| Cathepsin J [Acromyrmex echinatior]
Length = 169
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 100/165 (60%), Positives = 125/165 (75%)
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
+S QQ+VDCS I+GNLGC GGSLRNTL Y++ + GLM YPY +Q C+F++ VV
Sbjct: 5 VSAQQLVDCSTITGNLGCTGGSLRNTLKYLEKSKGLMARSTYPYDAEQGECRFEKDQSVV 64
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+I+SW++LP +DE AL++ +AT+GP+A SINASP TFQLY G+YDD C+SD VNHAML
Sbjct: 65 NITSWAILPARDEKALQIAVATIGPVAASINASPKTFQLYHKGVYDDHRCSSDMVNHAML 124
Query: 265 LVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+VGYT WILKNWW WG+NGYM L R NRCGIANYA Y +
Sbjct: 125 IVGYTPTEWILKNWWGDSWGENGYMRLARNKNRCGIANYAAYVKV 169
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/304 (41%), Positives = 176/304 (57%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K YR D ++ W+ N I HN EA GLH Y L NH+ DL P ++
Sbjct: 38 KKMHGKTYRNYVEDESRRELWEKNLVLITMHNLEASMGLHTYKLSMNHMGDLTPEEIMQS 97
Query: 74 MTRLTH-SRIRRTLVRSPESNES-VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LT + I+R SP + S +PD +DWREKG +T Q CG+C+AFS A A+
Sbjct: 98 FATLTPPTDIQR--APSPFAGTSGAAVPDTMDWREKGCVTSVKMQGACGSCWAFSAAGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GN GC GG + YV G+ + YPY G+
Sbjct: 156 EGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNHGIDSDAAYPYTGR 215
Query: 192 QSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
QS C + + S +S LP DE ALK LAT+GPI+V+I+A F Y+SG+YD
Sbjct: 216 QSQECHYSPKFRAANCSQYSFLPEGDEGALKQALATIGPISVAIDARRPRFAFYSSGVYD 275
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +C+ D VNH +L VGY ++ W++KN W +GDNGY+ + R N++CGIA Y
Sbjct: 276 DPSCSQD-VNHGVLAVGYGTLNGQDYWLVKNSWGQTFGDNGYIRMARNKNDQCGIARYGC 334
Query: 306 YALI 309
Y ++
Sbjct: 335 YPIM 338
>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
Length = 329
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTNEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPASHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|354472953|ref|XP_003498701.1| PREDICTED: cathepsin K [Cricetulus griseus]
Length = 329
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/302 (41%), Positives = 172/302 (56%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HS TL PE PD +D+R+KG++TP NQ +CG+C+AFS A
Sbjct: 90 MTGLKLPPSHSHSNDTLY-IPEWEGRA--PDAIDYRKKGYVTPVKNQGECGSCWAFSSAG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + YVQ GG+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTTAFRYVQTNGGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P E ALK +A VGPI+VSI+AS +FQ Y+ G+Y
Sbjct: 205 GQDQSCMYNPTAKAAKCRGYREIPVGSEKALKRAVARVGPISVSIDASLTSFQFYSRGVY 264
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C D VNHA+L+VGY WI+KN W WG+ GY+ L R NN CGI N A
Sbjct: 265 YDENCDGDNVNHAVLVVGYGAQKGNKHWIIKNSWGESWGNKGYVLLARNRNNACGITNLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 173/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +H+R TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPSHTRSNDTLYIPDWEGRA---PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQDESCMYNPTGKAAKCRGYREIPQGNEKALKRAVARVGPVSVAIDASLTSFQFYSRGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANMA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/309 (40%), Positives = 175/309 (56%), Gaps = 28/309 (9%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKTYGKQYNNKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGAC 122
MT L +HSR SN+S+ IPD +D+R+KG++TP NQ CG+C
Sbjct: 91 MTGLKVPPSHSR----------SNDSLYIPDWESRAPDSIDYRKKGYVTPVKNQGQCGSC 140
Query: 123 YAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMK 182
+AFS A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+
Sbjct: 141 WAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDS 198
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E+ YPY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ
Sbjct: 199 EDAYPYVGQDESCMYNPTGKAAKCKGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQ 258
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNR 297
Y+ G+Y DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN
Sbjct: 259 FYSKGVYYDENCNSDNLNHAVLAVGYGVQKGNKHWIIKNSWGENWGNKGYILMARNKNNA 318
Query: 298 CGIANYAVY 306
CGIAN A +
Sbjct: 319 CGIANLASF 327
>gi|186688053|gb|ACC86112.1| cathepsin K [Paralichthys olivaceus]
Length = 330
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 114/299 (38%), Positives = 168/299 (56%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++++Y + ++ W+ N + I HN+EA G+H Y L NHL D+ ++MT
Sbjct: 34 HRREYNGLGEEGIRRAVWEKNMRMIVAHNEEAALGMHSYELGMNHLGDMTSEEVAEKMTG 93
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L R+ +P +D+R++G +TP NQ CG+C+AFS A A++GQ+
Sbjct: 94 LQVPLERQRSFTMALDERVSKLPKFVDYRKEGMVTPVKNQGSCGSCWAFSSAGALEGQLA 153
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T ++ +LS Q VDC ++ N GC GG + N YVQ GG+ EE YPY G+ C+
Sbjct: 154 KKTGQLMDLSPQNPVDC--VTENNGCGGGYMTNAFQYVQENGGIDSEEAYPYVGEDQSCR 211
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + +P DEHAL V L VGP++V I+AS +FQ Y G+Y D C
Sbjct: 212 YNSSGMAAQCKGYKEVPVGDEHALAVALFKVGPVSVGIDASQSSFQFYQRGVYYDRNCNK 271
Query: 257 DYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
D +NHA+L VGY +S WI+KN WS +WG GY+ + R +N CGIAN A Y ++
Sbjct: 272 DDINHAVLAVGYGISSKGKKYWIIKNSWSENWGKKGYILMARNRDNLCGIANLASYPIM 330
>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
Length = 348
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 49 KKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTNEEVVQK 108
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 109 MTGLKVPASHSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 165
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 166 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 223
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 224 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 283
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 284 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 343
Query: 305 VY 306
+
Sbjct: 344 SF 345
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 172/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L R SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPSRS------HSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFS 143
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 144 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRGIDSEDAY 201
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+
Sbjct: 202 PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSK 261
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C+SD VNHA+L VGY WI+KN W WG+ GY+ + R NN CGIA
Sbjct: 262 GVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNACGIA 321
Query: 302 NYAVY 306
N A +
Sbjct: 322 NLASF 326
>gi|836934|gb|AAA95998.1| cathepsin X [Homo sapiens]
Length = 329
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + +L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNNKVDEISPRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR TL PE PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPLSHSRSNDTLY-IPEWEGRA--PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 174/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 32 KKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 91
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HSR T + +PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 92 MTGLKVPPSHSRSNDTRYVPDWEGK---VPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVG 148
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N +YVQ G+ E+ YPY
Sbjct: 149 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFHYVQKNQGIDSEDAYPYV 206
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+ G+Y
Sbjct: 207 GQDESCMYNPTGKAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 266
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
D+ C SD +NHA+L VGY + WI+KN W WG+ GY+ + R NN CGIAN A
Sbjct: 267 YDKNCNSDNLNHAVLAVGYGIQKRKKHWIIKNSWGESWGNKGYILMARNKNNACGIANLA 326
Query: 305 VY 306
+
Sbjct: 327 SF 328
>gi|1149525|emb|CAA64218.1| preprocathepsin K [Mus musculus]
Length = 329
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 122/301 (40%), Positives = 173/301 (57%), Gaps = 12/301 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K+I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MT--RLTHSR-IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
MT R+ SR + +PE V PD +D+R+KG++TP NQ CG+C+AFS A A
Sbjct: 90 MTGLRIPPSRSYSNDTLYTPEWEGRV--PDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K T ++ LS Q +VDC ++ N GC GG + YVQ GG+ E+ +PY G
Sbjct: 148 LEGQLKKKTGKLLALSPQNLVDC--VTENYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVG 205
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C + + +P +E ALK +A VGPI+VSI+AS +FQ Y+ G+Y
Sbjct: 206 QDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYY 265
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
DE C D VNHA+L+VGY WI+KN W WG+ GY L R NN CGI N A
Sbjct: 266 DENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325
Query: 306 Y 306
+
Sbjct: 326 F 326
>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
Length = 329
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 174/305 (57%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKEYDSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L R SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPSRS------HSNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQCGSCWAFS 143
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 144 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSDNDGCGGGYMTNAFQYVQKNRGIDSEDAY 201
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V I+AS +FQ Y+
Sbjct: 202 PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVGIDASLTSFQFYSK 261
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE+C SD VNHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 262 GVYYDESCNSDNVNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 321
Query: 302 NYAVY 306
N A +
Sbjct: 322 NLASF 326
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 118/294 (40%), Positives = 172/294 (58%), Gaps = 16/294 (5%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM----TRLTHSRI 82
DS ++ W+ N K I HNQE G H + LR N D+ + + M + + R
Sbjct: 45 DSLRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYKSNGSQRRT 104
Query: 83 RRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
+ +L R ES+L +P+ +DWREKG++TP Q DCGAC++FS AI+GQ F+ T
Sbjct: 105 KGSLYR-----ESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTG 159
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
++ LSIQ ++DC+I GN GC GG + N YVQ GG+ EE YPY + + CK+K
Sbjct: 160 KLVSLSIQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECYPYVAQDTECKYKPE 219
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
+I+ + +P DE AL +ATVGPI+V I+++ +F+ Y SG+Y + C+S ++
Sbjct: 220 CSGANITGFVDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSGVYYEPDCSSSQLD 279
Query: 261 HAMLLVGYTR----NSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
H +L+VGY WI+KN W WGDNGY+ + K +N CGIA A Y +
Sbjct: 280 HGVLVVGYGSIGKDEYWIVKNSWGEAWGDNGYILMAKDKDNHCGIATEASYPKV 333
>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
Length = 329
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 175/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L ++SR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPTSYSRSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGILKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
Length = 383
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 178/305 (58%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H + L NHL D+ +++
Sbjct: 84 KKTHRKQYTSKVDEISRRLIWEKNLKYISIHNLEASLGVHTFELAMNHLGDMTSEEVVQK 143
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L ++ + RS N+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 144 MTGL---KVPTSFSRS---NDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 197
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 198 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 255
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G++ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+
Sbjct: 256 PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSK 315
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 316 GVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 375
Query: 302 NYAVY 306
N A +
Sbjct: 376 NLASF 380
>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
Length = 338
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 172/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 39 KKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 98
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L + SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 99 MTGL------KVPASRSRSNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQCGSCWAFS 152
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 153 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 210
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y
Sbjct: 211 PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRK 270
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 271 GVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 330
Query: 302 NYAVY 306
N A +
Sbjct: 331 NLASF 335
>gi|392333757|ref|XP_003752991.1| PREDICTED: cathepsin M-like [Rattus norvegicus]
Length = 333
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 177/303 (58%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYEKTYSLE-EEGQKRAVWEQNMKKIKLHNGENGLGKHGFTMEMNAFGDMTIEEFRKLMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ S + ++V +P+ ++WR++G++TP Q C AC+AFS+A AI+GQ+
Sbjct: 94 EIPIPTVKKE--NSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNACWAFSVAGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GNLGC G+ L YV+ GGL E YPY+GK+ C
Sbjct: 152 FRKTGQLIPLSVQNLVDCSRTQGNLGCYLGNTYFALQYVKENGGLESEATYPYEGKEGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + P++EHAL +AT+GPI+V+I+A +F Y +GIY + C
Sbjct: 212 RYHPDNSTASIAGIEFV-PKNEHALMNAVATLGPISVAIDARHESFLFYRNGIYHEPNCN 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
S V H+MLLVGY R WI+KN + WG+ GYM + K N CGIA YA+Y
Sbjct: 271 SSVVTHSMLLVGYGFVGEESDGRKYWIVKNSMGNKWGNRGYMKIAKXQGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 115/302 (38%), Positives = 175/302 (57%), Gaps = 7/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ + K Y K+ D+++K+ W+ N +K+ HN E GLH YTL N +DL +++
Sbjct: 32 KQTHSKQYTKEEEDNRRKI-WEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEEFVQM 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M L R S PD +DWR++G++TP +Q CG+C+AFS +++G
Sbjct: 91 MNGLKFDASRERQGIKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSCWAFSTTGSLEG 150
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F+ST + LS Q +VDCSI GN GC GG + Y++ G+ E+ YPY+ +
Sbjct: 151 QHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTEDKYPYEAEDD 210
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+F N+ S + + DE ALK A GPI+V+I+AS +FQLY SG+YD+E+
Sbjct: 211 TCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESFQLYESGVYDEES 270
Query: 254 CTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+S ++H +L+VGY +S WI+KN W WG GY+++ R +N+CGIA A Y
Sbjct: 271 CSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKDNQCGIATSASYP 330
Query: 308 LI 309
+
Sbjct: 331 TV 332
>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
Length = 329
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 172/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L + SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGL------KVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 143
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 144 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 201
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y
Sbjct: 202 PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRK 261
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 262 GVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 321
Query: 302 NYAVY 306
N A +
Sbjct: 322 NLASF 326
>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
Length = 334
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 172/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 35 KKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 94
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L + SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 95 MTGL------KVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 148
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 149 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 206
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y
Sbjct: 207 PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRK 266
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 267 GVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 326
Query: 302 NYAVY 306
N A +
Sbjct: 327 NLASF 331
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 120/304 (39%), Positives = 179/304 (58%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y + + ++ W+ N + I HN EA G+H Y L NH+ D+ R I +
Sbjct: 30 KKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDM-TREEILQ 88
Query: 74 MTRLTHSRIRRTLVR--SP-ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M +R+R L R SP ++ + +PD +DWREKG++T NQ CG+C+AFS A A
Sbjct: 89 M--FAGTRVRPNLTRRSSPFVASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAFSAAGA 146
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ ++T +++ LS Q +VDCS GN GC GG + YV GG+ +E YPY
Sbjct: 147 LEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPYTA 206
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C++ + + SS++ + DE ALK +AT+GPI+V+I+A+ F LY SG+Y
Sbjct: 207 MDGQCRYDQSQRAANCSSYNYVSEGDEEALKQAVATIGPISVAIDATRPMFILYHSGVYS 266
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D CT + VNH +L+VGY + W++KN W +GD GY+ + R N CGIANYA
Sbjct: 267 DPTCTQN-VNHGVLVVGYGSLNGEDYWLVKNSWGTRFGDGGYIRIARNKGNMCGIANYAC 325
Query: 306 YALI 309
Y L+
Sbjct: 326 YPLM 329
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 172/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKTYGKQYNNKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L R SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 91 MTGLKVPPSRS------RSNDTLYIPDWESRAPDSIDYRKKGYVTPVKNQGQCGSCWAFS 144
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 145 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 202
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+
Sbjct: 203 PYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSK 262
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 263 GVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 322
Query: 302 NYAVY 306
N A +
Sbjct: 323 NLASF 327
>gi|392354126|ref|XP_573974.4| PREDICTED: cathepsin M-like [Rattus norvegicus]
Length = 333
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 177/303 (58%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYEKTYSLE-EEGQKRAVWEQNMKKIKLHNGENGLGKHGFTMEMNAFGDMTIEEFRKLMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ S + ++V +P+ ++WR++G++TP Q C AC+AFS+A AI+GQ+
Sbjct: 94 EIPIPTVKKE--NSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNACWAFSVAGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GNLGC G+ L YV+ GGL E YPY+GK+ C
Sbjct: 152 FRKTGQLIPLSVQNLVDCSRTQGNLGCYLGNTYFALQYVKENGGLESEATYPYEGKEGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + P++EHAL +AT+GPI+V+I+A +F Y +GIY + C
Sbjct: 212 RYHPDNSTASIAGIEFV-PKNEHALMNAVATLGPISVAIDARHESFLFYRNGIYHEPNCN 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
S V H+MLLVGY R WI+KN + WG+ GYM + K N CGIA YA+Y
Sbjct: 271 SSVVTHSMLLVGYGFVGEESDGRKYWIVKNSMGNKWGNRGYMKIAKDQGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
Length = 335
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 176/302 (58%), Gaps = 10/302 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y+ + D+ ++ W+ N K I HN EA G+H Y L N + DL +K
Sbjct: 38 KKTHNKMYQNEVEDAHRRELWEKNLKFISMHNLEASMGIHTYELGMNQMGDLTQEEILKT 97
Query: 74 MTRLTHSRIRRTLVRSPESNES-VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L R + R+P + +S V P +DWR+ G +T NQ CG+C+AFS A++
Sbjct: 98 YATL---RPPTDVHRTPFTRKSGVAAPGAMDWRDLGCVTSVKNQGSCGSCWAFSAVGALE 154
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K+T ++ +LS Q +VDCS GN GC GG + N YV G+ E YPY G +
Sbjct: 155 GQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAFQYVIENQGIESEASYPYIGLE 214
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + S + LP +DE ALK +AT+GPI+V+I+AS TF Y+SG+YDD
Sbjct: 215 QQCHYNPEESAANCSQYHFLPEKDEEALKEAIATIGPISVAIDASKPTFTFYSSGVYDDP 274
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C S+ +NH +L VGY T++SW++KN W ++GD+GY+ + R N+CGIA Y Y
Sbjct: 275 TC-SEVINHGVLAVGYGTQSTQDSWLVKNSWGTYFGDSGYIRMSRNKGNQCGIALYGCYP 333
Query: 308 LI 309
LI
Sbjct: 334 LI 335
>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
Length = 330
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 172/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 31 KKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L + SN+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 91 MTGL------KVPASRSRSNDTLYIPDWEGRTPDSVDYRKKGYVTPVKNQGQCGSCWAFS 144
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 145 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 202
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y
Sbjct: 203 PYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRK 262
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 263 GVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 322
Query: 302 NYAVY 306
N A +
Sbjct: 323 NLASF 327
>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
Length = 329
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 178/305 (58%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H + L NHL D+ +++
Sbjct: 30 KKTHRKQYTSKVDEISRRLIWEKNLKYISIHNLEASLGVHTFELAMNHLGDMTSEEVVQK 89
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L ++ + RS N+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGL---KVPTSFSRS---NDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 143
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 144 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAY 201
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G++ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+
Sbjct: 202 PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSK 261
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIA
Sbjct: 262 GVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 321
Query: 302 NYAVY 306
N A +
Sbjct: 322 NLASF 326
>gi|348531519|ref|XP_003453256.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 177/306 (57%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K++K Y ++ ++ +K W +N K + HN A QGL Y L H +D+ Y + ++
Sbjct: 32 KFEKSYDSESDEAHRKQVWLNNRKFVLMHNILADQGLKSYRLGMTHFADMDNEEYKQLVS 91
Query: 76 RLTHSRIRRTLVRSPESNESVL-------IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ +L PE + L +PD +DWR+KG++T +Q+ CG+C+AFS
Sbjct: 92 QGCLHTFNASL---PERGSAFLGLPEGTALPDTVDWRDKGYVTEVKDQKQCGSCWAFSTT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
++GQ F+ T ++ LS QQ++DCS GN GC GGS++ L Y+Q GG+ E YPY
Sbjct: 149 GVLEGQHFRKTGKLVSLSEQQLMDCSHSFGNNGCNGGSVKRALQYIQANGGIDTETSYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K K C++K I + + + P +E LK +AT+GPI+V I+AS H+FQ Y SG+
Sbjct: 209 KAKGQRCRYKPDGIGAKCTGYVHVKPSNEETLKKAVATLGPISVGIDASRHSFQFYQSGV 268
Query: 249 YDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
YDD C+ ++H L VGY T N W++KN W WGD GY+ + R +N+CGIA+
Sbjct: 269 YDDPDCSKTVLDHGALAVGYGTENGHDYWLIKNSWGLRWGDKGYIKMSRNKSNQCGIASE 328
Query: 304 AVYALI 309
A Y L+
Sbjct: 329 ASYPLV 334
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/302 (39%), Positives = 173/302 (57%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + +++ W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTYRKQYNGKVDEISRRIIWEKNLKYISIHNLEASLGVHTYELSMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HS TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPSHSHSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQENRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G++ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQEESCMYNPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLSSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C + +NHA+L VGY WILKN W +WG+ GY+ L R NN CGIAN A
Sbjct: 265 YDESCNGEDLNHALLAVGYGMQRGNKHWILKNSWGENWGNKGYVLLARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|441593109|ref|XP_003260582.2| PREDICTED: cathepsin L2 isoform 1 [Nomascus leucogenys]
Length = 334
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 169/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMGKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+PP E AL +ATVGPI+V+++A +FQ Y GIY + C+S+ ++H +L+VGY
Sbjct: 225 FTVVPPGKEKALMKAVATVGPISVAMDAGHSSFQFYNQGIYFEPDCSSENLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 118/296 (39%), Positives = 172/296 (58%), Gaps = 18/296 (6%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM----TRLTHSRI 82
DS ++ W+ N K I HNQE G H + LR N D+ + + M + + R
Sbjct: 45 DSWRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYKSNGSQKRT 104
Query: 83 RRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
+ +L R ES+L +P+ +DWREKG++TP Q C +C+AFS A AI+GQ F+ T
Sbjct: 105 KGSLYR-----ESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTG 159
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
++ LS+Q +VDCSI GN GC GG + N YVQ GG+ EE YPY + + CK++
Sbjct: 160 KLVSLSVQNLVDCSIPEGNNGCDGGLMGNAFQYVQDNGGIDTEECYPYVAQDNECKYQPE 219
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
+++ + +P DE AL +A VGPI+V+I+A +F+ Y SG+Y D C+S +N
Sbjct: 220 CSGANVTGFVKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQSGVYYDPQCSSSQLN 279
Query: 261 HAMLLVGY------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
H +L+VGY R WI+KN W +WGDNGY+ + K +N CGI A Y ++
Sbjct: 280 HGVLVVGYGSEGKNGRKYWIVKNSWGENWGDNGYVLMAKDEDNHCGIITDASYPIV 335
>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
Length = 332
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 173/302 (57%), Gaps = 20/302 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y+K+Y K + ++L W+ N K I THN E GLH + L NHL D+ +++MT
Sbjct: 36 YRKEYNSKVDEISRRLIWEKNLKYISTHNLEFSLGLHTFELAMNHLGDMTSEEVVQKMTG 95
Query: 77 LTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFSIAS 129
L + L RS ++N+++ PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 96 L-----KVPLSRS-QNNDTLYFPDWETKTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVG 149
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 150 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYI 207
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P E ALK +A VGP+AV+I+AS +FQ Y+ G+Y
Sbjct: 208 GEDESCMYNPTGKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASLSSFQFYSKGVY 267
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C SD +NHA+L VGY WI+KN W WG+ GY+ + R NN CGIAN A
Sbjct: 268 YDENCNSDNLNHAVLAVGYGIQRGTKHWIIKNSWGEQWGNKGYILMARNKNNACGIANLA 327
Query: 305 VY 306
+
Sbjct: 328 SF 329
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 171/302 (56%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y+K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 30 KKTYQKQYNGKVDELSRRLIWEKNLKYISIHNLEASLGVHTYELSMNHLGDMTNEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L HS TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPAHSHSNDTLYIPDWEGRA---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQQNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGPI+V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQDESCMYNPTGKAAKCRGYREVPVGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C D +NHA+L VGY WILKN W +WG+ GY+ L R NN CGIAN A
Sbjct: 265 YDESCDGDNLNHAVLAVGYGIQRGHKHWILKNSWGENWGNKGYVLLARNKNNTCGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 121/302 (40%), Positives = 171/302 (56%), Gaps = 14/302 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K + ++L W+ N K I HN EA G H Y L NHL D+ +++
Sbjct: 30 KKTYGKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGAHTYELAMNHLGDMTSEEVVQK 89
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L + SR TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 90 MTGLKVPPSDSRNNDTLYIPDWEGRA---PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVG 146
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 147 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 204
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 205 GQDESCMYNPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 264
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A
Sbjct: 265 YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 324
Query: 305 VY 306
+
Sbjct: 325 SF 326
>gi|348531515|ref|XP_003453254.1| PREDICTED: cathepsin L2-like [Oreochromis niloticus]
Length = 333
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 121/306 (39%), Positives = 179/306 (58%), Gaps = 16/306 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+KK Y + ++ +K W +N K + HN A QGL + L + +D+ + Y K ++
Sbjct: 32 KFKKSYDSPSEETHRKQVWLNNRKLVLIHNALADQGLKSFHLGMTYFADMENQEYKKLIS 91
Query: 76 R-----LTHSRIRR--TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ S RR T R P+ + +P +DWR++G++T +Q++CG+C+AFS
Sbjct: 92 QGCLGSFNASLHRRGSTFNRLPKGTK---LPKTVDWRKQGYVTKVKHQKECGSCWAFSAT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T ++ LS QQ+VDCS GN GC GG + Y+++ GGL E+ YPY
Sbjct: 149 GALEGQHFRKTRKLVSLSEQQLVDCSRSFGNHGCNGGWMNPAFQYIRYNGGLDTEDSYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K K IC + PN V I S V DE ALK +AT+GPI+++++AS +FQLY SG+
Sbjct: 209 KAKDGICHY-NPNSVGAICSGHVDVSPDEAALKQAVATIGPISIAVDASHESFQLYQSGV 267
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
YD+ C +V HAML+VGY W++KN W WGD GY+ + R N+CGIA
Sbjct: 268 YDEHRCNKKHVTHAMLVVGYGTEGGHDYWLIKNSWGLQWGDKGYIKMTRNKGNQCGIATA 327
Query: 304 AVYALI 309
A Y L+
Sbjct: 328 ASYPLV 333
>gi|426362423|ref|XP_004048364.1| PREDICTED: cathepsin L2 isoform 1 [Gorilla gorilla gorilla]
gi|426362425|ref|XP_004048365.1| PREDICTED: cathepsin L2 isoform 2 [Gorilla gorilla gorilla]
Length = 334
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVVAPGKEKALMKAVATVGPISVAVDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|114625736|ref|XP_001153919.1| PREDICTED: cathepsin L2 isoform 2 [Pan troglodytes]
gi|114625742|ref|XP_520130.2| PREDICTED: cathepsin L2 isoform 5 [Pan troglodytes]
Length = 334
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVVTPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 178/310 (57%), Gaps = 12/310 (3%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + +Y+KDY + + +K ++ N K + HN+ ++G Y + NHL+D+HPR
Sbjct: 23 FTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNHLADMHPRE 82
Query: 70 YIKEMTRLTHSRIRRTLVRSPES-----NESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
++ T L +R R + PE N+ +I +DWR+KG I+P +Q CG+C+A
Sbjct: 83 FM--ATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCWA 140
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
FS A++ F LS Q ++DCS+ GN GC GG + YV+ G+ EE
Sbjct: 141 FSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTEE 200
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY+G+ S C+FK+ N+ + + +P DE AL +AT GP++++I+AS +FQ Y
Sbjct: 201 AYPYEGEDSECRFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPLSIAIDASNPSFQFY 260
Query: 245 ASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCG 299
+ G+Y + C+S ++H +LLVGY + W++KN WS WG+NGY+ + R +N CG
Sbjct: 261 SEGVYYEPECSSAQLDHGVLLVGYGVEKDQKYWLVKNSWSEQWGENGYIKMARNKDNNCG 320
Query: 300 IANYAVYALI 309
IA A + ++
Sbjct: 321 IATQASFPIV 330
>gi|23110960|ref|NP_001324.2| cathepsin L2 preproprotein [Homo sapiens]
gi|320118898|ref|NP_001188504.1| cathepsin L2 preproprotein [Homo sapiens]
gi|12644075|sp|O60911.2|CATL2_HUMAN RecName: Full=Cathepsin L2; AltName: Full=Cathepsin U; AltName:
Full=Cathepsin V; Flags: Precursor
gi|3107915|dbj|BAA25909.1| cathepsin V [Homo sapiens]
gi|3228672|gb|AAC23598.1| cathepsin U [Homo sapiens]
gi|3869129|dbj|BAA34365.1| cathepsin L2 [Homo sapiens]
gi|23958123|gb|AAH23504.1| CTSL2 protein [Homo sapiens]
gi|37182404|gb|AAQ89004.1| cathepsin L2 [Homo sapiens]
gi|83405150|gb|AAI10513.1| Cathepsin L2 [Homo sapiens]
gi|119579235|gb|EAW58831.1| cathepsin L2, isoform CRA_a [Homo sapiens]
gi|119579236|gb|EAW58832.1| cathepsin L2, isoform CRA_a [Homo sapiens]
Length = 334
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 167/290 (57%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + ++R+ L R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V+P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FEVVPAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG NGY+ + K +N CGIA A Y +
Sbjct: 285 GFEGANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|348545637|ref|XP_003460286.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 179/303 (59%), Gaps = 9/303 (2%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K++K Y + ++ +K W SN K + HN QGL Y L + +++ Y + ++
Sbjct: 32 KFEKSYDSPSEEAHRKQIWLSNRKLVLMHNILTDQGLKSYRLGMTYFANMENEEYKQLVS 91
Query: 76 RLTHSRIRRTLVRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ +L R + E +P+ +DWR+KG++T +Q+ CG+C+AFS A+
Sbjct: 92 QGCLGSFNGSLSRRGSTFAQLPEGTALPNTVDWRDKGYVTEVKDQKQCGSCWAFSATGAL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T + LS QQ+VDCS GN GC GG + Y+++ G+ EE YPY+ K
Sbjct: 152 EGQHFRKTGTLVSLSEQQLVDCSSNFGNSGCMGGWMDFAFKYIKYNRGIDTEEFYPYEAK 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C++KR +I S + ++ +E ALK +ATVGPI+V+I+AS +FQLY SG+Y D
Sbjct: 212 NGLCRYKRDSIGATCSGYIIVKRFEEQALKEAVATVGPISVTIDASRPSFQLYESGVYYD 271
Query: 252 EACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+ C S ++NHA+L VGY T N W++KN W WG+ GY+ + R N+CGIA+ A Y
Sbjct: 272 DGCGSIFLNHAVLAVGYGTENGHDYWLVKNSWGLGWGEKGYIRMSRNKKNQCGIASVARY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|109112413|ref|XP_001106814.1| PREDICTED: cathepsin L2 isoform 3 [Macaca mulatta]
gi|297271422|ref|XP_002800251.1| PREDICTED: cathepsin L2 [Macaca mulatta]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 167/290 (57%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + ++R+ L R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V+P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FKVVPAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG NGY+ + K +N CGIA A Y +
Sbjct: 285 GFEGANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|229366214|gb|ACQ58087.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 178/303 (58%), Gaps = 9/303 (2%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ + Y A ++++K W SN + + HN A QG+ Y L + +D+ Y ++++
Sbjct: 32 QFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEEYKRQIS 91
Query: 76 RLTHSRIRRTLVRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ +L R + E +P+ +DWREKG++T +Q+ CG+C+AFS ++
Sbjct: 92 QGCLGSFNASLPRRGSAYLRLPEGADLPNSVDWREKGYVTDVKDQKQCGSCWAFSTTGSL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS QQ+VDCS GN GC GG + + Y+Q GG+ E+ YPY+ +
Sbjct: 152 EGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTEDSYPYEAE 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ NI + + + DE ALK LAT+GP++V+I+AS +FQLY SG+YD+
Sbjct: 212 DGQCRYNSANIGATCTGYVDVKQGDEDALKEALATIGPVSVAIDASHSSFQLYESGVYDE 271
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C+S ++H +L VGY ++ W++KN W WG+ GY+ + R +N+CGIA + Y
Sbjct: 272 PECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHNQCGIATASSY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|3087790|emb|CAA75029.1| cathepsin L2 [Homo sapiens]
Length = 334
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFPDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 167/293 (56%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR---TL 86
+++ W+ N KKI HN E G+H Y L NH D+ + + M H + RR +L
Sbjct: 48 RRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL 107
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E +P+ LDWREKG++TP +Q +CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 108 FMEPNFIE---VPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLS 164
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + YV+ GL EE YPY G C F N +
Sbjct: 165 EQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAAN 224
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L
Sbjct: 225 DTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLA 284
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS +WGD GY+Y+ K +N CGIA A Y L+
Sbjct: 285 VGYGFEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 337
>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
Length = 334
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 174/303 (57%), Gaps = 9/303 (2%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K YR +S ++L W +N K + HN A QGL Y L + +D+ Y + +
Sbjct: 32 KFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEEYRQLVF 91
Query: 76 RLTHSRIRRTLVRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
R + T R + ++ ++PD +DWR+KG++T +Q+ CG+C+AFS ++
Sbjct: 92 RGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS QQ+VDCS GN GC GG + Y++ GL E+ YPY+ +
Sbjct: 152 EGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDSYPYEAQ 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+F + + + + DE AL+ +AT+GPI+V+I+A +FQLY+SG+Y++
Sbjct: 212 DGECRFNPSTVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVYNE 271
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C+S ++H +L VGY ++ WI+KN W WG GY+ + R +N+CGIA A Y
Sbjct: 272 PDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIATAASY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 173/305 (56%), Gaps = 20/305 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K+Y K + ++L W+ N K I THN E GLH + L NHL D+ +++
Sbjct: 33 KQSYGKEYNSKVDEISRRLIWEKNLKYISTHNLEFSLGLHTFELAMNHLGDMTSEEVVQK 92
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFS 126
MT L + L RS ++N+++ IPD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 93 MTGL-----KMPLSRS-QNNDTLYIPDWEGRTPESVDYRKKGYVTPVKNQGQCGSCWAFS 146
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ Y
Sbjct: 147 SVGALEGQLKKKTGKLLNLSPQNLVDC--VSKNDGCGGGYMTNAFQYVQENRGIDSEDAY 204
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ C + + +P E ALK +A VGP+AV+I+AS +FQ Y+
Sbjct: 205 PYIGQDESCMYNPTGKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASLSSFQFYSK 264
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
G+Y DE C D +NHA+L VGY WI+KN W WG+ GY+ + R N CGIA
Sbjct: 265 GVYYDENCNGDNLNHAVLAVGYGIQRGTKHWIIKNSWGEEWGNKGYILMARNKKNACGIA 324
Query: 302 NYAVY 306
N A +
Sbjct: 325 NLASF 329
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 176/303 (58%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +Y K YR DS ++ ++ N + I++HN++ + GL +TL N D+
Sbjct: 26 KARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEINAA 85
Query: 74 MTRLTHS--RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M + ++ R + P +E +PD +DWR+KG +TP +Q+ CG+C+AFS ++
Sbjct: 86 MNGFLSAGKKVPRGTMYQPLVDE---LPDTVDWRDKGAVTPVKDQKACGSCWAFSATGSL 142
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F ST ++ LS Q +VDCS GN GC GG + N Y++ G+ EE YPY+ K
Sbjct: 143 EGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEESYPYEAK 202
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+F N+ +SS+ + E L+ +A GP++V+I+AS TF Y+ GIY D
Sbjct: 203 NGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFYSRGIYYD 262
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
E C+S +++H +L VGY + + W++KN W+ WGD+GY+ + R NN CGIA+ A Y
Sbjct: 263 EKCSSSFLDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSRNRNNNCGIASQASY 322
Query: 307 ALI 309
++
Sbjct: 323 PVV 325
>gi|444522624|gb|ELV13407.1| Cathepsin L1 [Tupaia chinensis]
Length = 307
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 173/307 (56%), Gaps = 20/307 (6%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
F Q K K +R+ W+ N K IH HN E Q H +T+ N D+ +
Sbjct: 12 FNQDKNKDVWRRSV--------WEKNLKMIHQHNLEHSQQKHSFTMEMNAFGDMTNEEFR 63
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
K M + +R + + +P+ +DWREKG++TP NQ DCG+C+AFS A+
Sbjct: 64 KVMNGF--RKQKRKTGNLFQEFMHLDVPESVDWREKGYVTPVKNQGDCGSCWAFSSTGAL 121
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+F+ T ++ LS Q +VDCSI GN GC GG + N YV+ GGL EE YPY+
Sbjct: 122 EGQMFRKTGKLVSLSEQNLVDCSISEGNFGCNGGIMDNAFLYVKDNGGLDSEESYPYEAV 181
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
CK+ N + + + LP + E AL+ +ATVGPI+V I+AS +FQ Y GIY +
Sbjct: 182 DDSCKYNPKNSAANDTGFVHLPVE-EKALEKAVATVGPISVGIDASADSFQFYKEGIYFE 240
Query: 252 EACTSDYVNHAMLLVGY-------TRNS-WILKNWWSHHWGDNGY-MYLKRGNNRCGIAN 302
C+S ++HA+L+VGY T N W++KN W +WG +GY M K NN CGIA+
Sbjct: 241 PNCSSVELDHAVLVVGYGVMEEASTNNKFWLVKNSWGKNWGMDGYIMMAKDRNNNCGIAS 300
Query: 303 YAVYALI 309
YA+Y +
Sbjct: 301 YAMYPTV 307
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 177/303 (58%), Gaps = 9/303 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++++ K Y +K D ++ +++N KKI+ HN G Y L N +D+ P + K
Sbjct: 30 KRQHNKTYLQK-QDVGRRAIFEANIKKINAHNLLYDLGRSSYRLGLNGFADMTPDEFEKY 88
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
TR + R + ++ + N S+ +PD +DWR +G++TP NQ CG+C+AFS A++
Sbjct: 89 RGTRFEANEARVSKLQHRD-NRSMHVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALE 147
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ + ++ LS Q +VDCS + GN GC GG + N +++ AGGL E+ YPY GK
Sbjct: 148 GQHFRRSGDLVSLSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKD 207
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C F I ++ + +P +DE ALK VGP++V+I+AS FQ Y G+YD+
Sbjct: 208 GTCHFDARGIGAKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEI 267
Query: 253 ACTSDYVNHAMLLVGY--TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C+S ++H +L+VGY TR+ W++KN W WG +GY+ + R N+CGIA A Y
Sbjct: 268 TCSSTSLDHGVLVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRNKENQCGIATMASY 327
Query: 307 ALI 309
+
Sbjct: 328 PTV 330
>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 166/290 (57%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HN E QG HG+ + N D+ + + M + ++R+ L R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V+P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FEVVPAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG NGY+ + K +N CGIA A Y +
Sbjct: 285 GFEGANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|297684914|ref|XP_002820054.1| PREDICTED: cathepsin L2 isoform 2 [Pongo abelii]
Length = 334
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSHPQGNQGCNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVILPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSDNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
Length = 334
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 178/303 (58%), Gaps = 9/303 (2%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ + Y A ++++K W SN + + HN A QG+ Y L + +D+ Y ++++
Sbjct: 32 QFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEEYKRQIS 91
Query: 76 RLTHSRIRRTLVRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ +L R + E +P+ +DWREKG++T +Q+ CG+C+AFS ++
Sbjct: 92 QGCLGSFNASLPRRGSAYLRLPEGADLPNSVDWREKGYVTEVKDQKQCGSCWAFSTTGSL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS QQ+VDCS GN GC GG + + Y+Q GG+ E+ YPY+ +
Sbjct: 152 EGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYIQANGGIDTEDSYPYEAE 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ NI + + + DE ALK +AT+GP++V+I+AS +FQLY SG+YD+
Sbjct: 212 DGQCRYNSANIGATCTGYVDVKQGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDE 271
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C+S ++H +L VGY ++ W++KN W WG+ GY+ + R +N+CGIA + Y
Sbjct: 272 PECSSSELDHGVLAVGYGSDNGHDYWLVKNSWGLGWGNKGYIMMTRNKHNQCGIATASSY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 113/290 (38%), Positives = 170/290 (58%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR-RTLVR 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + + R + R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMVCFRNQKHKNRKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 GPLL---LNLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + N YV+ GGL E YPY K CK+K N V + +
Sbjct: 165 NLVDCSHPQGNQGCNGGFMNNAFQYVKENGGLDSEASYPYVAKDGSCKYKPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V+P ++ +K +ATVGPI+V+++AS +FQ Y SGIY ++ C+S ++H +L+VGY
Sbjct: 225 FVVIPAHEKELMKA-VATVGPISVAVDASHSSFQFYKSGIYFEQDCSSKNLDHGVLVVGY 283
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
N W++KN W WG NGY+ + K NN CGIA A Y ++
Sbjct: 284 GFEGTNSNNNNYWLIKNSWGPEWGSNGYIKIAKDRNNHCGIATAASYPIV 333
>gi|397499865|ref|XP_003820654.1| PREDICTED: cathepsin L2 isoform 1 [Pan paniscus]
gi|397499867|ref|XP_003820655.1| PREDICTED: cathepsin L2 isoform 2 [Pan paniscus]
Length = 334
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 167/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVVTPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K N CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKKNHCGIATAASY 331
>gi|149039728|gb|EDL93844.1| rCG24133 [Rattus norvegicus]
Length = 333
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 176/303 (58%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYEKTYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTIEEFRKLMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ S + ++V +P+ ++WR++G++TP Q C C+AFS+A AI+GQ+
Sbjct: 94 EIPIPTVKKE--NSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GNLGC G+ L YV+ GGL E YPY+ K+ C
Sbjct: 152 FQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESEATYPYEEKEGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P++E AL +AT+GPI+V+I+A +F Y +GIY + C+
Sbjct: 212 RYHPDNSTASITDFEFV-PKNEDALMNAVATLGPISVAIDARHESFLFYRNGIYHEPNCS 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
S V HAMLLVGY R WILKN + WG+ GYM + K N CGIA YA+Y
Sbjct: 271 SSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGNRGYMKIAKDQGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|297684916|ref|XP_002820055.1| PREDICTED: cathepsin L2 isoform 3 [Pongo abelii]
Length = 345
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 59 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 118
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 119 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 175
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 176 NLVDCSHPQGNQGCNGGFMDKAFQYVKENGGLDSEESYPYVAMDEICKYRPENSVANDTG 235
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 236 FTVILPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 295
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 296 GFEGANSDNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 342
>gi|213514640|ref|NP_001134963.1| Cathepsin S precursor [Salmo salar]
gi|209155506|gb|ACI33985.1| Cathepsin S precursor [Salmo salar]
gi|209737594|gb|ACI69666.1| Cathepsin S precursor [Salmo salar]
gi|223647278|gb|ACN10397.1| Cathepsin S precursor [Salmo salar]
gi|223673157|gb|ACN12760.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 172/304 (56%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K+Y+ + + ++ W+ N + I+ HN EA +H Y L NH+ D+ +
Sbjct: 31 KKTHGKNYQTEVEELGRREVWERNLQLINLHNLEASMDMHTYDLGMNHMGDMTQEEIAQS 90
Query: 74 MTRLTHSRIRRTLVRSPES---NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
L R+ L R P + + IPD DWREKG++T Q CG+C+AFS A
Sbjct: 91 FASL---RVPADLKREPSAFVGSSGAPIPDTFDWREKGYVTEVKMQGSCGSCWAFSAVGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K+T ++ ++S Q +VDCS GN GC GG + YV G+ ++ YPYKG
Sbjct: 148 LEGQLMKTTGKLIDISSQNLVDCSSKYGNKGCNGGFMSQAFQYVIDNQGIDSDQSYPYKG 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
Q C + + S +S LP DE LK LAT+GPI+V+I+A+ F Y SG+Y+
Sbjct: 208 VQQQCSYNPAQRAANCSKYSFLPEGDEGVLKEALATIGPISVAIDATRPLFTFYRSGVYN 267
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D CT +NHA+L VGY ++ W++KN WS WGD GY+ + R +N+CGIA Y
Sbjct: 268 DPTCTKK-INHAVLAVGYGTLGGQDYWLVKNSWSLSWGDQGYIRMSRNKDNQCGIALYGC 326
Query: 306 YALI 309
Y ++
Sbjct: 327 YPVM 330
>gi|37905511|gb|AAO64477.1| cathepsin S precursor [Fundulus heteroclitus]
Length = 337
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 172/304 (56%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K+Y+ + + ++ W+ N I THN EA G H Y L N + DL ++
Sbjct: 38 KKTHGKEYQNEEENVHRRDLWEKNLMLITTHNLEASMGFHTYDLSMNFMGDLSQEEILQF 97
Query: 74 MTRLTHSRIRRTLVRSPES---NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
T LT L R+P S +PD LD REKG +T Q CG+C+AFS A A
Sbjct: 98 YTTLT---TPTDLQRAPSSFVGASGADVPDTLDLREKGLVTAVRMQGACGSCWAFSAAGA 154
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K T +++ LS Q +VDCS GN GC GG + YV G+ E+ YPY+G
Sbjct: 155 LEGQLAKKTGKLQNLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNQGIDSEDSYPYRG 214
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C++ + S + LP DE ALK +AT+GPI+V+I+A F Y SG+YD
Sbjct: 215 RDQQCQYNPATRAANCSRYDFLPEGDEQALKEAIATIGPISVAIDARRPRFAFYRSGVYD 274
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +CT + VNHA+L VGY ++ W++KN W +GD GY+ + R N++CGIA YA
Sbjct: 275 DSSCTQN-VNHAVLAVGYGSLGGQDYWLVKNSWGTSFGDQGYIRMARNKNDQCGIALYAC 333
Query: 306 YALI 309
Y ++
Sbjct: 334 YPIM 337
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 109/292 (37%), Positives = 177/292 (60%), Gaps = 9/292 (3%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR---LTHSRIR 83
+S++ +++N K I+ HN EA G+H Y L N + + ++ + L + +
Sbjct: 16 ESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDRNASK 75
Query: 84 RTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
T R + + +++ +PD +DWR KG++TP NQE CG+C+AFS +++GQ FK T ++
Sbjct: 76 STADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGKL 135
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS Q +VDCS GN GC GG + + Y++ GG+ E+ YPY+ + C+FK ++
Sbjct: 136 VSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPADV 195
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
++ ++ + DE AL +ATVGPI+V+I+AS HTFQ+Y+ G+Y + C+S ++H
Sbjct: 196 GATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELDHG 255
Query: 263 MLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+L VGY ++ W++KN W WG NGY+ + R NN+CGIA A Y L+
Sbjct: 256 VLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQCGIATSASYPLV 307
>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
Length = 330
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 176/303 (58%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KK+ K Y + + ++ W+ N + I HN EA G+H Y L NH++D+ ++
Sbjct: 31 KKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEEILQT 90
Query: 74 M--TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ TR+ R T S+ ++PD LDWR+KG++T NQ CG+C+AFS A+
Sbjct: 91 LAVTRVPPGFKRPT--AEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCWAFSSVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GNLGC GG + YV GG+ E YPY+G
Sbjct: 149 EGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSESSYPYQGT 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
Q C++ + +S+ + DE ALK LA +GP++V+I+A+ F Y SG+YDD
Sbjct: 209 QGSCRYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFIFYRSGVYDD 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT VNH +L VGY ++ W++KN W +GD GY+ + R NN CGIA+ A Y
Sbjct: 269 PSCTQK-VNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARNKNNMCGIASEACY 327
Query: 307 ALI 309
++
Sbjct: 328 PIV 330
>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
Length = 330
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 176/303 (58%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KK+ K Y + + ++ W+ N + I HN EA G+H Y L NH++D+ ++
Sbjct: 31 KKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEEILQT 90
Query: 74 M--TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ TR+ R T S+ ++PD LDWR+KG++T NQ CG+C+AFS A+
Sbjct: 91 LAVTRVPPGFKRPT--AEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCWAFSSVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GNLGC GG + YV GG+ E YPY+G
Sbjct: 149 EGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSESSYPYQGT 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
Q C++ + +S+ + DE ALK LA +GP++V+I+A+ F Y SG+YDD
Sbjct: 209 QGSCRYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFIFYRSGVYDD 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT VNH +L VGY ++ W++KN W +GD GY+ + R NN CGIA+ A Y
Sbjct: 269 PSCTQK-VNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARNKNNMCGIASEACY 327
Query: 307 ALI 309
++
Sbjct: 328 PIV 330
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 173/303 (57%), Gaps = 8/303 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY--I 71
+ KY K Y + + +K W+SN + + HN A QG Y L N +DL+ + +
Sbjct: 23 KGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEFMAL 82
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
K L ++ + + ++ + V +P +DWR +G++TP +Q CG+C+ FS ++
Sbjct: 83 KGSGGLLQAKDKSS-TQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFSATGSL 141
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F T + LS QQ+VDC+ GN GC GG + + +Y++ GG+ E YPY +
Sbjct: 142 EGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELESAYPYTAR 201
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
CKF R +V + V+P DE AL + T+GP+AVSI+AS ++FQLY SG+YD
Sbjct: 202 DGRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYESGVYDF 261
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
C+S ++H +L VGY +N W++KN W WGD GY+ + K NN+CGIA + Y
Sbjct: 262 RRCSSTNLDHGVLAVGYGTEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQCGIATDSCY 321
Query: 307 ALI 309
L+
Sbjct: 322 PLV 324
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 166/290 (57%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HN E QG HG+ + N D+ + + M + ++R+ L R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSHPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V+P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FEVVPAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG NGY+ + K +N CGIA A Y +
Sbjct: 285 GFEGANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPTV 334
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 110/306 (35%), Positives = 179/306 (58%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ + Y +++++ W +N K + HN A QG+ Y L + +D+ Y + ++
Sbjct: 32 KFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEEYKRLIS 91
Query: 76 R-----LTHSRIRR--TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ S RR T R PE+ + +P +DWR+KG++T +Q+ CG+C+AFS
Sbjct: 92 QGCLGSFNASLPRRGSTFFRLPENKD---LPAAVDWRDKGYVTDVKDQKQCGSCWAFSAT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ F+ T ++ LS QQ+VDCS GN+GC GG + + Y+Q GG+ EE YPY
Sbjct: 149 GSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDTEESYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C++K + + + + DE AL+ +AT+GPI+V I+AS +FQLY SG+
Sbjct: 209 EAEDGECRYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISFQLYESGL 268
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
YD+ C+S ++H +L VGY ++ W++KN W WGD GY+ + K +N+CGIA
Sbjct: 269 YDEPQCSSSELDHGVLAVGYGSENGQDYWLVKNSWGLTWGDQGYIKMSKNKSNQCGIATA 328
Query: 304 AVYALI 309
A Y L+
Sbjct: 329 ASYPLV 334
>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 176/303 (58%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ ++K Y+ + D +++ W+ N I HN EA GLH Y L NH+ DL ++
Sbjct: 38 KRTHEKKYQNEGEDVRRRELWEKNLMLITMHNLEASMGLHTYELSMNHMGDLTQEEILQS 97
Query: 74 MTRLTH-SRIRRTLVRSPESNES-VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
L+ + I+R SP + S +PD +DWREKG +T Q CG+C+AFS A A+
Sbjct: 98 FATLSPPTDIQR--APSPFAGTSGAAVPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GN GC GG + YV G+ + YPY G+
Sbjct: 156 EGQLAKTTGKLLDLSPQNLVDCSSKYGNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQ 215
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + S +S LP DE ALK LAT+GPI+V+I+A+ +F Y SG+YDD
Sbjct: 216 SQQCHYNPAYRAANCSRYSFLPEGDEGALKEALATIGPISVAIDATRPSFTFYRSGVYDD 275
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+ CT + VNH +L VGY ++ W++KN W +GD G++ + R N++CGIA Y Y
Sbjct: 276 QTCTRN-VNHGVLAVGYGTLNGKDYWLVKNSWGSTFGDKGFIRMARNKNDQCGIALYGCY 334
Query: 307 ALI 309
++
Sbjct: 335 PIM 337
>gi|31077116|ref|NP_852043.1| cathepsin M precursor [Rattus norvegicus]
gi|27960485|gb|AAO27846.1|AF456462_1 cathepsin M [Rattus norvegicus]
Length = 333
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 175/303 (57%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYEKTYSLE-EEGQKRAVWEQNMKKIKLHNGENGLGKHGFTMEMNAFGDMTIEEFRKLMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ S + ++V +P+ ++WR++G++TP Q C C+AFS+A AI+GQ+
Sbjct: 94 EIPIPTVKKE--NSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GNLGC G+ L YV+ GGL E YPY+ K+ C
Sbjct: 152 FQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESEATYPYEEKEGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P++E AL +AT+GPI V+I+A +F Y +GIY + C+
Sbjct: 212 RYHPDNSTASITDFEFV-PKNEDALMNAVATLGPIFVAIDARHESFLFYRNGIYHEPNCS 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
S V HAMLLVGY R WILKN + WG+ GYM + K N CGIA YA+Y
Sbjct: 271 SSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGNRGYMKIAKDQGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 112/306 (36%), Positives = 177/306 (57%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+++ Y + ++ ++ W +N K + HN A QGL Y L + +D+ Y + ++
Sbjct: 32 KFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEEYKRVIS 91
Query: 76 R-----LTHSRIRR--TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ S RR T R PE + +PD +DWR+KG++T +Q+ CG+C+AFS
Sbjct: 92 QGCLHSFNASLPRRGSTFFRLPEGTD---LPDAVDWRDKGYVTDVKDQKQCGSCWAFSAT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ F+ T + LS QQ+VDCS GN+GC GG + Y+Q GG+ EE YPY
Sbjct: 149 GSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDTEESYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C++ NI + ++ + DE ALK +AT+GPI+V I+AS +FQ Y SG+
Sbjct: 209 EAENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSFQFYESGV 268
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y++ C+S ++H +L VGY + W++KN W WGD GY+ + R +N+CGIA
Sbjct: 269 YNEPDCSSLELDHGVLAVGYGTEDGNDYWLVKNSWGLEWGDKGYIKMSRNKSNQCGIATA 328
Query: 304 AVYALI 309
A Y L+
Sbjct: 329 ASYPLV 334
>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
Length = 331
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 115/299 (38%), Positives = 165/299 (55%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y K Y K + ++L W+ N I +HN E QGLH Y L N D+ ++ MT
Sbjct: 35 YHKHYDNKIHELMRRLIWEKNLNIIRSHNLEFTQGLHTYELGMNKFGDMTSEEVVRMMTG 94
Query: 77 L-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L H+ + T + S E S IP+ +D+R+KG++TP +Q +CG+C+AFS A++GQ+
Sbjct: 95 LKVHTGMGPTNLTSDEDEASQRIPNSIDYRKKGYVTPIRDQGECGSCWAFSTVGALEGQL 154
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
K T ++ +S Q +VDC + N GC GG + YV+ G+ EE YPY G C
Sbjct: 155 MKKTGKLVGISPQNLVDC--VKDNFGCGGGYMTTAFKYVKKNKGIDSEEAYPYVGMDQKC 212
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
K+ +I + + E ALK + VGPI+V I+A TF LY GIY D++C
Sbjct: 213 KYNVSGRAAEIKGFKEVKKGSETALKKAVGLVGPISVGIDAGLDTFFLYKKGIYYDKSCD 272
Query: 256 SDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
D +NHA+L VGY + WI+KN W WG+ GY+ + R N CGIAN A Y ++
Sbjct: 273 GDSINHAVLAVGYGKQKKGKYWIIKNSWGEDWGNKGYILMAREKGNACGIANLASYPVM 331
>gi|410904751|ref|XP_003965855.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
Length = 331
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 116/299 (38%), Positives = 168/299 (56%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++++Y + + ++ W+ N I HNQEA G+H Y L NHL D+ +++MT
Sbjct: 35 HRREYATQGEEEIRRAVWEKNMNVIDAHNQEAALGMHSYELGMNHLGDMTSEEVLEKMTG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L + V SN +P HLD+R+KG +T +Q CG+C+AFS A A++G
Sbjct: 95 LLVPLNDQRNVTMALSNSIERLPKHLDYRKKGIVTAVKDQGQCGSCWAFSSAGALEGMQA 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T ++ +LS Q +VDC + N GC GG + N YV G+ E YPY ++ C+
Sbjct: 155 KKTGKLVDLSPQNLVDC--VKENDGCGGGYMTNAFRYVATNRGIDSEASYPYVAQEQSCQ 212
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+K + SS+ +P +E L L GPIAV I+A+ TFQLY+ G+Y D C
Sbjct: 213 YKESGKAAECSSYEEVPQGNEKQLAYALFKHGPIAVGIDATLSTFQLYSKGVYYDPNCNP 272
Query: 257 DYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ +NHA+LLVGY NS WI+KN WS +WG+ GY+ + R N CGIAN A Y L+
Sbjct: 273 ENINHAVLLVGYGVNSRGQHYWIVKNSWSTNWGNGGYVLMARNRGNLCGIANLASYPLV 331
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 115/298 (38%), Positives = 169/298 (56%), Gaps = 10/298 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
++Y + ++ W+ N + I HN+EA G+H Y L NHL D+ +++T L
Sbjct: 37 REYNGLGEEVIRRTIWEKNMRLIEAHNEEAALGIHSYELGMNHLGDMTSEEIAEKLTGLQ 96
Query: 79 HSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
R R+ P++N V IP +D+R+KG +TP NQ CG+C+AFS A A++GQ+ K
Sbjct: 97 VPMNRDRSNTWIPDNN-VVKIPRSIDYRKKGMVTPVKNQLSCGSCWAFSSAGALEGQLAK 155
Query: 138 STSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKF 197
+T ++ +LS Q +VDC ++ N GC GG + N YV+ GG+ EE YPY G+ C +
Sbjct: 156 TTGKLIDLSPQNLVDC--VTENNGCGGGYMTNAFEYVEENGGIDTEEAYPYLGQDGQCAY 213
Query: 198 KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD 257
+ + +P DE AL + VGP+AV I+A+ TFQ Y G+Y D C D
Sbjct: 214 NASGMGAQCRGFKEIPEGDEWALTKAVVKVGPVAVGIDATLSTFQFYQRGVYYDPNCNKD 273
Query: 258 YVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+NHA+L VGY + + WI+KN WS WG GY+ + R N CGIAN A Y ++
Sbjct: 274 DINHAVLAVGYGQTAKGMKFWIVKNSWSESWGKQGYIMMARNRGNACGIANLASYPIM 331
>gi|189053498|dbj|BAG35664.1| unnamed protein product [Homo sapiens]
Length = 334
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 167/287 (58%), Gaps = 13/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG HG+T+ N D+ + + M + + R+ V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ C +C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLF---LDLPKSVDWRKKGYVTPVKNQKQCVSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY ICK++ N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
++V+ P E AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY
Sbjct: 225 FTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
+ NS W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 285 GFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 331
>gi|348531513|ref|XP_003453253.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 172/303 (56%), Gaps = 10/303 (3%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K++K Y + ++++K W SN K + HN A GL Y L + +D+ Y K ++
Sbjct: 32 KFEKSYDSPSEETQRKQIWLSNRKLVLKHNALADLGLKSYHLGMTYFADMENEEYKKLIS 91
Query: 76 RLTHSRIRRTLVRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ +L R + + ++PD +DWR+KG++T NQ+ CG+C+AFS A+
Sbjct: 92 QGCLGSFNASLPRRGSTFNRLPKGTVLPDTVDWRKKGYVTKVKNQQQCGSCWAFSATGAL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ FK T + LS QQ+VDCS GN GC GG + N Y++ GG+ E YPY+
Sbjct: 152 EGQHFKKTGRLVYLSEQQLVDCSRNFGNRGCDGGWMNNAFKYIKDNGGIQTEASYPYQAM 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C + PN V I + V DE ALK +AT+GPI+++++AS +FQLY SG+YD+
Sbjct: 212 DGLCHY-NPNSVGAICNGYVDVSPDEEALKEAVATIGPISIAMDASHESFQLYQSGVYDE 270
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
C Y++H ML+VGY W++KN W WG GY+ + R N+CGIA A Y
Sbjct: 271 HRCNDYYLSHGMLVVGYGTEGGLDYWLIKNSWGLGWGKMGYIKMVRNKRNQCGIATAASY 330
Query: 307 ALI 309
L+
Sbjct: 331 PLV 333
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/305 (36%), Positives = 180/305 (59%), Gaps = 10/305 (3%)
Query: 15 KKYKKDYRKKATDSK---KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
K +K+ K+ +D++ ++ W+ N +K+ HN +A G+H Y L N +D+ ++
Sbjct: 29 KLWKEANNKRYSDAEEHVRRATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADMTVTEFV 88
Query: 72 KEMTRLTHS-RIRRTLVRSPES-NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K M + R +RT R S N + +PD +DWR+KG++T +Q CG+C+AFS
Sbjct: 89 KVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTG 148
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ FK T ++ LS Q +VDCS GN+GC GG + Y++ G+ E+ YPY+
Sbjct: 149 ALEGQHFKQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYE 208
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C+FK N+ + ++ + +DE AL+ +ATVGPI+V+I+A +FQLY G+Y
Sbjct: 209 AVDNQCRFKAANVGATDTGFTDITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVY 268
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYA 304
++ C+ ++H +L VGY +S W++KN W WGD GY+ + R N+CGIA A
Sbjct: 269 NEPFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAA 328
Query: 305 VYALI 309
Y L+
Sbjct: 329 SYPLV 333
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/291 (39%), Positives = 163/291 (56%), Gaps = 16/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+K W+ N K I HNQE QG H +++ N DL + + M ++ V
Sbjct: 48 RKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQRQENKKGKVF- 106
Query: 90 PESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
+E++ IP +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 107 ---HETIFASIPPSVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSE 163
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q +VDCS GN GC GG + N YV GGL EE YPY G C + N + +
Sbjct: 164 QNLVDCSQPEGNRGCHGGLMDNAFQYVLDVGGLDSEESYPYTGLVGTCNYNPKNSAANET 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ LP Q E+AL +AT+GPI+V+++AS +FQ Y SGIY + C S+ V+H +L+VG
Sbjct: 224 GFVDLPKQ-ENALMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVG 282
Query: 268 YTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y W++KN W HWG NGY+ + K NN CGIA A Y +
Sbjct: 283 YGFEGADSDDNKYWLVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPTV 333
>gi|23452059|gb|AAN32912.1| cathepsin [Danio rerio]
Length = 310
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 165/293 (56%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR---TL 86
+++ W+ N K I HN G+H Y L NH D+ + + M H + RR +L
Sbjct: 21 RRIFWKKNLKXIEMHNLXHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL 80
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E +P+ LDWREKG++TP +Q +CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 81 FMEPXFIE---VPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLS 137
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + YV+ GL EE YPY G C F N +
Sbjct: 138 EQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAAN 197
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L
Sbjct: 198 DTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLA 257
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS +WGD GY+Y+ K +N CGIA A Y L+
Sbjct: 258 VGYGFEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 310
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 172/305 (56%), Gaps = 15/305 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K+Y+ + + +++ W+ N KKI HN E G H Y+L NH D+ + + M
Sbjct: 36 HSKNYQHEKEEGWRRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNG 95
Query: 76 -RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+L + + +L P + E+ P +DWRE+G++TP +Q CG+C+AFS A++GQ
Sbjct: 96 YKLQQRKFKGSLFLEPNNMEA---PKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQ 152
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQS 193
+F+ T ++ LS Q +VDCS GN GC GG + Y+Q GL EE YPY G
Sbjct: 153 MFRKTQKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQ 212
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C +K + + + +P EHAL +A+VGP++V+I+A +FQ Y SGIY ++
Sbjct: 213 PCNYKAEFSAANDTGFMDIPSGKEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKE 272
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
C+S+ ++H +L VGY + WI+KN WS WGD GY+ + K N CGIA A
Sbjct: 273 CSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAA 332
Query: 305 VYALI 309
Y L+
Sbjct: 333 SYPLV 337
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 111/299 (37%), Positives = 161/299 (53%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K+Y + ++ W+ N + I HNQEA G+H YTL N D+ ++ MT
Sbjct: 35 HTKEYITVEEEGIRRAIWEKNLRMIEAHNQEAALGMHTYTLGMNQFGDMTQEEVVERMTG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L V + +P +D+R+KG +T NQ CG+C+AFS A++GQ+
Sbjct: 95 LQMPLNPEPRVPMETDGSLIKLPKSVDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLA 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T + +LS Q +VDC ++ N GC GG + N YVQ GG+ E YPY G+ C+
Sbjct: 155 KKTGNLVDLSPQNLVDC--VTENDGCGGGYMTNAFKYVQENGGIDSEAAYPYMGEDQPCR 212
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + I + +P DEHAL V L GP++V I+AS ++F Y GIY D C
Sbjct: 213 YNVSGLAAQIKGYKEVPEGDEHALAVALFKAGPVSVGIDASQNSFLYYQKGIYFDRNCNK 272
Query: 257 DYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ +NHA+L VGY N+ WI+KN W WG+ GY+ + R N CGIAN A Y ++
Sbjct: 273 EDINHAVLAVGYGVNAKGKKFWIVKNSWGETWGNKGYVLMARNRGNVCGIANLASYPVM 331
>gi|348542776|ref|XP_003458860.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 178/306 (58%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y + +S +K W +N K + HN A QG Y L + +D+ Y K ++
Sbjct: 32 KFGKSYDSPSEESHRKQIWLTNRKHVLMHNILADQGFKSYRLGMTYFADMENEEYKKLVS 91
Query: 76 R-----LTHSRIRR--TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R S RR T +R PE + +PD +DWRE+G++T +Q+ CG+C+AFS
Sbjct: 92 RGCLGSFNASLPRRGSTFLRLPEG---IDLPDAVDWREQGYVTGVKDQKQCGSCWAFSAT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T + LS QQ+VDCS GN GC GG + + Y++ GG+ E YPY
Sbjct: 149 GALEGQHFRKTGILVSLSEQQLVDCSGAYGNEGCNGGWMDSAFRYIEANGGIDTEASYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + +C++ ++ S + + DE ALK +AT+GP++V+I+AS +FQ Y SG+
Sbjct: 209 EAEDWLCRYNPASVGATCSGYVDVNKYDEEALKEAVATIGPVSVAIDASHASFQFYTSGV 268
Query: 249 YDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
YD+ C+S ++H +L VGY T N W++KN W WG+ GY+ + R +N+CGIA+
Sbjct: 269 YDEPGCSSIELDHGVLAVGYGTENGHDYWLVKNSWGRGWGEMGYIKMSRNKHNQCGIASA 328
Query: 304 AVYALI 309
A Y L+
Sbjct: 329 ASYPLV 334
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 176/314 (56%), Gaps = 17/314 (5%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
++W +F KKY + A +K W+ N KKI HN E H +TL NHL
Sbjct: 26 QQWQAWKLFHTKKYTTVTEEGA----RKAIWRDNLKKIQKHNAEG----HSFTLAMNHLG 77
Query: 64 DLHP---RHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
DL R++ M R +S + + + V +PD +DWR++G++TP NQ CG
Sbjct: 78 DLTQDEFRYFYTGM-RSHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPVKNQGQCG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ FK T ++ LS Q +VDCS GN GC GG + Y++ GG+
Sbjct: 137 SCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENGGI 196
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
EE YPY+ + C+F++ NI + + + DE ALK TVGPI+V+I+A +
Sbjct: 197 DTEESYPYEARNDRCRFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGPISVAIDAGHMS 256
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-N 295
FQ Y SG+Y++ C+S ++H +L+VGY + W++KN W WG GY+ + R N
Sbjct: 257 FQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQGSDYWLVKNSWGERWGMEGYIMMSRNKN 316
Query: 296 NRCGIANYAVYALI 309
N+CG+A A Y L+
Sbjct: 317 NQCGVATQASYPLV 330
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 175/305 (57%), Gaps = 14/305 (4%)
Query: 16 KYKKDYRKK---ATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
K+K +RK + +++ W+ N K I HN E +QG H +T+ N D+ + K
Sbjct: 31 KWKATHRKLYGLNEEGRRRAIWEKNMKMIERHNWEHRQGKHSFTMAMNAFGDMTNEEFRK 90
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M + + ++ V S L P +DWREKG++T NQ CG+C+AFS A++
Sbjct: 91 TMNGFQNQKHKKGKVFLDAG--SALTPHSVDWREKGYVTAVKNQGHCGSCWAFSATGALE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+F+ TS++ LS Q +VDCS GN GC GG + N Y++ GGL EE YPY GK
Sbjct: 149 GQMFRKTSKLISLSEQNLVDCSWPEGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
CK+K + + + + +P Q E AL +ATVGPI+V I+AS +FQ Y++GIY +
Sbjct: 209 GSCKYKPQSSAANDTGYVDIPKQ-EKALMKAVATVGPISVGIDASHESFQFYSTGIYFEP 267
Query: 253 ACTSDYVNHAMLLVGY-------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
C+S+ ++H +L+VGY W++KN W + WG +GY+ + K NN CGIA A
Sbjct: 268 QCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMA 327
Query: 305 VYALI 309
Y ++
Sbjct: 328 SYPVV 332
>gi|149510440|ref|XP_001518002.1| PREDICTED: cathepsin K-like [Ornithorhynchus anatinus]
Length = 618
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 114/301 (37%), Positives = 168/301 (55%), Gaps = 12/301 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K ++ ++L W+ N + I HN E G+H + L NHL D+ ++
Sbjct: 319 KKTHQKQYNSKEDETSRRLVWEKNLQYISAHNLEFSLGIHTFELAMNHLGDMTSEEVVRT 378
Query: 74 MTRLTHSRIR---RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
MT L R + SP+ E PD +D+R+KG++TP NQ CG+C+AFS A
Sbjct: 379 MTGLKVPPARTQSNDTLYSPDWAERA--PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGA 436
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K T + +LS Q +VDC ++ N GC GG + N YV G+ E+ YPY G
Sbjct: 437 LEGQLKKKTGRLLDLSPQNLVDC--VASNDGCGGGYMTNAFQYVHDNRGIDSEDAYPYVG 494
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C++ + +P DE ALK +A VGP+AV+I+AS +FQ Y+ G+Y
Sbjct: 495 QDEPCRYSPTGKAAKCRGYREVPVGDEKALKRAVARVGPVAVAIDASLSSFQFYSKGVYF 554
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
DE C +NHA+L VGY WI+KN W WG+ GY+ + R NN CGIA+ A
Sbjct: 555 DENCNGANLNHALLAVGYGAQKGAKHWIIKNSWGEEWGNKGYVLMARNKNNACGIASLAS 614
Query: 306 Y 306
+
Sbjct: 615 F 615
>gi|318816588|ref|NP_001187996.1| cathepsin L precursor [Ictalurus punctatus]
gi|308324547|gb|ADO29408.1| cathepsin L [Ictalurus punctatus]
Length = 334
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 174/303 (57%), Gaps = 9/303 (2%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y+ +S++K W N K + HN A QG+ Y L + +D+ + Y + +
Sbjct: 32 KFGKIYKSVEEESQRKNTWLENRKLVLVHNMLADQGIKSYRLGMTYFADMDNQEYRQSVF 91
Query: 76 RLTHSRIRRTLVRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ RT + ++PD +DWR+KG++ +Q++CG+C+AFS ++
Sbjct: 92 KGCLGSFNRTKGHRASTFLLQAGGAVLPDTVDWRDKGYVAEVKDQKNCGSCWAFSATGSL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS QQ+VDCS GN+GC GG + Y++ G+ EE YPY+
Sbjct: 152 EGQTFRKTGKLVSLSEQQLVDCSGKYGNMGCGGGLMDLAFEYIEDNKGIDTEESYPYEAT 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+FK + + + + +DE+AL+ +A +GPI+V+I+A +FQLY SGIY++
Sbjct: 212 DGDCRFKPATVGATCTGYVDINSEDENALQKAVANIGPISVAIDAGHISFQLYGSGIYNE 271
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C+S+ ++H +L VGY ++ W++KN W WGD GY+ + R NN+CGIA A Y
Sbjct: 272 PNCSSEDLDHGVLAVGYGTDNQQDYWLVKNSWGLDWGDQGYIKMTRNKNNQCGIATAASY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 114/301 (37%), Positives = 168/301 (55%), Gaps = 9/301 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K+YR +A + +++ W+ N + + HN E GLH Y L NH+ D+
Sbjct: 32 KKAHGKEYRHQAEEGQRRATWEKNLRLVMLHNLEHSLGLHSYQLGMNHMGDMTSEDVAAL 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+T L R+ ++ PD +DWREKG +T NQ CGAC+AFS A++
Sbjct: 92 LTGL---RVPYGHNQTSTYRRRGGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEA 148
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ T ++ LS Q +VDCS++ GN GC GG + Y+ G+ EE YPY +
Sbjct: 149 QVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNG 208
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ S + LP DE ALK +A VGP++V+I+A+ TF LY SG+YDD
Sbjct: 209 TCQYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPR 268
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYAL 308
CT + VNH +L+VGY ++ W++KN W +GD GY+ + R + N CGIA+YA Y
Sbjct: 269 CTQE-VNHGVLVVGYGTLNEKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYPQ 327
Query: 309 I 309
I
Sbjct: 328 I 328
>gi|348531523|ref|XP_003453258.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 341
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 178/306 (58%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K++K Y ++ ++++K W +N K + HN A QGL Y L +D+ Y + ++
Sbjct: 39 KFEKSYDSESDEAQRKQIWLNNRKHVLVHNILADQGLKSYRLGMTQFADMENEEYKRLVS 98
Query: 76 R-----LTHSRIRR--TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ S RR T R P+ ++PD +DWR+KG++T NQ DCG+C+AFS
Sbjct: 99 QGCLHSFNSSLPRRGSTFFRLPKG---TVLPDTVDWRDKGYVTNVQNQMDCGSCWAFSAT 155
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ F+ T ++ LS QQ+VDCS GN GC GG + + Y+Q GG+ EE YPY
Sbjct: 156 GSLEGQHFRKTGKLVSLSKQQLVDCSGEFGNEGCNGGLMDSAFQYIQANGGIDTEESYPY 215
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C++ + + + + P +E LK +AT+GPI+V+I+A +FQ Y SG+
Sbjct: 216 EAEDGKCRYNPKSTGATCTGYVDVQPANEETLKEAVATIGPISVAIDAFHPSFQFYESGV 275
Query: 249 YDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
YD+ C+S ++HA+L VGY T N W++KN WG+ GY+ + R +N+CGIA
Sbjct: 276 YDEPDCSSTMLDHAVLAVGYGTENGLDYWLVKNSAGVGWGEKGYIKMSRNKSNQCGIATA 335
Query: 304 AVYALI 309
A Y L+
Sbjct: 336 ASYPLV 341
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 171/301 (56%), Gaps = 6/301 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++++Y + ++ W+ N + I HN+EA G+H + + NHL D+ +++
Sbjct: 31 KSTHRREYNGLGEEGIRRAIWEKNMRMIEAHNEEAALGIHSFEMGMNHLGDMTSEEVVEK 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
MT L + + IP +D+R+KG +T NQ CG+C+AFS A A++G
Sbjct: 91 MTGLQIPMNQERSFTLAMDDMPSKIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEG 150
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ KST ++ +LS Q +VDCS GN GC GG + YV G+ + YPY G+
Sbjct: 151 QLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYTGRDE 210
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ + SS+ LP DE+ALK LAT+GPI+V+I+A F Y SG+Y+D +
Sbjct: 211 QCRYNPATRAANCSSYQFLPEGDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPS 270
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
CT + VNH +L VGY ++ W++KN W +GD GY+ + R N+CGIA YA Y +
Sbjct: 271 CTQE-VNHGVLAVGYGSLNGQDYWLVKNSWGSTFGDQGYIRMARNTGNQCGIALYACYPV 329
Query: 309 I 309
+
Sbjct: 330 M 330
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 110/299 (36%), Positives = 162/299 (54%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K+Y + ++ W+ N + I HNQEA G+H Y L N+L D+ ++M
Sbjct: 35 HNKEYNGLDEEGIRRAIWEKNMRMIEAHNQEAALGMHSYELGMNNLGDMTSEEVAEKMMG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L R N +P +D+R KG +TP NQ CG+C+AFS A++GQ+
Sbjct: 95 LQVPLNRDRGNTFVPDNTVERLPKSIDYRRKGMVTPVKNQGSCGSCWAFSSVGALEGQLM 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+T ++ +LS Q +VDC ++ N GC GG + N NYV+ G+ E YPY G+ C
Sbjct: 155 KTTGKLVDLSPQNLVDC--VTENNGCGGGYMTNAFNYVRDNQGIDSEAAYPYIGQDETCA 212
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + +P +E AL V +A VGP++V I+A+ TFQ Y G+Y D C
Sbjct: 213 YNVSGMTASCRGYKEIPEGNERALTVAVAKVGPVSVGIDATLSTFQFYQKGVYYDRNCNK 272
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
D +NHA+L VGY + WI+KN WS WG+ GY+ + R N CGIAN A Y ++
Sbjct: 273 DDINHAVLAVGYGVTPKGKKYWIVKNSWSESWGNKGYILMARNRGNLCGIANLASYPIM 331
>gi|440893559|gb|ELR46281.1| Cathepsin L1 [Bos grunniens mutus]
Length = 330
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 116/292 (39%), Positives = 163/292 (55%), Gaps = 21/292 (7%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL---HPRHYIKEMTRLTHSRIRRTL 86
+K W+ N K I HNQE QG H +++ N D+ RH + R + + + T+
Sbjct: 48 RKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQRQKNKKGKETI 107
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
S IP +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 108 FAS--------IPPSMDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLS 159
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
Q +VDCS GN GC GG + N YV GGL EE YPY G C + N +
Sbjct: 160 EQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANE 219
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + LP Q E AL +AT+GPI+V+++A +FQ Y SGIY + C+S+ V+HA+L+V
Sbjct: 220 TGFVDLPKQ-EKALMKAVATLGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVV 278
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY W++KN W HWG +GY+ + K NN CGIA A Y +
Sbjct: 279 GYGFEGADSDDNKYWLVKNSWGEHWGMDGYIKMAKDRNNHCGIATMASYPTV 330
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 171/304 (56%), Gaps = 8/304 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y ++ +KL W+ N + HN + G Y L N +DL ++
Sbjct: 32 KNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEEFVAM 91
Query: 74 MTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
MT + + S SN +P +DWR KG++TP +Q CG+C+AFS ++
Sbjct: 92 MTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ FK+T ++ LS Q +VDCS GN GC GG + Y+ AGG+ EE YPYK
Sbjct: 152 EGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAV 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C FK+ NI ++ ++ + E AL+ +A +GPI+V+I+AS +FQLY SG+Y++
Sbjct: 212 DGECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNE 271
Query: 252 EACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
C+S ++H +L VGY S WI+KN W+ WG NGY+++ R +N+CGIA A
Sbjct: 272 PDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRNKDNQCGIATQAS 331
Query: 306 YALI 309
Y L+
Sbjct: 332 YPLV 335
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 177/304 (58%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K++ K+Y+ + + ++ W+ N + I HN EA G+H Y L NH+ D+ ++
Sbjct: 34 KKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDMTEEEILQS 93
Query: 74 MTRLTHSRIRRTLVRSPES---NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
L ++ L R P + + +PD +DWR+KG++T NQ CG+C+AFS A
Sbjct: 94 FASL---KVPADLKREPSAFVASSGTPVPDTVDWRQKGYVTQVKNQGSCGSCWAFSSVGA 150
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ ++T ++ +LS Q +VDCS GN GC GG + YV G+ + YPY+G
Sbjct: 151 LEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGIDSDTSYPYQG 210
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
Q C + + + +S LP DE LK +A +GPI+V+I+A+ +F L+ SG+Y+
Sbjct: 211 VQGTCHYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSFILWRSGVYN 270
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D CT +NHA+L+VGY ++ W++KN W +G+NGY+ + R NN+CGIA Y
Sbjct: 271 DLTCTQK-INHAVLVVGYGTLDGQDYWLVKNSWGTRFGENGYIRMSRNRNNQCGIALYGC 329
Query: 306 YALI 309
Y ++
Sbjct: 330 YPIM 333
>gi|403300987|ref|XP_003941193.1| PREDICTED: cathepsin L2 [Saimiri boliviensis boliviensis]
Length = 333
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 169/290 (58%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E +G HG+T+ N D+ + + M + + + V R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSRGKHGFTMAMNAFGDMTNEEFRQVMVCFRNQKHKNGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 GPLL---LDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL E YPY+ K ICK+K N V + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMNYAFRYVKENGGLDSEASYPYEAKDGICKYKPENSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V+P ++ +K +ATVGPI+V+++AS +FQ Y SGIY ++ C+S ++H +L+VGY
Sbjct: 225 FVVIPTHEKELMKA-VATVGPISVAVDASHSSFQFYKSGIYFEKKCSSKNLDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG NGY+ + K NN CGIA A Y ++
Sbjct: 284 GFEGANSKDNKYWLIKNSWGPEWGLNGYIKIAKDQNNHCGIATAASYPVV 333
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 176/304 (57%), Gaps = 23/304 (7%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y ++ ++ + W+ N +I +N +++ + LR NH D+ + +M
Sbjct: 34 HNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVI----LRMNHFGDMTNTEFRAKMNG 89
Query: 77 LTHSRIRRTLVRSPESNESVLIPDH------LDWREKGFITPDWNQEDCGACYAFSIASA 130
L L+ ++ + L+P H +DWR +G++TP NQ CG+C+AFS A
Sbjct: 90 L--------LLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGA 141
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ FK T + LS Q +VDCS GN GC GG + N +Y++ GG+ E YPY+G
Sbjct: 142 LEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEG 201
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C++ + +I D + + +P DE ALK +ATVGP++V+I+AS +FQ Y SG+YD
Sbjct: 202 QDGTCRYSKSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYD 261
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
+ C+ ++H +L+VGY ++ W++KN W WG GY+Y+ R N N+CGIA+ A
Sbjct: 262 EPQCSPSALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRNNQNQCGIASKAS 321
Query: 306 YALI 309
Y L+
Sbjct: 322 YPLV 325
>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 168/304 (55%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K+Y+ + + ++ W+ N + I HN EA +H Y L NH+ D+ +
Sbjct: 31 KKTHGKNYQTEVEELGRREVWERNLQLISLHNLEASMDMHTYDLGMNHMGDMTQEEIAQS 90
Query: 74 MTRLTHSRIRRTLVRSPES---NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
L + L R P + + IPD DWREKG++T Q CG+C+AFS A
Sbjct: 91 FASLL---VPADLKREPSAFAGSSGAPIPDTFDWREKGYVTGVKMQGSCGSCWAFSSVGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K+T ++ +LS Q +VDCS GN GC GG + YV G+ ++ YPYKG
Sbjct: 148 LEGQLMKTTGKLIDLSPQNLVDCSSKYGNKGCHGGFMTKAFQYVIDNQGIASDQSYPYKG 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
Q C + + S +S LP DE LK LAT+GPI+V I+A+ +F Y SG+Y+
Sbjct: 208 VQQQCIYNPAQRAANCSRYSFLPEGDEGVLKEALATIGPISVGIDATRPSFAFYRSGVYN 267
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D CT NHA+L VGY ++ W++KN W WGD GY+ + R +N+CGIA Y
Sbjct: 268 DPTCTKK-TNHAVLAVGYGTLGGQDYWLVKNSWGLSWGDQGYIRMSRNKDNQCGIALYGC 326
Query: 306 YALI 309
Y ++
Sbjct: 327 YPVM 330
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/302 (36%), Positives = 174/302 (57%), Gaps = 15/302 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K+Y K ++ + WQ+N KKI THN+ G H + L NHL D+ + +
Sbjct: 36 HGKEYPNKNEETMRNFIWQNNLKKIVTHNE----GKHSFKLAMNHLGDMTSLEISQTLLG 91
Query: 77 LTHSRIRRTLVRS----PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L + + + P +N V + D +DWR KG++TP NQ CG+C+AFS A++
Sbjct: 92 LKLKKHAESQPKGATFLPPAN--VKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T ++ LS Q +VDCS GN GC GG + N Y++ GG+ E+ YPY K
Sbjct: 150 GQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLAKD 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
+C + + I + + +P DE+AL+ LA+VGPI+++I+AS TF Y G+YDD
Sbjct: 210 GVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDP 269
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYA 307
C+S ++H +L VGY ++ W++KN W WG+ GY+ + R + ++CG+A+ A Y
Sbjct: 270 DCSSTRLDHGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKASYP 329
Query: 308 LI 309
L+
Sbjct: 330 LV 331
>gi|327322928|gb|AEA48885.1| cathepsin S [Oplegnathus fasciatus]
Length = 337
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 175/303 (57%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ ++K Y+ + D +++ W+ N I HN EA GLH Y L NH+ DL ++
Sbjct: 38 KRTHEKKYQNEGEDVRRRELWEKNLMLITMHNLEASMGLHTYELSMNHMGDLTQEEILQS 97
Query: 74 MTRLTH-SRIRRTLVRSPESNES-VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
L+ + I+R SP + S +PD +DWREKG +T Q CG+C+AFS A A+
Sbjct: 98 FATLSPPTDIQR--APSPFAGTSGAAVPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GN GC GG + YV G+ + YPY G+
Sbjct: 156 EGQLAKTTGKLLDLSPQNLVDCSSKYGNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQ 215
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + S +S LP DE ALK LAT+GPI+V+I+A+ +F Y SG+YDD
Sbjct: 216 SQQCHYNPAYRAANCSRYSSLPEGDEGALKEALATIGPISVAIDATRPSFTFYRSGVYDD 275
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+ T + VNH +L VGY ++ W++KN W +GD G++ + R N++CGIA Y Y
Sbjct: 276 QTYTRN-VNHGVLAVGYGTLNGKDYWLVKNGWGSTFGDKGFIRMARNKNDQCGIALYGCY 334
Query: 307 ALI 309
++
Sbjct: 335 PIM 337
>gi|226821419|gb|ACO82385.1| cathepsin K [Lutjanus argentimaculatus]
Length = 330
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 106/299 (35%), Positives = 165/299 (55%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K+Y + ++ W+ N + I HNQEA G+H Y + NHL D+ ++MT
Sbjct: 34 HRKEYNGLDEEGIRRAVWEKNMRMIEAHNQEAALGMHSYEMAMNHLGDMTSEEVSEKMTG 93
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L + ++ +P ++D+R+KG +T NQ CG+C+AFS A A++GQ+
Sbjct: 94 LLVPLNHKRSFTMALDDDVNRLPKYIDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLA 153
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T ++ +LS Q +VDC ++ N GC GG + YV GG+ EE YPY G+ C+
Sbjct: 154 KKTGQLVDLSPQNLVDC--VTENDGCGGGYMTKAFQYVADNGGIDSEEAYPYIGEDQPCR 211
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + +P +EHAL V L GP++V I+A+ +FQ Y+ G+Y D +C
Sbjct: 212 YNATGMAAQCKGYKEIPEGNEHALAVALFKAGPVSVGIDATLSSFQFYSKGVYYDPSCNK 271
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ +NHA+L VGY + WI+KN W WG GY+ + R N CGIAN A Y ++
Sbjct: 272 EDINHAVLAVGYGVTGKGKKYWIVKNSWGESWGKGGYILMARNRGNLCGIANLASYPIM 330
>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
Length = 337
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 174/303 (57%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y+ + + ++ W+ N I HN EA GLH Y L NH+ D+ P +
Sbjct: 38 KKTHEKKYQNEVEEFSRRRLWEKNLMLITMHNLEASMGLHTYELGMNHMGDMTPEEIWQS 97
Query: 74 MTRLTH-SRIRRTLVRSPESNES-VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LT + I+R SP + S IPD +DWREKG +T Q CG+C+AFS A+
Sbjct: 98 FATLTPPTDIQR--APSPFAGSSGADIPDTMDWREKGCVTSVKTQGSCGSCWAFSAVGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K T ++ +LS Q +VDCS GN GC GG + + YV G+ + YPY G+
Sbjct: 156 EGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVIDNQGIDSDASYPYTGR 215
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + SS++ LP DE ALK LAT+GPI+V+I+A+ F Y SG+Y+D
Sbjct: 216 SDQCHYNPSYRAANCSSYNFLPEGDEGALKQALATIGPISVAIDATRPRFIFYRSGVYND 275
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C+ + VNH +L VGY ++ W++KN W +GD GY+ + R N++CGIA Y Y
Sbjct: 276 PSCSQE-VNHGVLAVGYGTLNGQDYWLVKNSWGTKFGDQGYIRMARNQNDQCGIAMYGCY 334
Query: 307 ALI 309
++
Sbjct: 335 PIM 337
>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
Length = 334
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 172/303 (56%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y+ ++ ++ W +N K I HN EA GLH Y L NH+ DL ++
Sbjct: 35 KKTHGKSYKNDVENAHRRELWGNNLKMITVHNLEASMGLHTYELGMNHMGDLTEEEIMQF 94
Query: 74 MTRLTH-SRIRRTLVRSPESNES-VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LT + I+R SP + S IPD +DWREKG +T Q CG+C+AFS A A+
Sbjct: 95 FASLTPPTDIQR--APSPFAGASGSGIPDTMDWREKGCVTKVKMQGACGSCWAFSAAGAL 152
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ KST ++ +LS Q +VDCS GN GC GG + YV G+ + YPY G+
Sbjct: 153 EGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYIGR 212
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + SS+ LP DE+ALK LATVGPI+V+I+A F Y SG+Y+D
Sbjct: 213 DDQCHYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAIDARRPRFSFYRSGVYND 272
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT VNH +L VGY ++ W++KN W +GD GY+ + R N+CGIA Y Y
Sbjct: 273 PSCTQK-VNHGVLAVGYGTLNGQDYWLVKNSWGTTFGDQGYIRMARNTGNQCGIALYPCY 331
Query: 307 ALI 309
++
Sbjct: 332 PVM 334
>gi|27681979|ref|XP_225125.1| PREDICTED: cathepsin 7-like [Rattus norvegicus]
gi|109505372|ref|XP_001065135.1| PREDICTED: cathepsin 7-like [Rattus norvegicus]
Length = 331
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/301 (37%), Positives = 169/301 (56%), Gaps = 15/301 (4%)
Query: 18 KKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL 77
+K Y + + ++ L W+ N K I H E ++ +T+ N D+ +EM +
Sbjct: 37 EKTYSPEEEEQRRAL-WEENVKMIKQHTVENGLWMNNFTIEMNQFCDMTD----EEMRMM 91
Query: 78 THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
S +V IP LDWR+ G++TP Q CGAC+ F++A +I+GQ+FK
Sbjct: 92 IDSSAPTLRKEKRVQKRNVEIPKTLDWRKDGYVTPVRRQGACGACWGFAVAGSIEGQLFK 151
Query: 138 STSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKF 197
T ++ LS+Q +VDCS G +GC GG + N YV+ GGL E YPY+ K+ C++
Sbjct: 152 KTGKLSPLSVQNLVDCSRSFGTMGCNGGRIYNAFQYVKNNGGLEAEATYPYEAKEGNCRY 211
Query: 198 KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD 257
+ VV ++ + V+ P++E AL L +GPIAV I+A +F+ YA GIY + C D
Sbjct: 212 RPEKSVVKVTRFLVV-PRNEEALINALVNIGPIAVGIDAQHESFKKYAGGIYHEPNCKRD 270
Query: 258 YVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
NH+MLLVG+ R W++KN + WG+ GYM + RG NN CGIA+YA+Y +
Sbjct: 271 SPNHSMLLVGFGYEGQESEGRKYWLVKNSYGEQWGEKGYMKIPRGQNNYCGIASYAMYPV 330
Query: 309 I 309
+
Sbjct: 331 L 331
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 170/300 (56%), Gaps = 7/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N + HN++ +GL Y L N DL P + +
Sbjct: 34 HKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNG 93
Query: 77 LTHSRI--RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+R R + P + +P +DWREKG +TP NQ CG+C+AFS +++GQ
Sbjct: 94 YRGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQ 153
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
F T + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YPY+ +
Sbjct: 154 HFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGE 213
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C+FK+ N+ + + + E LK +ATVGP++V+I+AS +FQLY+ G+YD+ C
Sbjct: 214 CRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETEC 273
Query: 255 TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+S+ ++H +L+VGY + W++KN W+ WGDNGY+ + R +N+CGIA+ A Y L+
Sbjct: 274 SSEQLDHGVLVVGYGVEDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPLV 333
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 116/290 (40%), Positives = 164/290 (56%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HN+E QG HG+T+ N D+ + + M + + ++ + +
Sbjct: 48 RRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKMFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E IP +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFAE---IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDIS 207
+VDCS GN GC GG + N YV+ GGL EE YPY G+ + C +K P
Sbjct: 165 NLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYK-PECSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E AL +AT+GPI+V+I+A +FQ Y SGIY D C+S ++H +L+VG
Sbjct: 224 TGFVDLPQREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVG 283
Query: 268 Y-------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 284 YGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
Length = 327
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 118/305 (38%), Positives = 169/305 (55%), Gaps = 15/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y + + ++ W+ N + I HN E GLH Y L NHL DL I
Sbjct: 29 KKTYSKTYSHEIEEFGRRRIWEENLEMISVHNLEVSLGLHSYELAMNHLGDLTIEELIAS 88
Query: 74 MTRLTH----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+T RI LV+ S +P+ +DWRE G +T Q CG+C+AFS
Sbjct: 89 LTGTVAPVGLERIHYDLVKINTS-----VPESVDWREGGLVTSVKTQGRCGSCWAFSAVG 143
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K+T + LS Q +VDCS GN GC GG + N YV G+ + YPY
Sbjct: 144 ALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQGISSDAAYPYI 203
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
GK+ CK+ + + + ++ LP DE ALKV +AT+GPI+V+I+AS F Y G+Y
Sbjct: 204 GKRDKCKYDSKHRAANCTGYNFLPKGDEFALKVGVATIGPISVAIDASRPKFLFYRHGVY 263
Query: 250 DDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
D +C+ + VNH +L+VGY T N W++KN W +GD GY+ + R N+CGIA YA
Sbjct: 264 KDHSCSHN-VNHGVLVVGYGTENGEDYWLVKNSWGERYGDGGYIKMARNRRNQCGIALYA 322
Query: 305 VYALI 309
+ ++
Sbjct: 323 CFPVM 327
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 113/315 (35%), Positives = 176/315 (55%), Gaps = 16/315 (5%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+ EW I +K+Y K Y+ + +++++L W+SN I HN A +G H + + N
Sbjct: 24 DNEWNIF----KKQYNKLYQNEE-EARRRLVWESNLDFITLHNLAADRGEHTFWVGMNEY 78
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP---ESNESVLIPDHLDWREKGFITPDWNQEDC 119
D+ + K M R+R +P N +PD +DWR KG++TP NQ C
Sbjct: 79 GDMTNEEFTKTMNGY---RMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQC 135
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C++FS +++GQ FK T ++ LS Q +VDCS GN GC GG + + Y++ G
Sbjct: 136 GSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNG 195
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
+ E YPYK + C+FK ++ + + + +DE ALK +ATVGPI+V+I+AS
Sbjct: 196 IDTEASYPYKARDGKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHM 255
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG- 294
+FQLY +G+Y D C+ ++H +L VGY +++ W++KN W WG GY+ + R
Sbjct: 256 SFQLYRTGVYHDWFCSQTKLDHGVLAVGYGTEDSKDYWLVKNSWGESWGQKGYIQMSRNR 315
Query: 295 NNRCGIANYAVYALI 309
N CGIA A Y +
Sbjct: 316 RNNCGIATSASYPTV 330
>gi|308322281|gb|ADO28278.1| cathepsin L [Ictalurus furcatus]
Length = 359
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 178/305 (58%), Gaps = 10/305 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++K+ K Y+ +S++K WQ NHK + HN A +G+ Y L N+ +D+ + Y +
Sbjct: 29 KQKFGKIYKSVEEESQRKKTWQENHKLVMNHNILADKGIKSYRLGMNYFADMSNQEYRQS 88
Query: 74 MTRLTHSRIRRTLVRSPESNESVL----IPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ + S RTL S + + +P+ ++W + G++T Q+ C +C+AFS
Sbjct: 89 VFKGCLS-FNRTLNHSAATFLRQVGGPALPNTVNWTQMGYVTEVEEQKQCNSCWAFSATG 147
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ FK T ++ LS QQ+VDCS GN GC GG + YV+ GGL EE YPY+
Sbjct: 148 ALEGQTFKKTGKLVSLSKQQLVDCSKKFGNNGCKGGLMNWAFEYVKENGGLHTEESYPYE 207
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
K C+ + V + + +DE+AL+ +AT+GPI+V+I+A+ +FQLY SG+Y
Sbjct: 208 AKDGSCRDNLGTVGVTCTGHVQINSEDENALQEAVATIGPISVAIDANHTSFQLYESGLY 267
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
D+ C+ +NH +L VGY ++ W++KN W +WGD GY+ + R NN+CGIA A
Sbjct: 268 DEPDCSCTDMNHGVLAVGYGTDDGKDYWLIKNSWGINWGDKGYIKMSRNKNNQCGIATAA 327
Query: 305 VYALI 309
Y L+
Sbjct: 328 SYPLV 332
>gi|337255596|gb|AEI61876.1| cathepsin K [Gadus morhua]
Length = 331
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/299 (35%), Positives = 163/299 (54%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+++DY + ++ W+ N + I HN+EA G+H Y + NHL D+ ++M
Sbjct: 35 HRRDYNGLDEEGIRRAVWEKNSRMIAAHNEEAALGMHSYEMGMNHLGDMTSEEVSEKMMG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L R + ++ +P +D+R+KG +T NQ CG+C+AFS A A++GQ+
Sbjct: 95 LLVPLNRDLTFTAAFDDKLEKLPKSVDYRKKGMVTSVKNQGSCGSCWAFSSAGALEGQLA 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+T + +LS Q +VDC ++ N GC GG + N +YV GG+ +E YPY G+ C
Sbjct: 155 KTTGTLRDLSPQNLVDC--VTENSGCGGGYMTNAFSYVMQNGGIDSDESYPYVGQDQQCG 212
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
F + + + +P DE AL V L GP++V I+A TFQ Y G+Y D C +
Sbjct: 213 FNVSGVAAECKGYKQIPVGDERALAVALFKAGPVSVGIDAGLGTFQFYQHGVYYDRNCNA 272
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ +NHA+L VG+ + WI+KN W WG GY+ + R N CGIAN A Y ++
Sbjct: 273 EDINHAVLAVGFGVTAKGKKYWIIKNSWGEDWGHKGYILMARNRGNLCGIANMASYPIM 331
>gi|54020916|ref|NP_001005702.1| cathepsin K (pycnodysostosis) precursor [Xenopus (Silurana)
tropicalis]
gi|49671274|gb|AAH75275.1| cathepsin K (pycnodysostosis) [Xenopus (Silurana) tropicalis]
Length = 329
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 171/304 (56%), Gaps = 14/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y + Y + + +++ W+ N I HN+E QGLH Y L NHL D+ +++
Sbjct: 32 KKTYHRQYNGQLDEIRRRQIWEKNLNLISQHNKEFSQGLHTYDLAMNHLGDMTSEEVVQK 91
Query: 74 MTRLT---HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M L + R T + PE N IP+++D+R+KG++TP NQ CG+C+AFS A
Sbjct: 92 MMGLKVPPNHRPNNTYI--PEWNSR--IPEYIDYRKKGYVTPVHNQGICGSCWAFSSVGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ K T ++ LS Q +VDC + N GC GG + N YV+ GG+ + +YPY G
Sbjct: 148 LEGQLMKKTGKLVSLSPQNLVDCD--TDNYGCEGGYMTNAFGYVRDNGGIDSDAEYPYVG 205
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C + + + +P E ALK +A VGP++VSI+AS +FQ Y G+Y
Sbjct: 206 QDEGCHYNPADKAATCKGYKEIPVGSEKALKRAVANVGPVSVSIDASLPSFQFYKKGVYY 265
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +C D VNHA+L+VGY WI+KN W WG GY+ L R N CGIA+ A
Sbjct: 266 DSSCNPDAVNHAVLVVGYGNEKGIKHWIIKNSWGDWWGKKGYVLLARDKKNACGIASLAS 325
Query: 306 YALI 309
+ ++
Sbjct: 326 FPVM 329
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 160/289 (55%), Gaps = 12/289 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HNQE QG HG+T+ N D+ + + M + + ++ V
Sbjct: 48 RRAVWEKNMKMIEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFRNQKHKKGKVFQ 107
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
S + +P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPS--FLEVPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQN 165
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
+VDCS GN GC GG + Y++ GGL EE YPY CK+ RP V +
Sbjct: 166 LVDCSRPQGNEGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKY-RPEYSVANDTG 224
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
V P++E AL +ATVGPI+V+I+A +FQ Y G+Y + C+SD V+H +L+VGY
Sbjct: 225 FVDIPKEEKALMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYG 284
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA A Y +
Sbjct: 285 YEETESDNNKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPTV 333
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 107/299 (35%), Positives = 167/299 (55%), Gaps = 7/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y +S++ ++ N +I HN+ G Y L N +DL ++
Sbjct: 86 HDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEFVN-FNG 144
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L + + T S S ++++PD +DWR KG++T NQ CG+C+AFS +++GQ F
Sbjct: 145 LKMTNLNNTKCSSHLSANNIVVPDSVDWRSKGYVTKVKNQGACGSCWAFSATGSLEGQYF 204
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ ++ LS Q+VDCS GN GC GG + N YV+ GG+ E DYPYK +Q C
Sbjct: 205 RKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYVKSVGGIESESDYPYKARQRTCA 264
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
F + ++ +S + E +LK ++ VGP++V+I+A +FQLYA G+YD+ C++
Sbjct: 265 FDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGHSSFQLYAGGVYDEPLCST 324
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+NH +L VGY ++ WI+KN W WG GY+ + R NN+CGIA+ A Y L+
Sbjct: 325 SRLNHGVLCVGYGTSLQGKDYWIVKNSWGVRWGVEGYIKMSRNKNNQCGIASEASYPLV 383
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 109/292 (37%), Positives = 167/292 (57%), Gaps = 12/292 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+S ++ W+ N K I HN E QG H +T+ N D+ + + MT + + +
Sbjct: 45 ESLRRAVWEKNLKMIEQHNLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQNQKYNKGE 104
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
V P + + +P+ +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 105 VFQPP--QPLEVPESVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLS 162
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
Q +VDCS N GC GG + YV+ GGL EE YPY+ +S C++ N +
Sbjct: 163 EQNLVDCSQPQHNSGCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTCRYSPGNSAATV 222
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + + P +E AL+ +A+VGPI+V+I+A H+FQ Y GI + C+ ++NHA+L+V
Sbjct: 223 TGFKHI-PAEEKALEKAVASVGPISVAIDAHHHSFQFYTGGILHEPNCSPKWLNHAVLVV 281
Query: 267 GY--------TRNSWILKNWWSHHWGDNGY-MYLKRGNNRCGIANYAVYALI 309
GY W++KN W WG GY M K NN CGIA+ A+Y ++
Sbjct: 282 GYGVMQEGSNNNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPIV 333
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 112/293 (38%), Positives = 165/293 (56%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR---TL 86
+++ W+ N +KI HN E G H Y L N D+ + + M H + RR +L
Sbjct: 48 RRMVWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYKHKKERRFRGSL 107
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E +P+ LDWREKG++TP +Q +CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 108 FMEPNFLE---VPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLS 164
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GL EE YPY G C + +
Sbjct: 165 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAAN 224
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P EHAL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L
Sbjct: 225 DTGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLA 284
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS +WGD GY+Y+ K +N CGIA A Y L+
Sbjct: 285 VGYGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPLV 337
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 171/308 (55%), Gaps = 14/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y+ K + + + SNHK I HN E + G H + L N +D+ + + M
Sbjct: 49 KHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMN 108
Query: 76 RLTHSRIRRTLVRSP--------ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
R+ P E ++V IPD +DWR++G++T +Q CG+C+AFS
Sbjct: 109 GFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSA 168
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ +K T ++ LS Q +VDC + + GC GG + YV+ G+ E YP
Sbjct: 169 TGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEASYP 228
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
YKG+ C+FK ++ + + +P +E L+ +ATVGP++V+I+A+ FQ Y+ G
Sbjct: 229 YKGRDGRCRFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQFYSHG 288
Query: 248 IYDDEACTSDYVNHAMLLVGYT-----RNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIA 301
+Y D +C+ +Y++H +L VGY + +I+KN WS WGD+GY+ + +R NN CGIA
Sbjct: 289 VYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSRRKNNNCGIA 348
Query: 302 NYAVYALI 309
A Y +
Sbjct: 349 TMASYPFV 356
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 115/291 (39%), Positives = 162/291 (55%), Gaps = 16/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+K W+ N K I HNQE QG H +++ N D+ + M R+ +
Sbjct: 48 RKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQ----RQKNKKG 103
Query: 90 PESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
E +E++ IP +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 104 KEFHETIFASIPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSE 163
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q +VDCS GN GC GG + N YV GGL EE YPY G C + N + +
Sbjct: 164 QNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANET 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ LP Q E AL +A +GPI+V+++A +FQ Y SGIY + C+S+ V+HA+L+VG
Sbjct: 224 GFVDLPKQ-EKALMKAVANLGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVG 282
Query: 268 YTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y W++KN W HWG NGY+ + K NN CGIA A Y +
Sbjct: 283 YGFEGADSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 109/302 (36%), Positives = 171/302 (56%), Gaps = 13/302 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K+Y+ + + + + N I HN++ Y L N D+ ++ TR
Sbjct: 38 KEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVS--TRNG 95
Query: 79 HSRIRRTLVR------SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
R R+ R PE E +P +DWR+KG +TP NQ CG+C+AFS +++
Sbjct: 96 FRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 155
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ + ++ LS Q +VDCS GN GC GG + N Y++ GG+ E+ YPY G
Sbjct: 156 GQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTD 215
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C FK+ ++ + + +P +EH LK +ATVGPI+V+I+AS +FQ Y+ G+YD+
Sbjct: 216 GTCHFKKSDVGATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEP 275
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+S+ ++H +L+VGY ++ W++KN W WGD GY+Y+ R +N+CGIA+ A Y
Sbjct: 276 ECSSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYP 335
Query: 308 LI 309
L+
Sbjct: 336 LV 337
>gi|354504280|ref|XP_003514205.1| PREDICTED: cathepsin M-like [Cricetulus griseus]
gi|344250849|gb|EGW06953.1| Cathepsin M [Cricetulus griseus]
Length = 333
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 175/304 (57%), Gaps = 15/304 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K+Y + + +K+ W+ N K + +N E +QG G+T+ N D+ + K MT
Sbjct: 35 KYEKNYSLE-EEGQKRTVWEENLKLVKLYNAEYKQGKKGFTVEMNAFGDMTGEEFRKMMT 93
Query: 76 RL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+ +R +R ++ P + +P +DWR KG++TP Q C +C+AFS+A AI+GQ
Sbjct: 94 VIPVQTRRKRKSIQPPMVS---YVPKFVDWRRKGYVTPVKIQGRCNSCWAFSVAGAIEGQ 150
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+F+ T + LS Q +VDCS GNLGC G+ L YVQ GL E YPY+ K+
Sbjct: 151 MFRKTRRLVSLSPQNLVDCSRPEGNLGCYEGNTYYALKYVQHNRGLEAEATYPYEAKEGP 210
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++ + ++ + + ++E AL +AT+GPI+V I+A +F+LY GIY + C
Sbjct: 211 CRYHPEHSAARVTDF-MFVSKNEKALMHAVATIGPISVGIDAGHESFKLYKGGIYYEPNC 269
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
+S+ +NH++LLVGY R W++KN WG NGYM + R NN CGIA YA+
Sbjct: 270 SSEVINHSVLLVGYGYEGRESDGRKYWLIKNSHGERWGMNGYMKIARDRNNHCGIATYAI 329
Query: 306 YALI 309
Y +
Sbjct: 330 YPRV 333
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 107/300 (35%), Positives = 179/300 (59%), Gaps = 13/300 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K Y +A D ++ + W+ + I+ HN EA G H ++L N DL +H M+
Sbjct: 31 HSKTYATEAEDMRRFI-WERHLNMINQHNIEADLGKHTFSLGMNEYGDL-TQHEYAAMSG 88
Query: 76 -RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
++ S + + + E++ +P +DWREKG++TP NQ CG+C+AFS +++GQ
Sbjct: 89 YKMAKSSVGSSFLEP----ENLQVPKTVDWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQ 144
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+F+ T + +S Q +VDCS GN+GC+GG + N Y++ G+ E+ YPY+
Sbjct: 145 VFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGE 204
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++K+ + V S + +P DE AL+ +A+VGP++V+I+AS +FQ Y +G+Y + C
Sbjct: 205 CRYKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKTGVYTEANC 264
Query: 255 TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+S ++H +L+VGY ++ W++KN W WG+ GY+ L R N+CGIA+ A Y L+
Sbjct: 265 SSTQLDHGVLVVGYGVENGQDYWLVKNSWGASWGEAGYIKLARNHGNQCGIASQASYPLL 324
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 172/304 (56%), Gaps = 11/304 (3%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ + YR + + ++ W +N K + HN A QG+ Y L +D+ Y K +
Sbjct: 33 KFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEY-KSLI 91
Query: 76 RLTHSRIRRTLVRSPESN-----ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
L R T S E +P +DWR+KG++T +Q+ CG+C+AFS +
Sbjct: 92 SLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGS 151
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ F+ T ++ LS QQ+VDCS GN+GC GG + Y+Q GG+ E+ YPY+
Sbjct: 152 LEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKSYPYEA 211
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C+FK N+ + + + DE ALK +AT+GP++V I+AS +FQLY SG+YD
Sbjct: 212 EDGQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSGVYD 271
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
++ C+S ++H +L VGY ++ W++KN W WG GY+ + R +N+CGIA A
Sbjct: 272 EQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDNQCGIATAAS 331
Query: 306 YALI 309
Y L+
Sbjct: 332 YPLV 335
>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
Length = 294
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 169/292 (57%), Gaps = 9/292 (3%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
++ ++ W SN K + HN A QG+ Y L +D+ Y + ++ +
Sbjct: 3 EAARRQIWLSNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKRLISLGCLGAFNASA 62
Query: 87 VRSPES----NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
R + E +P +DWR+KG++T +Q+ CG+C+AFS +++GQ ++ T ++
Sbjct: 63 PRKGSAFFRLAEGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLEGQNYRKTGKL 122
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS QQ+VDCS GN+GC GG + + Y+Q GG+ EE YPY+ + C+FK NI
Sbjct: 123 VSLSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESYPYEAEDGKCRFKPQNI 182
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
+ + + DE ALK +AT+GP++V+I+AS +FQLY SG+YD+ C+S+ ++H
Sbjct: 183 GAKCTGYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDELECSSEDLDHG 242
Query: 263 MLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+L VGY ++ W++KN W WG GY+ + R N+CGIA+ A Y L+
Sbjct: 243 VLAVGYGTDNGQDYWLVKNSWGLGWGQKGYIMMSRNKHNQCGIASMASYPLV 294
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 110/300 (36%), Positives = 173/300 (57%), Gaps = 8/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K Y A + ++L W++N I HN E GLH YTL N+ +DL + + M
Sbjct: 35 FGKQY-STAEEITRRLAWEANVAIIRQHNLEHDLGLHTYTLGLNNYADLTNAEFNQVMNG 93
Query: 76 -RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
R+ S+ + R+ + V +P +DWR KG++TP +Q CG+C+AFS +++GQ
Sbjct: 94 LRVNASQTKSANRRTYVAPVGVELPTSVDWRTKGYVTPIKDQGQCGSCWAFSSTGSLEGQ 153
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
F T ++ LS Q + DCS GN+GC GG + Y++ G+ E YPYK
Sbjct: 154 HFAKTGQLVSLSEQNLTDCSQKQGNMGCNGGLMDQAFTYIKENNGIDTESSYPYKAVDEK 213
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C FK ++ + ++ + QDE+AL+ +ATVGPI+V+I+AS +FQLY SG Y++ AC
Sbjct: 214 CHFKAADVGATDTGYTDIAQQDENALQSAIATVGPISVAIDASHSSFQLYRSGAYNERAC 273
Query: 255 TSDYVNHAMLLVGYT----RNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++ ++H +L VGY ++ +I+KN W WG GY+++ R NN+CGIA + Y +
Sbjct: 274 SATQLDHGVLAVGYDSEDGKDYYIVKNSWGTSWGQKGYIWMTRNKNNQCGIATMSTYPTV 333
>gi|348531585|ref|XP_003453289.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 366
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 169/292 (57%), Gaps = 15/292 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL--- 86
+K W +N K + HN A QGL Y L H +D+ Y + +++ +L
Sbjct: 78 RKQIWLNNRKHVLMHNILADQGLKSYHLGMTHFADMEHEEYKQLISQSFLGSFNASLPQR 137
Query: 87 ----VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
R PE + +PD +DWR+KG++T +Q+ CG+C+AFS ++GQ F+ T ++
Sbjct: 138 GSAFFRLPEGTD---LPDTVDWRDKGYVTEVKDQKICGSCWAFSTTGVLEGQHFRKTGKL 194
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS QQ++DCS GN GC GGS++ Y+Q GG+ E YPY+ K C++K I
Sbjct: 195 VSLSEQQLMDCSHSFGNNGCNGGSVKRAFQYIQANGGIDTEASYPYEAKGQQCRYKPDGI 254
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
+ + + P +E ALK +AT+GPI+V I+AS ++F+ Y SG+YD+ C+ +NH
Sbjct: 255 GAKCTGYVEVKPSNEDALKEAVATIGPISVGIDASHNSFRFYQSGVYDEPDCSKTVLNHD 314
Query: 263 MLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+L VGY T N W++KN W WGD GY+ + R +N+CGIA+ A Y L+
Sbjct: 315 VLAVGYGTENGHDYWLIKNSWGIRWGDKGYIKMSRNKSNQCGIASDATYPLV 366
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 171/304 (56%), Gaps = 17/304 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K+Y +K + +++ W+ N +KI HN E G+H Y L NH D++ + + M
Sbjct: 38 KNYHEK-EEGWRRMIWEKNLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK 96
Query: 79 HSRIRR---TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
H R+ +L P E +P LDWREKG++TP +Q +CG+C+AFS A++GQ+
Sbjct: 97 HKTERKFKGSLFMEPNFLE---VPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQM 153
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSI 194
F+ ++ LS Q +VDCS GN GC GG + Y++ GL EE YPY G
Sbjct: 154 FRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQP 213
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C + + + + +P EHAL +A+VGP++V+I+A +FQ Y SGIY ++ C
Sbjct: 214 CHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKEC 273
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S+ ++H +L+VGY + WI+KN WS WGD GY+Y+ K N CGIA A
Sbjct: 274 SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAAS 333
Query: 306 YALI 309
Y L+
Sbjct: 334 YPLV 337
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 111/298 (37%), Positives = 163/298 (54%), Gaps = 6/298 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y+ + D ++ W+ N I HN EA GLH Y L NH+ DL ++
Sbjct: 41 HGKKYQTEVEDVSRRELWEKNLMLITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFAT 100
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L+ + +PD +DWREKG +T Q CG+C+AFS A A++GQ+
Sbjct: 101 LSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLA 160
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+T ++ +LS Q +VDCS GN GC GG + YV G+ + YPY G+ C+
Sbjct: 161 KTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVIDNQGIDSDASYPYTGRNGECR 220
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + S +S LP +E ALK LA +GPI+V+I+A+ TF Y SG+Y+D C S
Sbjct: 221 YNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNC-S 279
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
VNH +L VGY ++ W++KN W +GD GY+ + R N++CGIA Y Y ++
Sbjct: 280 QKVNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPIM 337
>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 121/304 (39%), Positives = 165/304 (54%), Gaps = 16/304 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
YKK Y D ++ + W+ N K I HNQE QG HG+T+ N D+ + + M
Sbjct: 36 YKKPYAVNEEDWRRAV-WEKNVKMIERHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNG 94
Query: 77 LTHSRIRR-TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ + ++ L P IP +DW +KG++TP NQ CG+C+AFS A++GQ+
Sbjct: 95 FQNQKHKKGKLFYEPVFGH---IPTSVDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-I 194
F+ T ++ LS Q +VDCS GN GC GG + N YVQ GGL EE YPY +
Sbjct: 152 FRKTGKLVSLSEQNLVDCSRREGNEGCNGGLMDNAFQYVQDNGGLDSEESYPYLATDTHT 211
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C +K P + V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY + C
Sbjct: 212 CNYK-PECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHESFQFYKSGIYYEPGC 270
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S ++H +LLVGY WI+KN W WG NGY+ + K NN CGIA A
Sbjct: 271 SSKDLDHGVLLVGYGFEGKDSENNKFWIVKNSWGTSWGTNGYVKMAKDQNNHCGIATAAS 330
Query: 306 YALI 309
Y +
Sbjct: 331 YPTV 334
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 104/298 (34%), Positives = 164/298 (55%), Gaps = 12/298 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y ++ + W+ N ++I HN + + L N D+ +
Sbjct: 34 HNKAYSHDGEETVRYTIWKDNERRIREHNLQGGD----FLLEMNQFGDMTNNEFKDFNGY 89
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L+H + + +P S + PD +DWR +G++TP +Q CG+C+AFS +++GQ F
Sbjct: 90 LSHKHVSGSTFLTPNS---FVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNF 146
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T ++ LS Q +VDCS GN GC GG + N Y++ G+ E YPY K C
Sbjct: 147 KKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKDGKCA 206
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
F +PN+ + + +P DE+ LK +A+VGPI+V+I+AS +FQ Y G+Y++ C+S
Sbjct: 207 FTKPNVAATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSS 266
Query: 257 DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
++H +L+VGY S W++KN W+ WGD GY+ + R N+CGIA A Y L+
Sbjct: 267 TELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMSRNAKNQCGIATNASYPLV 324
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 167/304 (54%), Gaps = 16/304 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
YKK Y D ++ + W+ N K I HNQE QG HG+T+ N D+ + + M
Sbjct: 36 YKKPYAANEEDWRRAV-WEKNMKMIERHNQEYSQGKHGFTMTMNAFGDMTNEEFRQVMNG 94
Query: 77 L-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
RI+ L+ P IP +DW +KG++TP +Q CG+C+AFS A++GQ+
Sbjct: 95 FQNQKRIQGKLLYEPVFGH---IPKSVDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSI 194
F+ T ++ LS Q +VDCS GN GC GG + N Y++ GGL EE YPY +
Sbjct: 152 FRKTGKLVSLSEQNLVDCSRREGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYTAMDKQD 211
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++ + + + +PPQ E AL +ATVGPI+V+++A +FQ Y SGIY D C
Sbjct: 212 CRYNPKYSAANDTGFVDIPPQ-EKALMKAVATVGPISVAVDAGHESFQFYKSGIYYDSNC 270
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S +NH +L+VGY W++KN W WG +GY+ + K NN CGIA A
Sbjct: 271 SSKDLNHGVLVVGYGFEGIDSANNRYWLVKNSWGTGWGTDGYIKMAKDRNNHCGIATAAS 330
Query: 306 YALI 309
Y +
Sbjct: 331 YPTV 334
>gi|118404242|ref|NP_001072435.1| cathepsin K precursor [Xenopus (Silurana) tropicalis]
gi|113197688|gb|AAI21683.1| hypothetical protein MGC147539 [Xenopus (Silurana) tropicalis]
Length = 331
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 109/302 (36%), Positives = 166/302 (54%), Gaps = 8/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K Y + + ++L W+ N K I HN E +GLH Y + NHL D+ ++
Sbjct: 32 KRTYHKQYNGQMDEGLRRLIWEKNFKMITAHNLEYSEGLHTYEMAMNHLGDMTSEEVVRT 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT L H R R T S ++ IPD +D+R+KG++TP NQ CG+C+AFS A++
Sbjct: 92 MTGLWVHRRNRSTNFTSEDNTAQEKIPDSIDYRKKGYVTPIRNQGSCGSCWAFSSVGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K ++ +LS Q +VDC + N GC GG + N YV+ G+ E YPY G+
Sbjct: 152 GQLKKKKGKLVDLSPQNLVDC--VKKNDGCGGGYMTNAFEYVRDNKGIDSENAYPYVGED 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + + E ALK + VGP++V I+A +FQ Y+ G+Y D+
Sbjct: 210 QECMYNATGKAASCKGFKEVQEGSEKALKKAVGLVGPVSVGIDAGLSSFQFYSKGVYYDK 269
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYA 307
C ++ +NHA+L VGY WI+KN W WG+ GY+ + R +N CGI++ A Y
Sbjct: 270 DCNAENINHAVLAVGYGTQKKTKYWIVKNSWGEDWGNKGYILMAREKDNACGISSLASYP 329
Query: 308 LI 309
++
Sbjct: 330 VM 331
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/298 (37%), Positives = 164/298 (55%), Gaps = 6/298 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y+ + D ++ W+ N I HN EA GLH Y L NH+ DL ++
Sbjct: 41 HGKKYQTEVEDVSRRELWEKNLMLITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFAT 100
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L+ + +PD +DWREKG +T Q CG+C+AFS A A++GQ+
Sbjct: 101 LSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLA 160
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+T ++ +LS Q +VDCS GN GC GG + + YV G+ + YPY G+ C+
Sbjct: 161 KTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVIDNQGIDSDASYPYTGRNGECR 220
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + S +S LP +E ALK LA +GPI+V+I+A+ TF Y SG+Y+D C S
Sbjct: 221 YNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNC-S 279
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
VNH +L VGY ++ W++KN W +GD GY+ + R N++CGIA Y Y ++
Sbjct: 280 QKVNHGVLAVGYGTLDGQDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPIM 337
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 167/303 (55%), Gaps = 9/303 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KKY K+Y+ K + +++ W+ N + + HN E GLH Y L NHL D+
Sbjct: 33 KKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNHLGDMTSEEVTAL 92
Query: 74 MT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
MT ++ S+ R + + S PD +DWREKG +T NQ CG+C+AFS A+
Sbjct: 93 MTGLKIPVSQSRNSTLYWARQGASA--PDTVDWREKGCVTNVKNQGSCGSCWAFSAVGAL 150
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+ Q+ T + LS Q +VDCS GN GC GG + YV + G+ E YPY G+
Sbjct: 151 ECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNGIDSEASYPYTGQ 210
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S + LP +E ALK +A GP++V+I+AS +F L+ G+YDD
Sbjct: 211 SGTCRYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDASRPSFFLFRKGVYDD 270
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CTS ++NH +L+VGY W++KN W +GD GY+ + R +NRCGIA+ Y
Sbjct: 271 PSCTSAHINHGVLVVGYGTEDGIDYWLVKNSWGVSFGDQGYIKIARNHDNRCGIASQCTY 330
Query: 307 ALI 309
L+
Sbjct: 331 PLM 333
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 110/283 (38%), Positives = 162/283 (57%), Gaps = 10/283 (3%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
+Q N I HN+E +QG H Y L NH DL ++ E + + V + ++N
Sbjct: 47 FQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFL-ERSNGFQGGVSGGDVFTFDTN 105
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V P + +W KG +TP +Q CG+C+AFS +++GQIF ++ LS QQ+VDC
Sbjct: 106 APV--PSYANWTAKGAVTPVKDQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDC 163
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GNLGC GG + N Y G+ E+ YPY K + CK+K+ V ISS+ +
Sbjct: 164 SGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTAKDNDCKYKKSMSVATISSFKDVK 223
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
+DE LK+ +A VGP++V+I+AS FQ Y SG+Y DE C+S+ ++H +L VGY +
Sbjct: 224 HKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVYYDENCSSEVLDHGVLAVGYGTDKK 283
Query: 273 -----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W+ WG NGY+ + R +N CGIA A Y ++
Sbjct: 284 SGMDFWLVKNSWAASWGLNGYIKMARNKDNNCGIATMASYPIV 326
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 108/302 (35%), Positives = 173/302 (57%), Gaps = 7/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
+K + K Y +++L ++S KI+ HN GL Y + N +D+ +
Sbjct: 23 KKVHGKSYGHDEEHFRRQLFYKS-VAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRNF 81
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ + ++ +R R + +P +DWREKG++TP NQ CG+C+AFS +++
Sbjct: 82 KGLKFDATKTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLE 141
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ FK+T ++ LS Q +VDCS + GN GC GG + N Y+Q GG+ EE YPY GK
Sbjct: 142 GQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKD 201
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C F ++ + + +P +DE AL+ +A+VGP++V+I+AS +FQ Y G+YD+
Sbjct: 202 GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEP 261
Query: 253 ACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
+C+ ++H +L+VGY T N W++KN W WG +GY+ + R N+CGIA+ A Y
Sbjct: 262 SCSFSQLDHGVLVVGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRNKENQCGIASMASYP 321
Query: 308 LI 309
+
Sbjct: 322 TV 323
>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
Length = 331
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/287 (38%), Positives = 164/287 (57%), Gaps = 14/287 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTLVR 88
++ W+ N K I HNQE QG H +T+ N DL + + M L + R + +
Sbjct: 48 RRAVWEKNLKVIKQHNQEYSQGKHSFTMAMNAFGDLTNEEFKQVMNGLKSQKRKEGNVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+P E+ P +DWR+KG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 APPFAET---PSSVDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC+GG + YV+ GGL EE YPY+ + CK+K + +
Sbjct: 165 NLVDCSQAEGNEGCSGGLMDYAFQYVKDNGGLDSEESYPYRAQDESCKYKPEQSAANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ + P++E +LK+ +ATVGPI+ +I+AS TFQ Y GIY D C+S+ ++H +L+VGY
Sbjct: 225 FMDIHPEEE-SLKLAVATVGPISAAIDASLSTFQFYHKGIYYDPDCSSENLDHGILVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
WI+KN W WG GY+ + K +N CGIA A +
Sbjct: 284 GSQGEDSEKQKYWIVKNSWGTDWGTQGYILMAKDRDNHCGIATAASF 330
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 118/291 (40%), Positives = 162/291 (55%), Gaps = 15/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN+E QG HG+T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIDLHNREYSQGQHGFTMAMNAFGDMTNEEFRQVMNGFRNQKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E IP +DW KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFAE---IPKSVDWTLKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDIS 207
+VDCS GN GC GG + N YV+ GGL EE YPY G + CK+K P
Sbjct: 165 NLVDCSRSQGNEGCNGGLMDNAFQYVKENGGLDSEESYPYLGTDTDSCKYK-PECSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VG
Sbjct: 224 TGFVDIPQREKALMKAVATVGPISVAIDAGHQSFQFYKSGIYYDPDCSSKDLDHGVLVVG 283
Query: 268 Y--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 284 YGFEGTDSNNNKFWIVKNSWGPEWGTNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 176/324 (54%), Gaps = 19/324 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + K+Y + K + K++ ++ H+ I HNQ +QG Y LR N
Sbjct: 22 LVREEWSAFKLEHSKRYDSEVEDKF---RMKIYLENKHR-IAKHNQRFEQGAVSYKLRPN 77
Query: 61 HLSDLHPRHYIKEMT---------RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFIT 111
+D+ ++ M + H + R + + + V PDH+DWR+KG +T
Sbjct: 78 KYADMLSHEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVT 137
Query: 112 PDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTL 171
+Q CG+C+AFS A++GQ F+ T + LS Q ++DCS GN GC GG + N
Sbjct: 138 EVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAF 197
Query: 172 NYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIA 231
Y++ GG+ E+ YPY+G C++ N D + +P DE L +ATVGP++
Sbjct: 198 KYIKDNGGIDTEKAYPYEGVDDKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVS 257
Query: 232 VSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDN 286
V+I+AS +FQ Y+ G+Y DE C+S ++H +++VGY + W++KN W WGD
Sbjct: 258 VAIDASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDL 317
Query: 287 GYMYLKRG-NNRCGIANYAVYALI 309
GY+ + R NN CGIA+ A Y L+
Sbjct: 318 GYIKMARNKNNHCGIASSASYPLV 341
>gi|348513412|ref|XP_003444236.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 328
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 169/303 (55%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K +KK Y + + ++L W+ N K I+ HN EA LH Y L NH D+ P ++
Sbjct: 30 KKTHKKVYSHQIEEWGRRLIWEMNLKMINLHNLEASLNLHTYELAINHFGDMTP---VEI 86
Query: 74 MTRLTHSRIRRTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LT +R+ L V + + +P +DWR+KG +TP Q CGAC+AFS A A+
Sbjct: 87 TGTLTGTRVPSDLEMVAASSIKVNASLPASVDWRDKGLVTPVKTQGSCGACWAFSAAGAL 146
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ KST + LS Q ++DC+ GN GC GG + YV G+ E+ YPY G+
Sbjct: 147 EGQLKKSTGILRSLSTQNLIDCTTDYGNRGCNGGLIARAFKYVVDNQGIASEDAYPYIGR 206
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+ CK+ + S + LP DE ALK +A VGP+AV+I+AS F Y G+Y D
Sbjct: 207 HNQCKYNPLYRAANCSGYYCLPRGDEFALKEAVALVGPVAVAIDASRPQFHFYHRGVYMD 266
Query: 252 EACTSDYVNHAMLLVGYTR----NSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
CT VNH L+VGY R + W++KN W +G+ GY+ + R NN+CGI ++A +
Sbjct: 267 NTCTQK-VNHGSLVVGYGREKGQDYWLVKNSWGVQFGEEGYIKMARNRNNQCGITHHACF 325
Query: 307 ALI 309
+
Sbjct: 326 PFV 328
>gi|9931986|ref|NP_064680.1| cathepsin R precursor [Mus musculus]
gi|23813621|sp|Q9JIA9.1|CATR_MOUSE RecName: Full=Cathepsin R; Flags: Precursor
gi|9623188|gb|AAF90051.1|AF245399_1 cathepsin R [Mus musculus]
gi|12837970|dbj|BAB24023.1| unnamed protein product [Mus musculus]
gi|12852278|dbj|BAB29345.1| unnamed protein product [Mus musculus]
gi|16445015|gb|AAK00507.1| cathepsin R precursor [Mus musculus]
gi|71682221|gb|AAI00339.1| Cathepsin R [Mus musculus]
gi|148709367|gb|EDL41313.1| cathepsin R [Mus musculus]
Length = 334
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 163/306 (53%), Gaps = 18/306 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y K + K++ W+ K I HN+E G +G+T++ N D + K M
Sbjct: 35 KYNKSYSLK-EEKLKRVVWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMI 93
Query: 76 RL---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ TH + + R S ++P +DWR+KG++TP Q DC AC+AF++ AI+
Sbjct: 94 EISVWTHREGKSIMKREAGS----ILPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q T ++ LS+Q +VDCS GN GC GG N YV GGL E YPY+GK
Sbjct: 150 AQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPYEGKD 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ N +I+ + L PQ E L +AT+GPI I+AS +F+ Y GIY +
Sbjct: 210 GPCRYNPKNSKAEITGFVSL-PQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEP 268
Query: 253 ACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+SD V H +L+VGY W++KN W WG GYM L K NN CGIA+Y
Sbjct: 269 NCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASY 328
Query: 304 AVYALI 309
A Y I
Sbjct: 329 AHYPTI 334
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 116/302 (38%), Positives = 168/302 (55%), Gaps = 9/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y + D +++ W+ N I HN EA GL Y L NH+ DL ++
Sbjct: 39 KKSHGKTYPNEVEDVRRRELWERNLMLITKHNLEASMGLQTYDLSMNHMGDLTTEEIMQS 98
Query: 74 MTRLTH-SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
LT + I+R +P +P +DWR +G +T Q CG+C+AFS A A++
Sbjct: 99 YATLTPPADIQR--APAPFVGSGADVPVSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALE 156
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K+T ++ +LS Q +VDCS+ GN GC GG + YV G+ E YPY+G+
Sbjct: 157 GQLAKTTGKLVDLSPQNLVDCSLKYGNKGCNGGFMDRAFQYVIDNKGIDSEASYPYRGQL 216
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + S +S LP DE ALK LAT+GPI+V+I+A+ TF Y SG+Y+D
Sbjct: 217 QQCSYNPSYRAANCSRYSFLPEGDEGALKNALATIGPISVAIDATRPTFAFYRSGVYNDP 276
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
CT VNH +L VGY S W++KN W +GD GY+ + R N++CGIA Y Y
Sbjct: 277 TCTQR-VNHGVLAVGYGTESGQDYWLVKNSWGTSFGDKGYIRMSRNKNDQCGIALYCSYP 335
Query: 308 LI 309
++
Sbjct: 336 IM 337
>gi|84028184|sp|Q9R014.2|CATJ_MOUSE RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|5306071|gb|AAD41898.1|AF158182_1 preprocathepsin P [Mus musculus]
gi|12838143|dbj|BAB24099.1| unnamed protein product [Mus musculus]
gi|74199838|dbj|BAE20748.1| unnamed protein product [Mus musculus]
gi|74355544|gb|AAI03770.1| Cathepsin J [Mus musculus]
gi|148709363|gb|EDL41309.1| cathepsin J, isoform CRA_a [Mus musculus]
Length = 334
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 172/306 (56%), Gaps = 15/306 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y K ++ ++ W+ N + I HN+E G + +T++ N D + K
Sbjct: 33 KTKYAKSYSPK-EEALRRAVWEENMRMIKLHNKENSLGKNNFTMKMNKFGDQTSEEFRKS 91
Query: 74 MTRLTHSRIRRTLVRSPESNE-SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ + I + N S+ +PD+ DWRE+G++TP NQ CG+C+AF+ A AI+
Sbjct: 92 IDNIP---IPAAMTDPHAQNHVSIGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+F T + LS+Q ++DCS GN GC G+ YV GL E YPY+GK
Sbjct: 149 GQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+++ N +I+ + LPP +E L V +A++GP++ +I+AS +F+ Y GIY +
Sbjct: 209 GPCRYRSENASANITDYVNLPP-NELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYYEP 267
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
C+S +VNHA+L+VGY N W++KN W WG NGYM + + NN CGIA+
Sbjct: 268 NCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIASL 327
Query: 304 AVYALI 309
A Y I
Sbjct: 328 ASYPNI 333
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 177/314 (56%), Gaps = 16/314 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++ WI ++ K Y D ++ + W+ N ++I HN + + L+ N
Sbjct: 22 VKDESWIQWKMYHNKVYSHD----GEETVRYTIWKDNERRIREHNLKGGD----FLLKMN 73
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+ + L+H + + +P + + PD +DWR +G++TP +Q CG
Sbjct: 74 QFGDMTNSEFKAFNGYLSHKHVNGSTFLTPNN---FVAPDTVDWRNEGYVTPVKDQGQCG 130
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ FK T ++ LS Q +VDCS GN GC GG + N Y++ G+
Sbjct: 131 SCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGI 190
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E YPY + C FK+P++ + + LP +E+ LK +A+VGPI+V+I+AS +
Sbjct: 191 DSEASYPYTAEDGKCVFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHES 250
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GN 295
FQ Y+SG+Y++ +C+S ++H +L+VGY S W++KN W+ WGD GY+ ++R
Sbjct: 251 FQFYSSGVYNEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAK 310
Query: 296 NRCGIANYAVYALI 309
N+CGIA A Y L+
Sbjct: 311 NQCGIATKASYPLV 324
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 181/318 (56%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +++++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y++ GGL
Sbjct: 137 SCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGL 196
Query: 181 MKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L+VGY NS WI+KN W +WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 169/302 (55%), Gaps = 13/302 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K+Y + + + + N KI HN++ Y L N DL ++ TR
Sbjct: 59 KEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVS--TRNG 116
Query: 79 HSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
R R+ R PE E +P +DWR+KG +TP NQ CG+C+AFS +++
Sbjct: 117 FKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 176
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T + LS Q +VDCS GN GC GG + N Y++ GG+ E YPY G
Sbjct: 177 GQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTD 236
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
IC F++ ++ + + +P +E LK +ATVGP++V+I+AS +FQ Y+ G+YD+
Sbjct: 237 GICHFEKSDVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEP 296
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+S+ ++H +L+VGY ++ W++KN W WGD+GY+Y+ R N+CGIA+ A Y
Sbjct: 297 ECSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYP 356
Query: 308 LI 309
L+
Sbjct: 357 LV 358
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/293 (37%), Positives = 163/293 (55%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M + T + + +L
Sbjct: 49 RRMVWEKNLKKIEIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSL 108
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P + P +DWREKG++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 109 FMEPNY---LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLS 165
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y+Q GL EE YPY G + C +K +
Sbjct: 166 EQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAAN 225
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P EHA+ +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 226 ETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLV 285
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA + Y L+
Sbjct: 286 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|149755226|ref|XP_001494409.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 334
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/291 (40%), Positives = 166/291 (57%), Gaps = 15/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N + I HNQE QG HG+T+ N D+ + + M + + ++ V
Sbjct: 48 RRAVWEKNMRMIELHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKGRVFL 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E +P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFLE---VPKTVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDIS 207
+VDCS GN GC GG + N YV+ GGL EE YPY K+ + C +K P
Sbjct: 165 NLVDCSRAEGNQGCNGGLMDNAFQYVKDNGGLDSEESYPYLAKEGNNCNYK-PEYSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VG
Sbjct: 224 TGYVDIPQKEKALMKAVATVGPISVAIDAGHESFQFYKSGIYYDPDCSSKDLDHGVLVVG 283
Query: 268 Y---TRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y R+S WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 284 YGFEGRDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/300 (35%), Positives = 173/300 (57%), Gaps = 7/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N +DL P ++K M
Sbjct: 34 HKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEFVKMMNG 93
Query: 77 LTHSRI--RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
R+ R + P + +P +DWR+KG +TP +Q CG+C+AFS +++GQ
Sbjct: 94 YQGKRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQ 153
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
F T ++ LS Q +VDCS GN GC GG + N+ NY++ GG+ E+ YPY+ +
Sbjct: 154 HFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAEDGD 213
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++K+ ++ + + + E L+ +ATVGP++V+I+AS +FQLY+ G+YD+ C
Sbjct: 214 CRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNC 273
Query: 255 TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+S+ ++H +L VGY + W++KN W+ WG +GY+ + R NN+CGIA+ A Y L+
Sbjct: 274 SSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQCGIASSASYPLV 333
>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
Length = 336
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 182/320 (56%), Gaps = 17/320 (5%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFTFEKNTIKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTR--LTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQED 118
D+H + + + L ++ + L+ S +++++ +P +DWR ++ +Q +
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKVNKPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVKDQGE 136
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y++ G
Sbjct: 137 CGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANG 196
Query: 179 GLMKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GL EE YPY CKF ++ + + + +EHALK +ATVGPI+V+I+A
Sbjct: 197 GLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPISVAIDAG 256
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMY 290
+FQ Y+SG+YD+ C+S+ ++H +L+VGY NS WI+KN W +WGD GY+
Sbjct: 257 HESFQFYSSGVYDEPQCSSEQLDHGVLVVGYGAMNDNSHQAFWIVKNSWGPNWGDQGYIM 316
Query: 291 LKRG-NNRCGIANYAVYALI 309
+ R +N+CGIA A Y L+
Sbjct: 317 MSRNKDNQCGIATSASYPLV 336
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 177/311 (56%), Gaps = 14/311 (4%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F +K ++K+Y + +S +K + N K+I HN +QG + L+ NHL+D+
Sbjct: 27 FTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHE 86
Query: 70 Y------IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
Y + ++ +++++ P V + +DWR KG +TP NQ CG+C+
Sbjct: 87 YSDVYLGFNKSSKANNNKLQSYTFIPPAH---VTLNKEVDWRTKGAVTPVKNQGHCGSCW 143
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS A++GQ F+ T ++ LS Q +VDCS GN GC GG + N Y++ G+ E
Sbjct: 144 AFSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTE 203
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
+ YPY+G+ C+F++ +I S + + DE AL +AT+GPI+V+I+AS +FQ
Sbjct: 204 KSYPYEGEDETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQF 263
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRC 298
Y+ G+Y + C+S+ ++H +L+VGY + W++KN W WGD GY+ + R +N C
Sbjct: 264 YSEGVYYEPECSSENLDHGVLVVGYGVEDNQKYWLVKNSWGTQWGDGGYIKMARDQDNNC 323
Query: 299 GIANYAVYALI 309
GIA A Y L+
Sbjct: 324 GIATQASYPLV 334
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 166/290 (57%), Gaps = 21/290 (7%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HN E G HG+T+ N D+ + + M + + + V
Sbjct: 172 RRAVWEKNMKMIEMHNHEYSNGKHGFTMGMNAFGDMTNEEFRQVMNGFRNQKQKSGKV-- 229
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
+ +L+ P +DWREKGF+TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 230 --FHAPLLLQAPKSVDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSE 287
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q +VDCS GNLGC GG + N Y++ GGL EE YPYKG C++K V + +
Sbjct: 288 QNLVDCSRRQGNLGCQGGLMDNAFQYIKDNGGLDSEESYPYKGMDGTCQYKAEWAVANDT 347
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ E AL +A+VGPI+V+I+A +FQ Y GIY + C+S+ ++H +L+VG
Sbjct: 348 GF-------EKALMKAVASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLDHGVLVVG 400
Query: 268 Y---TRNS----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y RNS W++KN W WG NGY+ + K NN CG+A+ A Y ++
Sbjct: 401 YGVEKRNSNDKYWLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASYPVV 450
>gi|149755237|ref|XP_001495795.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 339
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/288 (40%), Positives = 163/288 (56%), Gaps = 17/288 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL---THSRIRRTL 86
++ W+ N + I HNQE QG HG+T+ N D+ + + M L TH + R +
Sbjct: 48 RRAVWEKNMRMIELHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGLHNQTHKKGR--V 105
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
R P S E +P +DWR+KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 106 FREPLSAE---LPKSVDWRKKGYVTPVKNQGLCGSCWAFSATGALEGQMFRKTGKLVSLS 162
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
Q +VDCS GN GC+GG + YV+ GGL E+ YPY + CK+K P
Sbjct: 163 EQNLVDCSWAQGNEGCSGGLMDYAFQYVKDNGGLDSEKSYPYLAEDGFCKYK-PEYSAAN 221
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + Q E L +ATVGPI+ I+AS +FQ Y GIY D C+S Y++H +L+V
Sbjct: 222 DTGFLDIQQQEKFLMEAVATVGPISAGIDASLESFQFYKEGIYYDPDCSSKYLDHGVLVV 281
Query: 267 GY------TRNS-WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
GY +RN W++KN W WG NGY+ + K N CGIA A Y
Sbjct: 282 GYGFEGKDSRNKYWLVKNSWGEDWGMNGYIKMAKDRENHCGIATMASY 329
>gi|344258279|gb|EGW14383.1| Cathepsin L1 [Cricetulus griseus]
Length = 295
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/295 (35%), Positives = 169/295 (57%), Gaps = 11/295 (3%)
Query: 22 RKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THS 80
+ + + +K+ W++N K I HN++ +G HG+ L N DL + + MT +
Sbjct: 5 KSQDEEGQKRAVWENNRKMIELHNEDYTKGKHGFHLEMNAFGDLTNTEFRQLMTGFQSMG 64
Query: 81 RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
+ + P + +P +DWR+ G++TP +Q CGAC+AFS ++ GQ+F T
Sbjct: 65 TTEMNVFQEPRLGD---VPKSVDWRKHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTG 121
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
++ LS Q +VDCS GN+GC GG ++N YV GGL E YPY+ + + C++
Sbjct: 122 KLVPLSEQNLVDCSWSHGNIGCHGGLMQNAFQYVMDNGGLDTSESYPYESRNTTCRYNPE 181
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
N +++ + V P +E++L +A VGPI+ +I+ H+FQ Y G+Y + C+S ++
Sbjct: 182 NSAANVTGF-VKIPANEYSLMKAVAIVGPISAAIDTKHHSFQFYRGGMYYEPECSSSNLD 240
Query: 261 HAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
HA+L+VGY S W++KN W +WG NGY+ + R NN CGIA YA+Y +
Sbjct: 241 HAVLVVGYGEESDGRKYWLVKNSWGTYWGMNGYIKMARDRNNNCGIATYAMYPTV 295
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 169/305 (55%), Gaps = 16/305 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ KDY +K + +++ W+ N KKI HN E G H Y L NH D+ + + M
Sbjct: 36 HNKDYHEK-EEGWRRMVWEKNLKKIELHNLEHSLGKHSYRLAMNHFGDMPHEEFRQVMNG 94
Query: 77 LTHS--RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
H +IR +L P E+ P LDWREKG++TP +Q CG+C+AFS A++GQ
Sbjct: 95 YKHKVRKIRGSLFMEPNFLEA---PSKLDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQ 151
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQS 193
F+ T ++ LS Q +VDCS GN GC GG + Y++ GGL E+ YPY G
Sbjct: 152 QFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQ 211
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C + + + + +P EHAL + VGP++V+I+A +FQ Y SGIY +
Sbjct: 212 PCHYDPSYSAANDTGFVDIPSGKEHALMKAVTAVGPVSVAIDAGHESFQFYQSGIYYEAD 271
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
C+S+ ++H +L+VGY + WI+KN WS WG+ GY+Y+ K +N CGIA A
Sbjct: 272 CSSEDLDHGVLVVGYGYEGENVDGKKYWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAA 331
Query: 305 VYALI 309
Y L+
Sbjct: 332 SYPLV 336
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/293 (37%), Positives = 163/293 (55%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M + T + + +L
Sbjct: 49 RRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSL 108
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P + P +DWREKG++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 109 FMEPNY---LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLS 165
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y+Q GL EE YPY G + C +K +
Sbjct: 166 EQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGAN 225
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P EHA+ +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 226 ETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLV 285
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA + Y L+
Sbjct: 286 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 109/299 (36%), Positives = 169/299 (56%), Gaps = 7/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y +S++ ++ N +KI HN+ G Y L N SDL ++K
Sbjct: 63 HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEFVK-YNG 121
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L + ++ S + +++ PD +DWR+KG++T NQ CG+C++FS +++GQ F
Sbjct: 122 LKKTSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEGQHF 181
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ + ++ LS Q+VDCS GN GC GG + N Y++ GGL EEDYPYK KQ CK
Sbjct: 182 RKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPKQGTCK 241
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
F + + + E ALK ++ VGP++V+I+AS +FQ YA G+YD+ C+S
Sbjct: 242 FDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVYDEPECSS 301
Query: 257 DYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ ++H +L VGY + WI+KN W WG++GY+ + R N+CGIA A Y L+
Sbjct: 302 EQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRNKKNQCGIATQASYPLV 360
>gi|431897851|gb|ELK06685.1| Cathepsin L1 [Pteropus alecto]
Length = 331
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/290 (40%), Positives = 166/290 (57%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSR-IRRTLVR 88
++ W+ N K I HNQE +QG H +T+ N D+ + K M L + + + L +
Sbjct: 46 RRAVWEKNMKMIELHNQEHRQGKHSFTMAINAFGDMTNEEFRKLMNGLQNQKHWKGKLFQ 105
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E IP +DWR+KG++TP +Q CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 106 EPPFPE---IPPSVDWRQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQ 162
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + N YV+ GGL EE YPY + CK+K P S
Sbjct: 163 NLVDCSQSQGNEGCDGGLMDNAFQYVKDNGGLDSEESYPYLARDESCKYK-PEFSAANDS 221
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V + E +L +A+VGPI+V I+AS +FQ Y GIY + C+S+ +NH +L+VGY
Sbjct: 222 GFVDIHKQERSLMKAVASVGPISVGIDASYSSFQFYEKGIYYEPECSSEDLNHGVLVVGY 281
Query: 269 -------TRNS-WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+N WI+KN W +WG NGY+ + K NN CGIA A Y ++
Sbjct: 282 GFERAESNKNKYWIVKNSWGTNWGMNGYINMAKDQNNHCGIATAASYPIV 331
>gi|148709375|gb|EDL41321.1| mCG12216, isoform CRA_a [Mus musculus]
Length = 333
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 169/303 (55%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y + + K+ + W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KHGKPYSLEEEEQKRAV-WEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + + SV +P ++W+++G++TP Q C +C+A S+ AI+GQ+
Sbjct: 94 EIPVPTVKKG--KSVQKHLSVNLPKFINWKKRGYVTPVRTQGRCNSCWAISVTGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GN GC G+ L YV GGL E YPY+ K+ C
Sbjct: 152 FQKTGQLIPLSVQNLVDCSRPQGNRGCYVGNTYRALKYVVENGGLESEATYPYEEKEGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P++E AL +AT+GPI+V+I+A +F Y GIY + C+
Sbjct: 212 RYNPENSTASITGFDFV-PENEDALMNAVATIGPISVAIDARHESFLFYKRGIYHEPNCS 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S V HAMLLVGY R WI+KN WG GYM + R N CGIA YA+Y
Sbjct: 271 SSVVTHAMLLVGYGFVGNESEGRKYWIVKNSMGTKWGSKGYMKIARDQGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 179/318 (56%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +++++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y++ GGL
Sbjct: 137 SCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGL 196
Query: 181 MKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L VGY NS WI+KN W WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 179/318 (56%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +++++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y++ GGL
Sbjct: 137 SCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGL 196
Query: 181 MKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L VGY NS WI+KN W WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|7770062|ref|NP_036137.1| cathepsin J precursor [Mus musculus]
gi|6467374|gb|AAF13142.1|AF136272_1 cathepsin J precursor [Mus musculus]
gi|15418834|gb|AAK58455.1| cathepsin J [Mus musculus]
gi|148709364|gb|EDL41310.1| cathepsin J, isoform CRA_b [Mus musculus]
Length = 333
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 112/306 (36%), Positives = 172/306 (56%), Gaps = 16/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y + ++ ++ W+ N + I HN+E G + +T++ N D + K
Sbjct: 33 KTKYAKSYSPE--EALRRAVWEENMRMIKLHNKENSLGKNNFTMKMNKFGDQTSEEFRKS 90
Query: 74 MTRLTHSRIRRTLVRSPESNE-SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ + I + N S+ +PD+ DWRE+G++TP NQ CG+C+AF+ A AI+
Sbjct: 91 IDNIP---IPAAMTDPHAQNHVSIGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIE 147
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+F T + LS+Q ++DCS GN GC G+ YV GL E YPY+GK
Sbjct: 148 GQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKD 207
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+++ N +I+ + LPP +E L V +A++GP++ +I+AS +F+ Y GIY +
Sbjct: 208 GPCRYRSENASANITDYVNLPP-NELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYYEP 266
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
C+S +VNHA+L+VGY N W++KN W WG NGYM + + NN CGIA+
Sbjct: 267 NCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIASL 326
Query: 304 AVYALI 309
A Y I
Sbjct: 327 ASYPNI 332
>gi|224809458|ref|NP_001019580.2| cathepsin S, b.1 precursor [Danio rerio]
gi|63101450|gb|AAH95788.1| Cathepsin S, b.1 [Danio rerio]
gi|77748418|gb|AAI07613.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 177/303 (58%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y + + ++ W+ N + I HN EA G+H Y L NH+ DL ++
Sbjct: 31 KKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMNHMGDLTTEEILQT 90
Query: 74 MTRLTH--SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ LTH S +R + S+ +PD LDWREKG+++ Q CG+C+AFS A+
Sbjct: 91 LA-LTHVPSGFKRQIANIVGSSGDA-VPDSLDWREKGYVSSVKMQGACGSCWAFSSVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GN GC GG + + YV GG+ + YPY+G
Sbjct: 149 EGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGV 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
Q C + + + + + DE+ALK +A+VGPI+V+I+A+ F LY SG+Y+D
Sbjct: 209 QQQCSYSSSQRAANCTKYYFVRQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVYND 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C S VNHA+L+VGY ++ W++KN W +GD GY+ + R NN CGIA+YA Y
Sbjct: 269 PTC-SKRVNHAVLVVGYGTLSGQDYWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACY 327
Query: 307 ALI 309
++
Sbjct: 328 PVM 330
>gi|148669360|gb|EDL01307.1| mCG12220, isoform CRA_b [Mus musculus]
Length = 335
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 171/303 (56%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 37 KYGKAYSLEE-EGQKRAVWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMI 95
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+AFS+ AI+GQ+
Sbjct: 96 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQM 153
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GN GC G+ L+YV GGL E YPY+ K C
Sbjct: 154 FRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGSC 213
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N +I+ + + P++E AL +A++GPI+V+I+A +F Y GIY + C+
Sbjct: 214 RYSPENSTANITGFEFV-PKNEDALMNAVASIGPISVAIDARHASFLFYKRGIYYEPNCS 272
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S V H+MLLVGY R W++KN WG+ GYM + R N CGIA YA+Y
Sbjct: 273 SSVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 332
Query: 307 ALI 309
+
Sbjct: 333 PRV 335
>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 179/318 (56%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +++++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y++ GGL
Sbjct: 137 SCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGL 196
Query: 181 MKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L VGY NS WI+KN W WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|31981229|ref|NP_071721.2| cathepsin M precursor [Mus musculus]
gi|12837968|dbj|BAB24022.1| unnamed protein product [Mus musculus]
gi|12838184|dbj|BAB24116.1| unnamed protein product [Mus musculus]
gi|16445013|gb|AAK00506.1| cathepsin M precursor [Mus musculus]
gi|148669359|gb|EDL01306.1| mCG12220, isoform CRA_a [Mus musculus]
gi|148921940|gb|AAI46421.1| Cathepsin M [synthetic construct]
gi|151555373|gb|AAI48858.1| Cathepsin M [synthetic construct]
Length = 333
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 171/303 (56%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYGKAYSLEE-EGQKRAVWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+AFS+ AI+GQ+
Sbjct: 94 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GN GC G+ L+YV GGL E YPY+ K C
Sbjct: 152 FRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N +I+ + + P++E AL +A++GPI+V+I+A +F Y GIY + C+
Sbjct: 212 RYSPENSTANITGFEFV-PKNEDALMNAVASIGPISVAIDARHASFLFYKRGIYYEPNCS 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S V H+MLLVGY R W++KN WG+ GYM + R N CGIA YA+Y
Sbjct: 271 SSVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
Length = 334
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 113/291 (38%), Positives = 165/291 (56%), Gaps = 15/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HNQE QG HG+++ N D+ + + M + + ++ V
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKVF- 106
Query: 90 PESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
+ES++ +P +DWREKG++T NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 107 ---HESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSE 163
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q +VDCS GN GC GG + N YV+ GGL EE YPY G+++ +P
Sbjct: 164 QNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VG
Sbjct: 224 TGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVG 283
Query: 268 Y--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y + WI+KN W WG NGY+ + K NN CGI+ A Y +
Sbjct: 284 YGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
Length = 330
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 167/301 (55%), Gaps = 5/301 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + ++ W+ N + HNQEA G H +TL NHL+D+ ++
Sbjct: 30 KTKHGKVYDNQTEIDFRRAVWEKNVHLVLRHNQEASAGKHSFTLGLNHLADMTAEEINEK 89
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ L + E +P ++DWR++G + P NQ CG+C+AFS A++G
Sbjct: 90 LNGLKLEETVNFTNGTFEDVSDSPLPVNVDWRKEGLVGPVRNQGLCGSCWAFSSLGALEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ K T + LS Q +VDCS GNLGC GG + +YV GG+ E YPY+ K
Sbjct: 150 QLKKRTGTLVSLSPQNLVDCSTQDGNLGCRGGYITKAYSYVIRNGGVDSESFYPYEHKNG 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ S +S+LP DE L+ LA+VGPI+V++NA +F +Y+ G+Y+ +
Sbjct: 210 KCRYSVQGRAGYCSKFSILPEGDEKMLQKVLASVGPISVAVNAMLESFHMYSGGLYNVPS 269
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C +NHA+LLVGY ++ W++KN W WG+ GY+ L R NN CGIA++ VY
Sbjct: 270 CNPKLINHAVLLVGYGTDAGQDYWLVKNSWGTAWGEGGYIRLARNKNNLCGIASFPVYPT 329
Query: 309 I 309
+
Sbjct: 330 V 330
>gi|47076309|emb|CAD89795.1| putative cathepsin L protease [Meloidogyne incognita]
Length = 383
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 176/310 (56%), Gaps = 15/310 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++K+ K Y + D+++ L + S + I H ++ +G + + ENH++D+ Y K
Sbjct: 75 KEKHGKSYPNQDEDNERMLAYLSAKQFIEKHQRDYTEGRVSFQVGENHMADVPFNQYRKL 134
Query: 73 -EMTRLTHSRIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
RL + R S P N IP+ +DWR+KG +T NQ CG+C+AFS
Sbjct: 135 NGFKRLLGDAVTRKNASSTFLPPLN-MYAIPESVDWRDKGLVTSVKNQGMCGSCWAFSAT 193
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ + + LS Q ++DC+ GN+GC GG + N Y++ G+ E Y
Sbjct: 194 GALEGQHSRKLGTLVSLSEQNLIDCTKGEPYGNMGCNGGLMDNAFQYIEDNKGVDTENSY 253
Query: 187 PYKGKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
PYK K C FKR N+ + + LP DE LK+ +AT GPI+V+I+A +FQLYA
Sbjct: 254 PYKAKNGKKCLFKRSNVGATDTGYVDLPSGDEDKLKIAVATQGPISVAIDAGHRSFQLYA 313
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
G+YD+EAC+ D + H +L+VGY + W++KN W HWG+NGY+ + R +N+CG
Sbjct: 314 HGVYDEEACSPDNLGHGVLVVGYGTDDIHGDYWLVKNSWGEHWGENGYIRMSRNKDNQCG 373
Query: 300 IANYAVYALI 309
IA+ A Y L+
Sbjct: 374 IASKASYPLV 383
>gi|63101996|gb|AAH95694.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 117/303 (38%), Positives = 177/303 (58%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y + + ++ W+ N + I HN EA G+H Y L NH+ DL ++
Sbjct: 31 KKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMNHMGDLTTEEILQT 90
Query: 74 MTRLTH--SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ LTH S +R + S+ +PD LDWREKG+++ Q CG+C+AFS A+
Sbjct: 91 LA-LTHVPSGFKRQIANIVGSSGDA-VPDSLDWREKGYVSSVKMQGACGSCWAFSSVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+ K+T ++ +LS Q +VDCS GN GC GG + + YV GG+ + YPY+G
Sbjct: 149 EGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGV 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
Q C + + + + + DE+ALK +A+VGPI+V+I+A+ F LY SG+Y+D
Sbjct: 209 QQQCSYSSSQRAANCTKYYFVRQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVYND 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C S VNHA+L+VGY ++ W++KN W +GD GY+ + R NN CGIA+YA Y
Sbjct: 269 PTC-SKRVNHAVLVVGYGTLSGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACY 327
Query: 307 ALI 309
++
Sbjct: 328 PVM 330
>gi|27960487|gb|AAO27847.1|AF456463_1 cathepsin M [Mus musculus]
gi|16323039|gb|AAL15416.1| cathepsin M [Mus musculus]
Length = 333
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 171/303 (56%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYGKAYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+AFS+ AI+GQ+
Sbjct: 94 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVRTQGRCNSCWAFSVTGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T + LS+Q +VDCS GN GC G+ L+YV GGL E YPY+ K+ C
Sbjct: 152 FRKTGPLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKEGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N +I+ + + P++E AL +A++GPI+V+I+A +F Y GIY + C+
Sbjct: 212 RYSPENSTANITGFEFV-PKNEDALMNAVASIGPISVAIDARHASFLFYKRGIYYEPNCS 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S V H+MLLVGY R W++KN WG+ GYM + R N CGIA YA+Y
Sbjct: 271 SSVVTHSMLLVGYGFAGRESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 177/318 (55%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +S+++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y+ GGL
Sbjct: 137 PCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIPANGGL 196
Query: 181 MKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L VGY NS WI+KN W WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 178/318 (55%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ ++ N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +++++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y+ GGL
Sbjct: 137 SCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITANGGL 196
Query: 181 MKEEDYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L VGY NS WI+KN W WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 109/293 (37%), Positives = 163/293 (55%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H + L NH D+ + + M + T + + +L
Sbjct: 49 RRMVWEKNLKKIEIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSL 108
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P + P +DWREKG++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 109 FMEPNY---LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLS 165
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y+Q GL EE YPY G + C +K +
Sbjct: 166 EQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAAN 225
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P EHA+ +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 226 ETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLV 285
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA + Y L+
Sbjct: 286 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|463046|gb|AAA49207.1| cysteine proteinase [Cyprinus carpio]
Length = 331
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 108/293 (36%), Positives = 165/293 (56%), Gaps = 6/293 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y K + ++ W+ N I HN E GLH Y L NH+ D+ ++
Sbjct: 31 KKTHNKFYSSKDEELGRRELWERNLGLITLHNLEDLHGLHSYDLGMNHMGDMTTEEILQT 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ + + + IPD LDWREKG+++ NQ CG+C+AFS A++G
Sbjct: 91 LATIRVPPGFKRQTAEFVGSSGAAIPDSLDWREKGYVSSVKNQGACGSCWAFSSVGALEG 150
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ K+T ++ +LS Q +VDCS GN GC GG + YV GG+ E YPY+G Q
Sbjct: 151 QLMKTTGKLVDLSPQNLVDCSSSYGNYGCGGGLMSAAFQYVIDNGGIDSESSYPYEGVQG 210
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ + + + + + DE ALK +A +GPI+V+I+A+ F LY SG+Y+D +
Sbjct: 211 QCRYNPSQLAANCTKYYYVRQGDEEALKQAVANIGPISVAIDATHPQFILYRSGVYNDPS 270
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
CT++ +NHA+L VGY ++ W++KN W +GD GY+ + R NN CGIA
Sbjct: 271 CTTN-INHAVLAVGYGAIAGQDFWLVKNSWGTGFGDGGYIRMARNQNNMCGIA 322
>gi|348531521|ref|XP_003453257.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 333
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 111/306 (36%), Positives = 174/306 (56%), Gaps = 16/306 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K++K Y + ++ +K W +N K + HN A QGL Y L +D+ Y + ++
Sbjct: 32 KFEKSYDSDSEEAHRKQIWLNNRKLVLVHNILADQGLKSYRLGMTQFADMENEEYKRLVS 91
Query: 76 RLTHSRIR-------RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R T +R PE + +PD +DWR+KG++T NQ CG+C+AFS
Sbjct: 92 RGCLGSFNTSLHHRGSTFLRLPEGTD---LPDTVDWRDKGYVTDVQNQMQCGSCWAFSAI 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T ++ LS QQ+VDCS GN GC GG + Y+Q GG+ E YPY
Sbjct: 149 GALEGQNFRKTGKLVSLSKQQLVDCSQSFGNHGCNGGWMDWAFKYIQATGGIDTEASYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ ++ C + + + + + P +E ALK +AT+GPI+++++AS +FQ Y SG+
Sbjct: 209 EAEEGNCHYNPETVGATCTGYVDVSP-NEDALKEAVATIGPISIAMDASHESFQFYQSGV 267
Query: 249 YDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
YD+ +C + +HAML VGY T N W++KN + WG+ GY+ + R +N+CGIA+
Sbjct: 268 YDEPSCITSRFSHAMLAVGYGTENGHDYWLVKNSFGLGWGEKGYIKMSRNKSNQCGIASK 327
Query: 304 AVYALI 309
A Y L+
Sbjct: 328 ASYPLV 333
>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 178/318 (55%), Gaps = 15/318 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
NKEW + + ++ K Y +A + ++ + N KI HN A G+H YTL N
Sbjct: 21 NKEWEMWKL----QHGKQYETEAEEYSRRFILEKNTVKIAEHNIRASLGMHSYTLAMNKF 76
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+H + + + +++ L+ S +++++ +P +DWR ++ +Q +CG
Sbjct: 77 GDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQGECG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ T ++ +LS QQ+VDCS GN GC GG + Y++ GGL
Sbjct: 137 SCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGL 196
Query: 181 MKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
EE YPY CKF ++ + + + +EHALK +ATVGP++V+I+A
Sbjct: 197 DTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHE 256
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY---TRNS----WILKNWWSHHWGDNGYMYLK 292
+FQ Y+SG+YD+ C+++ ++H +L VGY NS WI+KN W WGD GY+ +
Sbjct: 257 SFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMS 316
Query: 293 RG-NNRCGIANYAVYALI 309
R NN+CGIA A Y L+
Sbjct: 317 RNKNNQCGIATSASYPLV 334
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 163/282 (57%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES----- 95
I HN+E + G + + N ++DL Y K L R+RR S +SN +
Sbjct: 78 IEEHNKEHRLGRKTFEMGLNEIADLPFSQYRK----LNGYRMRRQFGDSMQSNGTKFLVP 133
Query: 96 --VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V IP+ +DWRE+G +TP NQ CG+C+AFS A++GQ ++T ++ LS Q +VDC
Sbjct: 134 FNVQIPESVDWREEGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDC 193
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ E+ YPY G+++ C FKR + D + LP
Sbjct: 194 STKYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRETKCHFKRNTVGADDKGFVDLP 253
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALK +AT GPI+++I+A +FQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 254 EGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPE 313
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W WG+ GY+ + R NN CG+A A Y L+
Sbjct: 314 AGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKASYPLV 355
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 170/304 (55%), Gaps = 24/304 (7%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y K T+ ++++ W+SN K + HN + + G+T+ N +DL +
Sbjct: 30 KYNKVYETKETELERQIIWESNKKFVENHNANSDK--FGFTVAMNEFADLDAGEF----- 82
Query: 76 RLTHSRIRRTLVRSPESNES--------VLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
RI L+ P S S V +PD +DW+EKG +TP NQ CG+C++FS
Sbjct: 83 ----GRIFNGLLPRPSSYNSTNIYKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFSS 138
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ F +T + LS QQ++DCS GN GC GG + N+ Y++ G E++YP
Sbjct: 139 TGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYP 198
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + +C++ VV S+ +P DE +LK +A VGPI+V+I+AS +FQLY SG
Sbjct: 199 YTAENGVCRYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSG 258
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+Y C+S ++H +L +GY ++ W++KN W WG GY+ + R NN CGIA
Sbjct: 259 VYYASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRNRNNNCGIAT 318
Query: 303 YAVY 306
A Y
Sbjct: 319 QASY 322
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 166/298 (55%), Gaps = 9/298 (3%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y+ +A ++K+ ++ N +KI HN E +QG+H YT N +D+ + +
Sbjct: 32 KHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLA 91
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
++ ++ + + V +P+ +DWR + +TP +Q CG+C+AF++ + +G
Sbjct: 92 TQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAY 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
ST ++ S QQ+VDC+ N GC GG L +T Y+Q GL E DYPY G C
Sbjct: 152 ALSTGKLTRFSEQQLVDCT-TDLNYGCDGGYLDDTFPYIQ-TNGLELESDYPYTGYDGYC 209
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ +V +SS+ V P +E AL + T GP+A++INA Q Y SGI DD+ C
Sbjct: 210 SYESSKVVTKVSSY-VSVPANEQALLEAVGTAGPVAIAINADD--LQFYFSGIIDDKYCD 266
Query: 256 SDYVNHAMLLVGYT----RNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+Y++H +L VGY R+ W++KN W WG++GY RG N CG+ AVY LI
Sbjct: 267 PEYLDHGVLAVGYDSENGRDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPLI 324
>gi|327289219|ref|XP_003229322.1| PREDICTED: cathepsin K-like, partial [Anolis carolinensis]
Length = 289
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/285 (39%), Positives = 159/285 (55%), Gaps = 20/285 (7%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W+ N K I+THN E G H + L NHL D+ +++MT L + L R P SN
Sbjct: 10 WEKNLKYINTHNLEFSLGRHTFELAMNHLGDMTSEELVQKMTGL-----KVPLSRKP-SN 63
Query: 94 ESVLIPD-------HLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
+++ IPD +D+R+KG++TP NQ CG+C+AFS A++ Q+ T ++ LS
Sbjct: 64 DTLYIPDWEERVPDAVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEAQLKMKTGKLLNLS 123
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
Q +VDC +S N GC GG + N YV G+ ++ YPY G+ C +
Sbjct: 124 PQNLVDC--VSNNDGCGGGYMTNAFEYVHVNRGIDSDDTYPYIGQDENCMYNPTGKAAKC 181
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ +P DE ALK +A GP++V I+AS +FQ Y+ G+Y DE C +D +NHA+L V
Sbjct: 182 RGYKEIPEGDEKALKRAVARKGPVSVGIDASLASFQFYSRGVYYDENCNADNINHAVLAV 241
Query: 267 GYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
GY WI+KN W WGD GY+ + R NN CGIAN A +
Sbjct: 242 GYGSQKGTKHWIVKNSWGEDWGDKGYILMARNMNNACGIANLASF 286
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 164/286 (57%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVRSPES 92
W+ N K I HN E QG H +T+ N D+ + + M L + ++ + ++P
Sbjct: 52 WKKNMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQKHKKGKMFQAPLF 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG++TP +Q CG+C+AFS A++GQ+F+ T ++ LS Q +VD
Sbjct: 112 AK---IPSSVDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + N YV+ GGL EE YPY + CK+K + + + + +
Sbjct: 169 CSQAEGNEGCNGGLMNNAFQYVKDNGGLDSEESYPYHAQDESCKYKPQDSAANDTGFFDI 228
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY---- 268
PQ E AL V +AT GPI+V I+AS TFQ Y GIY D C+S+ ++H +L++GY
Sbjct: 229 -PQQEKALMVAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSEDLDHGVLVIGYGTEI 287
Query: 269 ----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ WI+KN W +WG +GY+ + K N CGIA A + ++
Sbjct: 288 GQSINKTYWIVKNSWGANWGIDGYIKMAKDRKNHCGIATMASFPVV 333
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/298 (34%), Positives = 168/298 (56%), Gaps = 5/298 (1%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y + +++L W+ N K I+ HN + G+ L N D+ M
Sbjct: 39 HTKSYTNDMHELERRLVWEENVKMINMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNG 98
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
S + + + + ++ +PD +DWR KG++TP NQ CG+C+AFS +++GQ F
Sbjct: 99 YKSSNVTKVQGSTFLTPSNIQVPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTF 158
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K TS++ LS Q +VDCS GN+GC GG + YV G+ E+ YPY + C
Sbjct: 159 KKTSKLVSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCH 218
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+K +++ ++ + DE AL +A+VGP++V+I+AS +FQLY SG+YD+ C+S
Sbjct: 219 YKASCDSAEVTGFTDVTSGDEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSS 278
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++H +L+VGY ++ W++KN W WG +GY+ + R +N+CGIA A Y L+
Sbjct: 279 SELDHGVLVVGYGTDGGKDYWLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPLV 336
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 164/294 (55%), Gaps = 22/294 (7%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HN+E QG H + L N DL + + M L R
Sbjct: 48 RRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLKIQNPR------ 101
Query: 90 PESNESVLIP-----DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
E N L+P +DWREKG++TP +Q CG+C+AFS A++GQ+F+ T ++
Sbjct: 102 -EGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVS 160
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
LS Q +VDCS GN GC GG + N YV+ GGL EE YPY + CK+K
Sbjct: 161 LSEQNLVDCSRAEGNAGCNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAA 220
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+ + ++ + QDE +L +++ATVGPI+V+I+AS TF+ Y GIY D C+S+ ++H +L
Sbjct: 221 NDTGFADI-HQDEESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVL 279
Query: 265 LVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+VGY +N WI+KN W WG GY+ + K N CGIA A + ++
Sbjct: 280 VVGYGSDEREAENKNYWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASFPIV 333
>gi|405958752|gb|EKC24846.1| Cathepsin L1 [Crassostrea gigas]
Length = 290
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 107/284 (37%), Positives = 167/284 (58%), Gaps = 11/284 (3%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W++N I+ HN E Q+G H YTL N +DL ++ R R ++ P+++
Sbjct: 9 WEANLDYINQHNDEFQRGAHSYTLGLNEFADLSHEEFLHLYGGGI--RPRDSVSSDPDTD 66
Query: 94 ---ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
++ +P +DWR++G++ P NQ CG+C+AF+ A++GQ+ T ++ LS+QQ+
Sbjct: 67 IVVDTSGLPLEVDWRKEGWVGPIGNQFACGSCWAFTATGALEGQVRNKTGKLIVLSVQQM 126
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWS 210
+DCS GN GC GG + Y+ GG+ YPYK + CKF + +V + +
Sbjct: 127 MDCSEKWGNHGCEGGLMDAAFKYIHDVGGIESNASYPYKPAEEKCKFNKSAVVAKVKGYK 186
Query: 211 VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-- 268
L P+ E +L V +ATVGPI+ +++AS +FQLY SG+YD+ C+S V+H++++VGY
Sbjct: 187 DL-PKSEESLMVAVATVGPISAALDASHSSFQLYKSGVYDEPNCSSGQVDHSLVVVGYGL 245
Query: 269 --TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ WI KN W WGD GY+ L K NN+CGIAN Y ++
Sbjct: 246 MDGKKYWIAKNSWGTSWGDKGYILLSKDKNNQCGIANTLSYPIL 289
>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
Length = 333
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/299 (36%), Positives = 164/299 (54%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y K YR + + W+ N I+ HN +A QG H Y L N DL Y + T
Sbjct: 37 YNKTYRAHEEPVRYSV-WKDNFLAINRHNSKADQGFHTYWLAMNEYGDLTNEEYFRLRTG 95
Query: 77 L-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L ++ I R + +N S P +DWR KG++TP NQ CG+CYAFS A++GQ
Sbjct: 96 LKINANIERRGLVFKYTNLSEY-PSEVDWRSKGYVTPVKNQGGCGSCYAFSATGAVEGQH 154
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS Q +VDCS GN GC GG + + Y++ G+ EE YPY+ + C
Sbjct: 155 FRKTGKLVSLSEQNIVDCSFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAYPYEARDGPC 214
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+F+R + + + LP DE AL+ + T+GPI+V+I+ F+ Y G++D+ C+
Sbjct: 215 RFRRSEVGATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFYHHGVFDNPNCS 274
Query: 256 SDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NH +L+VGY TR+ W++KN W WG GY+ + R N N+C I A Y ++
Sbjct: 275 KTKINHGVLVVGYGTRDGLDYWLVKNSWGERWGAEGYILMSRNNDNQCCITCAASYPIV 333
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 163/290 (56%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY-NPKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+++A +FQ Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|390994425|gb|AFM37362.1| cathepsin L2 [Dictyocaulus viviparus]
Length = 352
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/282 (39%), Positives = 160/282 (56%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES----- 95
I HN E + G + + N+++DL E +L R RR S N +
Sbjct: 75 IEEHNHEHRLGRKTFEMGLNNIADLP----FSEYRKLNGYRHRRLFGDSMRKNGTKFLVP 130
Query: 96 --VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V +PD +DWRE +TP NQ CG+C+AFS A++GQ F++T ++ LS Q +VDC
Sbjct: 131 FNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDC 190
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ EE YPY GK+ C FK+ +I + + LP
Sbjct: 191 STKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIGAEDRGFVDLP 250
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALKV +AT GPI+++I+A +FQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 251 EGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPE 310
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
WI+KN W WG+ GY+ + R NN CG+A A Y L+
Sbjct: 311 AGDYWIIKNSWGTKWGEKGYVRIARNRNNHCGVATKASYPLV 352
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 164/294 (55%), Gaps = 22/294 (7%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HN+E QG H + L N DL + + M L R
Sbjct: 48 RRAVWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLKIQNPR------ 101
Query: 90 PESNESVLIP-----DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
E N L+P +DWREKG++TP +Q CG+C+AFS A++GQ+F+ T ++
Sbjct: 102 -EGNMFQLLPFAETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVS 160
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
LS Q +VDCS GN GC GG + N YV+ GGL EE YPY + CK+K
Sbjct: 161 LSEQNLVDCSRAEGNAGCNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAA 220
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+ + ++ + QDE +L +++ATVGPI+V+I+AS TF+ Y GIY D C+S+ ++H +L
Sbjct: 221 NDTGFADI-HQDEESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVL 279
Query: 265 LVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+VGY +N WI+KN W WG GY+ + K N CGIA A + ++
Sbjct: 280 VVGYGSDEREAENKNYWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASFPIV 333
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 59/111 (53%), Gaps = 9/111 (8%)
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
RP + V PQ E A+ + +A GP++ +I AS +FQ GIY D C+S+
Sbjct: 386 RPECSAADVTGPVNVPQQEEAVMLAVAAGGPVSAAIRASLGSFQFCKEGIYYDPNCSSED 445
Query: 259 VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
++H +L+VGY +N WI+KN W WG GYM L R +N C I
Sbjct: 446 LDHGVLVVGYGSDEREAENKNYWIVKNSWGTDWGLQGYMLLVRDWDNHCEI 496
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/276 (37%), Positives = 160/276 (57%), Gaps = 7/276 (2%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT--RLTHSRIRRTLVRSPESNESVLI 98
+ HN + +GL Y L N DL P + K + R ++ +R P + +
Sbjct: 58 VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSL 117
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWR+KG +TP NQ CG+C+AFS +++GQ F+ T ++ LS Q +VDCS G
Sbjct: 118 PTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFG 177
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N Y++ GG+ EE +PY + CKFK+ ++ + + + E
Sbjct: 178 NQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDGDCKFKKADVGATDAGFVDIQQGSED 237
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
LK +ATVGP++V+I+AS +FQLY+ G+YD+ C+S ++H +L VGY + W+
Sbjct: 238 DLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWL 297
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+KN W WGDNGY+ + R +N+CGIA+ A Y L+
Sbjct: 298 VKNSWGGDWGDNGYILMSRDKDNQCGIASSASYPLV 333
>gi|22653680|sp|Q9JL96.1|CATM_MOUSE RecName: Full=Cathepsin M; Flags: Precursor
gi|7715970|gb|AAF68224.1|AF202528_1 cathepsin M [Mus musculus]
Length = 333
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 171/303 (56%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + K M
Sbjct: 35 KYGKAYSLE-EEGQKRAVWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+AFS+ AI+GQ+
Sbjct: 94 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDCS GN GC G+ L+YV GGL E YPY+ K C
Sbjct: 152 FRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGSC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N +I+ + + P++E AL +A++GPI+V+I+A +F Y GIY + C+
Sbjct: 212 RYSPENSTANITGFEFV-PKNEDALMNAVASIGPISVAIDARHASFLFYKRGIYYEPNCS 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S V H+MLLVGY R W++KN WG+ GYM + R N CGIA YA+Y
Sbjct: 271 SCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 169/303 (55%), Gaps = 8/303 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY--I 71
+ KY K Y + + +K W+SN + + HN A QG Y L N +DL+ + +
Sbjct: 23 KGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEFMAL 82
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
K + + ++ + + ++ + V +P +DWR +G++TP +Q CG+C++FS ++
Sbjct: 83 KGSSGILQAKDQSS-TQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFSATGSL 141
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F T + LS QQ+VDCS GN GC+GG + + +Y++ AGG+ E YPY +
Sbjct: 142 EGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAYPYTAQ 201
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C F + V + +P DE +L + TVGP+AV+I+AS + FQLY SG+YD
Sbjct: 202 NGRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYESGVYDR 261
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C+S ++H +L GY W++KN W WG GY+ + R +N+CGIA A Y
Sbjct: 262 SRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQCGIATMACY 321
Query: 307 ALI 309
L+
Sbjct: 322 PLV 324
>gi|21483188|gb|AAK77918.1| cathepsin L 1 [Dictyocaulus viviparus]
Length = 347
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 112/282 (39%), Positives = 160/282 (56%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES----- 95
I HN E + G + + N+++DL E +L R RR S N +
Sbjct: 70 IEEHNHEHRLGRKTFEMGLNNIADLP----FSEYRKLNGYRHRRLFGDSMRKNGTKFLVP 125
Query: 96 --VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V +PD +DWRE +TP NQ CG+C+AFS A++GQ F++T ++ LS Q +VDC
Sbjct: 126 FNVKVPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDC 185
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ EE YPY GK+ C FK+ +I + + LP
Sbjct: 186 STKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIGAEDRGFVDLP 245
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALKV +AT GPI+++I+A +FQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 246 EGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPE 305
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
WI+KN W WG+ GY+ + R NN CG+A A Y L+
Sbjct: 306 AGDYWIIKNSWGTKWGEKGYVRIARNRNNHCGVATKASYPLV 347
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 165/293 (56%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL---THSRIRRTL 86
+++ W+ N KKI HN E G H Y+L NH D+ + + M + ++R +L
Sbjct: 47 RRMVWEKNLKKIELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKSQRKLRGSL 106
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+KG++TP +Q CG+C+AFS A++GQ F+ T + LS
Sbjct: 107 FMEPNFLEA---PRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLS 163
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GGL EE YPY G + C + +
Sbjct: 164 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSAN 223
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A+VGP++V+I+A +FQ Y SGIY D+ C+S+ ++H +L+
Sbjct: 224 DTGFVDVPSGSERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLV 283
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS +WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VGYGFEGKDVDGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPLV 336
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/293 (37%), Positives = 162/293 (55%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M + T + + +L
Sbjct: 49 RRMVWEKNLKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQTTERKFKGSL 108
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P + P +DWREKG++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 109 FMEPNY---LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLS 165
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y+Q GL EE YPY G + C +K +
Sbjct: 166 EQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGAN 225
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P EHA+ +A VGP++V+I+A +FQ Y GIY ++ C+S+ ++H +L+
Sbjct: 226 ETGFVDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLV 285
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA + Y L+
Sbjct: 286 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPLV 338
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 163/282 (57%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES----- 95
I HN+E + G + + N ++DL Y K L R+RR S +SN +
Sbjct: 77 IEEHNKEHRLGRKTFEMGLNEIADLPFSQYRK----LNGYRMRRQFGDSLQSNGTKFLVP 132
Query: 96 --VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V IP+ +DWRE+G +TP NQ CG+C+AFS A++GQ ++T ++ LS Q +VDC
Sbjct: 133 FNVQIPESVDWREEGLVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDC 192
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ E+ YPY G+++ C FKR + D + LP
Sbjct: 193 STKYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRETKCHFKRNAVGADDKGFVDLP 252
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALK +AT GPI+++I+A +FQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 253 EGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPE 312
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W WG+ GY+ + R NN CG+A A Y L+
Sbjct: 313 AGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKASYPLV 354
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 166/304 (54%), Gaps = 12/304 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ KDY ++ +S +++ W+ N K I HN + G H Y L N D+ + + M
Sbjct: 51 HSKDYHER-EESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNG 109
Query: 77 LTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
H + R S S L P +DWREKG++TP +Q CG+C+AFS A++GQ
Sbjct: 110 YKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQH 169
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSI 194
F+ T ++ LS Q +VDCS GN GC GG + YVQ GG+ EE YPY K
Sbjct: 170 FRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED 229
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++K + + + +P E AL +A+VGP++V+I+A +FQ Y SGIY + C
Sbjct: 230 CRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDC 289
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S+ ++H +L+VGY + WI+KN W WGD GY+Y+ K N CGIA A
Sbjct: 290 SSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAAS 349
Query: 306 YALI 309
Y L+
Sbjct: 350 YPLV 353
>gi|354504703|ref|XP_003514413.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
gi|344245863|gb|EGW01967.1| Cathepsin R [Cricetulus griseus]
Length = 333
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 114/305 (37%), Positives = 172/305 (56%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y ++ + +K+ W+ N K I + E G++ +T+ N DL K
Sbjct: 33 KKSYDKTYSQE-EERQKRAVWEDNVKMIKLLSMENGLGMNNFTVEMNEFGDLTGEEMRKM 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
MT + +R + + V I LDWR +G++ P Q CGAC+AF++A +I+G
Sbjct: 92 MTDSSVLTLRNG--KHIQKRGDVKIHKTLDWRTQGYVGPVRRQNGCGACWAFALAGSIEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+FK T ++ +LS+Q ++DCS G GC GG L N YV+ GGL E YPY+ K+
Sbjct: 150 QMFKKTGKMTQLSVQNLIDCSRTYGTNGCKGGRLYNAFQYVKNNGGLEAEATYPYESKEG 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+++ VV I+ + V+ P++E AL L T GPIAV I+A +F YA G+Y +
Sbjct: 210 RCRYRAERSVVKITRFLVV-PRNEEALMNALVTHGPIAVGIDAGHESFTNYAGGMYHEPN 268
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C D H++LLVG+ R W++KN +WG+NGYM + R NN CGIA+YA
Sbjct: 269 CRRDSPTHSVLLVGFGYEGRESEGRKYWLIKNSHGENWGENGYMKIPRDQNNYCGIASYA 328
Query: 305 VYALI 309
+Y ++
Sbjct: 329 MYPVL 333
>gi|39930363|ref|NP_058817.1| cathepsin J precursor [Rattus norvegicus]
gi|84028185|sp|Q63088.2|CATJ_RAT RecName: Full=Cathepsin J; AltName: Full=Cathepsin L-related
protein; AltName: Full=Cathepsin P; AltName:
Full=Catlrp-p; Flags: Precursor
gi|28196048|gb|AAL26793.2| cathepsin P [Rattus norvegicus]
gi|66910531|gb|AAH97263.1| Cathepsin J [Rattus norvegicus]
gi|149039736|gb|EDL93852.1| cathepsin J [Rattus norvegicus]
Length = 334
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 173/307 (56%), Gaps = 17/307 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y + K+ + W+ N K I HN+E G +G+T+ N +D + K
Sbjct: 33 KTKYAKSYSPVEEELKRAV-WEENLKMIQLHNKENGLGKNGFTMEMNAFADTTGEEFRKS 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
++ + + V +P + + V I P+ DWR++G++TP NQ CG+C+AF+ AI
Sbjct: 92 LSDI----LIPAAVTNPSAQKQVSIGLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAI 147
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+F T + LS+Q ++DCS GN GC G+ NYV GL E YPY+GK
Sbjct: 148 EGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCRWGTAHQAFNYVLKNKGLEAEATYPYEGK 207
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N +I+ + LPP +E L V +A++GP++ +I+AS +F+ Y+ G+Y +
Sbjct: 208 DGPCRYHSENASANITGFVNLPP-NELYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHE 266
Query: 252 EACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
C+S VNHA+L+VGY N W++KN W WG NG+M + K NN CGIA+
Sbjct: 267 PNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWGINGFMKIAKDRNNHCGIAS 326
Query: 303 YAVYALI 309
A + I
Sbjct: 327 QASFPDI 333
>gi|187960088|ref|NP_062414.3| cathepsin 8 precursor [Mus musculus]
gi|8917581|gb|AAF81277.1|AF250840_1 EPCS68 [Mus musculus]
gi|16445019|gb|AAK00509.1| cathepsin 2 precursor [Mus musculus]
gi|26340210|dbj|BAC33768.1| unnamed protein product [Mus musculus]
gi|148709369|gb|EDL41315.1| cathepsin 8 [Mus musculus]
Length = 333
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/305 (37%), Positives = 169/305 (55%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++K+ K+Y + + +K+ W+ N K + HN E QG +T+ N D+ Y K
Sbjct: 33 KRKFNKNYSME-EEGQKRAVWEENMKLVKQHNIEYDQGKKNFTMDVNAFGDMTGEEYRKM 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+T + R+ +S + +P +DWR++G +TP NQ C +C+AFS A AI+G
Sbjct: 92 LTDIPVPNFRKK--KSIHQPIAGYLPKFVDWRKRGCVTPVKNQGTCNSCWAFSAAGAIEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+F+ T ++ LS Q +VDCS + GN GC GS L YV GL E YPYKG
Sbjct: 150 QMFRKTGKLVPLSTQNLVDCSRLEGNFGCFKGSTFLALKYVWKNRGLEAESTYPYKGTDG 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ I+S+S + E L +AT+GPI+V I+A +F+LY GIY +
Sbjct: 210 HCRYHPERSAARITSFSFV-SNSEKDLMRAVATIGPISVGIDARHKSFRLYREGIYYEPK 268
Query: 254 CTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C+S+ +NH++L+VGY W++KN WG NGYM L RG NN CGIA+YA
Sbjct: 269 CSSNIINHSVLVVGYGYEGKESDGNKYWLIKNSHGEQWGMNGYMKLARGRNNHCGIASYA 328
Query: 305 VYALI 309
VY +
Sbjct: 329 VYPRV 333
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 112/293 (38%), Positives = 162/293 (55%), Gaps = 14/293 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N K I HN + G H Y L NH D+ + + M HS+ + R
Sbjct: 48 RRMIWEKNLKMIQLHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKHSKTEKKY-RG 106
Query: 90 PESNES--VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
E E +++P +DWREKG++TP +Q CG+C+AFS +++GQ F+ T ++ LS
Sbjct: 107 SEFLEPNFLVVPKSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSE 166
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDI 206
Q +VDCS GN GC GG + Y+ GG+ EE YPY K C +K +
Sbjct: 167 QNLVDCSRPEGNQGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAAND 226
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P E AL +A VGP++V+I+AS TFQ Y SGIY D C+S+ ++H +L+V
Sbjct: 227 TGFVDVPEGHERALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVV 286
Query: 267 GY---------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY + WI+KN WS WGD GY+ + K NN CGIA A Y L+
Sbjct: 287 GYGFEGTDDDNKKKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASYPLV 339
>gi|45946482|gb|AAH68241.1| Cathepsin 8 [Mus musculus]
Length = 333
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/305 (37%), Positives = 169/305 (55%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++K+ K+Y + + +K+ W+ N K + HN E QG +T+ N D+ Y K
Sbjct: 33 KRKFNKNYSME-EEGQKRAVWEENMKLVKQHNIEYDQGKKNFTMDVNAFGDMTGEEYRKM 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+T + R+ +S + +P +DWR++G +TP NQ C +C+AFS A AI+G
Sbjct: 92 LTDIPVPNFRKK--KSIHQPIAGYLPKFVDWRKRGCVTPVKNQGTCNSCWAFSAAGAIEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+F+ T ++ LS Q +VDCS + GN GC GS L YV GL E YPYKG
Sbjct: 150 QMFRKTGKLVPLSTQNLVDCSRLEGNFGCFKGSTFLALKYVWKNRGLEAESTYPYKGTDG 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ I+S+S + E L +AT+GPI+V I+A +F+LY GIY +
Sbjct: 210 HCRYHPERSAARITSFSFV-SNSEKDLMRAVATIGPISVGIDARHKSFRLYREGIYYEPK 268
Query: 254 CTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C+S+ +NH++L+VGY W++KN WG NGYM L RG NN CGIA+YA
Sbjct: 269 CSSNIINHSVLVVGYGYEGKESDGNKYWLIKNSHGEQWGMNGYMKLARGRNNHCGIASYA 328
Query: 305 VYALI 309
VY +
Sbjct: 329 VYPRV 333
>gi|62945374|ref|NP_001017509.1| uncharacterized protein LOC498688 precursor [Rattus norvegicus]
gi|60552853|gb|AAH91563.1| Similar to cathepsin R [Rattus norvegicus]
gi|149039732|gb|EDL93848.1| similar to cathepsin R [Rattus norvegicus]
Length = 334
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 114/308 (37%), Positives = 164/308 (53%), Gaps = 18/308 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KKY K Y + + ++ + W+ N K I HN E G +G+T+ N D + K
Sbjct: 33 KKKYDKSYSLEEEELRRAV-WEENLKMIKLHNGENGLGKNGFTMEINEFGDTTGEEFRKM 91
Query: 74 MTRL---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M TH + + R+ S + P +DWR+KG++TP Q +C AC+AFS+ A
Sbjct: 92 MVEFPVQTHREGKSIMKRAAGS----IFPKFVDWRKKGYVTPVRRQGNCNACWAFSVTGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
I+ Q + ++ LS+Q +VDCS GN GC GG N YV GGL E YPY+G
Sbjct: 148 IEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLQSEATYPYEG 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K C++ N +I+ + L P+ E L V +AT+GPI+ I+AS +F+ Y GIY
Sbjct: 208 KDGPCRYNPKNSSAEITGFVSL-PESEDILMVAVATIGPISAGIDASHESFKFYKKGIYH 266
Query: 251 DEACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIA 301
+ C+S+ V H +L+VGY W++KN W WG GYM + K NN C IA
Sbjct: 267 EPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKNSWGKQWGIRGYMKITKDKNNHCAIA 326
Query: 302 NYAVYALI 309
+YA Y I
Sbjct: 327 SYAHYPTI 334
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/305 (35%), Positives = 166/305 (54%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ + K Y + ++ W++N + HN G+H YTL N +DL R
Sbjct: 34 KRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGA---GIHSYTLGMNIFADLTHEEFKRF 90
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ L R + P +N L PD +DWR G +TP +Q CG+C++FS
Sbjct: 91 YLGTKVDLNRPRSNFSSTFIPTANVGAL-PDSVDWRTAGIVTPVKDQGQCGSCWSFSTTG 149
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+++GQ + T ++ LS Q +VDCS GN GC GG + + Y+ G+ E YPY
Sbjct: 150 SVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYT 209
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
K CKF N+ +SS+ + E L+ +ATVGP++V+I+AS ++FQLY SG+Y
Sbjct: 210 AKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGVY 269
Query: 250 DDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKR-GNNRCGIANYA 304
+++ C+S ++H +L GY T N W++KN W WG GY+++ R NN+CGIA A
Sbjct: 270 NEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANNQCGIATSA 329
Query: 305 VYALI 309
Y ++
Sbjct: 330 SYPIV 334
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 176/314 (56%), Gaps = 16/314 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++ WI ++ K Y D ++ + W+ N ++I HN + + L+ N
Sbjct: 22 VKDESWIQWKMYHNKVYSHD----GEETVRYTIWKDNERRIREHNLKGGD----FILKMN 73
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+ + L+H + + +P + + PD +DWR +G++TP +Q CG
Sbjct: 74 QFGDMTNSEFKAFNGYLSHKHVNGSTFLTPNN---FVAPDTVDWRNEGYVTPVKDQGQCG 130
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ FK T ++ LS Q +VDCS GN GC GG + N Y++ G+
Sbjct: 131 SCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGI 190
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E YPY + C FK+ ++ + + +P +E+ LK +A+VGPI+V+I+AS +
Sbjct: 191 DSEASYPYTAEDGKCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHES 250
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GN 295
FQ Y+SG+Y++ +C+S ++H +L+VGY S W++KN W+ WGD GY+ ++R
Sbjct: 251 FQFYSSGVYNEPSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAK 310
Query: 296 NRCGIANYAVYALI 309
N+CGIA A Y L+
Sbjct: 311 NQCGIATKASYPLV 324
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 170/318 (53%), Gaps = 16/318 (5%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+K W + + QK Y K A + +++ W+ N K I HN E GLH Y L N
Sbjct: 76 DKHWELWKNWHQKSYHK-----AEEGWRRMVWEENLKVIELHNLEQSLGLHTYQLGMNQF 130
Query: 63 SDLHPRHYIKEMTRLTH-SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGA 121
DL + + + H S R + V +P +DWR+ G++TP NQ CG+
Sbjct: 131 GDLTNEEFQQMLISERHFSEGNRINGSAFLEVNYVQVPTSVDWRDHGYVTPVKNQGHCGS 190
Query: 122 CYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
C+AFS A++GQ+F+ + + LS Q +VDCS GN GC GG + Y+ G+
Sbjct: 191 CWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGID 250
Query: 182 KEEDYPYKGKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E+ YPY K + C FK ++ + +PP E AL +ATVGP++V+I+A P +
Sbjct: 251 SEDCYPYTAKDTAQCAFKPECATARVTGFVDIPPHSEEALMKAVATVGPVSVAIDAHPTS 310
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL- 291
F+ Y SGI+ + C+S+ +NHA+L+VGY + WI+KN W WGD+GY YL
Sbjct: 311 FRFYQSGIFYEPKCSSERLNHAVLVVGYGYEGEDEAGKKYWIVKNSWGKQWGDHGYFYLS 370
Query: 292 KRGNNRCGIANYAVYALI 309
K N CGIA A Y L+
Sbjct: 371 KDRGNHCGIATTASYPLL 388
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/291 (38%), Positives = 160/291 (54%), Gaps = 16/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N + I HN E QG H +TL NH D+ + + M H + + +
Sbjct: 48 RRAVWEKNLRMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQHQKHKTGKMY- 106
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
E +L+ P +DWREKG++T NQ CG+C+AFS +++GQ+F T + LS
Sbjct: 107 ---QEPLLLQLPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSE 163
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q +VDCS GN GC GG + YV+ GL E+ YPY GK CK+K P +
Sbjct: 164 QNLVDCSRPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYK-PELSAAND 222
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E ++ LATVGP++V+I+A +FQ Y GIY D C+S +NH +LLVG
Sbjct: 223 TGFVDVPQREKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVG 282
Query: 268 YTRNS--------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
Y ++ W++KN W WG +GY+ + R NN CG+A A Y L+
Sbjct: 283 YGTDASETGKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPLV 333
>gi|301769891|ref|XP_002920367.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281346353|gb|EFB21937.1| hypothetical protein PANDA_009084 [Ailuropoda melanoleuca]
Length = 333
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/297 (37%), Positives = 167/297 (56%), Gaps = 16/297 (5%)
Query: 24 KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR 83
K + +++ W+ N K I HN+E QG H +T+ N DL + + + L +
Sbjct: 42 KDEEGQRRRVWEKNMKMIDQHNEEYSQGQHSFTMAMNAFGDLTSEEFKQVLNDLKIQKPE 101
Query: 84 R-TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
+ ++P E IP +DWREKG++TP Q C +C+AFS A++GQ+F+ T ++
Sbjct: 102 EGNVFQAPLFAE---IPASVDWREKGYVTPVKYQGHCQSCWAFSATGALEGQMFRKTGKL 158
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS Q +VDCS N GC GG + N YV+ GGL E YPY G+ CK++
Sbjct: 159 VSLSEQNLVDCSWPQNNDGCRGGLMDNAFRYVKDNGGLDSAESYPYLGRNESCKYRPEKS 218
Query: 203 VVDISS-WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNH 261
++++ WSV +D L T+ATVGP++ ++++S H+FQ Y GIY D C S+ +NH
Sbjct: 219 AANLTTFWSVSNKED--GLMTTVATVGPVSAAVDSSLHSFQFYKKGIYYDPNCRSNRLNH 276
Query: 262 AMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
A+L+VGY + WI+KN W +WG GYM L K +N CGIA A + ++
Sbjct: 277 AVLVVGYGFEGEESENKKYWIIKNSWGTNWGMKGYMLLAKDRDNHCGIATMASFPVV 333
>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 344
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 114/290 (39%), Positives = 165/290 (56%), Gaps = 16/290 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N + I HN+E QG HG+T+ N D+ + M + +R
Sbjct: 48 RRAVWEKNMQMIEKHNREYSQGKHGFTMAMNAYGDMTNEEFRLMMNGFENQNHKR----G 103
Query: 90 PESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
E + S+L IP LDWRE+G++TP NQE CG+ +AFS A++GQ+F+ T + LS
Sbjct: 104 EEFHNSLLFKIPAFLDWRERGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSE 163
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q +VDCS GN GC+GG + YV+ GL EE YPY+ ++ CK+ +++
Sbjct: 164 QNLVDCSWPQGNQGCSGGLMDYAFQYVKDNRGLDSEESYPYEQRKGSCKYNPRFSAANVT 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V +DE AL +ATVGP++V I +P +F Y GIY D C+S+ VNHA+L+VG
Sbjct: 224 GF-VDVSKDEKALMEAVATVGPVSVGIATTPESFLFYEGGIYYDPKCSSENVNHAVLVVG 282
Query: 268 Y------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYAL 308
Y ++N+ W++KN W WG GYM + K NN CGIA A Y L
Sbjct: 283 YGFEEVGSKNNKYWLIKNSWGKDWGMGGYMKMAKDQNNHCGIATAASYPL 332
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/292 (36%), Positives = 162/292 (55%), Gaps = 12/292 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM----TRLTHSRI 82
D ++ W+ N K I HN E G H + L N D+ + + M + + R
Sbjct: 45 DGWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNSNGSQKRT 104
Query: 83 RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
+ +L R P + +P +DWREKG++TP NQ CG+C+AFS +++GQ F T ++
Sbjct: 105 KGSLYREPLLAQ---LPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKL 161
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS Q +VDCS GN GC+GG + N YV+ GG+ E+ YPY G+ + CK++
Sbjct: 162 VSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAECS 221
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
+++ + +P +E AL +A VGPI+V+I+A +FQ Y SG+Y + C+S ++H
Sbjct: 222 GANVTGFVDIPSMNERALMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHG 281
Query: 263 MLLVGYTR----NSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+L+VGY WI+KN W WG GY+ + K NN CGIA A Y +
Sbjct: 282 VLVVGYGSIGKDEYWIVKNSWGEEWGKKGYVLMAKFRNNHCGIATAASYPQV 333
>gi|318054062|ref|NP_001187179.1| cathepsin S precursor [Ictalurus punctatus]
gi|190351079|gb|ACE75948.1| cathepsin S [Ictalurus punctatus]
Length = 329
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 173/304 (56%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y + + ++ W+ N + I HN EA G+H Y L NH+ D+ R I +
Sbjct: 30 KKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDM-AREEILQ 88
Query: 74 M---TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M TR+ + RR+ S SV PD +DWREKG++T NQ CG+C+AFS A A
Sbjct: 89 MFAGTRVPPNLTRRSSTFVASSGISV--PDSVDWREKGYVTEVKNQGSCGSCWAFSAAGA 146
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ ++T +++ LS Q +VDCS GN GC GG + YV GG+ +E YPY
Sbjct: 147 LEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTEAFQYVIDNGGIDSDEAYPYTA 206
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C++ + + SS++ + DE ALK +AT+GPI+V+I+A+ F LY SG+Y+
Sbjct: 207 MDGQCRYDQAQRAANCSSYNYVSQGDEEALKQAVATIGPISVAIDATRPMFILYHSGVYN 266
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D+ T + + VGY + W++KN W +GD GY+ + R N CGIANYA
Sbjct: 267 DQTSTP-WFTFWVQDVGYGSLNGEDYWLVKNSWGPRFGDGGYIRIARNKGNMCGIANYAC 325
Query: 306 YALI 309
Y L+
Sbjct: 326 YPLM 329
>gi|61200410|gb|AAX39778.1| cathepsin R [Mus musculus]
Length = 335
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/307 (38%), Positives = 163/307 (53%), Gaps = 19/307 (6%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y K + K++ W+ K I HN+E G +G+T++ N D + K M
Sbjct: 35 KYNKSYSLK-EEKLKRVVWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMI 93
Query: 76 RL---THSRIRRTLVRSPESNESVLIPDHLDWR-EKGFITPDWNQEDCGACYAFSIASAI 131
+ TH + + R S ++P +DWR +KG++TP Q DC AC+AF++ AI
Sbjct: 94 EISVWTHREGKSIMKREAGS----ILPKFVDWRTKKGYVTPVRRQGDCDACWAFAVTGAI 149
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+ Q T ++ LS+Q +VDCS GN GC GG N YV GGL E YPY+GK
Sbjct: 150 EAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPYEGK 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N +I+ + L PQ E L +AT+GPI I+AS +F+ Y GIY +
Sbjct: 210 DGPCRYNPKNSKAEITGFVSL-PQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHE 268
Query: 252 EACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
C+SD V H +L+VGY W++KN W WG GYM L K NN CGIA+
Sbjct: 269 PNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHCGIAS 328
Query: 303 YAVYALI 309
YA Y I
Sbjct: 329 YAHYPTI 335
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 165/304 (54%), Gaps = 12/304 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ KDY ++ +S +++ W+ N K I HN + G H Y L N D+ + + M
Sbjct: 17 HNKDYHER-EESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTEEFRQLMNG 75
Query: 77 LTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
H + R S S L P +DWREKG++TP +Q CG+C+AFS A++GQ
Sbjct: 76 YAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQH 135
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-I 194
F+ T ++ LS Q +VDCS GN GC GG + YVQ GG+ EE YPY K
Sbjct: 136 FRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED 195
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++K + + + +P E AL +A VGP++V+I+A +FQ Y SGIY + C
Sbjct: 196 CRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC 255
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S+ ++H +L+VGY + WI+KN W WGD GY+Y+ K N CGIA A
Sbjct: 256 SSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAAS 315
Query: 306 YALI 309
Y L+
Sbjct: 316 YPLV 319
>gi|355681660|gb|AER96816.1| cathepsin L2 [Mustela putorius furo]
Length = 334
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 113/290 (38%), Positives = 160/290 (55%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN+E QG HG+T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFRNQKHRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E IP +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFAE---IPKSVDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + Y++ GGL EE YPY + + +P V +
Sbjct: 165 NLVDCSRSQGNQGCNGGLMDFAFQYIKDNGGLDSEESYPYLARDTDSCNYKPEYSVANDT 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VGY
Sbjct: 225 GFVDIPQRERALMKAVATVGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 285 GFEGTDSNNNKFWIVKNSWGPEWGCNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|318065049|ref|NP_001187379.1| cathepsin K precursor [Ictalurus punctatus]
gi|308322859|gb|ADO28567.1| cathepsin K [Ictalurus punctatus]
Length = 331
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 167/300 (55%), Gaps = 11/300 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K+Y D+ ++ W+ N + I +HNQE + GLH Y L NHL D+ +++
Sbjct: 36 HRKEYNGLGEDAIRRSVWEKNMRLIESHNQEYELGLHTYELGMNHLGDMTTEEVAEKLLG 95
Query: 77 LTHSRIRRTL-VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L L P+S + +P +D+R+ G++TP NQ CG+C+AFS A++GQ+
Sbjct: 96 LQVPMDNDPLNTYYPDSLDK--LPKSIDYRKLGYVTPVRNQGSCGSCWAFSSVGALEGQL 153
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
K+T ++ LS Q +VDC ++ N GC GG + N +YV+ GG+ EE YPY G+ C
Sbjct: 154 MKTTGKLVNLSPQNLVDC--VTENDGCGGGYMTNAFSYVRDNGGIDSEEAYPYVGQDQQC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+ + + + + E+AL +A VGP++V I+A TFQ Y G+Y D C
Sbjct: 212 AYNKSGKAAECRRFKEVKKGSEYALASAVAKVGPVSVGIDAMQSTFQFYKRGVYYDPNCD 271
Query: 256 SDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ +NHA+L VGY + WI+KN W WG GY+ + R NN CGIAN A + ++
Sbjct: 272 KESINHAVLAVGYGATPKGKKHWIVKNSWGEEWGMKGYVLMARNRNNACGIANLASFPVM 331
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 166/306 (54%), Gaps = 11/306 (3%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K + + +A + ++ W+ N K I HN E GLH Y L N DL + + +
Sbjct: 33 KNWHQKSYHEAEEGWRRTVWEENLKAIQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEIL 92
Query: 75 TRLTH-SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T H S+ R + V +P +DWR+ G++TP NQ CG+C+AFS A++G
Sbjct: 93 TGERHFSKGNRINGSAFLEANFVQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEG 152
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+F+ + + LS Q +VDCS GN GC GG + Y+ G+ E+ YPY K +
Sbjct: 153 QLFRKSGRLISLSEQNLVDCSWQQGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDT 212
Query: 194 I-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C FK ++ + +PP E AL +ATVGP++V I+AS +F+ Y SGI+ D
Sbjct: 213 AQCTFKPECATAPVTGFVDIPPHSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDP 272
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+S+ ++HA+L+VGY + WI+KN W HWGD GY+Y+ K N CGIA
Sbjct: 273 KCSSESLDHAVLVVGYGYEREDEAGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATV 332
Query: 304 AVYALI 309
A Y L+
Sbjct: 333 ASYPLL 338
>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 326
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 165/304 (54%), Gaps = 24/304 (7%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y K T+ ++++ W+SN K + HN + + G+T+ N +DL +
Sbjct: 29 KYNKVYETKETELERQIIWESNKKFVENHNANSDK--FGFTVAMNEFADLDAGEF----- 81
Query: 76 RLTHSRIRRTLVRSPESNES--------VLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
+ I L+ P S S V + D +DWREKG +T NQ CG+C++FS
Sbjct: 82 ----ANIYNGLLPRPASYNSTKLFKKTGVSVGDTVDWREKGAVTEVKNQGKCGSCWSFSS 137
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ F T + LS QQ++DCS GN GC GG + N+ Y++ G M EE YP
Sbjct: 138 TGSLEGQHFLKTGTLSSLSEQQLMDCSTSFGNHGCKGGLMDNSFRYLETVAGDMSEEMYP 197
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C+++ + + + +P DE ALK +ATVGPI+V+I+A +FQLY G
Sbjct: 198 YTAEDGFCRYRSSEAIAKDTGYKDIPRGDEDALKEAVATVGPISVAIDAGHRSFQLYHEG 257
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
IY + AC+S ++H +L VGY W++KN W WG+ GY+ + R N CGIA
Sbjct: 258 IYYEPACSSTKLDHGVLAVGYGTGEGEEYWLVKNSWGPSWGNEGYVMMSRNRENNCGIAT 317
Query: 303 YAVY 306
A Y
Sbjct: 318 QASY 321
>gi|21483190|gb|AAL14223.1| cathepsin L [Dictyocaulus viviparus]
Length = 347
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 159/282 (56%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI-- 98
I HN E + G + + N+++DL E +L R RR S N + +
Sbjct: 70 IEEHNHEHRLGRKTFEMGLNNIADLP----FSEYRKLNGYRHRRLFGDSMRKNGTKFLVP 125
Query: 99 -----PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
PD +DWRE +TP NQ CG+C+AFS A++GQ F++T ++ LS Q +VDC
Sbjct: 126 FNVKAPDSVDWREHNLVTPVKNQGMCGSCWAFSATGALEGQHFRATGKLVSLSEQNLVDC 185
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ EE YPY GK+ C FK+ +I + + LP
Sbjct: 186 STKYGNHGCNGGLMDLAFEYIKDNHGIDTEEGYPYVGKEMRCHFKKRDIGAEDRGFVDLP 245
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALKV +AT GPI+++I+A +FQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 246 EGDEDALKVAVATQGPISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPE 305
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
WI+KN W WG+ GY+ + R NN CG+A A Y L+
Sbjct: 306 AGDYWIIKNSWGTKWGEKGYVRIARNRNNHCGVATKASYPLV 347
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 112/282 (39%), Positives = 163/282 (57%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES----- 95
I HNQE + G + + N ++DL Y K L R RR S +SN +
Sbjct: 77 IDEHNQEHRLGRKTFEMGLNSIADLPFSQYRK----LNGYRHRRNFGDSMQSNGTKWLAP 132
Query: 96 --VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V IPD +DWR+KG +T NQ CG+C+AFS A++GQ +++ ++ LS Q +VDC
Sbjct: 133 FNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDC 192
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ EE YPY G+++ C FK+ +I + + LP
Sbjct: 193 STKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGAEDKGFVDLP 252
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALKV +AT GPI+++I+A TFQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 253 EGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPE 312
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W WG+ GY+ + R +N CG+A A Y L+
Sbjct: 313 AGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPLV 354
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 112/282 (39%), Positives = 163/282 (57%), Gaps = 17/282 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES----- 95
I HNQE + G + + N ++DL Y K L R RR S +SN +
Sbjct: 77 IDEHNQEHRLGRKTFEMGLNSIADLPFSQYRK----LNGYRHRRNFGDSMQSNGTKWLAP 132
Query: 96 --VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
V IPD +DWR+KG +T NQ CG+C+AFS A++GQ +++ ++ LS Q +VDC
Sbjct: 133 FNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARASGKMVSLSEQNLVDC 192
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ EE YPY G+++ C FK+ +I + + LP
Sbjct: 193 STKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKKKDIGAEDKGFVDLP 252
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
DE ALKV +AT GPI+++I+A TFQLY G+Y DE C+S+ ++H +LLVGY +
Sbjct: 253 EGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPE 312
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W WG+ GY+ + R +N CG+A A Y L+
Sbjct: 313 AGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPLV 354
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 174/304 (57%), Gaps = 13/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY--I 71
+ ++ K YR + + + WQ+N K I HNQ A G+ GYTL+ N DL + +
Sbjct: 26 KAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHA--GVFGYTLKMNQFGDLENSEFKSL 83
Query: 72 KEMTRLTHS-RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
R++++ R + V + + +P +DW +KG++TP NQ CG+C++FS +
Sbjct: 84 YNGYRMSNAPRKGKPFVPAARVQD---LPASVDWSKKGWVTPVKNQGQCGSCWSFSATGS 140
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ F +T + LS Q +VDCS GN GC GG + + YV G+ E YPY+
Sbjct: 141 MEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYPYRA 200
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
S CKF ++ IS + + E L+V +AT+GP++V+I+AS +FQ Y+SG+YD
Sbjct: 201 VDSTCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGVYD 260
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
C+S ++H +L VGY +++ W++KN W WG +GY+ + R NN+CGIA A
Sbjct: 261 PLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSWGASWGMSGYIEMVRNHNNKCGIATSAS 320
Query: 306 YALI 309
Y ++
Sbjct: 321 YPVV 324
>gi|66378018|gb|AAY45870.1| cathepsin L-like cysteine proteinase [Rotylenchulus reniformis]
Length = 369
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/310 (35%), Positives = 177/310 (57%), Gaps = 20/310 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+++++K Y+ + ++++ L + SN + I HNQ ++G +++ ENH++DL Y K
Sbjct: 66 KQQHEKSYKNQQLETERMLAYLSNKQFIDKHNQAFREGKKSFSIGENHIADLPFSEYKK- 124
Query: 74 MTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAF 125
+ RR L + N S IP+ +DWR+K ++T NQ CG+C+AF
Sbjct: 125 -----LNGYRRALGDNLRRNASTFLAPMNIGDIPESVDWRDKQWVTEVKNQGQCGSCWAF 179
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S A++GQ + T ++ LS Q +VDC+ GN+GC GG + N Y++ G+ KE
Sbjct: 180 SATGALEGQHARKTGQLVSLSEQNLVDCTKKYGNMGCNGGLMDNAFQYIKDNEGIDKEMT 239
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPYK K C FKR ++ + + + DE LK+ +AT GP++V+I+A +FQLY
Sbjct: 240 YPYKAKAGRCHFKRNDVGATDTGFFDVAEGDEDKLKLAVATQGPVSVAIDAGHRSFQLYK 299
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
G+Y +E C + ++H +L+VGY + WI+KN WS HWG+ GY+ + NN CG
Sbjct: 300 HGVYFEEECNPEELDHGVLVVGYGTDPEHGDYWIVKNSWSTHWGEQGYIRMAPNRNNNCG 359
Query: 300 IANYAVYALI 309
I ++A Y +
Sbjct: 360 IPSHASYPTV 369
>gi|163914459|ref|NP_001106314.1| cathepsin K precursor [Xenopus laevis]
gi|159155477|gb|AAI54985.1| LOC100127265 protein [Xenopus laevis]
Length = 331
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 167/302 (55%), Gaps = 8/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K Y + + +++L W+ N K I +HN E QGLH Y + N L D+ ++
Sbjct: 32 KRTYHKQYNGQMDELQRRLIWEKNFKMITSHNFEYNQGLHTYEMAMNQLGDMTSEEVVRT 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT L H R + T + +PD +D+R+KG++TP NQ CG+C+AFS A++
Sbjct: 92 MTGLKIHKRNKPTNLTFEHDKAPEKVPDSIDYRKKGYVTPIRNQGSCGSCWAFSSVGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K ++ LS Q +VDC + N GC GG + N YV+ G+ E+ YPY G+
Sbjct: 152 GQLKKKKGKLVVLSPQNLVDC--VKKNDGCGGGYMTNAFEYVRDNKGIDSEKAYPYVGED 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + + +E ALK +A VGP++V I+A +FQ Y+ G+Y D+
Sbjct: 210 QECMYNVSGRAAACKGYKEVQEGNEKALKKAVALVGPVSVGIDAGLSSFQFYSKGVYYDK 269
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYA 307
C+++ +NHA+L VGY WI+KN W WGD GY+ + K N CGIAN A Y
Sbjct: 270 DCSAEDINHAVLAVGYGTQKKAKYWIVKNSWGEEWGDKGYILMAKDKGNACGIANLASYP 329
Query: 308 LI 309
++
Sbjct: 330 VM 331
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 114/306 (37%), Positives = 171/306 (55%), Gaps = 17/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K+Y +K + ++L W+ N +KI HN E G H Y L NH D+ + + M
Sbjct: 35 HSKNYHEK-EEGWRRLVWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNG 93
Query: 76 --RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
R + +L P E+ P +DWR+KG++TP +Q CG+C+AFS A++G
Sbjct: 94 YKRREQRKYSGSLFMEPNFLEA---PRAVDWRDKGYVTPVKDQGQCGSCWAFSTTGALEG 150
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQ 192
Q F+ T ++ LS Q +VDCS GN GC GG + YV+ GL E+ YPYKG
Sbjct: 151 QQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDFYPYKGTDD 210
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ V+ + + +P E AL +A+VGP++V+I+A +FQ Y SGIY ++
Sbjct: 211 QPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEK 270
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+SD ++H +L+VGY + WI+KN WS WGD G++Y+ K +N CGIA
Sbjct: 271 ECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMAKDRHNHCGIATA 330
Query: 304 AVYALI 309
A Y L+
Sbjct: 331 ASYPLV 336
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 107/299 (35%), Positives = 167/299 (55%), Gaps = 6/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ K + + + N I HN + +GL Y L N DL P + K
Sbjct: 14 HKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEFAKMFNG 73
Query: 77 LTHSRIRRTLVRSPESN-ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
R R P +N +P +DWR+KG +TP +Q CG+C+AFS +++GQ
Sbjct: 74 YHGERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQH 133
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F + ++ LS Q ++DCS GN GC GG + N Y++ G+ EE YPY+ C
Sbjct: 134 FLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESYPYEAMDGDC 193
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+FK+ ++ + + + E L+ +ATVGPI+V+I+AS +FQLY+ G+YD+ C+
Sbjct: 194 RFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSEGVYDEPNCS 253
Query: 256 SDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
S+ ++H +L VGY + W++KN W+ WGDNGY+ + R +N+CGIA+ A Y L+
Sbjct: 254 SEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCGIASSASYPLV 312
>gi|213623956|gb|AAI70449.1| LOC100127265 protein [Xenopus laevis]
Length = 331
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 167/302 (55%), Gaps = 8/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K Y + + +++L W+ N K I +HN E QGLH Y + N L D+ ++
Sbjct: 32 KRTYHKQYNGQMDELQRRLIWEKNFKMITSHNFEYNQGLHTYEMAMNQLGDMTSEEVVRT 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT L H R + T + +PD +D+R+KG++TP NQ CG+C+AFS A++
Sbjct: 92 MTGLKIHKRNKPTNLTFEHEKAPEKVPDSIDYRKKGYVTPIRNQGSCGSCWAFSSVGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K ++ LS Q +VDC + N GC GG + N YV+ G+ E+ YPY G+
Sbjct: 152 GQLKKKKGKLVVLSPQNLVDC--VKKNDGCGGGYMTNAFEYVRDNKGIDSEKAYPYVGED 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + + +E ALK +A VGP++V I+A +FQ Y+ G+Y D+
Sbjct: 210 QECMYNVSGRAAACKGYKEVQEGNEKALKKAVALVGPVSVGIDAGLSSFQFYSKGVYYDK 269
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYA 307
C+++ +NHA+L VGY WI+KN W WGD GY+ + K N CGIAN A Y
Sbjct: 270 DCSAEDINHAVLAVGYGTQKKAKYWIVKNSWGEEWGDKGYILMAKDKGNACGIANLASYP 329
Query: 308 LI 309
++
Sbjct: 330 VM 331
>gi|111036374|dbj|BAF02516.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 106/284 (37%), Positives = 159/284 (55%), Gaps = 11/284 (3%)
Query: 36 SNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRI----RRTLVRSPE 91
+N+ + HN+ GL Y+ N +DL + ++ L + + + + E
Sbjct: 56 NNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAEKYLTLKQTPMEGIWQDMSTQYVE 115
Query: 92 SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+L+PD +DWR+KG +TP +Q DCG+C+AFS A++GQ+ + T ++ LS QQ+V
Sbjct: 116 RPTRMLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSATGALEGQLKRKTGKLISLSEQQLV 175
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
DCS +GN GC GG + + Y G E DYPY CKF +V +S +
Sbjct: 176 DCSTYTGNEGCNGGDMNDAFRY-WMRNGAESESDYPYTAMDGKCKFNSSKVVTKVSKFVK 234
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+P + E LK+++A VGP++V+I+A+ F LY GIY D C+ Y++HA+L+VGY
Sbjct: 235 VPKKREDQLKLSVAQVGPVSVAIDATSSGFMLYKKGIYQDNTCSQQYLDHAVLVVGYDAD 294
Query: 269 -TRNS-WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
TR WI+KN W WG GY+++ R N CGIA A Y LI
Sbjct: 295 KTRQKYWIVKNSWGEDWGQRGYIWMARDKGNMCGIATMASYPLI 338
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 108/302 (35%), Positives = 166/302 (54%), Gaps = 13/302 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
KDY + + + N KI HN++ + Y L N DL ++ TR
Sbjct: 36 KDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVS--TRNG 93
Query: 79 HSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
R R R PE E + +P +DWR+KG +TP NQ CG+C+AFS +++
Sbjct: 94 FKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 153
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
G F+ T ++ LS Q +VDCS GN GC GG + N Y++ G+ E YPY
Sbjct: 154 GPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNATD 213
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
+C F R ++ + + +P DE+ LK +A VGP++V+I+AS +FQ Y+ G+YD+
Sbjct: 214 GVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEP 273
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+S+ ++H +L+VGY ++ W++KN W WGD GY+Y+ R +N+CGIA+ A Y
Sbjct: 274 ECSSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRNKDNQCGIASSASYP 333
Query: 308 LI 309
L+
Sbjct: 334 LV 335
>gi|392354135|ref|XP_225128.6| PREDICTED: LOW QUALITY PROTEIN: cathepsin M [Rattus norvegicus]
Length = 333
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 166/303 (54%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K+Y + + +++ W+ N K + HN E QG + +T++ N D+ + K M
Sbjct: 35 KYDKNYSLE-EEGQRRAVWEENMKVVKQHNIEYDQGKNNFTMKVNAFGDMTGEEFRKMMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ R R+ V +S+ +P +DWR +G++T NQ C +C+AFS+A AI+GQ+
Sbjct: 94 DIPVLRFRKKKVX--QSHTGGYLPKFVDWRRRGYVTSVKNQGRCNSCWAFSVAGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T + LS Q +VDCS GN GC G T YV GGL E YPY+G++ C
Sbjct: 152 FRKTGRLVSLSAQNLVDCSRPEGNRGCISGHTFYTFKYVWNNGGLEAESTYPYEGREGHC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ I +S++ +E AL +AT+GPI+V I+AS +F Y+ GIY + C
Sbjct: 212 RYLPERSAARIKGFSIISSTEE-ALMNAVATIGPISVGIDASHESFTFYSGGIYYEPKCR 270
Query: 256 SDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+ VNHA+LLVGY R W++KN WG NGYM L RG N CGIA A Y
Sbjct: 271 NKTVNHAVLLVGYGYEGRESDGRKYWLIKNSHGVGWGMNGYMKLARGWNKHCGIATCAFY 330
Query: 307 ALI 309
+
Sbjct: 331 PRV 333
>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
Length = 334
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 115/291 (39%), Positives = 159/291 (54%), Gaps = 15/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTLVR 88
++ W+ N K I HNQE QG HG+++ N D+ + + M R + L R
Sbjct: 48 RRAVWEKNKKIIDLHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKRKKGKLFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +P +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLL---IDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDIS 207
+VDCS GN GC GG + N Y++ GGL EE YPY S C +K P
Sbjct: 165 NLVDCSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYLATDTSSCNYK-PECSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VG
Sbjct: 224 TGFVDIPQREKALMKAVATVGPISVAIDAGHASFQFYKSGIYYDPDCSSKDLDHGVLVVG 283
Query: 268 Y--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 284 YGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 166/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 32 KKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ R + + N+ +PD +DWREKG +T Q CG+C+AFS A++
Sbjct: 92 MSSLRVPSQWPRNVTYKSDPNQK--LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 150 AQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S + LP E ALK +A GP++V I+AS +F LY +G+Y D
Sbjct: 210 DGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYD 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W H+GD GY+ + R + N CGIANY Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|410910990|ref|XP_003968973.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
Length = 329
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 107/296 (36%), Positives = 164/296 (55%), Gaps = 5/296 (1%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y + ++ W+ N + + HN EA G HG+TL NHL+D+ ++M
Sbjct: 31 KHSKRYDNQTEMVHRRAAWEHNVRLVLRHNLEASAGKHGFTLELNHLADMTAEEVNEKMN 90
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L + E P +DWR+ G ++P NQ CG+C+AFS A++GQ+
Sbjct: 91 NLKVEEWVPVRNGTFEDKLDSETPQSVDWRKHGLVSPVQNQGYCGSCWAFSSLGALEGQM 150
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
+ T + LS Q ++DCS GNLGC GG + + +Y+ GG+ E YPY+ ++ C
Sbjct: 151 KRKTGFLVPLSPQNLLDCSTSDGNLGCRGGYISKSYSYIIRNGGVDSESFYPYEHQKGKC 210
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ S + +LP DE LK T+A VGP+AV++NA +F LY G+Y+ C
Sbjct: 211 RYSVKGKAGYCSRFHILPQGDEETLKATVARVGPVAVAVNAMLASFHLYRGGLYNVPNCN 270
Query: 256 SDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
++NHA+L+VGY ++ W++KN W WG+ GY+ L R N CGIA++AVY
Sbjct: 271 PKFINHAVLVVGYGSSEGQDFWLVKNSWGSAWGEEGYIRLARNKKNLCGIASFAVY 326
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 111/327 (33%), Positives = 178/327 (54%), Gaps = 25/327 (7%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + +K+Y + +K + K++ ++ HK + HNQ Q+GL Y L+ N
Sbjct: 22 LVREEWNTFKLEHKKQYDSETEEKF---RMKIYAENKHK-VAKHNQRYQKGLVSYRLKTN 77
Query: 61 HLSDLHPRHYIKEMTRLTHS------------RIRRTLVRSPESNESVLIPDHLDWREKG 108
SD+ ++ M + IR SP + V P +DWR+ G
Sbjct: 78 KYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSPAN---VAAPPTVDWRQHG 134
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLR 168
+TP +Q CG+C++FS A++GQ F+ + + LS Q ++DCS GN GC GG +
Sbjct: 135 AVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSAYGNNGCNGGLMD 194
Query: 169 NTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVG 228
N Y++ G+ E+ YPY+ C++ N + + +P DEH L + LATVG
Sbjct: 195 NAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDVGFVDIPAGDEHKLMLALATVG 254
Query: 229 PIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHW 283
P++V+I+AS +FQLY+ G+Y DE C+S+ ++H +L+VGY + W++KN W W
Sbjct: 255 PVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGTDEDGGDYWLVKNSWGPSW 314
Query: 284 GDNGYMYLKRG-NNRCGIANYAVYALI 309
GD GY+ + R +N CGIA+ A Y L+
Sbjct: 315 GDEGYIKMARNRDNHCGIASSASYPLV 341
>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 111/294 (37%), Positives = 163/294 (55%), Gaps = 14/294 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM----TRLTHSRI 82
DS ++ W+ N K I HNQE + G + L N D+ + + + + + R
Sbjct: 45 DSLRRAIWEKNLKMIERHNQEYRAGKQSFQLGMNKFGDMTTEEFQEAINFYNSSASQRRT 104
Query: 83 RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
+R L R P + +P+ +DWRE+G++TP NQ C +C+AFS AI+GQ F+ T E+
Sbjct: 105 KRYLHREPLLAQ---LPESVDWREEGYVTPVKNQGQCLSCWAFSAVGAIEGQWFRKTGEL 161
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LSIQ +VDC+ C GG + YVQ GG+ EE YPY G+ + CK++
Sbjct: 162 VSLSIQNLVDCTTSDSISSCHGGFMDRAFQYVQDNGGIDTEECYPYVGEVNECKYQPECS 221
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
++ + +P DE AL +ATVGPI+V+I+ +F+ Y SG+Y D C+S +NHA
Sbjct: 222 GANVVGFVDIPSMDERALMEAVATVGPISVAIDGGNPSFKFYESGVYYDPQCSSSQLNHA 281
Query: 263 MLLVGY------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
L+VGY R WI+KN W WG+NGY+ + K +N CGIA A Y +
Sbjct: 282 GLVVGYGSEGIDGRKYWIVKNSWGELWGNNGYILMAKDEDNHCGIATEASYPEV 335
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+ +
Sbjct: 26 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR 85
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L R +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 86 MGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 143
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 144 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 203
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 204 TDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 263
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 264 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 322
Query: 306 YALI 309
Y I
Sbjct: 323 YPEI 326
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 112/315 (35%), Positives = 174/315 (55%), Gaps = 20/315 (6%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+ EW+ + K Y + ++ + WQ N +KI HN E H Y + NHL
Sbjct: 24 DSEWVAWKSYHGKSYSDVHEERT----RMAIWQQNLEKIKRHNAED----HSYKMAMNHL 75
Query: 63 SDLHPRH--YIKEMTRLTHSRIRRTLVR-SPESNESVLIPDHLDWREKGFITPDWNQEDC 119
DL Y R H+ +R P SN V IP +DW +KG++T NQ C
Sbjct: 76 GDLTEDEFRYFYLGVRAHHNSTKRGWATYMPPSN--VKIPSSVDWSQKGYVTGVKNQGQC 133
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C+AFS +++GQ F+ T + LS Q ++DCS GN GC GG + N Y++ GG
Sbjct: 134 GSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGG 193
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
+ E YPY G+Q C F ++ ++ + +P E AL+ +ATVGP++V+++AS
Sbjct: 194 IDTESSYPYLGQQGSCHFSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVSVAVDAS-- 251
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG- 294
+Q Y+SG+YD+ C+S ++H +L++GY ++ W++KN W + WG GY+ + R
Sbjct: 252 QWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNYNGQDYWLVKNSWGYSWGVEGYIMMSRNK 311
Query: 295 NNRCGIANYAVYALI 309
NN+CGIA+ A Y L+
Sbjct: 312 NNQCGIASSASYPLV 326
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+ +
Sbjct: 30 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR 89
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L R +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 90 MGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 147
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 148 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 208 MDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 267
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 268 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 326
Query: 306 YALI 309
Y I
Sbjct: 327 YPEI 330
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 111/306 (36%), Positives = 175/306 (57%), Gaps = 17/306 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM- 74
K+ K Y +K + ++L +Q N K I +HNQEA G H Y L N +D+ Y+ ++
Sbjct: 30 KHDKVYSEKE-EYARRLIFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHAEYLNQVI 88
Query: 75 ------TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ LT + R T P ++ + D +DWR+KG +T +Q CG+C+AFS
Sbjct: 89 GGCLITSNLTKTGSRATYRYMP----NMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTT 144
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ K+T + LS Q +VDCS GN GC GG + Y+ G+ E+ YPY
Sbjct: 145 GSLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQNKGIDTEQCYPY 204
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K K CKF I +SS++ + DE ALK A +GPI+V I+AS +FQ Y+SG+
Sbjct: 205 KAKNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGV 264
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y++ C+S ++H +L+VGY +++ W++KN W WG+ GY+ + R +N+CG+A
Sbjct: 265 YNEFECSSTKLDHGVLVVGYGTYGSKDYWLVKNSWGTVWGNEGYIMMSRNKDNQCGVATD 324
Query: 304 AVYALI 309
A + ++
Sbjct: 325 ASFPVV 330
>gi|308322193|gb|ADO28234.1| cathepsin K [Ictalurus furcatus]
Length = 331
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 107/300 (35%), Positives = 167/300 (55%), Gaps = 11/300 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K+Y ++ ++ W+ N + I +HNQE + GLH Y L NHL D+ +++
Sbjct: 36 HRKEYNGLGEEAIRRSVWEKNMRLIESHNQEYELGLHTYELGMNHLGDMTTEEVAEKLLG 95
Query: 77 LTHSRIRRTL-VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L L P+S + +P +D+R+ G++TP NQ CG+C+AFS A++GQ+
Sbjct: 96 LQVPMDNDPLNTYYPDSLDK--LPKSIDYRKLGYVTPVRNQGSCGSCWAFSSVGALEGQL 153
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
K+T ++ LS Q +VDC ++ N GC GG + N +YV+ GG+ EE YPY G+ C
Sbjct: 154 MKTTGKLVNLSPQNLVDC--VTENDGCGGGYMTNAFSYVRDNGGIDSEEAYPYVGQDQQC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+ + + + + E+AL +A VGP++V I+A TFQ Y G+Y D C
Sbjct: 212 AYNKSGKAAECRRFKEVKKGSEYALASAVAKVGPVSVGIDAMQSTFQFYKRGVYYDPNCD 271
Query: 256 SDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ +NHA+L VGY + WI+KN W WG GY+ + R NN CGIAN A + ++
Sbjct: 272 KESINHAVLAVGYGATPKGKKHWIVKNSWGEEWGMKGYVLMARNRNNACGIANLASFPVM 331
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/290 (38%), Positives = 164/290 (56%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG H +T+ N D+ + + M + + R V +
Sbjct: 48 RRAVWEKNMKMIELHNHEYNQGKHSFTMAMNAFGDMTNEEFRQVMNGFQNRKPRNGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P +E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFHEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNQGCDGGLMDYAFQYVQENGGLDSEESYPYEATEESCKY-NPEYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +FQ Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKLEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGY 283
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ NS W++KN W WG +GY+ + K N CGIA+ A Y +
Sbjct: 284 GFERTGSDNSKYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYPTV 333
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/290 (38%), Positives = 164/290 (56%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN E QG H +T+ N D+ + + M + + R V +
Sbjct: 48 RRAVWEKNMKTIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQNRKPRNGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P +E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLLHEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKY-NPKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +FQ Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKLEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGY 283
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ NS W++KN W WG +GY+ + K N CGIA+ A Y +
Sbjct: 284 GFERTGSDNSKYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYPTV 333
>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 106/291 (36%), Positives = 163/291 (56%), Gaps = 16/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N + I HNQE QG H +TL N D+ + + + + ++ V
Sbjct: 48 RRTVWERNMEMIEQHNQEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFKIQKHKKGKVFP 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+P E +P +DWRE+G++TP +Q C C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 APLFAE---VPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GGL EE YPY + CK++ ++++
Sbjct: 165 NLVDCSWSQGNRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTA 224
Query: 209 -WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
W +L +E L T+ATVGP++ ++++SP +FQ Y GIY D C++ +NH +L+VG
Sbjct: 225 FWPIL--NEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVG 282
Query: 268 Y--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y + WI+KN W +WG GYM L K +N CGIA A Y ++
Sbjct: 283 YGFEGAESDNKKYWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASYPVV 333
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 173/316 (54%), Gaps = 13/316 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ + EW + KKY+ + +KK+ Q+ H I HN + +G Y L+ N
Sbjct: 27 LFDAEWQNFKVHHNKKYEGSTVEAF---RKKIFLQNTHL-IARHNIKHAKGETTYKLKMN 82
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSP-ESNESVLIPDHLDWREKGFITPDWNQEDC 119
D+ ++ M L R RT S ESV +P +DWREKG +TP NQ C
Sbjct: 83 QFGDMLHHEFVSTMNGLL--RSNRTYFGSTWIEPESVSLPKSVDWREKGAVTPVKNQGHC 140
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C++FS A++GQ+F+ T E+ LS Q ++DCS GN GC GG + N Y++ G
Sbjct: 141 GSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYIKENHG 200
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
+ EE YPY+GKQ C++ + + + + +P +E AL LAT+GP++V+I+AS
Sbjct: 201 IDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERALAKALATIGPVSVAIDASHE 260
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG 294
+FQ Y G+Y+ C S ++H +L VGY ++ +I+KN W WG GY+ + R
Sbjct: 261 SFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYYIIKNSWGERWGQEGYVLMARN 320
Query: 295 N-NRCGIANYAVYALI 309
+ N CG+A A Y L+
Sbjct: 321 SKNECGVATQASYPLV 336
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 165/304 (54%), Gaps = 12/304 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++KDY ++ + +++ W+ N K I HN + G H Y L N D+ + + M
Sbjct: 141 HRKDYHER-EEGWRRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNG 199
Query: 77 LTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
H + R S + L P +DWREKG++TP +Q CG+C+AFS A++GQ
Sbjct: 200 YVHKKSERKYRGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQH 259
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSI 194
F+ T ++ LS Q +VDCS GN GC GG + YVQ GG+ EE YPY K
Sbjct: 260 FRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED 319
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++K + + + +P E AL +A VGP++V+I+A +FQ Y SGIY + C
Sbjct: 320 CRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDC 379
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S+ ++H +L+VGY + WI+KN W WGD GY+Y+ K N CGIA A
Sbjct: 380 SSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAAS 439
Query: 306 YALI 309
Y L+
Sbjct: 440 YPLV 443
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+ +
Sbjct: 43 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR 102
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L R +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 103 MGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 160
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 161 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 220
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 221 TDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 280
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 281 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 339
Query: 306 YALI 309
Y I
Sbjct: 340 YPEI 343
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+
Sbjct: 40 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISCR 99
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L SR +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 100 MGALRISRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 157
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 158 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 217
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 218 TDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 277
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 278 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 336
Query: 306 YALI 309
Y I
Sbjct: 337 YPEI 340
>gi|291224872|ref|XP_002732426.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 691
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 164/292 (56%), Gaps = 13/292 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSR---IR 83
D +++ W N K + HN + ++G YT+ N D+ + + M + R
Sbjct: 404 DGVRQMIWSQNKKNVELHNMKYRKGESSYTMEMNQFGDMTNKEFTDMMCGYKGKKQNSPR 463
Query: 84 RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIE 143
+ +P + ++ PD +DWR KG++T +Q CG+C+AFS +++GQ FK+T ++
Sbjct: 464 SSTFLAPSNYKA---PDSVDWRTKGYVTEVKDQGACGSCWAFSTTGSMEGQSFKNTGKLV 520
Query: 144 ELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIV 203
S QQ+VDCS GN+GC GG + Y++ G+ E DYPY K C + V
Sbjct: 521 SFSEQQLVDCSGSYGNMGCGGGLMDQAFAYIE-DYGIEPEADYPYTAKDDPCSYDTSKAV 579
Query: 204 VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAM 263
+ ++ + DE AL+ +ATVGPI+V+I+AS +F+LY SG+YD+ AC+ ++H +
Sbjct: 580 ATNTGYTDIATMDEKALQQAVATVGPISVAIDASHSSFRLYKSGVYDEPACSQTMLDHGV 639
Query: 264 LLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
L VGY WI+KN W WG+ GY+++ R N N+CGIA A Y L+
Sbjct: 640 LAVGYGTTDDGNDYWIVKNSWGSTWGNQGYIHMSRNNDNQCGIATNASYPLM 691
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 173/316 (54%), Gaps = 13/316 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ + EW + KKY+ + +KK+ Q+ H I HN + +G Y L+ N
Sbjct: 22 LFDAEWQNFKVHHNKKYEGSTVEAF---RKKIFLQNTHL-IARHNIKHAKGETTYKLKMN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSP-ESNESVLIPDHLDWREKGFITPDWNQEDC 119
D+ ++ M L R RT S ESV +P +DWREKG +TP NQ C
Sbjct: 78 QFGDMLHHEFVSTMNGLL--RSNRTYFGSTWIEPESVSLPKSVDWREKGAVTPVKNQGHC 135
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C++FS A++GQ+F+ T E+ LS Q ++DCS GN GC GG + N Y++ G
Sbjct: 136 GSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYIKENHG 195
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
+ EE YPY+GKQ C++ + + + + +P +E AL LAT+GP++V+I+AS
Sbjct: 196 IDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGNERALAKALATIGPVSVAIDASHE 255
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG 294
+FQ Y G+Y+ C S ++H +L VGY ++ +I+KN W WG GY+ + R
Sbjct: 256 SFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYYIIKNSWGERWGQEGYVLMARN 315
Query: 295 N-NRCGIANYAVYALI 309
+ N CG+A A Y L+
Sbjct: 316 SKNECGVATQASYPLV 331
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+ +
Sbjct: 42 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR 101
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L R +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 102 MGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 159
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 160 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 219
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 220 TDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 279
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 280 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 338
Query: 306 YALI 309
Y I
Sbjct: 339 YPEI 342
>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
Length = 333
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 162/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY-NPKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+ +
Sbjct: 40 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR 99
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L R +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 100 MGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 157
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 158 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 217
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 218 TDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 277
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 278 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 336
Query: 306 YALI 309
Y I
Sbjct: 337 YPEI 340
>gi|188501707|gb|ACD54818.1| cathepsin L precursor-like protein [Adineta vaga]
Length = 331
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 106/281 (37%), Positives = 166/281 (59%), Gaps = 13/281 (4%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS---RIRRTLVRSPESN 93
N I+ HN E GLH YTL N D+ + K+M S I + +PE+
Sbjct: 56 NVAMINKHNLEVDLGLHSYTLAMNQFGDMTNEEFRKQMNGYQMSSEDEINSQIFSAPEN- 114
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
++P+ +DWR KG++T +Q CG+C+AFS A++GQ F TS++ LS Q +VDC
Sbjct: 115 --FVLPNDVDWRTKGYVTYVKDQGQCGSCWAFSTTGALEGQHFAKTSQLIPLSEQNLVDC 172
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S++ N GC GG+ +Y++ G+ EE YPY ++ IC+F + +I I+ + +
Sbjct: 173 SLL--NFGCNGGNQDLAYDYIKMHHGISSEESYPYWAERDICQFNKIHIAATITGHARIK 230
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----T 269
+E L+ +AT+GPIAVSI+A +FQ Y+SG+YD+ C++ ++HA+L VGY +
Sbjct: 231 KHNETDLQAAIATIGPIAVSIDAGQGSFQFYSSGVYDEPNCSTKRLDHAVLAVGYGTLNS 290
Query: 270 RNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++ +I+KN W WG GY+ + R N+CGIA A+Y L+
Sbjct: 291 KDYYIVKNSWGSSWGIRGYILMSRNKQNQCGIATSALYPLV 331
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 176/335 (52%), Gaps = 41/335 (12%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++++DY + + ++ W+ N + I HN E G H + + N D+ + +
Sbjct: 33 KAQHRRDYGE--NEDWRRAIWEKNLRSIEMHNLEYSAGKHSFQMEMNKFGDMTNEEFRQV 90
Query: 74 MTRLTHSRIRR----------TLVRSPES---------------------NESVL--IPD 100
M + R++R LV+ P+S E +L IP
Sbjct: 91 MNGFSTHRVQRRTKGRLFREPLLVQIPKSVDWRDKGYVTPVKNQLVRRLFREPLLVQIPK 150
Query: 101 HLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNL 160
+DWR+KG++TP NQ CG+C+AFS +++GQ F+ T ++ LS Q +VDCS GN
Sbjct: 151 SVDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNS 210
Query: 161 GCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHAL 220
GC GG + N YV+ GG+ EE YPY C++K +I+ + +P + E AL
Sbjct: 211 GCQGGLMDNAFEYVKENGGIDTEESYPYIAADDTCQYKPQYSGANITGYVDIPSRMEKAL 270
Query: 221 KVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WIL 275
+ +ATVGPI+V+I+A +FQ Y SG+Y + C+S+ ++H +L VGY WI+
Sbjct: 271 EKAVATVGPISVAIDAGHSSFQFYRSGVYYEPECSSEDLDHGVLAVGYGVQGKNGKYWIV 330
Query: 276 KNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
KN W WGD+GY+ + R NN CGIA A Y +
Sbjct: 331 KNSWGEEWGDSGYILMARDRNNHCGIATAASYPEV 365
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 168/304 (55%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+
Sbjct: 40 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISCR 99
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L SR +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 100 MGALRISRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 157
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 158 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 217
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 218 MDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 277
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+Y
Sbjct: 278 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCS 336
Query: 306 YALI 309
Y I
Sbjct: 337 YPEI 340
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 167/304 (54%), Gaps = 9/304 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y ++ ++L WQ N + HN + G Y L N +DL ++
Sbjct: 32 KNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGMNQFADLKNEEFVSL 91
Query: 74 MT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M R S+ R P SN +P +DWR KG++TP NQ CG+C+AFS ++
Sbjct: 92 MNGFRGNSSKATRGSTFLPPSN-VFDMPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSL 150
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ FK T ++ LS Q +VDCS GN+GC GG + Y+ GG+ E YPY
Sbjct: 151 EGQHFKKTGKLVSLSEQNLVDCSGKEGNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAM 210
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C F + NI + ++ + E AL++ +A+VGPI+V+I+AS +FQLY SG+Y++
Sbjct: 211 DGQCHFNKANIGATDTGYTDVTTGSESALQMAVASVGPISVAIDASHQSFQLYKSGVYNE 270
Query: 252 EACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
AC+S ++H +L VGY +S + + W WG NGY+++ R +N+CGIA A
Sbjct: 271 PACSSTLLDHGVLAVGYGTSSDGTDYFFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKAS 330
Query: 306 YALI 309
Y L+
Sbjct: 331 YPLV 334
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 170/305 (55%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y++K + ++L W+ N K + HN E +H Y+L NH+ D+ + +
Sbjct: 41 KKFHGKQYKEKNEEEARRLIWEKNLKLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEVLGQ 100
Query: 74 M--TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M R+ R R + +S N + +PD +DWREKG +T Q CG+C+AFS A+
Sbjct: 101 MRPLRVPSQRHRNSTYKS---NPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGAL 157
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+ Q+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 158 EAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASYPYK 217
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
C + + S + LP DE ALK +A GP++V I+AS +F LY SG+Y
Sbjct: 218 AVAEKCHYDSKSRAATCSRYMELPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKSGVY 277
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYA 304
D+ +CT + VNH +L+VGY ++ W++KN W H+GD GY+ + R N N+CGIA+Y
Sbjct: 278 DEPSCTEN-VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIASYG 336
Query: 305 VYALI 309
Y I
Sbjct: 337 SYPEI 341
>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
Length = 333
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 110/291 (37%), Positives = 167/291 (57%), Gaps = 16/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTLVR 88
++ W+ N K I HN+E QG H +T+ N D+ + + M L R + + +
Sbjct: 48 RRAVWERNMKMIEQHNREHSQGKHTFTMAMNAFGDMTNEEFRQVMNGLKIQKRKKWKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+P E IP +DWREKG++TP +Q C C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 APFFVE---IPSSVDWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN G +GG + + YV+ GGL EE YPY + CK++ N V +++
Sbjct: 165 NLVDCSQTEGNEGYSGGLIDDAFQYVKDNGGLDSEESYPYHAQGDSCKYRPENSVANVTD 224
Query: 209 -WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
W + P E+ L +TLA VGPI+ +I+AS TF+ Y GIY D +C+S+ V+H +L+VG
Sbjct: 225 YWDI--PSKENELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVG 282
Query: 268 Y--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y + WI+KN W WG +GY+ + K +N CGIA+ A + +
Sbjct: 283 YGADGTETENKKYWIIKNSWGTDWGMDGYIKMAKDRDNHCGIASLASFPTV 333
>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
Length = 333
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 160/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE QG H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKY-NPEYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|410303012|gb|JAA30106.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
Length = 334
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
Length = 333
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 160/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE QG H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKY-NPEYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|47169476|tpe|CAE48375.1| TPA: cathepsin Q-like 2 [Rattus norvegicus]
Length = 342
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/313 (38%), Positives = 174/313 (55%), Gaps = 24/313 (7%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + K++ W+ N KKI HN+E G + Y + N+ +DL + +T
Sbjct: 35 KYEKLYSPE-EELLKRVVWEENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMIT 93
Query: 76 RLT-------HSRIRRTLVRSPESNE---SVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
+T S +R L SP N +P +DWR++G++T Q C +C+AF
Sbjct: 94 GITLPINNTMKSLWKRAL-GSPFPNSWYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAF 152
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
+A AI+GQ+FK T ++ LS+Q +VDCS GN GC GG+ N YV GGL E
Sbjct: 153 PVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEAT 212
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPYKGK+ +CK+ N I+ + L P+DE L LAT GP+A I+AS +F +
Sbjct: 213 YPYKGKEGLCKYNPKNAYAKITRFVAL-PEDEDVLMDALATKGPVAAGIHASHGSFH-FV 270
Query: 246 SGIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNN 296
SGIY + C ++ VNHA+L+VGY N W++KN W WG GYM + K NN
Sbjct: 271 SGIYHEPKC-NNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNN 329
Query: 297 RCGIANYAVYALI 309
CGIA +A Y ++
Sbjct: 330 HCGIATFAQYPIV 342
>gi|346574377|gb|AEO36960.1| silicatein-alpha 3 [Baikalospongia fungiformis]
Length = 324
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 168/298 (56%), Gaps = 9/298 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+++ Y + + ++ W +N K I HN A L GYTL N DL + +
Sbjct: 31 HQRSYESQLQEMERHSIWIANKKYIEHHNANAD--LFGYTLAMNGFGDLTSAEFTERF-- 86
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
LTH +R+ +++ ES + V D LDWR +G +T +Q CG+ YAF+ A A++G
Sbjct: 87 LTHKHSQRSGLQTFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATA 146
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ ++ LS Q ++DCS+ GN GC+GG + YV GG+ E YPYKGKQS C+
Sbjct: 147 LAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKGKQSSCQ 206
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ N+ + + E L +A+VGPIAV+++AS + F Y SG++D C++
Sbjct: 207 YNSKNVGAISTGVVKIASGSETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCST 266
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN W WG++GY+ + R N+CGIA+ A+Y ++
Sbjct: 267 SKLNHAMLVTGYGSTNGKDYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 324
>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 160/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE QG H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQLMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKY-NPEYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|28194645|gb|AAO33584.1|AF479266_1 cathepsin P [Mesocricetus auratus]
Length = 286
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/288 (37%), Positives = 158/288 (54%), Gaps = 12/288 (4%)
Query: 31 KLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSP 90
+L W+ N K + HN E QG HG+T+ N D+ Y MT + T V+S
Sbjct: 1 RLVWEENMKMVKMHNGEDAQGRHGFTMEMNAFGDMTAEEYRSMMTDIPVPAA--TKVKSE 58
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
+ +P DW +KGF+TP Q CG+C+AF+ AI+GQ+F T + LS+Q +
Sbjct: 59 PNTLLDDLPKSEDWTKKGFVTPVRKQGPCGSCWAFAATGAIEGQMFWKTGNLTTLSVQNL 118
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWS 210
VDCS GN GC G+ YV GGL EE YPY+ K+ C++ N I+ +
Sbjct: 119 VDCSKPQGNNGCMQGNAYRAYKYVLHNGGLEAEETYPYEAKEGPCRYNPENSRAYITEFV 178
Query: 211 VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-- 268
LP +++ L V +AT+GP++ +++AS +F+ Y GIY + C+S NHA+L+VGY
Sbjct: 179 TLPANEDY-LMVAVATIGPVSAAVDASHDSFRFYNGGIYHEPNCSSYVTNHAVLVVGYGF 237
Query: 269 ------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
N W++KN W WG NGYM + K NN C IA++A + I
Sbjct: 238 EGNETDGNNYWLIKNSWGEGWGINGYMKIAKDRNNHCAIASFASFPNI 285
>gi|410256886|gb|JAA16410.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|339765072|gb|AEK01110.1| cathepsin L [Cristaria plicata]
gi|397880684|gb|AFO67888.1| cathepsin L [Cristaria plicata]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 107/282 (37%), Positives = 159/282 (56%), Gaps = 7/282 (2%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR-LTHSRIRRTLVRSPES 92
W+ N I+ HN +A QG+H Y L N DL Y + T + + I R+ +
Sbjct: 53 WKENVLAINRHNSKADQGVHTYWLSMNEYGDLTNEEYFRLRTGFIMNGNIERSGSIFKYT 112
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
N S P +DWR KG++T +Q CG+CYAFS A++GQ F+ T ++ LS Q +VD
Sbjct: 113 NLSEY-PRQVDWRRKGYVTRVKDQGGCGSCYAFSATGALEGQHFRKTGKLVSLSEQNIVD 171
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + + Y++ G+ KEE YPY+ + C+F+R + + L
Sbjct: 172 CSFKEGNKGCKGGLMDKSFTYIKNNNGIDKEEAYPYEARDGPCRFRRSEVGATDRGYVDL 231
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRN 271
P DE AL+ +AT+GPI+V+I+ F+ Y G++D+ C+ +NH +L+VGY TRN
Sbjct: 232 PENDETALRHAVATIGPISVAIDGHHFNFRFYDHGVFDNPNCSKTKINHGVLVVGYGTRN 291
Query: 272 S---WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
W++KN W WG GY+ + R N N+C IA A Y ++
Sbjct: 292 GLDYWMVKNSWGRGWGAKGYILMSRNNDNQCCIACAASYPIV 333
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 171/303 (56%), Gaps = 12/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + +K+ W++N K I+ HN++ +G HG++L N DL + +
Sbjct: 33 KTKHGKTYNTNE-EGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFREL 91
Query: 74 MTRLTHSRIRRTLV-RSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT + T + R P + IP LDWRE G++TP NQ CG+C+AFS +++
Sbjct: 92 MTGFQSMGPKETTIFREPFLGD---IPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQIFK T ++ LS Q +VDCS GNLGC GG + YV+ GL E Y Y+ +
Sbjct: 149 GQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
+C++ +++ + V P E L +A+VGP++V I++ +F+ Y+ G+Y +
Sbjct: 209 GLCRYNPKYSAANVTGF-VKVPLSEDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEP 267
Query: 253 ACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
C+S ++HA+L+VGY S W++KN W WG +GY+ + K NN CGIA YA+Y
Sbjct: 268 DCSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIY 327
Query: 307 ALI 309
+
Sbjct: 328 PTV 330
>gi|148745204|gb|AAI42984.1| Cathepsin L1 [Homo sapiens]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 166/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 43 KKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISL 102
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ R + + N+ +PD +DWREKG +T Q CG+C+AFS A++
Sbjct: 103 MSSLRVPSQWPRNVTYKSDPNQK--LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALE 160
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 161 AQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAM 220
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S + LP E ALK +A GP++V I+AS +F LY +G+Y D
Sbjct: 221 DGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYD 280
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W H+GD GY+ + R + N CGIA+Y Y
Sbjct: 281 PSCTQN-VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSY 339
Query: 307 ALI 309
I
Sbjct: 340 PEI 342
>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 160/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE QG H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKY-NPEYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 112/315 (35%), Positives = 172/315 (54%), Gaps = 19/315 (6%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KEW + ++ K Y ++ ++L WQ N + HN + G Y L N +
Sbjct: 29 KEW-------KNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFA 81
Query: 64 DLHPRHYIKEMTRL---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
DL + ++ MT S+ + P +N L P +DWR KG++TP +Q CG
Sbjct: 82 DLQNKEFVAMMTGFRVNGTSKAAKGSTFLPPNNVGKL-PKTVDWRTKGYVTPVKDQGQCG 140
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ FK T ++ LS Q +VDCS N GC GG + Y+ AGG+
Sbjct: 141 SCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS--DKNYGCNGGLMDRAFQYIIDAGGI 198
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
EE YPY C FK N+ ++ ++ + E AL+ +A +GPI+V+I+AS +
Sbjct: 199 DTEESYPYIAMDGNCHFKTANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFS 258
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG- 294
FQLY SG+Y++ C+S ++H +L VGY + WI+KN W+ WG NGY+++ R
Sbjct: 259 FQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRNK 318
Query: 295 NNRCGIANYAVYALI 309
+N+CGIA A Y L+
Sbjct: 319 DNQCGIATQASYPLV 333
>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
Length = 335
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/308 (37%), Positives = 166/308 (53%), Gaps = 17/308 (5%)
Query: 16 KYKKDYRK---KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
++K YR+ + ++ W+ N K I HN+E Q HG+T+ N D+ + +
Sbjct: 31 QWKATYRRLYGADEEGWRRAVWEKNRKMIELHNREYSQRKHGFTMAMNAFGDMTNEEFRQ 90
Query: 73 EMTRLTHSRIRRT--LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M + R L R P E IP +DWR+KG++TP NQ CG+C+AFS A
Sbjct: 91 VMNGFLKQKQHRNGRLFREPLFAE---IPSSVDWRQKGYVTPVKNQGQCGSCWAFSANGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+F+ T ++ LS Q +VDCS GN GC GG + N YV+ GL EE YPY G
Sbjct: 148 LEGQMFRKTGKLVSLSEQNLVDCSHSQGNQGCNGGLMDNAFQYVKDNKGLDSEESYPYLG 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
++S RP + V PQ E L +ATVGPI+V+I+A +FQ Y+ GIY
Sbjct: 208 RESNTCNYRPEYSAANDTGFVDIPQHERGLMKAVATVGPISVAIDAGHSSFQFYSEGIYY 267
Query: 251 DEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+ C+S ++H +L+VGY + WI+KN W WG +GY+ + R +N CGIA
Sbjct: 268 EPNCSSKDLDHGVLVVGYGSEGAQSDSNKFWIVKNSWGTGWGMSGYVKMARDQSNHCGIA 327
Query: 302 NYAVYALI 309
A Y +
Sbjct: 328 TAASYPTV 335
>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
Length = 333
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 165/306 (53%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSK---KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
K+K YR+ ++ ++ W+ N K I HN E +G HGYT+ N D+ + +
Sbjct: 31 KWKSTYRRLYGTNEEEWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMNAFGDMTNEEFRQ 90
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ H + R+ V + + +P +DWREKG +TP NQ CG+C+AFS A++
Sbjct: 91 LVNGYKHQKHRKGKVF--QEPLMLQLPKSVDWREKGCVTPVKNQGQCGSCWAFSACGALE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ T + LS Q +VDCS GN GC GG + YV GL EE YPY+ K
Sbjct: 149 GQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDSEESYPYEAKD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
CK+K P + V PQ E AL +ATVGPIA++I+AS +FQ Y+SGIY +
Sbjct: 209 GTCKYK-PEFAAANDTGYVDIPQLEKALMKAVATVGPIAIAIDASHPSFQFYSSGIYYEP 267
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+S ++H +L+VGY + WI+KN W WG G+ ++ K NN CG+A
Sbjct: 268 NCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAKDKNNHCGVATA 327
Query: 304 AVYALI 309
A Y +
Sbjct: 328 ASYPTV 333
>gi|410256882|gb|JAA16408.1| cathepsin L1 [Pan troglodytes]
gi|410256884|gb|JAA16409.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGGEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|15214962|gb|AAH12612.1| Cathepsin L1 [Homo sapiens]
gi|61363426|gb|AAX42388.1| cathepsin L [synthetic construct]
gi|123988681|gb|ABM83856.1| cathepsin L [synthetic construct]
gi|123999196|gb|ABM87178.1| cathepsin L [synthetic construct]
Length = 333
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNVKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|164519063|ref|NP_001002813.2| cathepsin Q-like 2 precursor [Rattus norvegicus]
gi|67678196|gb|AAH97257.1| Ctsql2 protein [Rattus norvegicus]
gi|149039735|gb|EDL93851.1| rCG24202 [Rattus norvegicus]
Length = 343
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 171/313 (54%), Gaps = 23/313 (7%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + K++ W+ N KKI HN+E G + Y + N+ +DL + +T
Sbjct: 35 KYEKLYSPE-EELLKRVVWEENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMIT 93
Query: 76 RLT-------HSRIRRTLVRSPESNE---SVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
+T S +R L SP N +P +DWR++G++T Q C +C+AF
Sbjct: 94 GITLPINNTMKSLWKRAL-GSPFPNSWYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAF 152
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
+A AI+GQ+FK T ++ LS+Q +VDCS GN GC GG+ N YV GGL E
Sbjct: 153 PVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEAT 212
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPYKGK+ +CK+ N I+ + L P+DE L LAT GP+A I+ + + Y
Sbjct: 213 YPYKGKEGLCKYNPKNAYAKITRFVAL-PEDEDVLMDALATKGPVAAGIHVVYSSLRFYK 271
Query: 246 SGIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNN 296
GIY + C ++ VNHA+L+VGY N W++KN W WG GYM + K NN
Sbjct: 272 KGIYHEPKC-NNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNN 330
Query: 297 RCGIANYAVYALI 309
CGIA +A Y ++
Sbjct: 331 HCGIATFAQYPIV 343
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 176/316 (55%), Gaps = 10/316 (3%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ NK+ ++ ++ +KK Y + + ++L W+ N I HN A +G H Y L +N
Sbjct: 19 ILNKDLDGDWVLYKQTHKKTYSQDE-EQMRRLIWEDNVNYIQKHNLAADRGEHTYWLGQN 77
Query: 61 HLSDLHPRHYIKEMT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQED 118
+D+ + M +++ +R + L SP SN L PD +DWR++G++T NQ
Sbjct: 78 EYADMTIFEFRAIMNGYKMSANRTKGDLYMSP-SNIGDL-PDSVDWRKEGYVTDIKNQGH 135
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C++FS +++GQ FK++ ++ LS Q +VDCS GN GC GG + N Y++
Sbjct: 136 CGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDNAFRYIESNK 195
Query: 179 GLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
G+ EE YPY K C FK N+ + + +P E L+ +ATVGPI+V I+A
Sbjct: 196 GIDTEESYPYTAKNGFCHFKAENVGATDTGYVDIPHMQEDKLQEAVATVGPISVGIDAGH 255
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG 294
+FQLY G+Y + AC+S ++H +L VGY S W++KN W WG GY+ + R
Sbjct: 256 KSFQLYREGVYSEPACSSSKLDHGVLAVGYGTESGDDYWLVKNSWGTSWGMQGYVMMARN 315
Query: 295 -NNRCGIANYAVYALI 309
+N CGIA A Y +
Sbjct: 316 KHNMCGIATQASYPKV 331
>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
Length = 376
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 174/301 (57%), Gaps = 13/301 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK--EMTR 76
K + + T++++ L + S+ + I HN++ +QG + L N ++DL Y K R
Sbjct: 79 KSFYDEDTENERMLAFLSSQQHIKKHNEQYEQGKVSFKLDANSIADLPFSEYQKLNGYRR 138
Query: 77 LTHSRIRRTLVR--SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+ +RR R +P + V +P+ +DWR+ G++T NQ CG+C+AFS +++GQ
Sbjct: 139 IYGDPLRRNSSRFLAPHN---VEVPESMDWRDHGYVTEVKNQGMCGSCWAFSATGSLEGQ 195
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+S + LS Q +VDCS GN GC GG + Y++ G+ E YPYK +Q
Sbjct: 196 HKRSKGTLVSLSEQNLVDCSAAYGNNGCNGGLMDFAFQYIKENHGIDTETSYPYKARQKK 255
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C F+R ++ D + + LP DE LK+ +AT GPI+V+I+A +FQLY +G+Y ++ C
Sbjct: 256 CHFQRSSVGADDTGFMDLPEGDEDQLKIAVATQGPISVAIDAGHRSFQLYKTGVYYEKEC 315
Query: 255 TSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
+S+ ++H +L+VGY + WI+KN W WG+ GY+ + R NN CGIA A Y L
Sbjct: 316 SSEQLDHGVLVVGYGTDPDHGDYWIVKNSWGTTWGEQGYVRMARNKNNHCGIATKASYPL 375
Query: 309 I 309
+
Sbjct: 376 V 376
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/305 (35%), Positives = 173/305 (56%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K+Y +K ++ + + W N K I HN++ G YT N DL Y +
Sbjct: 26 KRTYGKEYTQKE-EALRHMIWNVNLKMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYREL 84
Query: 74 MTRLTHSRIRRTLVRSPES---NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M + + +T++ P + + P +DWR +G++T +Q CG+C+AFS +
Sbjct: 85 M--CGYKKSNKTVISKPSTFLLPSNYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGS 142
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ FK T ++ LS QQ+VDCS GN+GC GG + +Y++ G E+ YPY G
Sbjct: 143 LEGQTFKKTGKLVPLSEQQLVDCSGDYGNMGCGGGWMDQAFSYIKDKG-EESEDGYPYTG 201
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + +V + ++ +P DE+AL+ +ATVGPI+V+I+A+ +FQ Y SG+YD
Sbjct: 202 TDDTCVYDASKVVATDTGYTDIPEMDENALQQAVATVGPISVAIDATHSSFQFYESGVYD 261
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ C+ ++HA+L VGY + WI+KN WS WG GY+ + R +N+CGIA+ A
Sbjct: 262 EPECSQTNLDHAVLAVGYGTSEEGLDYWIVKNSWSTGWGMQGYIEMSRNKDNQCGIASKA 321
Query: 305 VYALI 309
Y ++
Sbjct: 322 SYPVV 326
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 108/299 (36%), Positives = 161/299 (53%), Gaps = 6/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y + + + N I HN + +GL Y L N DL + K
Sbjct: 34 HKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNG 93
Query: 77 LTHSRIRRTLVRSPESN-ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
R R P +N +P +DWR+KG +TP +Q CG+C+AFS +++GQ
Sbjct: 94 YRGQRTSRGSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQH 153
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F E+ LS Q +VDCS GN GC GG + N Y++ G+ EE YPY+ C
Sbjct: 154 FLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKC 213
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+FK+ ++ + + + E LK +ATVGPI+V+I+A +FQLY+ G+YD+ C+
Sbjct: 214 RFKKEDVGATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECS 273
Query: 256 SDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
S+ ++H +L VGY + W++KN W WGDNGY+ + R NN+CGIA+ A Y L+
Sbjct: 274 SEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPLV 332
>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
Length = 334
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 169/301 (56%), Gaps = 6/301 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + + Y + + ++ W+ N + I+ HN EA GLH Y L NH+ DLH +
Sbjct: 35 KKTHSRTYESEVEEGSRREVWEKNLRLINMHNLEASMGLHTYELGMNHMGDLHAEEILPT 94
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+T LT R+ ++E +P +DWR+KG +T Q CG+C+AFS A A++G
Sbjct: 95 LTTLTPPSERQRGQSIFVTSEGADLPQTVDWRDKGLVTSVKKQGSCGSCWAFSAAGALEG 154
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ K+T + +LS Q +VDCS GN GC GG + YV G+ E YPY+G+
Sbjct: 155 QLAKTTGRLVDLSPQNLVDCSGKYGNHGCNGGYMHRAFQYVIDNQGIDSEASYPYRGQVQ 214
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C + + S + L DE L+ +A++GPI+V+I+A F Y SG+YDD +
Sbjct: 215 QCHYNPAFRAANCSQYRFLTQGDEGNLQAAVASIGPISVAIDAKQPKFYFYKSGVYDDPS 274
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C S +NHA+L VGY ++ W++KN W +GD GY+ + R N++CGIA +A + +
Sbjct: 275 C-SQTINHAVLAVGYGTLNGQDYWLVKNSWGVKFGDKGYIRMVRNKNDQCGIAQFACFPI 333
Query: 309 I 309
+
Sbjct: 334 M 334
>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 161/303 (53%), Gaps = 17/303 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
KDY + + ++ W++N K + HN A + G+TL N +DL +
Sbjct: 32 KDYSSEKEELYRQTIWEANKKIVLEHNANADK--WGWTLEMNAFADLESSEFAAMYNGYR 89
Query: 79 HSRIRRTLVR--SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
S + R P N +PD +DWR KG +TP NQ+ CG+C+AFS +++GQ F
Sbjct: 90 RSARKSNATRYHVPTGN---ALPDTVDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTF 146
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ LS QQ+VDCS GN GC GG + N Y++ GG+ E YPY+ K C+
Sbjct: 147 LKKGTLPSLSEQQLVDCSDKYGNHGCQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCR 206
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
F++ + + + +P D L+ +A VGPI+V+++AS +FQLYA+G+YD C+S
Sbjct: 207 FQQSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSS 266
Query: 257 DYVNHAMLLVGY----------TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
++H +L VGY + W++KN W WG GY + R +N+CGIA A Y
Sbjct: 267 TRLDHGVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASY 326
Query: 307 ALI 309
+
Sbjct: 327 PTV 329
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 157/290 (54%), Gaps = 10/290 (3%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N KI HN E G H Y L NH D+ + + M R+ +
Sbjct: 47 RRMIWEKNLNKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRKTERKAIGSL 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
++ P +DWREKG++TP +Q CG+C+AFS A+ZGQ F+ ++ LS Q
Sbjct: 107 FMEPNFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQN 166
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GL E+ YPY G C + V+ +
Sbjct: 167 LVDCSRPEGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTG 226
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ +P EHAL +A+VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L VGY
Sbjct: 227 FVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGY 286
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 287 GFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|10946820|ref|NP_067420.1| cathepsin 6 precursor [Mus musculus]
gi|9931384|gb|AAG02172.1|AF223401_1 cathepsin-6 [Mus musculus]
gi|12838129|dbj|BAB24093.1| unnamed protein product [Mus musculus]
gi|16445021|gb|AAK00510.1| cathepsin 6 precursor [Mus musculus]
gi|68534635|gb|AAH99455.1| Cathepsin 6 [Mus musculus]
gi|148709368|gb|EDL41314.1| cathepsin 6 [Mus musculus]
Length = 334
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 110/306 (35%), Positives = 169/306 (55%), Gaps = 14/306 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K+Y+K Y + ++ + W+ N + I HN E G + +TL+ N DL P K
Sbjct: 33 KKQYEKSYTMEEEGLRRAI-WEENMRMIKLHNWENSLGKNNFTLKMNEFGDLTPEELRKM 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M S +R ++R + ++P +DWR+KG++T Q+ C +C+AF++ AI+
Sbjct: 92 MNNFPIWSHKKRKIIRKRAVGD--VLPKFVDWRKKGYVTRVRRQKFCNSCWAFAVNGAIE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+FK T ++ LS+Q +VDC+ GN GC G YV GGL E YPY+GK+
Sbjct: 150 GQMFKKTGKLTPLSVQNLVDCTKTQGNDGCQWGDPYIAYEYVLNNGGLEAEATYPYEGKE 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ N +I+ + L P+ E L +AT+GPI+ +++AS + F Y GIY
Sbjct: 210 GPCRYNPKNSKAEITGFVSL-PESEDILMEAVATIGPISAAVDASFNRFSFYDGGIYHQP 268
Query: 253 ACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
C+++ VNHA+L+VGY W++KN W WG GYM + R NN CGIA Y
Sbjct: 269 NCSNNTVNHAVLVVGYGTEGNETDGNKYWLIKNSWGRRWGIGGYMKIIRDQNNHCGIATY 328
Query: 304 AVYALI 309
A Y ++
Sbjct: 329 AHYPIV 334
>gi|213623960|gb|AAI70453.1| Hypothetical protein LOC100127265 [Xenopus laevis]
Length = 331
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 109/302 (36%), Positives = 166/302 (54%), Gaps = 8/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K Y + + +++L W+ N K I +HN E QG H Y + N L D+ ++
Sbjct: 32 KRTYHKQYNGQMDELQRRLIWEKNFKMITSHNFEYNQGPHTYEMAMNQLGDMTSEEVVRT 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT L H R + T + +PD +D+R+KG++TP NQ CG+C+AFS A++
Sbjct: 92 MTGLKIHKRNKPTNLTFEHDKAPEKVPDSIDYRKKGYVTPIRNQGSCGSCWAFSSVGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K ++ LS Q +VDC + N GC GG + N YV+ G+ E+ YPY G+
Sbjct: 152 GQLKKKKGKLVVLSPQNLVDC--VKKNDGCGGGYMTNAFEYVRDNKGIDSEKAYPYVGED 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + + +E ALK +A VGP++V I+A +FQ Y+ G+Y D+
Sbjct: 210 QECMYNVSGRAAACKGYKEVQEGNEKALKKAVALVGPVSVGIDAGLSSFQFYSKGVYYDK 269
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYA 307
C+++ +NHA+L VGY WI+KN W WGD GY+ + K N CGIAN A Y
Sbjct: 270 DCSAEDINHAVLAVGYGTQKKAKYWIVKNSWGEEWGDKGYILMAKDKGNACGIANLASYP 329
Query: 308 LI 309
++
Sbjct: 330 VM 331
>gi|354504282|ref|XP_003514206.1| PREDICTED: cathepsin J-like [Cricetulus griseus]
gi|344250851|gb|EGW06955.1| Cathepsin J [Cricetulus griseus]
Length = 334
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 169/306 (55%), Gaps = 15/306 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KKY+K Y ++ K+ + W+ N + I THN E QG HG+T+ N D+ Y
Sbjct: 33 KKKYEKSYSQEEEVWKRAV-WEKNMQMIRTHNGEDGQGKHGFTVEMNAFGDMTGEEYRTF 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+T + + ++ V++P N+ +P DW +KGF+TP Q CG+C+AF+ AI+
Sbjct: 92 LTDIPVPAAVKVKSVQNPLLND---LPKSEDWTKKGFVTPVRKQGQCGSCWAFAAIGAIE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+F T + LS+Q ++DCS GN GC G + YV GGL EE YPY+ K
Sbjct: 149 GQMFWRTGNLTTLSVQNLLDCSKPQGNNGCVRGDAYSAYQYVLHNGGLEAEETYPYEAKD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ N I+ LP +++ L V ++ +GP+A +I+AS +F+ Y GIY +
Sbjct: 209 GPCRYNPNNSRAYITEVVSLPAHEDYLL-VAVSMIGPVAAAIDASHDSFRFYRGGIYHEP 267
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+S NHA+L+VGY N W++KN W WG GYM + K NN C IA+Y
Sbjct: 268 NCSSYLTNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWGMKGYMKIAKDQNNHCAIASY 327
Query: 304 AVYALI 309
A + I
Sbjct: 328 ASFPNI 333
>gi|291383486|ref|XP_002708337.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 105/290 (36%), Positives = 163/290 (56%), Gaps = 12/290 (4%)
Query: 29 KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
+++ W+ N + I HN E QG G+++ N D+ + + M H ++ V
Sbjct: 47 RRRAVWEKNMRMIELHNGEYSQGKRGFSMAMNAYGDMTSEEFRQVMNGFHHQPDKKEKVF 106
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+ V P +DWR+KG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 107 GKAVFQEV--PSSVDWRDKGYVTPVKNQGRCGSCWAFSATGALEGQMFRKTGRLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
++DCS +GN GC GG + YV+ GGL E+ YPY+ + +C++ V + +
Sbjct: 165 NLIDCSWPAGNYGCRGGLPDHAFQYVKDNGGLDSEDSYPYEARDGLCRYSPQESVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ +P Q+E AL +ATVGPIAV+I+AS +F Y GIY + C+ + ++HA+L+VGY
Sbjct: 225 FVQIPEQEE-ALMEAVATVGPIAVAIDASHSSFLFYKEGIYYEPNCSRENLDHAVLVVGY 283
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ W++KN W WG +GYM + K NN CGIA A Y +
Sbjct: 284 GFEGAESDNQKYWLVKNSWGKGWGMDGYMKMAKDRNNHCGIATAASYPTV 333
>gi|443694581|gb|ELT95681.1| hypothetical protein CAPTEDRAFT_173171 [Capitella teleta]
Length = 342
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 172/304 (56%), Gaps = 19/304 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K Y K ++ L W+ N + I HN + G H +++ N LSDL P Y ++
Sbjct: 44 KETYGKSYDMKEDVVRRSL-WEGNLRHISMHNVKHDLGKHSFSMGINELSDLTPSEY-RQ 101
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
L + RT + + E V P+H+DWR+KG++TP NQ CG+C+AFS +++G
Sbjct: 102 RLGLRPALGERTGKKFVYNGEKV--PEHVDWRDKGYVTPVKNQGACGSCWAFSSTGSLEG 159
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F+ T ++ LS Q +VDC+ GN GC GG + N NYV+ G+ E YPY+G
Sbjct: 160 QHFRLTGQLVSLSEQNLVDCTKKYGNAGCNGGWMDNAFNYVKANNGIDTEAFYPYEGHDD 219
Query: 194 ICKF------KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
C + K N + + DE ALK +ATVGP++V I+A+ +FQLY SG
Sbjct: 220 WCGYDGSPGHKGANCTGHVD----VQQGDELALKQAVATVGPVSVGIDATHRSFQLYKSG 275
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
IYD+ AC++ +HA+L+VGY W++KN W WG +GY+ + R N+C IA+
Sbjct: 276 IYDEVACSNSSTDHAVLVVGYGSQGGHDYWLVKNSWGTSWGMDGYIMMSRNKGNQCAIAS 335
Query: 303 YAVY 306
YA Y
Sbjct: 336 YASY 339
>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 159/290 (54%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE QG H +T+ N D+ + + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YV GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKY-NPEYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W+ KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNSKYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|431917800|gb|ELK17041.1| Cathepsin L1 [Pteropus alecto]
Length = 334
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 111/291 (38%), Positives = 163/291 (56%), Gaps = 15/291 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN+E Q HG+T+ N D+ + + M + + ++ V R
Sbjct: 48 RRAVWEKNMKMIELHNREYSQRKHGFTMAMNAFGDMTNEEFRQIMNGFQNQKHKKGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + IP +DWR+KG++TP NQ CG+C+AFS +++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFAQ---IPPSVDWRQKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDIS 207
+VDCS GN GC GG + N Y++ GGL EE YPY K+S C +K P
Sbjct: 165 NLVDCSRSQGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYLAKESDTCNYK-PEYSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E +L +ATVGPI+V+I+A +FQ Y GIY + C+S ++H +L++G
Sbjct: 224 TGFVDIPQREKSLMKAVATVGPISVAIDAGHSSFQFYNKGIYYEPDCSSKDLDHGVLVIG 283
Query: 268 Y--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y + WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 284 YGSEGGDPKSNKFWIVKNSWGPEWGMNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 167/306 (54%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ + Y + + K+ W N + + HN A QG Y L +DL + + +
Sbjct: 32 KFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEEFKQTVF 91
Query: 76 RLTHSRIRRTLVRSPESNESVL-------IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ + P S L +P +DWR+ GF+TP NQ CG+C++FS
Sbjct: 92 GVCLGSFNAS---KPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSCWSFSST 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T + LS Q++VDCS GN GC GG + N Y+ GG+ E+ YPY
Sbjct: 149 GALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTEDSYPY 208
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+G+ C+ I + + +P +EHALK +AT GP++V+I+AS +FQLY SG+
Sbjct: 209 EGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQLYHSGV 268
Query: 249 YDDEACTSDYVNHAMLLVG----YTRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
Y++ C+ ++HA+L+VG Y ++ W++KN W WGD GY+ + R N+CGIA+
Sbjct: 269 YNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSRNRYNQCGIASA 328
Query: 304 AVYALI 309
A + L+
Sbjct: 329 ASFPLV 334
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 162/293 (55%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR---TL 86
++L W+ N KKI HN E G H Y L NH D+ + + M H R+ +L
Sbjct: 48 RRLVWEKNLKKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYKHKAERKFKGSL 107
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWREKG++TP +Q +CG+C+AFS A++GQ F T ++ LS
Sbjct: 108 FLEPNFLEA---PRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLS 164
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +V+CS GN GC GG + YV+ GL E+ YPY G C + +
Sbjct: 165 GQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAAN 224
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P +E AL +A+VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L
Sbjct: 225 DTGFVDIPSGNERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLA 284
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS +WGD GY+Y+ K N CGIA A Y L+
Sbjct: 285 VGYGFQGEDVDGKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|47213723|emb|CAF95154.1| unnamed protein product [Tetraodon nigroviridis]
Length = 334
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 107/298 (35%), Positives = 172/298 (57%), Gaps = 6/298 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y+ + + ++ W+SN + I+ HN EA GLH Y L NH+ D ++
Sbjct: 37 KKTHDKMYQSEVEERSRRELWESNLRLINMHNLEASMGLHTYQLGMNHMGDWSQEEIVQA 96
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T+LT + + +++ +P +DWR KG +T Q CG+C+AFS A A++G
Sbjct: 97 GTKLTPPSDHQRGLAYFDASGRADLPATVDWRNKGLVTSVKMQGSCGSCWAFSAAGALEG 156
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+ K+T ++ +LS Q +VDC+ GN GC GG + +T YV G+ E YPY G++
Sbjct: 157 LLAKTTGKLVDLSPQNLVDCTRKYGNHGCNGGYMHHTFQYVIDNHGIDSEASYPYTGQEG 216
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
+C++ + S + L DE AL+ +AT+GPI+V I+A+ H F Y SG+Y+D
Sbjct: 217 VCRYNPAFRAANCSHYWFLRQGDEGALQEAVATIGPISVGIDATRHQFVYYRSGVYNDPG 276
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C S VNHA+L VGY ++ W++KN W +G++GY+ + R N++CGIA + +
Sbjct: 277 C-SQTVNHAVLAVGYGTDNGQDYWLVKNSWGVGFGEDGYIRMARNKNDQCGIAQFPCF 333
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 110/293 (37%), Positives = 161/293 (54%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M R ++ R +L
Sbjct: 47 RRMVWEKNLKKIELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAETKARGSL 106
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 107 FLEPNFLEA---PKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLS 163
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + YV+ GL E+ YPY G C + V+
Sbjct: 164 EQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVN 223
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 224 DTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLV 283
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VGYGFQGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|193786743|dbj|BAG52066.1| unnamed protein product [Homo sapiens]
Length = 333
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 160/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + M + + R+ V +
Sbjct: 48 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEELRQVMNGFQNRKPRKGKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q
Sbjct: 108 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 165 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY-NPKYSVANDT 223
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 224 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 284 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 161/293 (54%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS---RIRRTL 86
+++ W+ N KKI HN + G H Y L NH D+ + + M H +++ +L
Sbjct: 49 RRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNEEFRQLMNGYKHKAERKVKGSL 108
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P LDWR+KG++TP +Q CG+C+AFS A++GQ F+ T ++ +LS
Sbjct: 109 FLEPNFLEA---PRSLDWRDKGYVTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQLS 165
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +V+CS GN GC GG + YV+ GL EE YPY G C + V+
Sbjct: 166 EQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEESYPYLGTDDQKCHYDPRYNAVN 225
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + + EHAL + VGPI+V+I+A +FQ Y SGIY + C+S+ ++H +LL
Sbjct: 226 DTGFVDIKSGSEHALMKAVTAVGPISVAIDAGHESFQFYQSGIYYEPECSSEELDHGVLL 285
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 286 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYVYMAKDRQNHCGIATAASYPLV 338
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 164/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y +K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 32 KKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYELGMNHLGDMTSEEVISS 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ R + N+ +PD LDWREKG +T Q CG+C+AFS A++
Sbjct: 92 MSSLRVPSQWPRNVTYKSSPNQK--LPDSLDWREKGCVTEVKYQGACGSCWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS + GN GC GG + Y+ G+ E YPYK
Sbjct: 150 AQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S + LP E ALK +A GP++V I+A +F LY +G+Y D
Sbjct: 210 DGRCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDAKQTSFFLYKTGVYYD 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIAN+ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIANFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|189571697|ref|NP_001121688.1| cathepsin 8 precursor [Rattus norvegicus]
Length = 333
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 169/305 (55%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY+K+Y + + +K+ W+ N K + HN E Q +T+ N +D+ + K
Sbjct: 33 KTKYEKNYSLE-EEGQKRAVWEENMKVVKQHNIEYDQEKKNFTMELNAFADMTGEEFRKM 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
MT + +R+ +S +P +DWR +G++T NQ C +C+AFS+A AI+G
Sbjct: 92 MTNIPVQNLRKK--KSIHQPIFRYLPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+F+ T + LS Q +VDCS GN GC GS L YV GGL E YPY+GK+
Sbjct: 150 QMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKEG 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ ++ +S + + E AL +AT+GPI+V I+AS +F+ Y GIY +
Sbjct: 210 PCRYLPRRSAARVTGFSTV-ARSEEALMHAVATIGPISVGIDASHVSFRFYRRGIYYEPR 268
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C+S+ +NH++L+VGY R W++KN WG NGYM L RG NN CGIA Y
Sbjct: 269 CSSNRINHSVLVVGYGYEGRESDGRKYWLIKNSHGVGWGMNGYMKLARGWNNHCGIATYG 328
Query: 305 VYALI 309
Y +
Sbjct: 329 FYPRV 333
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 106/292 (36%), Positives = 163/292 (55%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M H R + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNRTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++GF+TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CG+A A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPLM 336
>gi|118425914|gb|ABK90856.1| cathepsin-L-like cysteine peptidase [Radix peregra]
Length = 324
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 178/317 (56%), Gaps = 19/317 (5%)
Query: 2 TNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENH 61
T W I K Y D D ++ WQ+N +KI HN+ +GL Y L EN
Sbjct: 18 TEANWAIFKAKHNKTYSGD-----EDIIRRYIWQTNLQKIEAHNELYAKGLSTYFLGENK 72
Query: 62 LSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHL----DWREKGFITPDWNQE 117
+D+ + + ++ L R+ + L +P S + D L DWR++G++T +Q
Sbjct: 73 YADMTNEEFRRTLSGL---RVDKEL--TPGDFVSGMFKDSLPTAVDWRKEGYVTEVKDQG 127
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+C+AFS +++GQ FK+T ++ LS +VDCS GN GC GG + N Y+
Sbjct: 128 QCGSCWAFSTTGSLEGQHFKATKQLVSLSESNLVDCSKKWGNQGCNGGLMDNAFKYIADN 187
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
G+ E+ YPYK + C FK+ N+ + + E AL+ +AT+GPI+V+I+AS
Sbjct: 188 KGIDTEKSYPYKPEDRKCNFKKANVGATDKLYKDITSGSEDALQEAVATIGPISVAIDAS 247
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKR 293
+FQLY+ G+Y+++AC++ ++H +L VGY ++N WI+KN W WG +GY+++ R
Sbjct: 248 HDSFQLYSGGVYNEKACSTKTLDHGVLAVGYDSKNGDDYWIVKNSWGKSWGIDGYIWMSR 307
Query: 294 G-NNRCGIANYAVYALI 309
N+CGIA A Y ++
Sbjct: 308 NKKNQCGIATMASYPVV 324
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 110/305 (36%), Positives = 169/305 (55%), Gaps = 12/305 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y ++ ++L WQ N + HN + G Y L N +DL ++
Sbjct: 32 KNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEEFVAM 91
Query: 74 MTRLTHSRIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
MT S + S P +N L P +DWR KG++TP +Q CG+C+AFS +
Sbjct: 92 MTGFRVSGTSKAAKGSTFLPPNNVGEL-PKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGS 150
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ FK+T ++ LS Q +VDCS + GC GG + Y+ AGG+ E YPYK
Sbjct: 151 VEGQHFKATGKLVSLSEQNLVDCS--GRDAGCDGGFMDRAFQYIIDAGGIDTEASYPYKA 208
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C FK+ N+ ++ ++ + E AL+ +A VGPI+V+I+AS +FQ Y SG+Y+
Sbjct: 209 VDGKCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSGVYN 268
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ C S ++H +L VGY +S WI+KN W+ WG NGY+++ R +N+CGIA A
Sbjct: 269 EPGCDSTVLDHGVLAVGYGTSSDGTDYWIVKNSWAETWGMNGYVWMSRNKDNQCGIATNA 328
Query: 305 VYALI 309
Y L+
Sbjct: 329 SYPLV 333
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 112/292 (38%), Positives = 161/292 (55%), Gaps = 18/292 (6%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N + I HN E QG HG+T+ N D+ + + M + + ++ + R
Sbjct: 48 RRAVWEKNMRMIELHNGEYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQNQKHKKGKMFR 107
Query: 89 SPESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P +L+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 108 DP-----LLLQYPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLS 162
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL EE YPY+G CK+K P V
Sbjct: 163 EQNLVDCSHPQGNQGCNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYK-PECSVAN 221
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ V P E AL +ATVGPI+ +I+A +FQ Y SGIY D C+S ++H +L+V
Sbjct: 222 DTGFVDIPGHEKALLRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVV 281
Query: 267 GY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
GY W++KN W WGD GY+ + R +N CGIA A Y +
Sbjct: 282 GYGFEGTNSNATKYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPTV 333
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 112/294 (38%), Positives = 161/294 (54%), Gaps = 17/294 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT----RLTHSRIRRT 85
+++ W+ N KKI HN E G H Y L NH D+ + + M R T + + +
Sbjct: 47 RRMVWEKNLKKIELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQRKTERKFKGS 106
Query: 86 LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEEL 145
L P E+ P LDWR+KG++TP +Q CG+C+AFS A++GQ F+ T ++ L
Sbjct: 107 LFMEPNFLEA---PRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSL 163
Query: 146 SIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI-VV 204
S Q +VDCS GN GC GG + YV+ GL E+ YPY G PN
Sbjct: 164 SEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSA 223
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+ + + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L
Sbjct: 224 NDTGFVDVPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVL 283
Query: 265 LVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VVGYGYEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 337
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 110/292 (37%), Positives = 159/292 (54%), Gaps = 18/292 (6%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HNQE QG H +T+ N D+ + + M + + R+ V
Sbjct: 48 RRAVWEKNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKVF- 106
Query: 90 PESNESVLI---PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
+ +L P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS
Sbjct: 107 ----QELLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLS 162
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV GGL EE YPY+ + CK+ P V
Sbjct: 163 EQNLVDCSWPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKY-NPEYSVAN 221
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+V
Sbjct: 222 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVV 281
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 282 GYGFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPTV 333
>gi|312386083|gb|ADQ74586.1| silicatein alpha 3 [Lubomirskia baicalensis]
Length = 330
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 105/298 (35%), Positives = 168/298 (56%), Gaps = 9/298 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+++ Y + + ++ W +N K I HN A L GYTL N DL + +
Sbjct: 37 HQRSYESQLQEMERHSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMSAEFTERY-- 92
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
LTH +R+ +++ ES + V D LDWR +G +T +Q CG+ YAF+ A A++G
Sbjct: 93 LTHKHSQRSGLQTFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATA 152
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ ++ LS Q ++DCS+ GN GC+GG + YV GG+ E YPYKGK+S C+
Sbjct: 153 LAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKGKKSSCQ 212
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ N+ + + E L +A+VGPIAV+++AS + F Y SG++D C++
Sbjct: 213 YNSKNVGAISTGVVKIASGSETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCST 272
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN W WG++GY+ + R N+CGIA+ A+Y ++
Sbjct: 273 SKLNHAMLVTGYGSTNGKDYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 330
>gi|94448668|emb|CAI91572.1| silicatein a3 [Lubomirskia baicalensis]
Length = 344
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 105/298 (35%), Positives = 168/298 (56%), Gaps = 9/298 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+++ Y + + ++ W +N K I HN A L GYTL N DL + +
Sbjct: 51 HQRSYESQLQEMERHSIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLMSAEFTERY-- 106
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
LTH +R+ +++ ES + V D LDWR +G +T +Q CG+ YAF+ A A++G
Sbjct: 107 LTHKHSQRSGLQTFESPKGVTYADSLDWRTRGVVTSVQSQGQCGSSYAFAAAGALEGATA 166
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ ++ LS Q ++DCS+ GN GC+GG + YV GG+ E YPYKGK+S C+
Sbjct: 167 LAADKLVALSEQNIIDCSVPYGNHGCSGGDVYTAFKYVVDNGGIDTESSYPYKGKKSSCQ 226
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ N+ + + E L +A+VGPIAV+++AS + F Y SG++D C++
Sbjct: 227 YNSKNVGAISTGVVKIASGSETDLLSAVASVGPIAVAVDASVNAFMFYQSGVFDSSTCST 286
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN W WG++GY+ + R N+CGIA+ A+Y ++
Sbjct: 287 SKLNHAMLVTGYGSTNGKDYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 344
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 165/304 (54%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 32 KKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISL 91
Query: 74 MT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ R+ R +S N + +PD +DWREKG +T Q CG+C+AFS A+
Sbjct: 92 MSCVRVPSQWPRNVTYKS---NPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
+ Q+ T + LS Q +VDCS N GC GG + Y+ G+ E YPYK
Sbjct: 149 EAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKA 208
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
CK+ N S ++ LP DE+ALK +A GP++V+I+A +F Y SG+Y
Sbjct: 209 VDGKCKYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYY 268
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIANY
Sbjct: 269 DPSCTQN-VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPS 327
Query: 306 YALI 309
Y I
Sbjct: 328 YPEI 331
>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
Length = 362
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 159/281 (56%), Gaps = 19/281 (6%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPE------- 91
++I HN++ G Y + N SD+ Y++ H+ +RR + +
Sbjct: 83 ERIEEHNRKYHMGQKSYYMGVNQFSDMSHDEYLR------HNGLRRGNRKYSKGEGCDSY 136
Query: 92 SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+ + D +DWR+KG++TP NQ CG+C++FS +++GQ F+ T ++ LS QQ+V
Sbjct: 137 TKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLV 196
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
DCS GN GC GG + N Y++ GGL E+DYPY KQ C K+ + + +
Sbjct: 197 DCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGEDDYPYTAKQGKCHLKKSLFKANDTGCTD 256
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRN 271
+ DE ALK LA+VGPI+V+I+AS +FQ Y G+YD+E C+S ++H +L VGY
Sbjct: 257 VESGDEDALKDALASVGPISVAIDASHASFQSYDGGVYDEEECSSQNLDHGVLTVGYGTE 316
Query: 272 S-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
W++KN W WG+ GY+ + R +N+CGIA A Y
Sbjct: 317 ENGGDYWLVKNSWGEMWGEEGYIKMSRNKDNQCGIATQASY 357
>gi|70912393|ref|NP_783171.2| cathepsin R precursor [Rattus norvegicus]
gi|66911479|gb|AAH97484.1| Cathepsin R [Rattus norvegicus]
gi|149039731|gb|EDL93847.1| cathepsin R [Rattus norvegicus]
Length = 334
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 108/305 (35%), Positives = 170/305 (55%), Gaps = 14/305 (4%)
Query: 17 YKKDYRKKAT---DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K +Y K T + ++ W+ N K I HN+E G +G+ + N DL + K
Sbjct: 32 WKTEYEKSYTMEEEGHRRAVWEENMKMIKLHNRENSLGKNGFIMEMNEFGDLTAEEFRKM 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M + R+ + +VL P +DWR+KG++T NQ+ C +C+AF++ AI+G
Sbjct: 92 MVNIPIRSHRKGKIIRKRDVGNVL-PKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEG 150
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+F T ++ LS+Q +VDC+ GN GC G YV GGL E YPYKGK+
Sbjct: 151 QMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEG 210
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
+C++ + +I+ + L P+ E L +AT+GPI+V+++AS ++F Y G+YD+
Sbjct: 211 VCRYNPKHSKAEITGFVSL-PESEDILMEAVATIGPISVAVDASFNSFGFYKKGLYDEPN 269
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
C+++ VNH++L+VGY + W++KN W WG GYM + K NN C IA+YA
Sbjct: 270 CSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAIASYA 329
Query: 305 VYALI 309
Y +
Sbjct: 330 HYPTV 334
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 164/292 (56%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M TH + + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDPNQTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CG+A A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 112/327 (34%), Positives = 173/327 (52%), Gaps = 22/327 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + K+Y + K + K++ ++ H+ I HNQ +Q L Y L+ N
Sbjct: 22 LVREEWNAFKMEHSKQYDSEVEDKF---RMKIYVENKHR-IAKHNQRFEQRLVSYKLKPN 77
Query: 61 HLSDLHPRHYIKEMT------------RLTHSRIRRTLVRSPESNESVLIPDHLDWREKG 108
+D+ ++ M + HS+ R + + V PDH+DWR+KG
Sbjct: 78 KYADMLHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAPAHVSYPDHVDWRKKG 137
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLR 168
+T +Q CG+C+AFS A++GQ F+ T + LS Q +VDCS GN GC GG +
Sbjct: 138 AVTDVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMD 197
Query: 169 NTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVG 228
N Y++ GG+ E+ YPY+ C++ N D + +P DE L +ATVG
Sbjct: 198 NAFKYIKDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLMQAVATVG 257
Query: 229 PIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHW 283
PI+V+I+AS TFQ Y+ G+Y DE C+S ++H +++VGY W++KN W W
Sbjct: 258 PISVAIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEEGGDYWLVKNSWGRSW 317
Query: 284 GDNGYMYLKRG-NNRCGIANYAVYALI 309
G+ GY+ + NN CGIA+ A Y L+
Sbjct: 318 GELGYIKMAHNKNNHCGIASSASYPLV 344
>gi|148709376|gb|EDL41322.1| mCG12216, isoform CRA_b [Mus musculus]
Length = 329
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 112/296 (37%), Positives = 164/296 (55%), Gaps = 23/296 (7%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTH--SRIRR 84
+ +K+ W+ N KKI HN E G HG+T+ N D M +TH S+
Sbjct: 44 EEQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGD---------MVTMTHTASQCFP 94
Query: 85 TLVR--SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
T+ + S + + SV +P ++W+++G++TP Q C +C+A S+ AI+GQ+F+ T ++
Sbjct: 95 TVKKGKSVQKHLSVNLPKFINWKKRGYVTPVRTQGRCNSCWAISVTGAIEGQMFQKTGQL 154
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS+Q +VDCS GN GC G+ L YV GGL E YPY+ K+ C++ N
Sbjct: 155 IPLSVQNLVDCSRPQGNRGCYVGNTYRALKYVVENGGLESEATYPYEEKEGSCRYNPENS 214
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
I+ + + P++E AL +AT+GPI+V+I+A +F Y GIY + C+S V HA
Sbjct: 215 TASITGFDFV-PENEDALMNAVATIGPISVAIDARHESFLFYKRGIYHEPNCSSSVVTHA 273
Query: 263 MLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
MLLVGY R WI+KN WG GYM + R N CGIA YA+Y +
Sbjct: 274 MLLVGYGFVGNESEGRKYWIVKNSMGTKWGSKGYMKIARDQGNHCGIATYALYPRV 329
>gi|23956098|ref|NP_062412.1| cathepsin 7 precursor [Mus musculus]
gi|81902493|sp|Q91ZF2.1|CAT7_MOUSE RecName: Full=Cathepsin 7; AltName: Full=Cathepsin 1; Flags:
Precursor
gi|16445017|gb|AAK00508.1| cathepsin 1 precursor [Mus musculus]
gi|40352949|gb|AAH64740.1| Cathepsin 7 [Mus musculus]
gi|148709372|gb|EDL41318.1| cathepsin 7, isoform CRA_a [Mus musculus]
Length = 331
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 162/292 (55%), Gaps = 14/292 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+ +++ W+ N K I H E ++ +T+ N D+ +EM LT S
Sbjct: 45 EKQRRAVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTG----EEMKMLTESSSYPLR 100
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
+ IP LDWR++G++TP Q CGAC+AFS+ + I+GQ+FK T ++ LS
Sbjct: 101 NGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLS 160
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
+Q ++DCS+ G GC GG + YV+ GGL E YPY+ K C+++ VV +
Sbjct: 161 VQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKHCRYRPERSVVKV 220
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + V+ P++E AL L T GPIAV+I+ S +F Y GIY + C D ++H +LLV
Sbjct: 221 NRFFVV-PRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLV 279
Query: 267 GY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
GY R W+LKN WG+NGYM L RG NN CGIA+YA+Y +
Sbjct: 280 GYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMYPAL 331
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 104/300 (34%), Positives = 173/300 (57%), Gaps = 9/300 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY---IKEMT 75
K Y + ++ + ++ N K + HN+EA G H + ++ N DL + +
Sbjct: 9 KQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIGSG 68
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ ++ ++ ES + + D +DWR+KG +T NQE CG+C+AFS +++GQ
Sbjct: 69 FMQSNKTQQAEGGVFESLPGLKVDDTVDWRQKGAVTKVKNQEQCGSCWAFSATGSLEGQH 128
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSI 194
F T+ + LS Q +VDCS GN GC GGS+ Y++ GG+ EE Y Y+G+ +S+
Sbjct: 129 FLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSYRGRDESM 188
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++K +SS++ + DE AL ++TVGPI+V+I+A +FQLY G+YD+ C
Sbjct: 189 CRYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHHGVYDEPKC 248
Query: 255 TSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+S +++H +L VGY ++ W++KN W WG GY+ + R +N+CGIA A+Y ++
Sbjct: 249 SSTHLDHGVLAVGYGSSNGSDYWLVKNSWGTEWGMEGYIMMSRNKHNQCGIATRAIYPVV 308
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 169/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+
Sbjct: 43 KKTYGKQYKEKNEEGVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTAL 102
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + +PD +DWR+KG +T Q CG+C+AFS A++
Sbjct: 103 MSSLRVPSQWQRNVTY--KSNPNQKLPDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALE 160
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS+ N GC GG + Y+ G+ E YPYK
Sbjct: 161 AQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAM 220
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E ALK +A GP++V+I+AS +F LY SG+Y D
Sbjct: 221 DGKCQYDSKYRAATCSRYTELPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYD 280
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
ACT +VNH +L+VGY ++ W++KN W H+GD GY+ + R + N CGIA+YA Y
Sbjct: 281 PACTL-HVNHGVLVVGYGNLNGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASY 339
Query: 307 ALI 309
I
Sbjct: 340 PEI 342
>gi|27465595|ref|NP_775155.1| testin-2 precursor [Rattus norvegicus]
gi|1174639|sp|P15242.2|TEST2_RAT RecName: Full=Testin-2; AltName: Full=CMB-23; Contains: RecName:
Full=Testin-1; AltName: Full=CMB-22; Flags: Precursor
gi|577430|gb|AAC52162.1| testin [Rattus norvegicus]
gi|149039744|gb|EDL93860.1| testin gene [Rattus norvegicus]
Length = 333
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 165/289 (57%), Gaps = 12/289 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
K+ W+ N K I HN E +G H +T+ N DL ++K MT +I++T +
Sbjct: 48 KRAVWEKNFKMIELHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQRQKIKKTHIF- 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
+ ++ + +P +DWR+ G++TP NQ C + +AFS +++GQ+F+ T + LS Q
Sbjct: 107 -QDHQFLYVPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQN 165
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
++DC + GC+GG ++ YV+ GGL EE YPY+G+ C++ N ++ +
Sbjct: 166 LLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRECRYHAENSAANVRDF 225
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P +E AL +A VGPI+V+++AS +FQ Y SGIY + C ++NHA+L+VGY
Sbjct: 226 VQIPGSEE-ALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYG 284
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W WG GYM L + +N CGIA Y+ Y ++
Sbjct: 285 FEGEESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333
>gi|42744610|gb|AAH66625.1| Ctssa protein [Danio rerio]
Length = 321
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/301 (34%), Positives = 167/301 (55%), Gaps = 8/301 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K YR + ++ W+ N + I HN+ A GLH YTL N LSD+ + +
Sbjct: 24 KSQHNKTYRNTREERLRRSVWEQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADE-VND 82
Query: 74 MTRLTHSRIRRT-LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L SP S ++ +P ++W E G ++P NQ CG+C+AFS +++
Sbjct: 83 MNGLLEEDFPDVNATFSPPSLQT--LPQRVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLE 140
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q+ + T+ + LS Q ++DCS+ GN GC GG L YV G+ YPY+ K+
Sbjct: 141 AQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEHKE 200
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
+C++ + + ++P +E AL+ +A +GP++V INA +F Y SGIY+D
Sbjct: 201 GVCRYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIYNDP 260
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
C+S +NHA+L+VGY ++ W++KN W WG+NGY+ + R N CGI+++ +Y
Sbjct: 261 KCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSFGIYPT 320
Query: 309 I 309
I
Sbjct: 321 I 321
>gi|27960477|gb|AAO27843.1|AF456459_1 cathepsin R [Rattus norvegicus]
Length = 334
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/308 (36%), Positives = 169/308 (54%), Gaps = 18/308 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KKY K Y + + ++ + W+ N K I HN E G +G+T+ N D + K
Sbjct: 33 KKKYDKSYSLEEEELRRAV-WEENLKMIKLHNGENGLGKNGFTMEINEFGDTTGEEFRKM 91
Query: 74 MTRL---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M TH + + R+ S + P +DWR+KG++TP Q +C AC+AFS+ A
Sbjct: 92 MVEFPVQTHREGKSIMKRAAGS----IFPKFVDWRKKGYVTPVRRQGNCNACWAFSVTGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
I+ Q T ++ LS+Q +VDCS GN GC G YV GGL E YPYKG
Sbjct: 148 IEAQTIWQTGKLIPLSVQNLVDCSKSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKG 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K+ +C++ + +I+ + L P+ E L +AT+GPI+V+++AS ++F Y G+YD
Sbjct: 208 KEGVCRYNPKHSKAEITGFVSL-PESEDILMEAVATIGPISVAVDASFNSFGFYKKGLYD 266
Query: 251 DEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIA 301
+ C+++ VNH++L+VGY + W++KN W WG GYM + K NN C IA
Sbjct: 267 EPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAIA 326
Query: 302 NYAVYALI 309
+YA Y +
Sbjct: 327 SYAHYPTV 334
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 103/298 (34%), Positives = 164/298 (55%), Gaps = 9/298 (3%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y+ +A ++K+ ++ N +KI HN E +QG+H YT N +D+ + +
Sbjct: 32 KHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLA 91
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
++ ++ + + V +P+ +DWR + +TP +Q CG+C++F++ + +G
Sbjct: 92 TQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAY 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
ST ++ S QQ+VDC+ N GC GG L +T Y+Q GL E DYPY G C
Sbjct: 152 ALSTGKLTRFSEQQLVDCT-TDLNYGCDGGYLDDTFPYIQ-TNGLELESDYPYTGYDGSC 209
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+ +V +SS+ V P +E AL + T GP+A++INA Q Y SGI DD+ C
Sbjct: 210 SYDSSKVVTKVSSY-VSVPANEQALLEAVGTAGPVAIAINADD--LQFYFSGIIDDKYCD 266
Query: 256 SDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
++++H +L VGY + W++KN W WG++GY RG N CG+ AVY LI
Sbjct: 267 PEWLDHGVLAVGYNSENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPLI 324
>gi|253722774|pdb|1CJL|A Chain A, Crystal Structure Of A Cysteine Protease Proform
Length = 312
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M L + + R+ V +
Sbjct: 27 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGLQNRKPRKGKVFQ 86
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+ +AFS A++GQ+F+ T + LS Q
Sbjct: 87 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSSWAFSATGALEGQMFRKTGRLISLSEQ 143
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 144 NLVDCSGPEGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDA 202
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 203 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 262
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 263 GFESTESDGNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 312
>gi|432910514|ref|XP_004078393.1| PREDICTED: cathepsin S-like [Oryzias latipes]
Length = 339
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 112/305 (36%), Positives = 166/305 (54%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y + D ++ W+ N I HN E GLH Y L NH+ DL +++
Sbjct: 39 KKTHSKHYPSEEEDVHRRDLWERNLMLITMHNLEYSMGLHTYDLSMNHMGDLTQEEILQQ 98
Query: 74 MTRLTHSRIRRTLVRSPE----SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
L R L R P ++ S IPD +DWR+K +T Q CG+C+AFS
Sbjct: 99 FASL---RPPTNLKREPHLFVGASGSNNIPDEMDWRQKNCVTSVKMQGSCGSCWAFSAVG 155
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ + T ++ +LS Q +VDCS GN GC GG + YV G+ + YPY
Sbjct: 156 ALEGQLCRKTGKLVDLSPQNLVDCSSKYGNHGCNGGFMHQAFQYVIDNQGIDSDAGYPYV 215
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G C + + S +S LP DE ALK +AT+GPI+V+I+A+ F Y SG+Y
Sbjct: 216 GVTQNCHYSSEYRAANCSQYSFLPEGDEGALKEAIATIGPISVAIDATRPRFAFYRSGVY 275
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DD +C+ + VNH +L VGY ++ W++KN W +G+ GY+ + R N++CGIA Y
Sbjct: 276 DDSSCSQN-VNHGVLAVGYGTLNGQDYWLVKNSWGTTFGEQGYIRMARNKNDQCGIAMYG 334
Query: 305 VYALI 309
Y ++
Sbjct: 335 CYPIM 339
>gi|148709373|gb|EDL41319.1| cathepsin 7, isoform CRA_b [Mus musculus]
Length = 358
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 162/292 (55%), Gaps = 14/292 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+ +++ W+ N K I H E ++ +T+ N D+ +EM LT S
Sbjct: 72 EKQRRAVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTG----EEMKMLTESSSYPLR 127
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
+ IP LDWR++G++TP Q CGAC+AFS+ + I+GQ+FK T ++ LS
Sbjct: 128 NGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLS 187
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
+Q ++DCS+ G GC GG + YV+ GGL E YPY+ K C+++ VV +
Sbjct: 188 VQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKHCRYRPERSVVKV 247
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + V+ P++E AL L T GPIAV+I+ S +F Y GIY + C D ++H +LLV
Sbjct: 248 NRFFVV-PRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLV 306
Query: 267 GY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
GY R W+LKN WG+NGYM L RG NN CGIA+YA+Y +
Sbjct: 307 GYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMYPAL 358
>gi|8917575|gb|AAF81274.1| EPCS24 [Mus musculus]
Length = 329
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 161/289 (55%), Gaps = 14/289 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+ +++ W+ N K I H E ++ +T+ N D+ +EM LT S
Sbjct: 45 EKQRRAVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTG----EEMKMLTESSSYPLR 100
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
+ IP LDWR++G++TP Q CGAC+AFS+ + I+GQ+FK T ++ LS
Sbjct: 101 NGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLS 160
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
+Q ++DCS+ G GC GG + YV+ GGL E YPY+ K C+++ VV +
Sbjct: 161 VQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKHCRYRPERSVVKV 220
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + V+ P++E AL L T GPIAV+I+ S +F Y GIY + C D ++H +LLV
Sbjct: 221 NRFFVV-PRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLV 279
Query: 267 GY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
GY R W+LKN WG+NGYM L RG NN CGIA+YA+Y
Sbjct: 280 GYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMY 328
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 166/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 29 KKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVISL 88
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ R + SN+ +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 89 MSSLRVPSQWPRNVTYKSNSNQK--LPDSVDWREKGCVTKVKYQGACGACWAFSAVGALE 146
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 147 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAT 206
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S ++ LP E LK +A GP++V+I+A +F LY SG+Y D
Sbjct: 207 DGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYD 266
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIA+Y Y
Sbjct: 267 PSCTQN-VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 325
Query: 307 ALI 309
I
Sbjct: 326 PEI 328
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/295 (37%), Positives = 166/295 (56%), Gaps = 10/295 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y +A +SK+ + N + I HN +QG Y N +D+ + K M L+
Sbjct: 35 KTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF-KTMLTLS 93
Query: 79 HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS 138
SR + TL + V IP +DWR++G +T +Q DCG+C+AFSI + +G +
Sbjct: 94 ASR-KPTLETTSYVKTGVEIPSSVDWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARK 152
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
+ ++ LS QQ++DC + + GC GGSL + YV GL EE Y YKG+ CK+
Sbjct: 153 SGKLVSLSEQQLIDCCTDT-SAGCDGGSLDDNFKYV-MKDGLQSEESYTYKGEDGACKYN 210
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
++V +S ++ +P +DE AL +ATVGP++V ++AS Y SGIY+D+ C+
Sbjct: 211 VASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGMDAS--YLSSYDSGIYEDQDCSPAG 268
Query: 259 VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+NHA+L VGY ++ WI+KN W WG+ GY L RG N+CGI+ VY I
Sbjct: 269 LNHAILAVGYGTENGKDYWIIKNSWGASWGEQGYFRLARGKNQCGISEDTVYPTI 323
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 166/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 41 KKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVISL 100
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ R + SN+ +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 101 MSSLRVPSQWPRNVTYKSNSNQK--LPDSVDWREKGCVTKVKYQGACGACWAFSAVGALE 158
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 159 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAT 218
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S ++ LP E LK +A GP++V+I+A +F LY SG+Y D
Sbjct: 219 DGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYD 278
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIA+Y Y
Sbjct: 279 PSCTQN-VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 337
Query: 307 ALI 309
I
Sbjct: 338 PEI 340
>gi|387915678|gb|AFK11448.1| cathepsin L1 [Callorhinchus milii]
Length = 336
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/298 (36%), Positives = 159/298 (53%), Gaps = 8/298 (2%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
++Y +K DS ++ W+ N K I HN G H + L N D+ + + M
Sbjct: 40 REYMQKE-DSYRRAVWEKNLKTIQLHNLGYSMGKHSFDLAMNQFGDMTTEEFHQLMNGFQ 98
Query: 79 HSRIRRTLVRSPESN--ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
S +S+ +P +DWR+KG++TP NQ CG+C+AFS A++GQ F
Sbjct: 99 QPDTEHLTSTKDVSTRPKSLKLPGSVDWRDKGYVTPVKNQGACGSCWAFSSTGALEGQTF 158
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T ++ LS Q +VDCS GN GC GG + Y+Q G+ E YPY K+ C
Sbjct: 159 KKTGKLIPLSEQNLVDCSQKQGNHGCNGGMMDRAFTYIQQNNGIDTEASYPYTAKEHPCN 218
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + DE AL T+AT+GPI+V+I+A +FQ Y SGIY + C S
Sbjct: 219 YDPRHNAATCHGYRYSEQYDEMALAETVATIGPISVAIDAKHISFQFYKSGIYQEPRCQS 278
Query: 257 DYVNHAMLLVGYT----RNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+NHA+L+VGY N WI+KN + WG+ GY+++ K NN CGIA+Y Y L+
Sbjct: 279 YNINHAVLVVGYNSQGGNNYWIVKNSFGSRWGNKGYIWMPKDKNNHCGIASYPTYPLV 336
>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
Length = 335
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 170/306 (55%), Gaps = 15/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY--IKE 73
KY+KDY + K L W N + + HN+ QG YTL NH++DL + +
Sbjct: 33 KYQKDYLSSEDELNKLLTWSKNLETVRKHNELYAQGKKSYTLAMNHMADLSSEEFKALYL 92
Query: 74 MTRLTHSRIRRTLVRSPESNESVLI----PDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ + +++ R + + E I P +DW KG +T NQ CG+C+AFS
Sbjct: 93 VPKFDATKVPR---KGKAAGEHRQIKNDPPSEIDWVRKGHVTAVKNQAQCGSCWAFSSTG 149
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+I+G + ++T ++ S QQ+VDCS GN GC GG + N+ NY+ GL E YPY+
Sbjct: 150 SIEGAVKRATGKLISFSEQQLVDCSTAFGNHGCNGGIMDNSFNYLIHNKGLESEASYPYE 209
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
++ C++K+ ISS++ + DE LK + VGP++++I+AS +F LY SG+Y
Sbjct: 210 AQKKECRYKKALSKGTISSFTDVSQFDEKDLKRAVGLVGPVSIAIDASQFSFHLYDSGVY 269
Query: 250 DDEACTSDYVNHAMLLVGYTR-----NSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
D+E C+ +NH +L VGY + W +KN W++ WG GY+ + R +N+CG+A
Sbjct: 270 DEEDCSQTMLNHGVLAVGYGTTPEGLDYWKVKNSWTNTWGMEGYILMSRNKDNQCGVATV 329
Query: 304 AVYALI 309
A Y ++
Sbjct: 330 ASYPIV 335
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 168/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y+++ + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 32 KKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L S+ +R + SN+ +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MGSLRVPSQWQRNVTYRSNSNQK--LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E ALK +A GP++V+I+AS ++F LY SG+Y +
Sbjct: 210 NGKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIA+Y Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|354504701|ref|XP_003514412.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
gi|344245862|gb|EGW01966.1| Cathepsin R [Cricetulus griseus]
Length = 333
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 169/305 (55%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y ++ + +K+ W+ N K I + E G++ +T+ N DL K
Sbjct: 33 KKSYDKTYSQE-EERQKRAVWEDNVKMIKLLSMENGLGMNNFTVEMNEFGDLTGEEMKKM 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
MT + +R + + V IP LDWR +G++ P Q CGAC+AF++A++I+
Sbjct: 92 MTDSSVLTLRNG--KHMQRLGDVKIPKTLDWRTQGYVGPVRKQNGCGACWAFAVAASIES 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+FK T ++ +LS+Q ++DC+ GC GG + YV+ GL E YPY+ K+
Sbjct: 150 QLFKKTGKMTQLSVQNLIDCARSYSTYGCKGGLVYGAFLYVKNNKGLEAEATYPYEAKEG 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+++ VV I+ + V+ P++E AL L T GPIAV I+A +F YA GIY +
Sbjct: 210 RCRYRAERSVVKITRFLVV-PRNEEALMNALVTHGPIAVGIDAGHESFTNYAGGIYHEPK 268
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C +D H +LLVG+ + W+LKN WG+NGYM L R NN CGIA+YA
Sbjct: 269 CKTDNPTHGLLLVGFGYEGRESDGKKYWLLKNSHGEKWGENGYMKLPRDQNNYCGIASYA 328
Query: 305 VYALI 309
+Y ++
Sbjct: 329 MYPIL 333
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y +K +++++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYTEKNEETERRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSL 91
Query: 74 MTRLTHSR-IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT L R +R + N+ +PD LDWREKG +T Q CG+C+AFS A++
Sbjct: 92 MTCLKVPRQSQRNVTYKSSPNQK--LPDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ +T ++ LS Q +VDCS N GC GG + Y+ G+ E YPYK
Sbjct: 150 AQLKLTTGKLVSLSAQNLVDCSTEKYRNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S ++ LP E ALK +A+ GP++V+I+AS +F LY SG+Y +
Sbjct: 210 DEKCQYDSKNRAATCSKYTELPFGSEEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
ACT VNH +L+VGY + W++KN W ++GD GY+ + R N CGIA+Y+ Y
Sbjct: 270 PACT-QVVNHGVLVVGYGNLNGNDYWLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|195382749|ref|XP_002050091.1| GJ20385 [Drosophila virilis]
gi|194144888|gb|EDW61284.1| GJ20385 [Drosophila virilis]
Length = 370
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 104/300 (34%), Positives = 159/300 (53%), Gaps = 10/300 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A ++ + + + N + G Y L N +DL ++K++T L
Sbjct: 72 KSYLSAADRQLREGIFSARKTLVEAKNAAFKSGASTYELAVNAFADLTNAEFLKQLTGLR 131
Query: 79 HSRIRRTLVRS----PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
S ++ P+ +V +PD DWREKG +TP Q +CG+C++F+ AI+G
Sbjct: 132 KSLSGEQRAKAHRIAPKLATNVPLPDSFDWREKGGVTPVKFQGECGSCWSFAATGAIEGH 191
Query: 135 IFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+F+ T ++ LS Q +VDC + G GC GG N++ G+ E YPY K+
Sbjct: 192 VFRKTGKLPNLSEQNLVDCGTVDLGLAGCDGGFQEYAFNFITEQNGIAAGEKYPYVDKKD 251
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK+K I+ ++ +PP+DE A+K +AT GP+A S+N + LY GIY DE
Sbjct: 252 TCKYKNDISGAQITGFAAIPPKDEQAMKTVVATQGPLACSVNGL-ESLLLYKRGIYADEE 310
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
C VNH++L+VGY ++ WI+KN W WG++GY L RG N CGIA+ Y ++
Sbjct: 311 CNKGEVNHSILVVGYGTEDGQDYWIVKNSWDKAWGEDGYFRLPRGKNFCGIASECSYPVV 370
>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
Length = 333
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 112/286 (39%), Positives = 156/286 (54%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVRSPES 92
W+ N K I HN E +G HG+T+ N D+ + + + H + R+ L + P
Sbjct: 52 WEKNMKMIELHNGEYSEGKHGFTMEMNAFGDMTNEEFRQLVNGYKHQKHRKGKLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ +P +DWREKG +TP NQ CG+C+AFS A++GQ+ T + LS Q +VD
Sbjct: 112 ---LQLPKSVDWREKGCVTPVKNQGQCGSCWAFSACGALEGQMCLKTGVLVSLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + YV GL EE YPY+ K CK+K P + V
Sbjct: 169 CSRGEGNQGCNGGLMDFAFQYVLNNKGLDSEESYPYEAKDGTCKYK-PEFAAANDTGYVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY---- 268
PQ E AL +ATVGPIAV+I+AS +FQ Y+SGIY + C+S ++H +L++GY
Sbjct: 228 IPQLEKALMKAVATVGPIAVAIDASHPSFQFYSSGIYFEPNCSSKDLDHGVLVIGYGFEG 287
Query: 269 ----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ WI+KN W WG G+ ++ K NN CGIA A Y +
Sbjct: 288 TDSNKKKYWIVKNSWGTGWGMGGFFHIAKDKNNHCGIATAASYPTV 333
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 164/304 (53%), Gaps = 17/304 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
YKK Y D ++ + W+ N K I HNQE QG HG+T+ N D + + M
Sbjct: 36 YKKVYAVNEEDWRRAV-WEKNMKMIERHNQEYSQGKHGFTMAMNAFGDKTNEEFRQLMNG 94
Query: 77 LTHSRIRR-TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ ++ L P IP +DW +KG++TP +Q CG+C+AFS A++GQ+
Sbjct: 95 FQSQKHKKGKLFYEPVFGH---IPTSVDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI- 194
F+ T ++ LS Q +VDCS GN GC GG + N YV+ GGL EE YPY +
Sbjct: 152 FRKTGKLVSLSEQNLVDCSWREGNEGCNGGLMDNAFQYVKDNGGLDSEESYPYTATDTQD 211
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C++ + + + +PPQ E AL +ATVGPI+V+I+A +FQ Y+SGIY D AC
Sbjct: 212 CRYNPKYSAANDTGFVDIPPQ-EKALMKAVATVGPISVAIDAGQVSFQFYSSGIYFDPAC 270
Query: 255 TSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
VNH +L VGY W++KN W WG +GY+ + K NN CGIA A
Sbjct: 271 RLT-VNHGVLAVGYGFEGTDPDKNKYWLVKNSWGKSWGADGYIKIAKDRNNHCGIARAAS 329
Query: 306 YALI 309
Y +
Sbjct: 330 YPTV 333
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 167/304 (54%), Gaps = 11/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ K + ++L W+ N K I HN E G+H Y + N + D+ +
Sbjct: 40 KKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCR 99
Query: 74 MTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L R +T+ SN + +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 100 MGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALE 157
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YPYK
Sbjct: 158 GQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKA 217
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S + LP DE ALK +AT GP++V I+AS +F Y SG+YD
Sbjct: 218 MDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD 277
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
D +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+
Sbjct: 278 DPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASDCS 336
Query: 306 YALI 309
Y I
Sbjct: 337 YPEI 340
>gi|301609080|ref|XP_002934105.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 334
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 167/301 (55%), Gaps = 7/301 (2%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ + K I HN E GLH Y + NHL D+ M
Sbjct: 35 KTHQKSYKDAEEERARRTIWEESLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATM 94
Query: 75 TRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T T S +V P+ P DWR KG +TP +Q CG+CYAF A++
Sbjct: 95 TGYTDSGDSLDNVVHVPKEILEAQPPASKDWRTKGCVTPVRSQGSCGSCYAFGAVGALEC 154
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q + T + S Q++VDCS GN GC GG + Y++ G +M+E YPY GK++
Sbjct: 155 QWKRKTGRLVTFSPQELVDCSYTVGNNGCKGGGSNASFTYMKKYG-VMEESAYPYTGKEA 213
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK ++P+ V + + LP +E LK + TVGP+ V+I++S F++Y SG+Y D
Sbjct: 214 QCKKEKPSNVGVVKQFYRLPTGNEVLLKKAVGTVGPVYVAIDSSRQGFRMYKSGVYYDPY 273
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C++ ++HA+L+VGY++ + W++KN W ++GD GY+ + R NN CGIA A Y +
Sbjct: 274 CSTTSLSHAVLIVGYSKENGQYYWLVKNSWGEYFGDKGYIKMARKRNNHCGIATRAAYPV 333
Query: 309 I 309
+
Sbjct: 334 V 334
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 175/323 (54%), Gaps = 18/323 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + +KKY + ++ + K++ Q+ HK I HNQ +QG + LR N
Sbjct: 22 LVKEEWNAYKLQHRKKYDSETEERL---RLKIYVQNKHK-IAKHNQRFEQGQEKFRLRVN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITP 112
+DL +++ + + ++ +++ + +E V +P +DWREKG +TP
Sbjct: 78 KYTDLLHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTP 137
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
+Q CG+C++FS A++GQ F+ T ++ LS Q +VDCS GN GC GG +
Sbjct: 138 VKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQ 197
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
Y++ GG+ E+ YPY+ C + + + +P DE AL +AT GP++V
Sbjct: 198 YIKDNGGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSV 257
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNG 287
+I+AS +FQ Y+ G+Y + C S+ ++H +L VGY + W++KN W WGD G
Sbjct: 258 AIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQG 317
Query: 288 YMYLKRG-NNRCGIANYAVYALI 309
Y+ + R +N CGIA A Y L+
Sbjct: 318 YVKMARNRDNHCGIATAASYPLV 340
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 164/292 (56%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M TH + + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDPNQTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CG+A A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 168/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y+++ + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 40 KKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL 99
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L S+ +R + SN+ +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 100 MGSLRVPSQWQRNVTYRSNSNQK--LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 157
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 158 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAM 217
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E ALK +A GP++V+I+AS ++F LY SG+Y +
Sbjct: 218 NGKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYE 277
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIA+Y Y
Sbjct: 278 PSCTQN-VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 336
Query: 307 ALI 309
I
Sbjct: 337 PEI 339
>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
Length = 320
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/304 (35%), Positives = 161/304 (52%), Gaps = 9/304 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K+Y ++ D +++ W+ N ++ HN E G+H Y L NHL+D+ +
Sbjct: 18 KNKHTKEYEDESEDLLRRITWEKNLNTVNMHNLEYSMGMHTYELGMNHLADMTSEEIKSK 77
Query: 74 MTRLT---HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
MT L HS + T S +PD +DWREKG ++ NQ CG+C+AFS A
Sbjct: 78 MTGLILPPHSERKATFSSQKNSTLGGKVPDSIDWREKGCVSEVKNQGGCGSCWAFSAVGA 137
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ T +I LS Q +VDCS GN GC+GG + YV G+ + YPY
Sbjct: 138 LEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYVIDNNGIDSDTYYPYHA 197
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C ++ + + P E LK L +GPI+V+I+ + TF LY SG+Y
Sbjct: 198 MDEKCHYELAGKASSCVKYREIVPGTEDNLKQALGNIGPISVAIDGTRPTFFLYKSGVYS 257
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +C+ + VNH +L VGY ++ W+LKN W +GD GY+ + R N CG+A+Y
Sbjct: 258 DPSCSQE-VNHGVLAVGYGTLNGQDFWLLKNSWGTKYGDQGYVRIARNKENLCGVASYTS 316
Query: 306 YALI 309
Y I
Sbjct: 317 YPEI 320
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 171/321 (53%), Gaps = 19/321 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRE 59
+ EW K+Y+ D T+ +L + N KI HN++ + Y L
Sbjct: 18 LVGAEWSAFKALHGKEYESD-----TEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAM 72
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPD 113
N D+ ++ TR R R R PE E +P +DWR+KG +TP
Sbjct: 73 NEFGDMLHHEFVS--TRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPV 130
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
NQ CG+C++FS +++GQ F+ ++ LS Q ++DCS GN GC GG + Y
Sbjct: 131 KNQGQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKY 190
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
++ G+ E+ YPY +C F + + + + +P DE+ LK +ATVGP++V+
Sbjct: 191 IKANKGIDTEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVA 250
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYM 289
I+AS +FQ Y+ G+YD+ C S+ ++H +L+VGY ++ W++KN W WGD GY+
Sbjct: 251 IDASHESFQFYSEGVYDEPECDSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGGYI 310
Query: 290 YLKRG-NNRCGIANYAVYALI 309
Y+ R +N+CGIA+ A Y L+
Sbjct: 311 YMSRNKDNQCGIASAASYPLV 331
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 166/303 (54%), Gaps = 12/303 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK+Y + + + + N K+ HN ++G Y + N DL + M
Sbjct: 34 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNG 93
Query: 77 LTH-----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
H SR T +N V +P+ +DWREKG ITP +Q CG+C+AFS A+
Sbjct: 94 YQHKKQNSSRAESTFTFMEPAN--VTVPESVDWREKGAITPVKDQGQCGSCWAFSSTGAL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS Q ++DCS GN GC GG + Y++ G+ E YPY+ +
Sbjct: 152 EGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAE 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C++ N + +P +E LK +ATVGP++V+I+AS +FQ Y+ G+Y +
Sbjct: 212 DDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYE 271
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C SD ++H +L+VGY ++ W++KN WS HWGD GY+ + R N CG+A+ A Y
Sbjct: 272 PSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 116/312 (37%), Positives = 173/312 (55%), Gaps = 14/312 (4%)
Query: 10 FIFPQK--KYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
F FP++ +KK++ K +++L WQ+N K + HN A++ G+T+ N +
Sbjct: 16 FDFPEEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEK--FGFTVGMNQFA 73
Query: 64 DLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
DL + + + + S + +P +DWR KGF+T NQ CG+C+
Sbjct: 74 DLESSEFGRLYNGYNNKPSMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCW 133
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS + ++GQ F +T + LS Q +VDCS GN GC GG + N YV GG+ E
Sbjct: 134 AFSAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTE 193
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWS-VLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
YPYK CKF N+ S +S +LP + E AL+V +A VGPI+V+I+AS +FQ
Sbjct: 194 ASYPYKAVDQKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQ 253
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNR 297
LY SG+Y + AC+ ++H + VGY +S WI+KN W WG GY+++ R NN+
Sbjct: 254 LYKSGVYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKNNQ 313
Query: 298 CGIANYAVYALI 309
CGIA A Y ++
Sbjct: 314 CGIATAASYPIV 325
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 108/292 (36%), Positives = 161/292 (55%), Gaps = 12/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N K I HN + G H Y L NH D+ + + M SR +R S
Sbjct: 47 RRMIWEKNLKMIELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQSRSQRKYKGS 106
Query: 90 PESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+ L P +DWREKG++TP +Q CG+C+AFS A++GQ F+ T ++ LS Q
Sbjct: 107 QFLEPNFLQAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQ 166
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDIS 207
++DCS GN GC GG + Y++ G+ EE YPY GK C +K + +
Sbjct: 167 NLIDCSGPEGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDT 226
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ +P E AL +A VGPI+V+I+AS +FQ Y SG+Y + C S+ ++H +L+VG
Sbjct: 227 GFVDIPEGRERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVG 286
Query: 268 Y---------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
Y + WI+KN WS WGD GY+++ K +N CGIA+ A Y ++
Sbjct: 287 YGYEGTDDDNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASYPMV 338
>gi|208972992|dbj|BAG74345.1| silicatein-M4 [Ephydatia fluviatilis]
gi|296168739|emb|CAQ54047.1| silicatein alpha 3 [Ephydatia muelleri]
Length = 327
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 102/298 (34%), Positives = 169/298 (56%), Gaps = 9/298 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+++ Y + + ++ W +N K I HN A L GYTL N DL + +
Sbjct: 34 HQRSYESQLQEMERHAIWVANKKYIEHHNANAD--LFGYTLAMNGFGDLTSAEFTERF-- 89
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
LTH R R+ +++ E+ + V D +DWR +G +T +Q CG+ YAF+ A A++G
Sbjct: 90 LTHKRSERSGLQTFEAPKGVTYADSMDWRTRGAVTSVQSQGSCGSSYAFAAAGALEGANA 149
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ ++ LS Q ++DCS+ GN GC+GG + YV GG+ + YPYKGKQ C+
Sbjct: 150 LAADKLVALSEQNIIDCSVAYGNHGCSGGDVYTAFKYVVDNGGIDTDSSYPYKGKQYSCQ 209
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ N+ + + E L +A+VGPIAV+++A+ ++F Y SG++D +C++
Sbjct: 210 YNSKNLGAVATGVVKITSGSETDLLSAVASVGPIAVAVDATVNSFMFYQSGVFDSSSCST 269
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN W WG++GY+ + R N+CGIA+ A+Y ++
Sbjct: 270 TKLNHAMLVTGYGSTNGKDYWLVKNSWGTGWGESGYIKMVRNKYNQCGIASDALYPML 327
>gi|66394764|gb|AAY46196.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 173/307 (56%), Gaps = 9/307 (2%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
+ QK +K Y + ++++ L + S + I HNQ +G + + ENH++DL Y
Sbjct: 73 YKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 132
Query: 72 K--EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K RL +RR +P+ +DWR+KG++T NQ CG+C+AFS
Sbjct: 133 KLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSSTG 192
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ Q + T ++ LS Q ++DCS GN+GC GG + N Y++ G+ KE DYPYK
Sbjct: 193 ALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYK 252
Query: 190 GKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C FKR ++ + + + DE LK+ +AT GP +V+I+A +FQLY G+
Sbjct: 253 AKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGV 312
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y ++ C+ + ++H +L+VGY ++ WI+KN W HWG+ GY+ + R N CGIA+
Sbjct: 313 YFEKECSPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGEQGYIRMARNRKNNCGIAS 372
Query: 303 YAVYALI 309
+A Y L+
Sbjct: 373 HASYPLV 379
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 169/314 (53%), Gaps = 14/314 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+ EW + K+Y + ++++++ W+ N I HN A +G + + L N
Sbjct: 24 DSEWQLYLKAHGKQYGAE-----EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEY 78
Query: 63 SDLHPRHYIKEMT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+ + M ++ + R +L P SN L PD +DWR KG++TP NQ CG
Sbjct: 79 GDMTNEEFRSTMNGYKMRNGTSRGSLYLPP-SNIGDL-PDTVDWRPKGYVTPIKNQGQCG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C++FS +++GQ FK T ++ LS Q +VDCS GN GC GG + + Y++ G+
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGI 196
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E YPY+ K C+F N+ S ++ + + E L+ +ATVGPIAV+I+AS +
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMS 256
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN- 295
FQLY SG+Y + C+ ++H +L VGY S W++KN W WG GY+ + R
Sbjct: 257 FQLYKSGVYHEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNKR 316
Query: 296 NRCGIANYAVYALI 309
N CGIA A Y +
Sbjct: 317 NNCGIATSASYPTV 330
>gi|308474437|ref|XP_003099440.1| CRE-CPL-1 protein [Caenorhabditis remanei]
gi|308266846|gb|EFP10799.1| CRE-CPL-1 protein [Caenorhabditis remanei]
Length = 337
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 171/305 (56%), Gaps = 14/305 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQS---NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
YK+DY K T ++++ ++ N I HN++ + G + + NH++DL Y K
Sbjct: 35 YKEDYEKDYTGDDEQVYMEAFVKNVIHIDNHNRDHRLGRKTFEMGLNHIADLPFSQYRKL 94
Query: 73 -EMTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
RL SRI+ + N V +PD +DWR+ +T NQ CG+C+AFS A
Sbjct: 95 NGYRRLYGDSRIKNSSSFLAPFN--VQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGA 152
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ + ++ LS Q +VDCS GN GC GG + Y++ G+ E+ YPYKG
Sbjct: 153 LEGQHARKLGKLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEDSYPYKG 212
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C F + ++ D ++ LP DE LK+ +AT GPI+++I+A +FQLY G+Y
Sbjct: 213 RDMKCHFSKKDVGADDKGYTDLPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYY 272
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C+S+ ++H +LLVGY + W++KN W WG+ GY+ + R NN CG+A A
Sbjct: 273 DEECSSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEKGYIRIARNRNNHCGVATKA 332
Query: 305 VYALI 309
Y L+
Sbjct: 333 SYPLV 337
>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
Length = 334
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 157/290 (54%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HNQE +G HG+ + N D+ + + M + + ++ L
Sbjct: 48 RRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFH 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P V +P +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLL---VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + N Y++ GGL EE YPY + +P +
Sbjct: 165 NLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDT 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VGY
Sbjct: 225 GFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 285 GFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/309 (33%), Positives = 171/309 (55%), Gaps = 18/309 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++KK Y + + + + N K+ HN+ +QGL+ Y L N DL ++ +
Sbjct: 33 QHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLN 92
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ T ++ E +S+ IPD +DWR++G +TP +Q CG+C++FS
Sbjct: 93 GFNRTK---TYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCGSCWSFSA 149
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ F+ T ++ LS Q +VDCS GN GC GG + N Y++ GG+ E YP
Sbjct: 150 TGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGIDTEAAYP 209
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G+ ++ N + +P DE LK +ATVGPI+++I+AS +FQLY++G
Sbjct: 210 YMGEDEKFRYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIAIDASHESFQLYSNG 269
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
+Y D C+S ++H +L+VGY + W++KN W WG +GY+ + R +N+CG+
Sbjct: 270 VYSDPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIKMARNQDNQCGV 329
Query: 301 ANYAVYALI 309
A A Y L+
Sbjct: 330 ATQASYPLV 338
>gi|8393221|ref|NP_059016.1| cathepsin S preproprotein [Rattus norvegicus]
gi|399190|sp|Q02765.1|CATS_RAT RecName: Full=Cathepsin S; Flags: Precursor
gi|203650|gb|AAA40994.1| cathepsin S precursor [Rattus norvegicus]
Length = 330
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 169/307 (55%), Gaps = 19/307 (6%)
Query: 17 YKKDYRKKATDSK----KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+KK ++ TD ++L W+ N K I HN E G+H Y++ NH+ D+ P I
Sbjct: 29 WKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMTPEEVIG 88
Query: 73 EMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M L RI R RS +S+ + +PD +DWREKG +T Q CG+C+AFS A
Sbjct: 89 YMGSL---RIPRPWNRSGTLKSSSNQTLPDSVDWREKGCVTNVKYQGSCGSCWAFSAEGA 145
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
++GQ+ T ++ LS Q +VDCS GN GC GG + Y+ + E YPY
Sbjct: 146 LEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYI-IDTSIDSEASYPY 204
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH-TFQLYASG 247
K C + N S + LP DE ALK +AT GP++V I+ + H +F LY SG
Sbjct: 205 KAMDEKCLYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSG 264
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIAN 302
+YDD +CT + +NH +L+VGY ++ W++KN W H+GD GY+ + R N N CGIA+
Sbjct: 265 VYDDPSCTEN-MNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIAS 323
Query: 303 YAVYALI 309
Y Y I
Sbjct: 324 YCSYPEI 330
>gi|66377984|gb|AAY45869.1| cathepsin L-like cysteine proteinase [Globodera pallida]
Length = 379
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 173/307 (56%), Gaps = 9/307 (2%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
+ QK +K Y + ++++ L + S + I HNQ +G + + ENH++DL Y
Sbjct: 73 YKQKHGRKSYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 132
Query: 72 K--EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K RL +RR +P+ +DWR+KG++T NQ CG+C+AFS
Sbjct: 133 KLNGYRRLLGDNLRRNASTFLAPINIGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSSTG 192
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ Q + T ++ LS Q ++DCS GN+GC GG + N Y++ G+ KE DYPYK
Sbjct: 193 ALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYK 252
Query: 190 GKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C FKR ++ + + + DE LK+ +AT GP +V+I+A +FQLY G+
Sbjct: 253 AKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGV 312
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y ++ C+ + ++H +L+VGY ++ WI+KN W HWG+ GY+ + R N CGIA+
Sbjct: 313 YFEKECSPENLDHGVLVVGYGTDAQQGDYWIVKNSWGAHWGEQGYIRMARNRKNNCGIAS 372
Query: 303 YAVYALI 309
+A Y L+
Sbjct: 373 HASYPLV 379
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 170/303 (56%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K YR+K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 32 KKTYSKHYREKIEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISL 91
Query: 74 MTRLT-HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M LT S+ +R + +SN + +PD LDWR+KG +T Q CG+C+AFS A++
Sbjct: 92 MGSLTVPSQWQRNVTY--KSNPNQKLPDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS N GC GG + + Y+ G+ E YPYK +
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQ 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E ALK +A GP++V+I+AS +F LY SG+Y D
Sbjct: 210 DGKCQYDSKFRAATCSKYTELPFGSEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYD 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
++CT VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIA+Y Y
Sbjct: 270 QSCTLK-VNHGVLVVGYGNLDGKDYWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|6650391|gb|AAF21819.1|AF098670_1 silicatein beta [Tethya aurantium]
Length = 334
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 165/301 (54%), Gaps = 7/301 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K YR + ++ L W SN + I HN A + G++L NH DL ++ +
Sbjct: 36 KSQHGKSYRSGLQELERHLVWVSNKEYIDRHNANAD--VFGFSLAMNHFGDLSDNEFVDK 93
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T S ++ V+ E+ E V P+ LDWR KG +T NQ DCGA YAFS +++G
Sbjct: 94 YLSYTKSDKKKRNVKMFEAPEGVSYPESLDWRTKGAVTSVKNQGDCGASYAFSAIGSLEG 153
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+ + ++ LS Q V+DCS+ GN GC GG++ NT Y+ G+ + YP+KGKQ+
Sbjct: 154 ALSLAQGKLTYLSEQNVIDCSVAYGNHGCQGGNMYNTYLYILSNDGIDTSDGYPFKGKQT 213
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C + R IS + E L+ +A+ GP+AV+++ S F+ Y G+Y+
Sbjct: 214 SCTYDRSCRGTSISGSIAITSGSESDLQAAVASAGPVAVAVDGSSRAFRFYDYGLYNLPG 273
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYAL 308
C+S ++HA+L+ GY W++KN W +WG +GY+ + R N N+CGIA A Y
Sbjct: 274 CSSYQLSHALLITGYGSFNGNQYWLVKNSWGTNWGMSGYIMMTRNNYNQCGIATDAAYPT 333
Query: 309 I 309
+
Sbjct: 334 L 334
>gi|341878328|gb|EGT34263.1| CBN-CPL-1 protein [Caenorhabditis brenneri]
Length = 336
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 112/305 (36%), Positives = 171/305 (56%), Gaps = 14/305 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQS---NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
YK+DY K+ ++S+++ + ++ N I HN++ + G + + NH++DL Y K
Sbjct: 34 YKEDYDKEYSESEEQTYMEAFVKNVIHIENHNRDHRLGRKTFEMGLNHIADLPFSQYRKL 93
Query: 73 -EMTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
RL SRI+ + N V +PD +DWR+ +T NQ CG+C+AFS A
Sbjct: 94 NGYRRLFGDSRIKNSSSFLAPFN--VQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGA 151
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ + ++ LS Q +VDCS GN GC GG + Y++ G+ EE YPYKG
Sbjct: 152 LEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKG 211
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C F + I D + P DE LK+ +AT GPI+++I+A +FQLY G+Y
Sbjct: 212 RDMKCHFNKKTIGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYY 271
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C+S+ ++H +LLVGY + W++KN W WG+ GY+ + R NN CG+A A
Sbjct: 272 DEECSSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEKGYIRIARNRNNHCGVATKA 331
Query: 305 VYALI 309
Y L+
Sbjct: 332 SYPLV 336
>gi|392922426|ref|NP_001256718.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
gi|3879367|emb|CAB07275.1| Protein CPL-1, isoform a [Caenorhabditis elegans]
Length = 337
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 171/305 (56%), Gaps = 14/305 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQS---NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
YK+D+ K+ ++S+++ + ++ N I HN++ + G + + NH++DL Y K
Sbjct: 35 YKEDFDKEYSESEEQTYMEAFVKNMIHIENHNRDHRLGRKTFEMGLNHIADLPFSQYRKL 94
Query: 73 -EMTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
RL SRI+ + N V +PD +DWR+ +T NQ CG+C+AFS A
Sbjct: 95 NGYRRLFGDSRIKNSSSFLAPFN--VQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGA 152
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ + ++ LS Q +VDCS GN GC GG + Y++ G+ EE YPYKG
Sbjct: 153 LEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKG 212
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C F + + D + P DE LK+ +AT GPI+++I+A +FQLY G+Y
Sbjct: 213 RDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYY 272
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C+S+ ++H +LLVGY + WI+KN W WG+ GY+ + R NN CG+A A
Sbjct: 273 DEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYIRIARNRNNHCGVATKA 332
Query: 305 VYALI 309
Y L+
Sbjct: 333 SYPLV 337
>gi|5822035|pdb|1CS8|A Chain A, Crystal Structure Of Procathepsin L
Length = 316
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 160/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HNQE ++G H +T+ N D+ + + M + + R+ V +
Sbjct: 31 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ 90
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P E+ P +DWREKG++TP NQ CG+ +AFS A++GQ+F+ T + LS Q
Sbjct: 91 EPLFYEA---PRSVDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQ 147
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + YVQ GGL EE YPY+ + CK+ P V +
Sbjct: 148 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDA 206
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P+ E AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 207 GFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 266
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
S W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 267 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 316
>gi|291383484|ref|XP_002708316.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 107/297 (36%), Positives = 162/297 (54%), Gaps = 26/297 (8%)
Query: 29 KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
+++ W+ N + I HN E QG G+++ N D+ + + M H
Sbjct: 47 RRRAVWEKNMRMIELHNGEYSQGKRGFSMAMNAYGDMTSEEFRQVMNGFHHQ-------- 98
Query: 89 SPESNESVL-------IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSE 141
P+ E V +P +DWR+KG++TP Q CG+C+AFS A++GQ+F+ T
Sbjct: 99 -PDKKEKVFGKAVFQEVPSSVDWRDKGYVTPVKKQGRCGSCWAFSATGALEGQMFRKTGR 157
Query: 142 IEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN 201
+ LS Q ++DCS +GN GC GG + YV+ GGL E+ YPY+ + C++ P
Sbjct: 158 LVSLSEQNLIDCSWPAGNHGCRGGLTDHAFQYVKDNGGLDSEDSYPYEARNLPCRYD-PQ 216
Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNH 261
V + V P+ E+AL +ATVGPIAV+I+A +FQ Y GIY + C+S + NH
Sbjct: 217 KSVANGTGFVRIPRQENALMEAVATVGPIAVAIDAGHPSFQFYKEGIYYEPNCSSKHHNH 276
Query: 262 AMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
A+L+VGY + W++KN W WG+ GY+ + K NN CGIA++A Y +
Sbjct: 277 AVLVVGYGYEGAESDSNKYWLVKNSWGKRWGEAGYIRIAKDRNNHCGIASHASYPTV 333
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 163/292 (55%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M H + + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNQTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CG+A A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 165/303 (54%), Gaps = 12/303 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK+Y + + + + N K+ HN ++G Y + N DL + M
Sbjct: 38 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNG 97
Query: 77 LTH-----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
H SR T +N V +P+ +DWREKG ITP +Q CG+C+AFS A+
Sbjct: 98 YQHKKQNSSRAESTFTFMEPAN--VEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS Q ++DCS GN GC GG + Y++ G+ E YPY+ +
Sbjct: 156 EGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAE 215
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C++ N + +P +E LK +ATVGP++V+I+AS +FQ Y+ G+Y +
Sbjct: 216 DDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYE 275
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C SD ++H +L+VGY ++ W++KN WS HWGD GY+ + R N CG+A A Y
Sbjct: 276 PSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASY 335
Query: 307 ALI 309
L+
Sbjct: 336 PLV 338
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 166/307 (54%), Gaps = 13/307 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + + + + N KI HN++ +G Y++ N D+ ++
Sbjct: 31 KAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVS- 89
Query: 74 MTRLTHSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
TR R + R PE+ E +P +DWR KG +TP NQ CG+C+AFS
Sbjct: 90 -TRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSA 148
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ F+ + + LS Q +VDCS GN GC GG + N Y++ G+ E+ YP
Sbjct: 149 TGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYP 208
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G C FK+ + S + + E LK +ATVGPI+V+I+AS +FQ Y+ G
Sbjct: 209 YNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 248 IYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+YD+ C S+ ++H +L+VGY T N W++KN W WGD GY+ + R N+CGIA+
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIAS 328
Query: 303 YAVYALI 309
A Y L+
Sbjct: 329 SASYPLV 335
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 174/313 (55%), Gaps = 14/313 (4%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + K YK + + + K+ + N KKI HN + +QG Y + NH
Sbjct: 25 EEWHVFKAMHGKTYKNQFEEMF---RMKI-FMDNKKKIEAHNAKYEQGEVSYKMMMNHFG 80
Query: 64 DLHPRHYIKEMT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGA 121
DL + M +++ R + P ++ +P +DWR+KG +TP +Q CG+
Sbjct: 81 DLMVHEFKALMNGFKMSPDTKRNGELYFPSNSN---LPKTVDWRQKGAVTPVKDQGQCGS 137
Query: 122 CYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
C++FS +++GQ+F T ++ LS Q +VDCS GN GC GG + YV G+
Sbjct: 138 CWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGID 197
Query: 182 KEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E YPY+ +++ C+FK+ + +P DE AL+ LATVGPI+V+I+A+ +F
Sbjct: 198 TEASYPYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSF 257
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NN 296
Q Y+ G+Y++ C+S ++H +L VGY T N W++KN W WG+NGY+ + R +N
Sbjct: 258 QFYSKGVYNEPNCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENGYIKIARNHSN 317
Query: 297 RCGIANYAVYALI 309
CGIA+ A Y L+
Sbjct: 318 HCGIASMASYPLV 330
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 111/302 (36%), Positives = 165/302 (54%), Gaps = 9/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAMD 209
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ S ++ LP E LK +A GP+ V ++AS +F LY SG+Y D
Sbjct: 210 QKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHSSFFLYRSGVYYDP 269
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
ACT + VNH +L++GY W++KN W ++G+ GY+ + R N CGIA+Y Y
Sbjct: 270 ACTQN-VNHGVLVIGYGDLNGEEYWLVKNSWGSNFGERGYIRMARNKGNHCGIASYPSYP 328
Query: 308 LI 309
I
Sbjct: 329 EI 330
>gi|268560858|ref|XP_002638172.1| C. briggsae CBR-CPL-1 protein [Caenorhabditis briggsae]
Length = 336
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 171/305 (56%), Gaps = 14/305 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQS---NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
YK+D+ K+ T+S+++ + ++ N I HN++ + G + + NH++DL Y K
Sbjct: 34 YKEDFDKEYTESEEQTYMEAFVKNVIHIENHNRDHRLGRKTFEMGLNHIADLPFSQYRKL 93
Query: 73 -EMTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
RL SRI+ + N V +PD +DWR+ +T NQ CG+C+AFS A
Sbjct: 94 NGYRRLFGDSRIKNSSSFLAPFN--VQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGA 151
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ + ++ LS Q +VDCS GN GC GG + Y++ G+ EE YPYKG
Sbjct: 152 LEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKG 211
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C F + + D + P DE LK+ +AT GPI+++I+A +FQLY G+Y
Sbjct: 212 RDMKCHFNKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYY 271
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
DE C+S+ ++H +LLVGY + W++KN W WG+ GY+ + R NN CG+A A
Sbjct: 272 DEECSSEELDHGVLLVGYGTDPEHGDYWLVKNSWGTGWGEKGYIRIARNRNNHCGVATKA 331
Query: 305 VYALI 309
Y L+
Sbjct: 332 SYPLV 336
>gi|157819967|ref|NP_001099569.1| cathepsin 7 precursor [Rattus norvegicus]
gi|374110484|sp|D3ZZ07.1|CAT7_RAT RecName: Full=Cathepsin 7; Flags: Precursor
gi|149039730|gb|EDL93846.1| cathepsin 7 (predicted) [Rattus norvegicus]
Length = 331
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 108/292 (36%), Positives = 162/292 (55%), Gaps = 14/292 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+ +++ W+ N K I H + ++ +T+ N D+ +EM +T S
Sbjct: 45 EKQRRAVWEENVKMIKWHTMQNGLWMNNFTIEMNEFGDMTG----EEMRMMTDSSALTLR 100
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
+V IP LDWR+ G + P +Q CGAC+AFS+A++I+ Q+FK T ++ LS
Sbjct: 101 NGKHIQKRNVKIPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLS 160
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDI 206
+Q ++DC++ GN C+GG YV+ GGL E YPY+ K C+++ VV I
Sbjct: 161 VQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGLEAEATYPYEAKLRHCRYRPERSVVKI 220
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + V+ P++E AL L T GPIAV+I+ S +F+ Y GIY + C D ++H +LLV
Sbjct: 221 ARFFVV-PRNEEALMQALVTYGPIAVAIDGSHASFKRYRGGIYHEPKCRRDTLDHGLLLV 279
Query: 267 GY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
GY R W+LKN WG+ GYM L R NN CGIA+YA+Y L+
Sbjct: 280 GYGYEGHESENRKYWLLKNSHGEQWGERGYMKLPRDQNNYCGIASYAMYPLL 331
>gi|28194643|gb|AAO33583.1|AF479265_1 cathepsin P [Meriones unguiculatus]
Length = 334
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 170/311 (54%), Gaps = 25/311 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL------HP 67
++KYKK+Y + ++ + W+ N + + HN E G +G+T+ N DL +P
Sbjct: 33 KEKYKKNYSPEVEAVRRAI-WEENMRIVKLHNGENGLGKNGFTMELNSFGDLTGGEFRNP 91
Query: 68 RHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
I LT R + +V +P +W +G++TP NQ CG+C+AF+
Sbjct: 92 MADIPVPAALTVERKDKKIVDG--------LPKFKNWINEGYVTPVRNQGTCGSCWAFAA 143
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
AI+GQ+F T ++ LS+Q +VDCS GN GCA GS YV GL E YP
Sbjct: 144 TGAIEGQMFWKTGKLTPLSVQNLVDCSEKQGNKGCAQGSAFRAFMYVNETKGLQDEISYP 203
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+GKQ C++ N ++ + +L PQ+E L V +A++GP+A +++AS +F+ Y G
Sbjct: 204 YEGKQGTCRYNSSNSRAYVTDFRLL-PQNEIYLLVAVASIGPVAAAVDASQDSFRFYRGG 262
Query: 248 IYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRC 298
IY + C+ VNHA+L+VGY ++ W++KN W +WG GYM + R NN C
Sbjct: 263 IYYEPKCSQYSVNHAVLVVGYGYEGNETDGKDYWLIKNSWGENWGMRGYMKIARDRNNHC 322
Query: 299 GIANYAVYALI 309
GIA+ A + I
Sbjct: 323 GIASQASFVDI 333
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 163/290 (56%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVR 88
++ W+ N + I HN E G HG+T+ N D+ + + + H + ++ L +
Sbjct: 48 RRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q
Sbjct: 108 EPLM---LQIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + Y++ GGL EE YPY+ K CK++ V + +
Sbjct: 165 NLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +L+VGY
Sbjct: 225 F-VDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG +GY+ + K NN CG+A A Y ++
Sbjct: 284 GYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>gi|410990010|ref|XP_004001243.1| PREDICTED: cathepsin L1 isoform 2 [Felis catus]
Length = 337
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 111/295 (37%), Positives = 167/295 (56%), Gaps = 20/295 (6%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTLVR 88
++ W+ N K I HN+E QG H +T+ N D+ + + M L R + + +
Sbjct: 48 RRAVWERNMKMIEQHNREHSQGKHTFTMAMNAFGDMTNEEFRQVMNGLKIQKRKKWKVFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+P E IP +DWREKG++TP +Q C C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 APFFVE---IPSSVDWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY----KGKQSICKFKRPNIVV 204
+VDCS GN G +GG + + YV+ GGL EE YPY K CK++ N V
Sbjct: 165 NLVDCSQTEGNEGYSGGLIDDAFQYVKDNGGLDSEESYPYHAQVKRASYSCKYRPENSVA 224
Query: 205 DISS-WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAM 263
+++ W + P E+ L +TLA VGPI+ +I+AS TF+ Y GIY D +C+S+ V+H +
Sbjct: 225 NVTDYWDI--PSKENELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGV 282
Query: 264 LLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
L+VGY + WI+KN W WG +GY+ + K +N CGIA+ A + +
Sbjct: 283 LVVGYGADGTETENKKYWIIKNSWGTDWGMDGYIKMAKDRDNHCGIASLASFPTV 337
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 105/299 (35%), Positives = 173/299 (57%), Gaps = 12/299 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT--- 75
K+Y+ + + ++ + +N K+I HN + +QG Y ++ NH DL H IK +
Sbjct: 36 KNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLM-SHEIKALMNGF 94
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
++T + R + P +++ +P +DWR+KG +TP +Q CG+C++FS +++GQI
Sbjct: 95 KMTPNTKREGKIYFPSNDK---LPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQI 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F ++ LS Q ++DCS GN GC GG + YV G+ E YPY+ + C
Sbjct: 152 FLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYAC 211
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+FK+ + + +P DE AL+ LATVGPI+V+I+AS +F Y+ G+Y++ C+
Sbjct: 212 RFKKDKVGGTDKGYVDIPEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCS 271
Query: 256 SDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
S ++H +L VGY T N W++KN W WG++GY+ + R +N CGIA+ A Y ++
Sbjct: 272 SYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMASYPIV 330
>gi|288548564|gb|ADC52430.1| cathepsin L1 cysteine protease [Pinctada fucata]
Length = 331
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 164/313 (52%), Gaps = 12/313 (3%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
++EW I K Y D + ++L W+ N I HN+ A +G H + L N
Sbjct: 25 DQEWAIYKDMFAKNYVADEERM-----RRLVWEDNIDYIEKHNRRADRGEHKFWLGTNEY 79
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGAC 122
+D+ + M + SN L PD +DWR+KG++TP NQ CG+C
Sbjct: 80 ADMTIDEFKAIMNGFIMQNGTKGDTYMSPSNIGDL-PDKVDWRDKGYVTPVKNQGHCGSC 138
Query: 123 YAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMK 182
++FS +++GQ FKST ++ LS Q ++DCS GN GC GG + Y+Q G+
Sbjct: 139 WSFSATGSLEGQHFKSTGKLVSLSEQNLIDCSKKEGNHGCKGGLMDFAFEYIQKNDGIDT 198
Query: 183 EEDYPYKGKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E+ YPY K I C+FK+ ++ LP Q E AL+ +ATVGPI+V+++A +F
Sbjct: 199 EQSYPYTAKDGIECRFKKADVGATDKGKVDLPRQSEKALQEAVATVGPISVAMDAGHRSF 258
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-N 296
QLY GIY + C+S ++H +L VGY W++KN W WG G+ L R + N
Sbjct: 259 QLYKRGIYTEPMCSSTKLDHGVLAVGYGSEGEGDYWLVKNSWGATWGMEGFFMLARNHRN 318
Query: 297 RCGIANYAVYALI 309
CGIA A Y +
Sbjct: 319 ECGIATQASYPKV 331
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 105/290 (36%), Positives = 161/290 (55%), Gaps = 11/290 (3%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M H R +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNRTSQGPL 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
+ P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S Q
Sbjct: 107 FMEPKFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN 166
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I+
Sbjct: 167 LVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITG 226
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ +P +E AL +A VGP++V+I+AS + Q Y SGIY + ACTS ++HA+L+VGY
Sbjct: 227 FVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQ-LDHAVLVVGY 285
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 286 GYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 164/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+
Sbjct: 33 KKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTSL 92
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + NE +PD LDWREKG +T Q CGAC+AFS A++
Sbjct: 93 MSSLRVPSQWQRNVTYKSNPNEK--LPDSLDWREKGCVTEVKYQGSCGACWAFSAVGALE 150
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T + LS Q +VDCS N GC GG + Y+ G+ + YPYK
Sbjct: 151 AQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDNNGIDSDASYPYKAM 210
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S ++ LP E LK +A GP++V+I+AS +F LY SG+Y D
Sbjct: 211 DGKCRYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAIDASHPSFFLYKSGVYYD 270
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R + N CGIANY Y
Sbjct: 271 PSCTQN-VNHGVLVVGYGNLNGKDYWLVKNSWGINFGDKGYIRMARNSGNHCGIANYCSY 329
Query: 307 ALI 309
I
Sbjct: 330 PEI 332
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 110/306 (35%), Positives = 163/306 (53%), Gaps = 17/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y +K + +++ W+ N +KI HN E G H + L NH D+ + + M
Sbjct: 35 HSKKYHEK-EEGWRRMVWEKNLQKIELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNG 93
Query: 77 L---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T + +L P + P +DWREKG++TP +Q CG+C+AFS A++G
Sbjct: 94 YKLKTQRKFTGSLFMEPNF---MTAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEG 150
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQ 192
Q F+ T ++ LS Q +VDCS GN GC GG + YV GL E+ YPY G
Sbjct: 151 QQFRKTGKLVSLSEQNLVDCSRPEGNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDD 210
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + + + +P EHAL +A+VGP++V+I+A +FQ Y SGIY ++
Sbjct: 211 QPCHYDPLYNSANDTGFVDVPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEK 270
Query: 253 ACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+S+ ++H +L VGY + WI+KN W WGD GY+Y+ K N CGIA
Sbjct: 271 ECSSEELDHGVLAVGYGFEGEDKMGKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATA 330
Query: 304 AVYALI 309
A Y L+
Sbjct: 331 ASYPLV 336
>gi|47213724|emb|CAF95155.1| unnamed protein product [Tetraodon nigroviridis]
Length = 336
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 166/304 (54%), Gaps = 13/304 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL-----HPRHYI 71
++++Y + ++ ++ W+ N + HNQEA G+H Y L NHL D+ + +
Sbjct: 35 HRREYGSQNEEAIRRAVWEKNMHVVEAHNQEAALGMHSYELAMNHLGDMVSSAGTSKEVL 94
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
++MT L + + + +P +LD+R+KG +T +Q CG+C+AFS A A+
Sbjct: 95 EKMTGLLVPMVDQRNATMALNGSVQRLPRNLDYRKKGAVTAVKDQGACGSCWAFSSAGAL 154
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+G + K T ++ +LS Q +VDC + N GC GG + N YV GL E YPY G+
Sbjct: 155 EGMLAKKTGKLVDLSPQNLVDC--VKENSGCGGGYMTNAFKYVATNKGLDSEAAYPYVGQ 212
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+ C++K V+ + +P +E L L GP+A+ I+A+ TF LY+ G+Y D
Sbjct: 213 EQPCQYKEAGKAVECRRYEEVPQGNEKLLAYALFKHGPVAIGIDATLTTFHLYSKGVYYD 272
Query: 252 EACTSDYVNHAMLLVGY--TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
C + +NHA+LLVGY TR WI+KN W WG GY+ + R N CGIAN A
Sbjct: 273 PDCNPEDINHAVLLVGYGVTRRGQQYWIVKNSWGTGWGTEGYILMARNRGNLCGIANLAS 332
Query: 306 YALI 309
Y ++
Sbjct: 333 YPIM 336
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 162/292 (55%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + M H + + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDPNQTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336
>gi|410042826|ref|XP_003951516.1| PREDICTED: cathepsin L1 [Pan troglodytes]
Length = 278
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 156/281 (55%), Gaps = 14/281 (4%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-RSPESNESVL 97
K I HNQE ++G H +T+ N D+ + + M + + R+ V + P E+
Sbjct: 2 KMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-- 59
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q +VDCS
Sbjct: 60 -PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ 118
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + YVQ GGL EE YPY+ + CK+ P V + V P+ E
Sbjct: 119 GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY-NPKYSVANDTGFVDIPKQE 177
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----- 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 178 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDN 237
Query: 273 ---WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 238 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 278
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 168/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYSKQYKEKNEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + +PD LDWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNVTF--KSNPNQKLPDSLDWREKGCVTDVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS N GC GG + Y+ G+ E YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYKAT 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S ++ LP E ALK +A GP++V I+AS +F LY SG+Y D
Sbjct: 210 DGKCQYDPKNRAATCSKYTELPYGSEDALKEAVANKGPVSVGIDASRPSFFLYKSGVYYD 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT D VNH +L+VGY ++ W++KN W ++G+ GY+ + R + N CGIA++ Y
Sbjct: 270 PSCT-DNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 107/292 (36%), Positives = 163/292 (55%), Gaps = 15/292 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M H R + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNRTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + ACTS ++HA+L+V
Sbjct: 225 TGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR-LDHAVLVV 283
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 284 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 169/314 (53%), Gaps = 14/314 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+ EW + K+Y + ++++++ W+ N I HN A +G + + L N
Sbjct: 24 DSEWQLYLKAHGKQYGAE-----EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEY 78
Query: 63 SDLHPRHYIKEMT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
D+ + M ++ + R +L P SN L PD +DWR KG++TP NQ CG
Sbjct: 79 GDMTNEEFRSTMNGYKMRNGTSRGSLYLPP-SNIGDL-PDTVDWRPKGYVTPIKNQGQCG 136
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C++FS +++GQ FK T ++ LS Q +VDCS GN GC GG + + Y++ G+
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGI 196
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E YPY+ K C+F N+ S ++ + + E L+ +ATVGPI+V+I+AS +
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMS 256
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN- 295
FQLY SG+Y + C+ ++H +L VGY S W++KN W WG GY+ + R
Sbjct: 257 FQLYRSGVYHEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNKR 316
Query: 296 NRCGIANYAVYALI 309
N CGIA A Y +
Sbjct: 317 NNCGIATSASYPTV 330
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 169/304 (55%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+
Sbjct: 32 KKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVRSL 91
Query: 74 MTRLTHSRIRRTLVR--SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ L R+ R +R + +S+ + +PD +DWREKG +T Q CG+C+AFS A+
Sbjct: 92 MSSL---RVPRQWLRNVTYKSDPNQKLPDSVDWREKGCVTEVKYQGACGSCWAFSAVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
+GQ+ T ++ LS Q +VDCS N GC+GG + YV G+ E YPYK
Sbjct: 149 EGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSGGFMTEAFQYVIDNNGIDSETSYPYKA 208
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + N S ++ LP E ALK +A GP++V+++AS +F LY +G+YD
Sbjct: 209 TDEKCHYDSKNRAATCSRYTELPYGSEEALKEAVANKGPVSVAVDASRPSFFLYKNGVYD 268
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +CT + V H +L VGY ++ W++KN W ++GD GY+ + R N CGIA+Y+
Sbjct: 269 DPSCTQN-VTHGVLAVGYGNLNGKDYWLVKNSWGLYFGDQGYIRMARNKGNHCGIASYSS 327
Query: 306 YALI 309
Y I
Sbjct: 328 YPEI 331
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 162/292 (55%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + M H + + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDPNQTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 336
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 164/303 (54%), Gaps = 12/303 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK+Y + + + + N K+ HN ++G Y + N DL + M
Sbjct: 38 HKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNG 97
Query: 77 LTH-----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
H SR T +N V +P+ +DWREKG ITP +Q CG+C+AFS A+
Sbjct: 98 YQHKKQNSSRAESTFTFMEPAN--VEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS Q ++DCS GN GC GG + Y++ G+ E YPY+ +
Sbjct: 156 EGQTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAE 215
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C++ N + +P +E LK +ATVGP++V+I+AS +FQ Y+ G Y +
Sbjct: 216 DGVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYE 275
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C SD ++H +L+VGY ++ W++KN WS HWGD GY+ + R N CG+A A Y
Sbjct: 276 PSCDSDDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASY 335
Query: 307 ALI 309
L+
Sbjct: 336 PLV 338
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 166/318 (52%), Gaps = 16/318 (5%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + I ++KK Y ++ +K + N KI HN+ + G Y L NH
Sbjct: 28 EEWSLFKI----QFKKLYEDIKEETFRKKVYLDNKLKIARHNKLYESGEETYALEMNHFG 83
Query: 64 DLHPRHYIKEMTRLTHSRI--RRTLVRSPE----SNESVLIPDHLDWREKGFITPDWNQE 117
DL Y K M S R +E+V+IP +DWR+KG++TP NQ
Sbjct: 84 DLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQG 143
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+C++FS +++GQ F+ T + LS Q ++DCS GN GC GG + Y++
Sbjct: 144 QCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSN 203
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GL E+ YPY+ + C++ N + +P DE AL LATVGP++++I+AS
Sbjct: 204 KGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDAS 263
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLK 292
FQ Y G++ + C+S ++H +L VG+ + WI+KN W WGD GY+ +
Sbjct: 264 SEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMA 323
Query: 293 RG-NNRCGIANYAVYALI 309
R N CG+A+ A Y L+
Sbjct: 324 RNKKNNCGVASSASYPLV 341
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 107/292 (36%), Positives = 163/292 (55%), Gaps = 15/292 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M H R + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNRTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I
Sbjct: 165 QNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKI 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + ACTS ++HA+L+V
Sbjct: 225 TGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR-LDHAVLVV 283
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 284 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|6978723|ref|NP_037288.1| cathepsin L1 preproprotein [Rattus norvegicus]
gi|55888|emb|CAA68691.1| prepro-cathepsin L [Rattus norvegicus]
Length = 334
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 163/290 (56%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVR 88
++ W+ N + I HN E G HG+T+ N D+ + + + H + ++ L +
Sbjct: 48 RRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQ 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q
Sbjct: 108 EPLM---LQIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + Y++ GGL EE YPY+ K CK++ V + +
Sbjct: 165 NLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ V PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +L+VGY
Sbjct: 225 F-VDIPQQEKALMKPVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG +GY+ + K NN CG+A A Y ++
Sbjct: 284 GYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 166/318 (52%), Gaps = 16/318 (5%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + I ++KK Y ++ +K + N KI HN+ + G Y L NH
Sbjct: 28 EEWSLFKI----QFKKLYEDIKEETFRKKVYLDNKLKIAGHNKLYESGEETYALEMNHFG 83
Query: 64 DLHPRHYIKEMTRLTHSRI--RRTLVRSPE----SNESVLIPDHLDWREKGFITPDWNQE 117
DL Y K M S R +E+V+IP +DWR+KG++TP NQ
Sbjct: 84 DLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSENVVIPKSVDWRKKGYVTPVKNQG 143
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+C++FS +++GQ F+ T + LS Q ++DCS GN GC GG + Y++
Sbjct: 144 QCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSN 203
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GL E+ YPY+ + C++ N + +P DE AL LATVGP++++I+AS
Sbjct: 204 KGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDAS 263
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLK 292
FQ Y G++ + C+S ++H +L VG+ + WI+KN W WGD GY+ +
Sbjct: 264 SEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKKGGDYWIVKNSWGKTWGDEGYIMMA 323
Query: 293 RG-NNRCGIANYAVYALI 309
R N CG+A+ A Y L+
Sbjct: 324 RNKKNNCGVASSASYPLV 341
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 164/303 (54%), Gaps = 19/303 (6%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY----- 70
KY K Y K T+ +++ W+SN K + HN + + G+T+ N +DL +
Sbjct: 29 KYNKAYETKETELARQVIWESNKKFVENHNANSDK--FGFTVAMNEFADLGAGEFANIYN 86
Query: 71 --IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
I ++ + VRS + + D +DWR+ G +T NQ CGAC+AFS
Sbjct: 87 GIIPHPPSYNNTNTFKRTVRS-----TFALADSVDWRKSGAVTGVKNQGKCGACWAFSAT 141
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F +T + LS QQ++DCS GN GC GG + N Y++ G M EE YPY
Sbjct: 142 GALEGQHFINTGTLISLSEQQLMDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAYPY 201
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C++ V + + +P DE AL+ +AT+GPI+VSIN+ +FQLY G+
Sbjct: 202 LAEVGTCRYNSSEAKVKNTVYKDIPEGDEDALQEAVATIGPISVSINSEHSSFQLYDQGV 261
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + C+S ++H +L++GY + W++KN W +WG +GY+ + R N CGIA
Sbjct: 262 YYEPTCSSSKLDHGVLVIGYGTSDNNDYWLVKNSWGTNWGMDGYIMMSRNKENNCGIATR 321
Query: 304 AVY 306
A Y
Sbjct: 322 ASY 324
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 167/307 (54%), Gaps = 13/307 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K Y + +S + + N KI HN+ QG H Y L N D+ ++ M
Sbjct: 35 EHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDMLHHEFVSTMN 94
Query: 76 --RLTHSRIRR-----TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R H+ + T E ++ V +P ++DWR KG +TP +Q CG+C+AFS
Sbjct: 95 GFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQCGSCWAFSAT 154
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T ++ LS Q +VDCS GN GC GG + N YV+ GG+ EE YPY
Sbjct: 155 GALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENGGIDTEESYPY 214
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C + + + + EHALK +ATVGP++V+I+AS +FQ Y+ G+
Sbjct: 215 DAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDASHESFQFYSHGV 274
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y + C+ + ++H +L+VGY + W++KN W WGD GY+ + R +N+CGIA+
Sbjct: 275 YIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMARNRDNQCGIAS 334
Query: 303 YAVYALI 309
A + L+
Sbjct: 335 SASFPLV 341
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANGTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 177/319 (55%), Gaps = 24/319 (7%)
Query: 11 IFPQKKYKKDYRK-------KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+ P +K +D++ + +S++K +++N KKI HN +QG Y + N +
Sbjct: 36 LVPFEKLWQDFKTVHERTYGETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFA 95
Query: 64 DLHPRHYIKEMTRLTHSRIRRTLVR--------SPESNESVLIPDHLDWREKGFITPDWN 115
D+ + M + RT VR SP V +P +DWR++G++TP N
Sbjct: 96 DMEANEFASIMNGFRMNN--RTEVRDHLHANYISPAI--PVSVPAEVDWRKEGYVTPVKN 151
Query: 116 QEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQ 175
Q CG+C+AFS +++GQ F+ T ++ LS Q +VDCS GN GC GG + Y++
Sbjct: 152 QGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIK 211
Query: 176 FAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSIN 235
G E YPY+ C+FK + + ++ LP DE +K +A VGP++V+I+
Sbjct: 212 DNDGDDTEACYPYEAVDGTCRFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAID 271
Query: 236 ASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL 291
AS +FQ+Y SGIY ++ C+ ++HA+L+VGY ++ W++KN W WGD GY+ +
Sbjct: 272 ASHSSFQMYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKM 331
Query: 292 KRG-NNRCGIANYAVYALI 309
R +N+CGIA+ A Y L+
Sbjct: 332 ARNMDNQCGIASQASYPLV 350
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEEALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 107/300 (35%), Positives = 166/300 (55%), Gaps = 8/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG 93
Query: 76 -RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
R T T + N+S L P +DWR+KG +TP +Q CG+C+AFS +++GQ
Sbjct: 94 HRGTRKTGGSTFLPPANVNDSSL-PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQ 152
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 153 HFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGE 212
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+ C
Sbjct: 213 CRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPEC 272
Query: 255 TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y L+
Sbjct: 273 SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|348542774|ref|XP_003458859.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 173/303 (57%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK--- 72
K++K Y + ++++K W +N K + HN A GL + L + +D+ Y K
Sbjct: 32 KFEKSYDSPSEETQRKQIWLNNRKLVLKHNALADLGLKSFRLGMTYFADMENEEYKKLGC 91
Query: 73 -EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ R T R P+ ++PD +DWR++G++T +Q++CG+C+AFS A+
Sbjct: 92 LGSFNASLPRHGSTFRRLPKG---TVLPDTVDWRKQGYVTHVKDQKECGSCWAFSATGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ FK T ++ LS QQ+VDCS N GC GG Y+++ GGL EE Y Y+ K
Sbjct: 149 EGQYFKKTGKLVSLSEQQLVDCSRKFRNNGCEGGEPHWAFQYIRYNGGLDTEESYHYEAK 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + ++ S + + P E ALK +AT+GPI+V+I+ S +FQLY SG+YD+
Sbjct: 209 DGQCHYNPDSVGAKCSGYVNVSPF-EDALKEAVATIGPISVAIDISRVSFQLYHSGVYDE 267
Query: 252 EACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C++ +NHA+L VGY T N W++KN W WG+ GY+ + R +N+CGIA A Y
Sbjct: 268 PWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSEWGNKGYIKMTRNKDNQCGIATEASY 327
Query: 307 ALI 309
L+
Sbjct: 328 PLV 330
>gi|345309264|ref|XP_001507503.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 335
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 171/310 (55%), Gaps = 26/310 (8%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSD-----LHPR--H 69
+ K+Y +A + ++ W+ N + I HN+E QG H Y L NH D LH R
Sbjct: 35 HGKNYSVEAEEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELHERLNG 94
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ ++ S + RS S E P+ +DWR KG++TP NQ CG+C+AFS
Sbjct: 95 FRPDLGGALRSGREQARFRSKTSWEG---PEEVDWRTKGYVTPVKNQGLCGSCWAFSATG 151
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ +FK+T ++ LS Q +VDCS GN+GC GG YV+ GG+ E+ YPY
Sbjct: 152 ALEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAEDLYPYL 211
Query: 190 GKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
G+ I C++ + +S+ V+ +E AL+ +ATVGP++V+++A P F Y SG
Sbjct: 212 GRDDISCRYSLQGKAGNCTSYMVVDQDNEQALEQAVATVGPVSVAVDARP--FFFYHSG- 268
Query: 249 YDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCG 299
CT VNHAML VGY ++ WILKN WS WG+ GYM L +G NN CG
Sbjct: 269 --SSRCTQK-VNHAMLAVGYGTSKEPGGGQDYWILKNSWSERWGEQGYMRLLKGANNHCG 325
Query: 300 IANYAVYALI 309
+A+ A + ++
Sbjct: 326 VASVASFPVL 335
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 165/312 (52%), Gaps = 18/312 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K Y + D + + N +I HNQ +Q L Y L+ N +D+ ++ M
Sbjct: 33 EHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHHEFVHTMN 92
Query: 76 RLTHS-----RIRRTLVRSPESNESVLI-------PDHLDWREKGFITPDWNQEDCGACY 123
+ R + + + + I PDH+DWR+KG +T +Q CG+C+
Sbjct: 93 GFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCW 152
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS A++GQ F+ T + LS Q ++DCS GN GC GG + N Y++ GG+ E
Sbjct: 153 AFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTE 212
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
+ YPY+ C++ D + +P DE L +ATVGPI+V+I+AS TFQ
Sbjct: 213 KSYPYEAVDDKCRYNPKESGADDVGFVDIPQGDEEKLMQAVATVGPISVAIDASQETFQF 272
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNR 297
Y+ G+Y DE C+S ++H +++VGY + W++KN W WG+ GY+ + R NN
Sbjct: 273 YSKGVYYDENCSSTDLDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGELGYIKMARNKNNH 332
Query: 298 CGIANYAVYALI 309
CGIA+ A Y L+
Sbjct: 333 CGIASSASYPLV 344
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 164/303 (54%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 41 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 100
Query: 74 MTRLTHSRIRRTLVR--SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ L R+ R + +SN + ++PD +DWREKG +T Q CGAC+AFS A+
Sbjct: 101 MSSL---RVPNQWQRNITYKSNPNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGAL 157
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+ Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 158 EAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAT 217
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP+ V ++AS +F LY SG+Y D
Sbjct: 218 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYD 277
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
ACT VNH +L++GY + W++KN W ++G+ GY+ + R N CGIA+Y Y
Sbjct: 278 PACTQK-VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSY 336
Query: 307 ALI 309
I
Sbjct: 337 PEI 339
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDYAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
Length = 330
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 164/303 (54%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRLTHSRIRRTLVR--SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ L R+ R + +SN + ++PD +DWREKG +T Q CGAC+AFS A+
Sbjct: 92 MSSL---RVPNQWQRNITYKSNPNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+ Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 149 EAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAT 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP+ V ++AS +F LY SG+Y D
Sbjct: 209 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYD 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
ACT VNH +L++GY + W++KN W ++G+ GY+ + R N CGIA+Y Y
Sbjct: 269 PACTQK-VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSY 327
Query: 307 ALI 309
I
Sbjct: 328 PEI 330
>gi|2239109|emb|CAA70694.1| cathepsin S-like cysteine proteinase [Heterodera glycines]
Length = 353
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 108/278 (38%), Positives = 157/278 (56%), Gaps = 8/278 (2%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI 98
K I HN ++G + + NHL P Y + S +R + + N S L
Sbjct: 77 KFIDAHNLAFEKGEVSFKVAPNHLMHFTPAQYNRIRGLQMRSNRQRHNMATLAGNSSTL- 135
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF-KSTSEIEELSIQQVVDCSIIS 157
P+ LDWREKG +T +Q DCG+C+AFS AI+G + K S+I LS Q +VDCS
Sbjct: 136 PEKLDWREKGAVTEVKDQGDCGSCWAFSATGAIEGALAQKKASKIISLSEQNLVDCSSKY 195
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + + YV+ GL EE YPY+ C+FK + + S+ L DE
Sbjct: 196 GNEGCDGGLMDSAFEYVRDNNGLDTEESYPYEAVTGKCQFKNETVGGTVVSFKDLKKGDE 255
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----- 272
LK+ +AT+GPI+V+++AS +FQ Y +G+Y + C++ Y++H +LLVGY +
Sbjct: 256 EQLKIAVATIGPISVALDASNLSFQFYKTGVYYERWCSNRYLDHGVLLVGYGTDETHGDY 315
Query: 273 WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W HWG+NGY+ + R N CGIA A Y ++
Sbjct: 316 WLVKNSWGPHWGENGYIRIARNKQNHCGIATMASYPVV 353
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 166/304 (54%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y + NHL+D+ +
Sbjct: 32 KKAYGKQYKEKNEEAARRLIWEKNLKFVTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSL 91
Query: 74 MT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ R+ H R + N + +PD +DWRE+G +T Q CGAC+AFS A+
Sbjct: 92 MSSLRIPHQWPRNVTYKL---NPNQKLPDSVDWRERGCVTEVKYQGSCGACWAFSAVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
+ Q+ T + LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 149 EAQLKLKTGNLVSLSAQNLVDCSTTKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKA 208
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + + S ++ LP E ALK +A GP++V+I+AS +F LY SG+Y
Sbjct: 209 MDQKCHYDSKHRAATCSKYTELPFGSEEALKEAVANKGPVSVAIDASHSSFFLYRSGVYY 268
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
+ +CT + VNH +L VGY ++ W++KN W H+G+ GY+ + R + N CGIANY
Sbjct: 269 EPSCTQN-VNHGVLAVGYGNLKGKDYWLVKNSWGIHFGEQGYIRMARNSKNHCGIANYPS 327
Query: 306 YALI 309
Y I
Sbjct: 328 YPEI 331
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
Length = 334
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIEIAKDRDNHCGLATAASYPVV 333
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 26 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 85
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 86 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 142
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 143 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 201
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 202 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 261
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 262 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 307
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 165/303 (54%), Gaps = 20/303 (6%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-----IKE 73
K YR + K+ +Q + +KI HN + GL + L +DL + + I
Sbjct: 32 KSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEFSDMLGISR 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T+ + R+ +L + +P DWREKG +T +Q CG+C++FS ++G
Sbjct: 92 STKSSRPRVIHSLTPVKD------LPSKFDWREKGAVTEVKDQGSCGSCWSFSTTGTVEG 145
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
F T ++ LS Q +VDC+ GC+GG + L Y++ AGG+M E DYPY+G
Sbjct: 146 AYFLKTGKLVSLSEQNLVDCAK-EDCYGCSGGYMDKALEYIETAGGIMSENDYPYEGIDD 204
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+F + IS+++ + DE LK + GPI+V+I+AS FQLY SGI DD +
Sbjct: 205 KCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAIDAS-FNFQLYDSGILDDSS 263
Query: 254 CTSDY--VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C SD+ +NH +L+VGY ++ WI+KN W WG +GY+++ R NN+CGIA A Y
Sbjct: 264 CYSDFNSLNHGVLVVGYGTEKEQDYWIVKNSWGADWGMDGYIWMSRNKNNQCGIATDATY 323
Query: 307 ALI 309
I
Sbjct: 324 PTI 326
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 170/313 (54%), Gaps = 11/313 (3%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
++EW + + ++ K Y+ ++ +K + + I HN EA +G+H + + N
Sbjct: 19 DREWGMFKV----RHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEY 74
Query: 63 SDLHPRHYIKEMTRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGA 121
+D+ +++ M R + P SN L P +DWR KG++T NQ CG+
Sbjct: 75 ADMPNEEFVRVMNGYKMQEQRPKAPTYMPPSNVGDL-PATVDWRTKGYVTEVKNQGQCGS 133
Query: 122 CYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
C+AFS +++GQ FK +++ LS Q +VDCS GN+GC GG + Y++ G+
Sbjct: 134 CWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGID 193
Query: 182 KEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E YPY+ C+F + N+ + + ++ + + E L+ +ATVGPIAV+I+AS +F
Sbjct: 194 TETSYPYEAASGKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASHMSF 253
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NN 296
QLY SG+Y C+ ++H +L VGY +S W++KN W WG GY+ + R +N
Sbjct: 254 QLYKSGVYHYIFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGATWGQQGYIMMSRNRDN 313
Query: 297 RCGIANYAVYALI 309
CGIA A Y +
Sbjct: 314 NCGIATQASYPTV 326
>gi|391226352|gb|AFM38108.1| cathepsin L [Patiria pectinifera]
Length = 327
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 173/317 (54%), Gaps = 20/317 (6%)
Query: 3 NKEWIIIFIFPQKKY---KKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRE 59
++EW + Q+KY ++D+R+ W+ N+K + HNQ G YT+
Sbjct: 21 DQEWQMWKDNNQRKYGAEEEDFRR--------FVWEYNYKMVTEHNQRFALGHTTYTMAM 72
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDC 119
N +DL + K+M ++ V + + +P +DWR KG++T NQ C
Sbjct: 73 NEFADLTSAEFTKKMNGFVMDKVPPKPVNT--FTDVSDLPTTVDWRTKGWVTGVKNQMQC 130
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI-ISGNL-GCAGGSLRNTLNYVQFA 177
G+C+A S +++GQ F T ++ ++S Q +VDC++ S N GC GG++ YV
Sbjct: 131 GSCWALSATGSLEGQTFNKTGKLPDISEQNLVDCAMKPSYNCHGCEGGTMNGAFQYVHDN 190
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
G+ E YPY+ + C+F N+V + ++LP DE AL++ +A VGPI+V+I+AS
Sbjct: 191 MGIDSESSYPYQAEDKKCRFNPANVVATDKTHTLLPAMDEKALQMAVAMVGPISVAIDAS 250
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR 293
+FQ+Y G+YD+ C+ ++H +L VGY + W++KN W WG GY+ + R
Sbjct: 251 HESFQMYHKGVYDEPMCSQTMLDHGVLAVGYGMEDDKAYWLVKNSWGKKWGMKGYIMMSR 310
Query: 294 -GNNRCGIANYAVYALI 309
NN+CGIA A Y L+
Sbjct: 311 FNNNQCGIATNASYPLV 327
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 174/327 (53%), Gaps = 25/327 (7%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW+ + KKY + + + K++ ++ HK I HNQ +QGL Y L N
Sbjct: 23 LVKEEWVAFKMQHDKKYDSEVEDRF---RMKIYAENKHK-IAKHNQLYEQGLVSYKLGPN 78
Query: 61 HLSDLHPRHYIKEMTRLTHSR------------IRRTLVRSPESNESVLIPDHLDWREKG 108
+D+ +I+ M + +R P V PDH+DW +KG
Sbjct: 79 KYTDMLHHEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAH---VKYPDHVDWTKKG 135
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLR 168
+T +Q CG+C+AFS A++GQ F+ + + LS Q ++DCS GN GC GG +
Sbjct: 136 AVTEVKDQGKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMD 195
Query: 169 NTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVG 228
N Y++ GG+ E+ YPY+G C++ N + + +P DE L +ATVG
Sbjct: 196 NAFKYIKDNGGIDTEKTYPYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVG 255
Query: 229 PIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHW 283
P++V+I+AS ++FQ Y+ G+Y D C+S ++H +L+VGY + W++KN WS W
Sbjct: 256 PVSVAIDASQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTW 315
Query: 284 GDNGYMYLKRG-NNRCGIANYAVYALI 309
G+ GY+ + R +N CGIA A Y L+
Sbjct: 316 GELGYIKMARNRDNHCGIATDASYPLV 342
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/311 (34%), Positives = 173/311 (55%), Gaps = 11/311 (3%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW+ + ++ K Y+ + + ++ N +KI HN+ + G Y L+ NH
Sbjct: 24 EEWLAF----KAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFG 79
Query: 64 DLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
DL +H K + +L S ++ + L P +DWR+KG +TP + CG+C+
Sbjct: 80 DLM-QHEFKALNKLKRSAKQQNSGEVFRATGGKL-PAKVDWRQKGAVTPVKDPGQCGSCW 137
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS ++ GQ+F ++ LS QQ+VDCS GN GC GG + Y++ GG+ E
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
YPY+ + C++K ++ + + DE+ALK +A +GPI+V+I+A +FQ
Sbjct: 198 GSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQF 257
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRC 298
Y+ GIYD+ C++ ++H +L+VGY T N W++KN W WG+NGY+ + R NN C
Sbjct: 258 YSEGIYDEPFCSNTELDHGVLVVGYGTENGQDYWLVKNSWGPSWGENGYIKIARNHNNHC 317
Query: 299 GIANYAVYALI 309
GIA+ A Y ++
Sbjct: 318 GIASMASYPIV 328
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 101/296 (34%), Positives = 162/296 (54%), Gaps = 11/296 (3%)
Query: 21 YRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL-HPRHYIKEMT-RLT 78
Y ++ ++ +++N I HN E H Y L N +DL +P K + R
Sbjct: 33 YATVGEETARRGIYRANLDFIEKHNSEG----HSYKLAVNKFADLTYPEFAAKYLGLRFD 88
Query: 79 HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS 138
+ ++ S V +PD +DWR G +TP +Q CG+C++FS +++GQ +
Sbjct: 89 ATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSVEGQHARK 148
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
T ++ LS Q +VDCS GN GC GG + Y+ G+ E YPY + C+F
Sbjct: 149 TGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYTAQDGTCQFN 208
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
N+ ++S+ + E L+ +ATVGPI+V+I+AS +FQ Y+SG+Y++ AC+S
Sbjct: 209 SANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGVYNEPACSSSQ 268
Query: 259 VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
++H +L VGY + + W++KN W WG +GY+++ R NN+CGIA A Y L+
Sbjct: 269 LDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQCGIATAASYPLV 324
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 162/292 (55%), Gaps = 14/292 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M H R + +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNRTS--QG 104
Query: 90 PESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
P E P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S
Sbjct: 105 PLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSE 164
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDI 206
Q +VDCS GN GC GG + YV+ GL E+ YPY + + C++ V
Sbjct: 165 QNLVDCSRPQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKS 224
Query: 207 SSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLV 266
+ + +P +E AL +A VGP++V+I+AS + Q Y SGIY + AC+S ++HA+L+V
Sbjct: 225 TGFVDIPSGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVV 284
Query: 267 GYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
GY WI+KN WS WGD GY+Y+ K NN CG+A A Y L+
Sbjct: 285 GYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPLM 336
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/293 (36%), Positives = 160/293 (54%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M R + + + +L
Sbjct: 47 RRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRKSERKFKGSL 106
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 107 FMEPNFLEA---PRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLS 163
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GL E+ YPY G C + +
Sbjct: 164 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSAN 223
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 224 DTGFIDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLV 283
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/293 (36%), Positives = 158/293 (53%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M R + +L
Sbjct: 12 RRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYKRKPQRKFTGSL 71
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 72 FMEPNFLEA---PRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLS 128
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GL E+ YPY G C + +
Sbjct: 129 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSAN 188
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 189 DTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLV 248
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 249 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 301
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 169/322 (52%), Gaps = 24/322 (7%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + + ++KK Y ++ +K + N KI HN+ + G Y L NH
Sbjct: 28 EEWSLF----KMQFKKLYEDIKEETFRKKVYLDNKLKIARHNKLYESGEETYALEMNHFG 83
Query: 64 DLHPRHYIKEMTRLTHSRIRR----------TLVRSPESNESVLIPDHLDWREKGFITPD 113
DL Y K M S T ++S E+V+IP +DWR+KG++TP
Sbjct: 84 DLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKS----ENVVIPKSIDWRKKGYVTPV 139
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
NQ CG+C++FS +++GQ F+ T + LS Q ++DCS GN GC GG + Y
Sbjct: 140 KNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKY 199
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
++ GL E+ YPY+ + C++ N + + +P DE AL LATVGP++++
Sbjct: 200 IKSNKGLDTEKSYPYEAEDDKCRYNPDNSGATDNGFVDIPEGDEEALMHALATVGPVSIA 259
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGY 288
I+AS FQ Y G++ + C+S ++H +L VG+ + WI+KN W WGD GY
Sbjct: 260 IDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKKGGDYWIVKNSWGKTWGDEGY 319
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R N CG+A+ A Y L+
Sbjct: 320 IMMARNKKNNCGVASSASYPLV 341
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 174/322 (54%), Gaps = 17/322 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + +K Y + ++ + K++ Q+ HK I HNQ G Y LR N
Sbjct: 22 LVKEEWNAFKLQHRKNYDSETEERI---RLKIYVQNKHK-IAKHNQRFDLGQEKYRLRVN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTL--VRSPE-----SNESVLIPDHLDWREKGFITPD 113
+DL +++ + + +++L VR E +V +P +DWR+KG +TP
Sbjct: 78 KYADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPV 137
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
+Q CG+C++FS A++GQ F+ T ++ LS Q +VDCS GN GC GG + Y
Sbjct: 138 KDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQY 197
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
++ GG+ E+ YPY+ C F + + +P DE ALK LATVGP++++
Sbjct: 198 IKDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGY 288
I+AS +FQ Y+ G+Y + C S+ ++H +L VGY + W++KN W WGD GY
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGY 317
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R +N CG+A A Y L+
Sbjct: 318 VKMARNHDNHCGVATCASYPLV 339
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 155/282 (54%), Gaps = 8/282 (2%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ N + I HN + G +TL+ N D+ + M + RR +
Sbjct: 47 FEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEEIVATMNGFLGAPTRRPAAVLKADD 106
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
E+ +P+ +DWR KG +TP +Q+ CG+C+AFS +++GQ F ++ LS Q +VDC
Sbjct: 107 ET--LPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDC 164
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN+GC GG + Y++ G+ E+ YPY+ + C+F N+ + + +
Sbjct: 165 SDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVE 224
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
E ALK +AT+GPI+V I+AS TF Y +G+Y D+ C+S ++H +L VGY +
Sbjct: 225 HGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDEN 284
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W+ WGD GY+ + R NN CGIA+ A Y L+
Sbjct: 285 GGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQASYPLV 326
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 174/322 (54%), Gaps = 17/322 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + +K Y + ++ + K++ Q+ HK I HNQ G Y LR N
Sbjct: 22 LVKEEWNAFKLQHRKNYDSETEERI---RLKIYVQNKHK-IAKHNQRFDLGQEKYRLRVN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTL--VRSPE-----SNESVLIPDHLDWREKGFITPD 113
+DL +++ + + +++L VR E +V +P +DWR+KG +TP
Sbjct: 78 KYADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPV 137
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
+Q CG+C++FS A++GQ F+ T ++ LS Q +VDCS GN GC GG + Y
Sbjct: 138 KDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQY 197
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
++ GG+ E+ YPY+ C F + + +P DE ALK LATVGP++++
Sbjct: 198 IKDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGY 288
I+AS +FQ Y+ G+Y + C S+ ++H +L VGY + W++KN W WGD GY
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGY 317
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R +N CG+A A Y L+
Sbjct: 318 VKMARNRDNHCGVATCASYPLV 339
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 110/325 (33%), Positives = 169/325 (52%), Gaps = 24/325 (7%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + + ++KK Y ++ +K + N KI HN+ + G Y L N
Sbjct: 25 IIEEEWDLFKV----QFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMN 80
Query: 61 HLSDLHPRHYIKEMTRLTHSRI----------RRTLVRSPESNESVLIPDHLDWREKGFI 110
H DL Y K M S T ++S E+V+IP +DWR+KG++
Sbjct: 81 HFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKS----ENVVIPKSIDWRKKGYV 136
Query: 111 TPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNT 170
TP NQ CG+C++FS +++GQ F+ T + LS Q ++DCS GN GC GG +
Sbjct: 137 TPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLA 196
Query: 171 LNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPI 230
Y++ GL E+ YPY+ + C++ N + +P DE AL LATVGP+
Sbjct: 197 FKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIPEGDEDALVHALATVGPV 256
Query: 231 AVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGD 285
+++I+AS FQ Y G++ + C+S ++H +L VGY + WI+KN W WGD
Sbjct: 257 SIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGD 316
Query: 286 NGYMYLKRG-NNRCGIANYAVYALI 309
GY+ + R N CG+A+ A Y L+
Sbjct: 317 QGYIMMARNKKNNCGVASSASYPLV 341
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 163/303 (53%), Gaps = 12/303 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK+Y + + + + N K+ HN ++G Y + N DL + M
Sbjct: 34 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNG 93
Query: 77 LTH-----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
H SR T +N V +P+ +DWREKG ITP +Q CG C+AFS A+
Sbjct: 94 YQHKKQNSSRAESTFTFMEPAN--VEVPESVDWREKGAITPVKDQGQCGPCWAFSSTGAL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ L Q ++DCS GN GC GG + Y++ G+ E YPY+ +
Sbjct: 152 EGQTFRKTGKLVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAE 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C++ N + +P +E LK +ATVGP++V+I+AS +FQ Y+ G+Y +
Sbjct: 212 DDVCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYE 271
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C SD ++H +L+VGY ++ W++KN WS HWGD GY+ + R N CG+A A Y
Sbjct: 272 PSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASY 331
Query: 307 ALI 309
L+
Sbjct: 332 PLV 334
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/293 (36%), Positives = 160/293 (54%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM---TRLTHSRIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M R + + + +L
Sbjct: 47 RRMVWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRKSERKFKGSL 106
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 107 FMEPNFLEA---PRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLS 163
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GL E+ YPY G C + +
Sbjct: 164 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSAN 223
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 224 DTGFIDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLV 283
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 104/304 (34%), Positives = 167/304 (54%), Gaps = 10/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y ++ +KL W+ N + HN + G Y L N +DL ++
Sbjct: 32 KNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEEFVAM 91
Query: 74 MTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
MT + + S SN +P +DWR KG++TP +Q CG+C+AFS ++
Sbjct: 92 MTGFRVNGTSKAAKGSTFLPSNNVDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSL 151
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ FK T ++ LS Q +VDCS N GC GG + Y+ AGG+ E Y Y+
Sbjct: 152 EGQQFKKTGKLVSLSEQNLVDCSYR--NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAV 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C FK+ N+ ++ ++ + E AL+ +A +GPI+V+I+AS F+ Y SG+Y++
Sbjct: 210 DGNCHFKKANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFYKSGVYNE 269
Query: 252 EACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
C++ + HA+L+VGY S WI+KN W+ WG NGY+++ R +N+CGIA+ A
Sbjct: 270 PGCSTTRLGHAVLVVGYGTTSDGTDYWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEAS 329
Query: 306 YALI 309
Y ++
Sbjct: 330 YPMV 333
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/301 (35%), Positives = 166/301 (55%), Gaps = 10/301 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN- 92
Query: 77 LTHSRIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
H R+T S P + +P +DWR+KG +TP +Q CG+C+AFS +++G
Sbjct: 93 -GHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEG 151
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPYK
Sbjct: 152 QHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDG 211
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+
Sbjct: 212 ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE 271
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYAL 308
C+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y L
Sbjct: 272 CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
Query: 309 I 309
+
Sbjct: 332 V 332
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRIIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|83715950|dbj|BAE54434.1| silicatein [Ephydatia fluviatilis]
Length = 326
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 165/303 (54%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + K Y + + +K W SN K I HN A GYTL NH DL R + +
Sbjct: 28 KDSHGKSYTSELEELEKHSVWLSNRKYIEEHNAHADD--FGYTLAMNHFGDLSEREFKDK 85
Query: 74 MTRLTHSRIRRTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LTH T V + ++ + + D +DWR KG +T NQ DCGA YAF+ +
Sbjct: 86 F--LTHEPGNYTSRGVATFKAPQGMKYVDSIDWRTKGAVTSVKNQGDCGASYAFAATGTM 143
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+G S + LS Q ++DCS+ GN GC+GG + YV GG+ E Y ++GK
Sbjct: 144 EGANALSNDKQVALSEQNIIDCSVAYGNHGCSGGDTYTAIKYVVDNGGIDTESSYSFRGK 203
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
QS C++ N + + E L +ATVGP+AV+++A+ + F+ Y SG++D
Sbjct: 204 QSSCQYNSKNSGASATGAVSISYGSESDLMSAVATVGPVAVAVDANTNAFRFYQSGVFDS 263
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
C+S +NHAML+ GY ++ W++KN W +WGD+GY+ + R N+CGIA+ A+Y
Sbjct: 264 STCSSTKLNHAMLVTGYGSYNGKDYWLVKNSWGKYWGDSGYIMMVRNKYNQCGIASDALY 323
Query: 307 ALI 309
+++
Sbjct: 324 SML 326
>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 106/293 (36%), Positives = 159/293 (54%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS---RIRRTL 86
+++ W+ N +KI HN E G H Y L NH D+ + + M H R++ +L
Sbjct: 47 RRVVWEKNLRKIEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHKAERRVKGSL 106
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +D+R+ G+ TP +Q CG+C+AFS A++GQ+F+ ++ LS
Sbjct: 107 FMEPNFIEA---PKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLS 163
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GGL E+ YPY G C + +
Sbjct: 164 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAAN 223
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A +FQ Y SGIY ++ C+S ++H +L+
Sbjct: 224 DTGFVDIPEGKERALMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTELDHGVLV 283
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VGYGFEGEDVDGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336
>gi|432108215|gb|ELK33129.1| Cathepsin L1 [Myotis davidii]
Length = 334
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 161/290 (55%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV-R 88
++ W+ N K I HN+E G+T+ N D+ + + M + + R V R
Sbjct: 48 RRAVWEKNMKMIELHNREYSLRKQGFTMAMNAFGDMTNEEFRQVMNGFQNQKQRNGKVFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + IP +DWR+KG++TP NQ CG+C+AFS +++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFAQ---IPSSVDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + N YV+ GL EE YPY ++S RP +
Sbjct: 165 NLVDCSRAQGNEGCNGGLMDNAFQYVKDNKGLDTEESYPYLARESNTCNYRPEYSAANDT 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V PQ E AL +ATVGPI+V+I+A +FQ Y +GIY + C+S ++H +L+VGY
Sbjct: 225 GFVDIPQREKALLKAVATVGPISVAIDAGHSSFQFYNAGIYYEPNCSSKDLDHGVLVVGY 284
Query: 269 ------TRNS--WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++N+ WI+KN W WG NGY+ + R +N CGIA A Y +
Sbjct: 285 GSEGGESKNNKFWIVKNSWGSGWGMNGYVKMARDQSNHCGIATAASYPTV 334
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 106/307 (34%), Positives = 168/307 (54%), Gaps = 17/307 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y + + + + N K+ HN+ QGL + L N SD+ ++ +
Sbjct: 34 HKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNG 93
Query: 77 LTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
S+ T +RS E +ES+ +P +DWR+ G +TP +Q CG+C++FS
Sbjct: 94 YNRSK---TPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTT 150
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ F+ + ++ LS Q ++DCS GN GC GG + N Y++ GG+ E+ YPY
Sbjct: 151 GSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPY 210
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K + C +K N + + DE LK +ATVGPI+V+I+AS TFQ Y+ G+
Sbjct: 211 KAEDEKCHYKPRNKGATDRGFVDIESGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGV 270
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y + C+S+ ++H +L+VGY + W++KN W WGD GY+ + R +N CGIA
Sbjct: 271 YYEPECSSEQLDHGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIAT 330
Query: 303 YAVYALI 309
A Y L+
Sbjct: 331 QASYPLV 337
>gi|351694995|gb|EHA97913.1| Cathepsin L1 [Heterocephalus glaber]
Length = 278
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 102/277 (36%), Positives = 153/277 (55%), Gaps = 12/277 (4%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI 98
+ I HN E +G HG+T+ N D+ + + M H + ++ ++ + + +
Sbjct: 2 RMIELHNGEYSEGKHGFTMAMNAFGDMTSEEFKQVMNGFQHQKHKKG--KTYQEPLLLQL 59
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWR+KG++TP NQ CG+C+AFS +++GQ+F+ T ++ LS Q +VDCS G
Sbjct: 60 PKSVDWRKKGYVTPVKNQGQCGSCWAFSATGSLEGQMFRKTGQLVSLSEQNLVDCSQPQG 119
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YV+ GL E+ YPY+GK C++K P + + V PQ E
Sbjct: 120 NQGCNGGLMDFAFEYVKENKGLESEKSYPYEGKDGSCRYK-PELSAANDTGFVDIPQREK 178
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +A GPI+V+++A +FQ Y GIY D C+S +NH +L+VGY
Sbjct: 179 ALMKAVAEKGPISVAVDAGLMSFQFYKDGIYFDPECSSKDLNHGVLVVGYGYEEVDTEKN 238
Query: 273 --WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
W++KN W WG GY+ + R NN CGIA A Y
Sbjct: 239 EYWLVKNSWGPEWGAEGYIKIARNRNNHCGIATAASY 275
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/300 (35%), Positives = 166/300 (55%), Gaps = 8/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG 93
Query: 77 LTHSRIR--RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
SR T + N+S L P +DWR+KG +TP +Q CG+C+AFS +++GQ
Sbjct: 94 YHGSRKSGGSTFLPPANVNDSSL-PKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQ 152
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 153 HFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGE 212
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+ C
Sbjct: 213 CRFKKEDVGATDTGYVEIKAGCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPEC 272
Query: 255 TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y L+
Sbjct: 273 SSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
Length = 327
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 174/316 (55%), Gaps = 14/316 (4%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+ EW + K+Y +Y D+ + + N K + HN+EA G H + +R N
Sbjct: 17 DNEWEAFKLLHGKQYN-EYE----DTARHAIFLENCKIVKQHNEEAAMGKHTFFMRMNKF 71
Query: 63 SDL---HPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDC 119
DL R + + +R ++ ES + + D +DWR+KG +T NQE C
Sbjct: 72 GDLTNEEFRMLVIGSGLMQSNRTQQAEGGVFESIPGLKVNDTVDWRQKGAVTKVKNQEQC 131
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C+AFS +++GQ F + + LS Q +VDCS GN GC GG + Y++ GG
Sbjct: 132 GSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCKGGLMDQAFKYIKTNGG 191
Query: 180 LMKEEDYPYKGK-QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
+ EE YPYKG+ + C++K +SS+ + DE ALK AT+GPI+V I+AS
Sbjct: 192 IDTEECYPYKGRDERKCEYKASCSGATLSSFVDVKTGDEDALKQASATIGPISVGIDASH 251
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG 294
+FQLY G+Y ++ C+S ++H +L+VGY T++ W++KN W WG GY+ + R
Sbjct: 252 PSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTQSTKDYWLVKNSWGADWGMEGYIMMSRN 311
Query: 295 -NNRCGIANYAVYALI 309
+N+CGIA A Y ++
Sbjct: 312 KDNQCGIATQASYPVV 327
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 108/298 (36%), Positives = 162/298 (54%), Gaps = 10/298 (3%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K K Y+ ++ + +Q+ +I HN +QGL Y N SD + +
Sbjct: 29 KQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFEQGLETYKKGVNKFSDWTQDEFNAYLG 88
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
H + + P V +P +DWR +G++T NQ DCG+C+AFS+ +++G +
Sbjct: 89 --LHPKPAKLGKGIPYVKTGVSVPASVDWRTEGYVTGVKNQGDCGSCWAFSLTGSVEGAL 146
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
FKST ++ LS QQ+VDC+ + N GC GG L T Y+Q GL E YPYK + C
Sbjct: 147 FKSTGKLVSLSEQQLVDCTYGTVNFGCDGGYLEETFPYIQ-ETGLEAEASYPYKARDGTC 205
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
KF +V I+ + V DE AL AT+GPI+V+++A + YASG++ C+
Sbjct: 206 KFDASKVVTKINDY-VYWYGDEEALLEATATIGPISVAMDA--NYIDSYASGVFSSRLCS 262
Query: 256 SDYVNHAMLLVGYTR----NSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
SD +NH +L+VGY N W++KN W+ WG++GY+ L RG N CGIA Y ++
Sbjct: 263 SDDLNHGVLVVGYGSENGVNYWLVKNSWAEDWGESGYLKLLRGQNECGIAEDDSYPIV 320
>gi|395509415|ref|XP_003758993.1| PREDICTED: cathepsin L1-like, partial [Sarcophilus harrisii]
Length = 323
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/317 (36%), Positives = 172/317 (54%), Gaps = 32/317 (10%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y+K+Y +K +K++ W+ N K I+ N ++G Y L N+L DL + + +
Sbjct: 15 YEKNYTEKEESFRKQV-WEKNMKFINDQNLLYKEGKLSYYLGMNNLGDLTDKEFKIMLNP 73
Query: 77 LTHSRIRRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
R+RR + N S+ +P +DWREKGFITP Q CG+C+AFS A++GQ
Sbjct: 74 SMLQRVRRD---TTTKNFSIFSHLPKSVDWREKGFITPVRQQGRCGSCWAFSATGAVEGQ 130
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-S 193
+F T ++ ELS Q ++DCS GC GG++ + Y++ G++ EE YPY K+ S
Sbjct: 131 LFLKTGKLVELSKQNLIDCSKFQ---GCHGGTVTSAFKYIKKNEGIVSEECYPYVAKKNS 187
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
+C ++ V I + VLP +E L +A VGP++VS+NA + Y GIY +
Sbjct: 188 LCSYRSECAAVKIRDYVVLPYGNEEILMEAVAIVGPVSVSLNAQ-KSLHFYKGGIYVEPK 246
Query: 254 CTSDYVNHAMLLVGY-----TRNS---------------WILKNWWSHHWGDNGYMYL-K 292
C Y NHA+LLVGY T++ WILKN W +WG GY+Y+ K
Sbjct: 247 CKPRYTNHALLLVGYGYKEGTKDKKSRLEEENQYEETKFWILKNSWGVNWGKKGYVYIAK 306
Query: 293 RGNNRCGIANYAVYALI 309
NN CG+A A+Y ++
Sbjct: 307 DKNNHCGVATRAIYPIL 323
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 165/303 (54%), Gaps = 12/303 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK+Y + + + + N K+ HN ++G Y + N DL + M
Sbjct: 38 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNG 97
Query: 77 LTH-----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
H SR T +N V +P+ +DWR KG ITP +Q CG+C+AFS A+
Sbjct: 98 YQHKKQNSSRAESTFTFMEPAN--VEVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGAL 155
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F+ T ++ LS Q ++DCS GN GC GG + Y++ G+ E YPY+ +
Sbjct: 156 EGQTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAE 215
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
++C++ N + +P +E LK +ATVGP++V+I+AS +FQ Y+ G+Y +
Sbjct: 216 DNVCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYE 275
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C SD ++H +L+VGY ++ W++KN WS HWGD GY+ + R N CGIA A Y
Sbjct: 276 PSCDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASY 335
Query: 307 ALI 309
L+
Sbjct: 336 PLV 338
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 174/324 (53%), Gaps = 19/324 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + QK+Y + K + K++ ++ H I HNQ+ +G + L++N
Sbjct: 22 LVKEEWNAFKMEHQKQYDSEVEDKF---RMKIYAENKHN-IAKHNQKYARGEVSFRLKQN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNE---------SVLIPDHLDWREKGFIT 111
D+ ++ M + + + E +V +PDH+DWR+ G +T
Sbjct: 78 KYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATFITPANVHLPDHVDWRKHGAVT 137
Query: 112 PDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTL 171
+Q CG+C++FS A++GQ ++ T+ + LS Q ++DCS GN GC GG + N
Sbjct: 138 EVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAF 197
Query: 172 NYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIA 231
Y++ G+ E+ YPY+G C++ N D + + +P DE L +ATVGP++
Sbjct: 198 KYIKDNRGIDTEKSYPYEGIDDKCRYNPKNTGADDNGFVDIPSGDEGKLMAAVATVGPVS 257
Query: 232 VSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDN 286
V+I+AS +FQ Y+ G+Y DE C+S ++H +L+VGY + W++KN W WGD
Sbjct: 258 VAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDL 317
Query: 287 GYMYLKRG-NNRCGIANYAVYALI 309
GY+ + R +N CGIA A Y L+
Sbjct: 318 GYIKMARNRDNHCGIATAASYPLV 341
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 168/322 (52%), Gaps = 24/322 (7%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + + ++KK Y ++ +K + N KI HN+ + G Y L NH
Sbjct: 28 EEWSLF----KAQFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMNHFG 83
Query: 64 DLHPRHYIKEMTRLTHSRI----------RRTLVRSPESNESVLIPDHLDWREKGFITPD 113
DL Y K M S T ++S E+V++P +DWR+KG++TP
Sbjct: 84 DLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKS----ENVVVPKAIDWRKKGYVTPV 139
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
NQ CG+C++FS +++GQ F+ T + LS Q ++DCS GN GC GG + Y
Sbjct: 140 KNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKY 199
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
++ GL E+ YPY+ + C++ N + +P DE AL LATVGP++++
Sbjct: 200 IKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIPEGDEDALMHALATVGPVSIA 259
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGY 288
I+AS FQ Y G++ + C+S ++H +L VGY + WI+KN W WGD GY
Sbjct: 260 IDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGY 319
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R N CG+A+ A Y L+
Sbjct: 320 IMMARNKKNNCGVASSASYPLV 341
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 166/308 (53%), Gaps = 14/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K Y ++ + + N +KI HN+ G Y L N D+ ++ M
Sbjct: 35 EHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDMLHHEFVNMMN 94
Query: 76 --RLTHSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
R S R E E V++P +DWREKG +T +Q CG+C+AFS
Sbjct: 95 GFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQGSCGSCWAFSA 154
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ ++ T ++ LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 155 TGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVNGGIDTEKSYP 214
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ + C++ N D + + +E+ALK +AT+GP++V+I+AS +FQ Y G
Sbjct: 215 YEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAIDASQDSFQFYQHG 274
Query: 248 IYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y D C+++ ++H +L VGY ++ W++KN WS WGD GY+ + R NN CGIA
Sbjct: 275 VYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKIARNQNNMCGIA 334
Query: 302 NYAVYALI 309
+ A Y L+
Sbjct: 335 SAASYPLV 342
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 107/302 (35%), Positives = 167/302 (55%), Gaps = 12/302 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN- 92
Query: 77 LTHSRIRRT----LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
H R+T + N+S L P +DWR+KG +TP +Q CG+C+AFS +++
Sbjct: 93 -GHHGTRKTGGSTFLPPANVNDSSL-PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLE 150
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 151 GQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD 210
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+
Sbjct: 211 GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP 270
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYA 307
C+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y
Sbjct: 271 ECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYP 330
Query: 308 LI 309
L+
Sbjct: 331 LV 332
>gi|75060921|sp|Q5E998.1|CATL2_BOVIN RecName: Full=Cathepsin L2; Flags: Precursor
gi|59858409|gb|AAX09039.1| cathepsin L2 preproprotein [Bos taurus]
Length = 334
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 156/290 (53%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HNQE +G HG+ + N D+ + + M + + ++ L
Sbjct: 48 RRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFH 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P V +P +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLL---VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + N Y++ G L EE YPY + +P +
Sbjct: 165 NLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDT 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VGY
Sbjct: 225 GFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 285 GFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 156/284 (54%), Gaps = 6/284 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y ++ ++ + +K+ HN +QGL Y L N +D+H + K
Sbjct: 31 KAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEFRKM 90
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M R ++V ESN + +P +DWR KG +TP NQ CG+C+AFS +++G
Sbjct: 91 MNGYRRGTPRNSVVVHVESN--ITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSLEG 148
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q ++ LS Q++VDCS GN GC GG + + Y++ G+ E+ YPY G+
Sbjct: 149 QHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQSYPYTGEDG 208
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C FK+ ++ ++ + + E L+ AT+GPI+V+I+AS FQLY SG+YD
Sbjct: 209 TCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQLYESGVYDVSD 268
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR 293
C++ ++H +L+VGY + W++KN W WG +GY+ + R
Sbjct: 269 CSTTELDHGVLVVGYGTDDGTAYWLVKNSWGTDWGHHGYIQMSR 312
>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
Length = 331
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/304 (34%), Positives = 167/304 (54%), Gaps = 13/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+KK Y + +K+ W++N K I HN++ +G HG++L N DL + +
Sbjct: 33 KTKHKKTYNMN-DEGQKRAVWENNKKMIDLHNEDYLKGKHGFSLEMNAFGDLTNTEFREL 91
Query: 74 MTRLTHSRIRRTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
MT + + + + P + +P +DWR+ G++TP +Q CG+C+AFS ++
Sbjct: 92 MTGFQGQKTKMMMKVFQEPLLGD---VPKSVDWRDHGYVTPVKDQGSCGSCWAFSAVGSL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+F+ T ++ LS+Q +VDCS GN GC GG YV+ GGL YPY+
Sbjct: 149 EGQMFRKTGKLVPLSVQNLVDCSWSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEAL 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N ++ + V E AL +ATVGPI+V I+ +FQ Y G+Y +
Sbjct: 209 NGTCRYNPKNSAATVTGF-VNVQSSEDALMKAVATVGPISVGIDTKHKSFQFYKEGMYYE 267
Query: 252 EACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
C+S ++HA+L+VGY S W++KN W WG NGY+ + K NN CGIA+ A
Sbjct: 268 PDCSSTVLDHAVLVVGYGEESDGRKYWLVKNSWGRDWGMNGYIKMAKDRNNNCGIASDAS 327
Query: 306 YALI 309
Y ++
Sbjct: 328 YPVV 331
>gi|2239107|emb|CAA70693.1| cathepsin L-like cysteine proteinase [Heterodera glycines]
Length = 374
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 172/305 (56%), Gaps = 9/305 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++K+ K Y + ++++ L + S + I HN+ ++G + + E H++DL Y K
Sbjct: 70 KQKHGKAYADQEVENERMLTYLSAKQFIDKHNEAYKEGKVSFRVGETHIADLPFSEYQKL 129
Query: 73 -EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
RL +RR +P+ +DWR+KG++T NQ CG+C+AFS A+
Sbjct: 130 NGFRRLMGDSLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSATGAL 189
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ + + LS Q ++DCS GN+GC GG + N Y++ G+ KE YPYK K
Sbjct: 190 EGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNKGIDKETAYPYKAK 249
Query: 192 QS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C FKR ++ S ++ + DE LK+ +AT GP++V+I+A +FQLY +G+Y
Sbjct: 250 TGKKCLFKRNDVGATDSGYNDIAEGDEEDLKMAVATQGPVSVAIDAGHRSFQLYTNGVYF 309
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
++ C + ++H +L+VGY + WI+KN W WG+ GY+ + R NN CGIA++A
Sbjct: 310 EKECDPENLDHGVLVVGYGTDPTQGDYWIVKNSWGTRWGEQGYIRMARNRNNNCGIASHA 369
Query: 305 VYALI 309
+ L+
Sbjct: 370 SFPLV 374
>gi|74142447|dbj|BAE31977.1| unnamed protein product [Mus musculus]
Length = 334
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/286 (37%), Positives = 158/286 (55%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP N+ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNKGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/293 (36%), Positives = 158/293 (53%), Gaps = 16/293 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS---RIRRTL 86
+++ W+ N +KI HN E G H Y L NH D+ + + M H R++ +L
Sbjct: 47 RRVVWEKNLRKIEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHKAERRVKGSL 106
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +D+R+ G+ TP +Q CG+C+AFS A++GQ+F+ ++ LS
Sbjct: 107 FMEPNFIEA---PKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLS 163
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GGL E+ YPY G C + +
Sbjct: 164 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAAN 223
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A VGP++V+I+A FQ Y SGIY ++ C+S ++H +L+
Sbjct: 224 DTGFVDIPEGKERALMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTELDHGVLV 283
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY + WI+KN WS WGD GY+Y+ K N CGIA A Y L+
Sbjct: 284 VGYGFEGEDVDGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPLM 336
>gi|74219261|dbj|BAE26764.1| unnamed protein product [Mus musculus]
Length = 333
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 162/289 (56%), Gaps = 12/289 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HN E +G H +T+ N DL ++K MT +I+R V
Sbjct: 48 RRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRRQKIKRMHVF- 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
+ ++ + +P ++DWR G++TP NQ C + +AFS +++GQ+FK T + LS Q
Sbjct: 107 -QDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQN 165
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
++DC + C+GG ++N YV+ GGL EE YPY G C++ N ++ +
Sbjct: 166 LLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESYPYIGPDRKCRYHAENSAANVRDF 225
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P ++E AL +A VGPI+V+++AS +FQ Y SGIY + C ++NHA+L+VGY
Sbjct: 226 VQIPGREE-ALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYG 284
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W WG GY+ + + NN CGIA A Y ++
Sbjct: 285 FEGEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333
>gi|288764223|emb|CAQ03432.1| silcatein 1 [Spongilla lacustris]
gi|296168747|emb|CAQ54051.1| silicatein alpha 3 [Spongilla lacustris]
Length = 327
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 102/298 (34%), Positives = 164/298 (55%), Gaps = 9/298 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K Y + + ++ W +N K I HN A L GYTL N D+ + +
Sbjct: 34 HQKSYDTELLEMERHAIWLANKKYIDHHNANAN--LFGYTLAMNGFGDMTSAEFAESF-- 89
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
LTH ++R +++ E+ + V D +DWR KG +T Q CG+ YAF+ A++G
Sbjct: 90 LTHKHVQRLGLQAFEAPKGVSYADSMDWRTKGVVTSVKTQSQCGSSYAFAAVGALEGASA 149
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+T ++ LS Q ++DCS+ GN GC+GG YV GG+ E YPYKGKQS C+
Sbjct: 150 LATDKLVALSEQNIIDCSVPYGNHGCSGGDTYTAFKYVVDNGGIDTESSYPYKGKQSSCQ 209
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ N + + E L +A+ GP+AV+++AS ++F Y SG++D C++
Sbjct: 210 YNSKNAGATATGVVKIASGSESDLMSAVASGGPVAVAVDASVNSFMFYQSGVFDSSTCSN 269
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN W WG++GY+ + R N+CGIA+ A+ ++
Sbjct: 270 TKLNHAMLVTGYGSVNGKDYWLVKNSWGTSWGESGYIRMVRNKYNQCGIASDALIPML 327
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 104/290 (35%), Positives = 159/290 (54%), Gaps = 11/290 (3%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M R +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDPNRTSKGAL 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S Q
Sbjct: 107 FMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN 166
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I+
Sbjct: 167 LVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITG 226
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ +P +E AL +A VGP++V+I+AS + Q Y SGIY + ACTS ++HA+L+VGY
Sbjct: 227 FVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR-LDHAVLVVGY 285
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 286 GYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|296168737|emb|CAQ54046.1| silicatein alpha 2 [Ephydatia muelleri]
Length = 340
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 165/303 (54%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + K Y + + ++ W SN K I HN A GYTL NH DL R + +
Sbjct: 42 KDSHGKSYTSELEELERHSVWLSNRKYIEEHNAHADD--FGYTLAMNHFGDLSEREFKDK 99
Query: 74 MTRLTHSRIRRTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LTH T V + ++ + + D +DWR KG +T NQ DCGA YAF+ +
Sbjct: 100 F--LTHEPGNYTSRGVATFKAPQGMKYVDSIDWRTKGAVTSVKNQGDCGASYAFAATGTM 157
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+G S + LS Q ++DCS+ GN GC+GG + YV GG+ E Y ++GK
Sbjct: 158 EGANALSNDKQVALSEQNIIDCSVAYGNHGCSGGDTYTAIKYVVDNGGIDTESSYSFRGK 217
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
QS C++ N + + E L +ATVGP+AV+++A+ + F+ Y SG++D
Sbjct: 218 QSSCQYNSKNSGASATGAVSISYGSESDLMSAVATVGPVAVAVDANTNAFRFYQSGVFDS 277
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
C+S +NHAML+ GY ++ W++KN W +WGD+GY+ + R N+CGIA+ A+Y
Sbjct: 278 STCSSTKLNHAMLVTGYGSYNGKDYWLVKNSWGKYWGDSGYIMMVRNKYNQCGIASDALY 337
Query: 307 ALI 309
+++
Sbjct: 338 SML 340
>gi|301609082|ref|XP_002934106.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 333
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 111/302 (36%), Positives = 163/302 (53%), Gaps = 10/302 (3%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ K I HN E GLH Y + NHL D+ M
Sbjct: 35 KTHQKTYKDAEEERARRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATM 94
Query: 75 TRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T T SR + P+ P +DWR +G +T NQ CG+CYAF A++
Sbjct: 95 TGYTGSRDSLANMTEVPKEILEAQTPASIDWRTQGCVTSVKNQGSCGSCYAFGTVGALEC 154
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q K + S Q++VDCS GN GC GG L+ + Y++ G+M+E YPY K+
Sbjct: 155 QWKKKMGTLVSFSPQELVDCSYTEGNNGCKGGYLQASFRYMKKY-GIMEESSYPYTAKEG 213
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK +P+ V + ++ V+P E L L TVGP++V+I+ S F++Y SG+Y D
Sbjct: 214 RCKKDKPSNVGVVKTFYVVPAGKELLLMKVLGTVGPVSVAIDCSREGFRMYKSGVYYDPY 273
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGY--MYLKRGNNRCGIANYAVYA 307
CT+ V+HA+L+VGY ++ W++KN W +GD GY M RGNN C IA++AVY
Sbjct: 274 CTTK-VDHAVLVVGYGTDNGKDYWLVKNSWGVGYGDKGYIKMARNRGNN-CAIASHAVYP 331
Query: 308 LI 309
+
Sbjct: 332 TV 333
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 166/301 (55%), Gaps = 10/301 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN- 92
Query: 77 LTHSRIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
H R+T S P + +P +DWR+KG +TP +Q CG+C+AFS +++G
Sbjct: 93 -GHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEG 151
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 152 QHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDG 211
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+
Sbjct: 212 ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE 271
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYAL 308
C+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y L
Sbjct: 272 CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
Query: 309 I 309
+
Sbjct: 332 V 332
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 172/319 (53%), Gaps = 19/319 (5%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + + ++ K Y + + + + N KI HN+ Q G Y L NH
Sbjct: 29 EEWELF----KTQFSKAYNTEIEEKFRMKVFMDNKHKIARHNKLFQNGEVSYELEMNHFG 84
Query: 64 DLHPRHYIKEMTRLTHSRIRRT------LVRSPESNESVLIPDHLDWREKGFITPDWNQE 117
DL ++K + HS R T + P N V +PD +DWR +G +T NQ
Sbjct: 85 DLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAYN--VTVPDSVDWRTEGAVTEVKNQG 142
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+C+AFS +++GQ F++T ++ LS Q ++DCS GN GC+GG + N Y++
Sbjct: 143 QCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKYGNNGCSGGLMDNAFAYIKSN 202
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
G+ E+ YPY+G C++K + +P DE LK+ +ATVGPI+V+I+AS
Sbjct: 203 KGIDTEQSYPYEGIDDKCRYKPQESGATDKGFVDIPQGDEEKLKLAVATVGPISVAIDAS 262
Query: 238 PHTFQLYASGIYDDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL 291
+FQ Y G+Y D+ C + + ++H +L VGY ++ W++KN W WG +GY+ +
Sbjct: 263 HQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGTENGKDYWLVKNSWGKRWGLDGYIKM 322
Query: 292 KRG-NNRCGIANYAVYALI 309
R +N CGIA A Y L+
Sbjct: 323 ARNKHNHCGIATSASYPLV 341
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 166/301 (55%), Gaps = 10/301 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN- 92
Query: 77 LTHSRIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
H R+T S P + +P +DWR+KG +TP +Q CG+C+AFS +++G
Sbjct: 93 -GHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEG 151
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 152 QHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDG 211
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+
Sbjct: 212 ECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPE 271
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYAL 308
C+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y L
Sbjct: 272 CSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
Query: 309 I 309
+
Sbjct: 332 V 332
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 173/309 (55%), Gaps = 15/309 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++ +Y + D+ + + + I HNQ+ + GL Y L N D+ ++K M
Sbjct: 33 QHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMN 92
Query: 76 RLTHSR-------IRRTLVRSPE--SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFS 126
+ ++ VR + S +V +P+ +DWR+ G +T +Q CG+C++FS
Sbjct: 93 GFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFS 152
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++GQ F+ + + LS Q ++DCS GN GC GG + N Y++ GG+ E+ Y
Sbjct: 153 TTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTY 212
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY+G C++ N + + +P DE L +ATVGP++V+I+AS +FQLY+S
Sbjct: 213 PYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSS 272
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
G+Y++E C+S ++H +L+VGY + W++KN W WG+ GY+ + R NNRCGI
Sbjct: 273 GVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGI 332
Query: 301 ANYAVYALI 309
A+ A Y L+
Sbjct: 333 ASSASYPLV 341
>gi|30017423|ref|NP_835199.1| testin-2 precursor [Mus musculus]
gi|81895036|sp|Q80UB0.1|TEST2_MOUSE RecName: Full=Testin-2; Contains: RecName: Full=Testin-1; Flags:
Precursor
gi|29289939|gb|AAN63093.1| testin precursor [Mus musculus]
gi|38173997|gb|AAH61218.1| RIKEN cDNA 4930486L24 gene [Mus musculus]
Length = 333
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 162/289 (56%), Gaps = 12/289 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HN E +G H +T+ N DL ++K MT +I+R V
Sbjct: 48 RRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRRQKIKRMHVF- 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
+ ++ + +P ++DWR G++TP NQ C + +AFS +++GQ+FK T + LS Q
Sbjct: 107 -QDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQN 165
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
++DC + C+GG ++N YV+ GGL EE YPY G C++ N ++ +
Sbjct: 166 LLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKCRYHAENSAANVRDF 225
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P ++E AL +A VGPI+V+++AS +FQ Y SGIY + C ++NHA+L+VGY
Sbjct: 226 VQIPGREE-ALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYG 284
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W WG GY+ + + NN CGIA A Y ++
Sbjct: 285 FEGEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333
>gi|148709357|gb|EDL41303.1| RIKEN cDNA 4930486L24 [Mus musculus]
Length = 334
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 162/289 (56%), Gaps = 12/289 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W+ N K I HN E +G H +T+ N DL ++K MT +I+R V
Sbjct: 49 RRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRRQKIKRMHVF- 107
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
+ ++ + +P ++DWR G++TP NQ C + +AFS +++GQ+FK T + LS Q
Sbjct: 108 -QDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQN 166
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
++DC + C+GG ++N YV+ GGL EE YPY G C++ N ++ +
Sbjct: 167 LLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKCRYHAENSAANVRDF 226
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P ++E AL +A VGPI+V+++AS +FQ Y SGIY + C ++NHA+L+VGY
Sbjct: 227 VQIPGREE-ALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYG 285
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W WG GY+ + + NN CGIA A Y ++
Sbjct: 286 FEGEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 334
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 108/290 (37%), Positives = 155/290 (53%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HNQE +G H + + N D+ + + M + + ++ L
Sbjct: 48 RRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFH 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P V +P +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLL---VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC GG + N Y++ GGL EE YPY + +P +
Sbjct: 165 NLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDT 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+ ++H +L+VGY
Sbjct: 225 GFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 285 GFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>gi|356984263|gb|AET43955.1| cathepsin L2, partial [Reishia clavigera]
Length = 278
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 156/279 (55%), Gaps = 9/279 (3%)
Query: 17 YKKDYRK---KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KK Y K +S +++ W+ N KKI HN EA +GLH Y L N L DL + +
Sbjct: 1 FKKTYNKLYSAEDESIRRMIWERNLKKIEEHNLEADRGLHTYRLGMNPLGDLTAKDFSWM 60
Query: 74 MTRLTHSRIRRT-LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ S R P SN L P +DWR KG++TP NQ+ CG+C+AFS +++
Sbjct: 61 LNGYKMSANRTAGATYLPPSNVGDL-PSEVDWRTKGYVTPVKNQKQCGSCWAFSATGSLE 119
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ FK T + LS Q +VDCS GN GC GG + Y++ G+ E+ YPY+
Sbjct: 120 GQHFKKTGTLVSLSEQNLVDCSKKEGNEGCEGGLMDQAFEYIKRNKGIDTEQSYPYRAVD 179
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+F R ++ + ++ + E L+ +ATVGPI+V+I+AS +FQLY SG+Y +
Sbjct: 180 EKCRFSRADVGATDTGYTDIHKGSEKDLQSAVATVGPISVAIDASRDSFQLYKSGVYYEP 239
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNG 287
C+S ++H +L VGY +++ WI+KN W WG G
Sbjct: 240 KCSSTMLDHGVLAVGYGTTDSKDYWIVKNSWGTQWGMKG 278
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 103/278 (37%), Positives = 158/278 (56%), Gaps = 12/278 (4%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT----LVRSPESNESV 96
I HN + +GL Y L N DL + + H R+T + N+S
Sbjct: 58 IARHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN--GHHGTRKTGGSTFLPPANVNDSS 115
Query: 97 LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
L P +DWR+KG +TP +Q CG+C+AFS +++GQ F E+ LS Q +VDCS
Sbjct: 116 L-PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174
Query: 157 SGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD 216
GN GC GG + + Y++ G+ E+ YPY+ C+FK+ ++ + + +
Sbjct: 175 FGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGS 234
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNS 272
E LK +ATVGPI+V+I+AS +FQLY+ G+YD+ C+S+ ++H +L+VGY +
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKY 294
Query: 273 WILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
W++KN W+ WGD GY+ + R NN+CGIA+ A Y L+
Sbjct: 295 WLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332
>gi|62955529|ref|NP_001017778.1| cathepsin K precursor [Danio rerio]
gi|62204416|gb|AAH92901.1| Cathepsin K [Danio rerio]
gi|182889052|gb|AAI64579.1| Ctsk protein [Danio rerio]
Length = 333
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 156/299 (52%), Gaps = 8/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+K++Y +S ++ W+ N I HN+E + G+H Y L NH D+ +++
Sbjct: 37 HKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMG 96
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L R + +P +D+R+ G++T NQ CG+C+AFS A++GQ+
Sbjct: 97 LQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLM 156
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+ ++ +LS Q +VDC ++ N GC GG + N YV G+ EE YPY G C
Sbjct: 157 KTKGQLVDLSPQNLVDC--VTENDGCGGGYMTNAFRYVSNNQGIDSEESYPYVGTDQQCA 214
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + +P +E AL +A VGP++V I+A TF Y SG+Y D C
Sbjct: 215 YNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNCNK 274
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ VNHA+L VGY + WI+KN W WG GY+ + R NN CGIAN A + ++
Sbjct: 275 EDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMARNRNNACGIANLASFPVM 333
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 104/290 (35%), Positives = 159/290 (54%), Gaps = 11/290 (3%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+++ W+ N +KI HN E G H + + N D+ + + M R +
Sbjct: 47 RRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDPNRTSKGAL 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
P +DWR++G++TP +Q+ CG+C++FS A++GQ+F+ T ++ +S Q
Sbjct: 107 FMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN 166
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDISS 208
+VDCS GN GC GG + YV+ GL E+ YPY + + C++ V I+
Sbjct: 167 LVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITG 226
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ +P +E AL +A VGP++V+I+AS + Q Y SGIY + ACTS ++HA+L+VGY
Sbjct: 227 FVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSR-LDHAVLVVGY 285
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN WS WGD GY+Y+ K NN CGIA A Y L+
Sbjct: 286 GYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPLM 335
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 107/302 (35%), Positives = 167/302 (55%), Gaps = 12/302 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN- 92
Query: 77 LTHSRIRRT----LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
H R+T + N+S L P +DWR+KG +TP +Q CG+C+AFS +++
Sbjct: 93 -GHHGTRKTGGSTFLPPANVNDSSL-PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLE 150
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 151 GQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD 210
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+
Sbjct: 211 GECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP 270
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYA 307
C+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y
Sbjct: 271 ECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYP 330
Query: 308 LI 309
L+
Sbjct: 331 LV 332
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/292 (35%), Positives = 164/292 (56%), Gaps = 9/292 (3%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY---IKEMTRLTHSRIR 83
D + +Q N + + HN+EA G H + +R N D+ + + L ++ +
Sbjct: 36 DGARYAIFQENSRIVKQHNEEAAMGKHTFFMRMNKFGDMTNEEFQMLVIGSGLLYSNKTQ 95
Query: 84 RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIE 143
+T ES + + D +DWR+KG +T NQE CG+C+AFS +++GQ F + +
Sbjct: 96 QTEGGVFESLPGLKVNDTVDWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLV 155
Query: 144 ELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSICKFKRPNI 202
LS Q +VDCS GN GC GG + Y++ GG+ EE YPYKGK + C++K
Sbjct: 156 SLSEQNLVDCSRKEGNKGCQGGLMDQAFKYIKTNGGIDTEECYPYKGKNERKCEYKSSCS 215
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHA 262
+SS+ + DE AL AT+GPI+V I+AS +FQLY G+Y ++ C+S ++H
Sbjct: 216 GATLSSYVDIKTGDEDALMQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHG 275
Query: 263 MLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+L+VGY ++ W++KN W WG GY+ + R +N+CGIA A Y ++
Sbjct: 276 VLVVGYGTDGEKDYWLVKNSWGEEWGMEGYIKMSRNKDNQCGIATQASYPVV 327
>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
Length = 325
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 156/285 (54%), Gaps = 13/285 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR--RTLVRSPE 91
++ N KI HN EAQ GLH Y+L N DL +++ T L T++
Sbjct: 45 FEENRIKIQKHNAEAQNGLHTYSLEMNQYGDLLQSEFLQGYTGLAKGSYSGDNTVIL--- 101
Query: 92 SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+ S +P +++W + G +T +Q+DCG+C+AFS +++GQ F ++ S QQ+V
Sbjct: 102 -DNSAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTTGSVEGQYFIKNKKLLSFSEQQLV 160
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
DCS N GC GG + N Y+ G+ E+ YPY +C + + ISS+
Sbjct: 161 DCSSDFRNEGCNGGWMDNAFKYLIANKGIATEDTYPYTATDGVCVYNKTMAAGRISSFKD 220
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRN 271
+ E LK+ +A +GPI+V+I+AS FQ Y G+Y DE C+S Y++H +L VGY +
Sbjct: 221 VKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYKKGVYVDEECSSKYLDHGVLAVGYGTD 280
Query: 272 S------WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
W++KN WS WGD GY+ + R + N CGIA+ A Y +I
Sbjct: 281 KGTGLDYWLVKNSWSASWGDQGYIKMARNHKNMCGIASLASYPVI 325
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 170/303 (56%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y ++ + ++ W+ N K + HN E G+H Y L NHL+D+ +
Sbjct: 41 KKTYGKQYTEENEEVTRRFIWEKNLKYVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLL 100
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + + +SN + +PD +DWR+KG +T Q CG+C+AFS A++
Sbjct: 101 MSSLRVPSQWQRNV--TFKSNPNQKLPDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALE 158
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS+Q +VDCS N GC GG + Y+ G+ E YPYK
Sbjct: 159 AQLKLKTGKLVSLSVQNLVDCSTGKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAM 218
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S + LP +E ALK +A GP++V+I+AS +F LY SG+Y D
Sbjct: 219 DGKCQYDVKNRAATCSKYVELPFGNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYD 278
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+ACT + VNH +L VGY ++ W++KN W H+G+ GY+ + R + N CGIA+Y Y
Sbjct: 279 KACTLN-VNHGVLAVGYGNYNGKDYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSY 337
Query: 307 ALI 309
I
Sbjct: 338 PEI 340
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 166/304 (54%), Gaps = 14/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + +K+ W++N K I+ HN++ +G HG++L N DL + +
Sbjct: 33 KTKHGKTYNTNE-EGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFREL 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
MT + + V E L +P +DWR+ G++TP NQ CG+C+AFS ++
Sbjct: 92 MTGFQGQKTKMMKVFP----EPFLGDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSL 147
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ+F+ T ++ LS Q +VDCS GN GC GG YV+ GGL YPY+
Sbjct: 148 EGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEAL 207
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ + + +PP E+AL +ATVGPI+V I+ +FQ Y G+Y +
Sbjct: 208 NGTCRYNPKYSAAKVVGFMSIPPS-ENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYE 266
Query: 252 EACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
C+S +NHA+L+VGY S W++KN W WG +GY+ + + NN CGIA+ A
Sbjct: 267 PDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDAS 326
Query: 306 YALI 309
Y ++
Sbjct: 327 YPIV 330
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 103/311 (33%), Positives = 174/311 (55%), Gaps = 10/311 (3%)
Query: 5 EWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSD 64
EW+ +K++ + Y K + ++ ++ N + I HN++ G Y L N +D
Sbjct: 41 EWVSF----KKQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFAD 96
Query: 65 LHPRHY-IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
+ + + R ++ R + + E ++ PD +DWR+KG++T NQ CG+C+
Sbjct: 97 MKNEEFRMYNGLRRDYNYSREVQCSNHLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCW 156
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
+FS +++GQ F + ++ LS QQ+VDCS GN GC GG + Y+ GG+ E
Sbjct: 157 SFSTTGSLEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETE 216
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
E+YPY +Q C FK+ + S + DE LK ++A VGP++++I+AS +FQL
Sbjct: 217 EEYPYDARQERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQL 276
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRC 298
Y+ G+YD+ C+S ++H +L+VGY ++ W++KN W WG GY+ + R +N+C
Sbjct: 277 YSGGVYDEPKCSSTELDHGVLVVGYGTDDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQC 336
Query: 299 GIANYAVYALI 309
G+A A Y L+
Sbjct: 337 GVATQASYPLV 347
>gi|74222595|dbj|BAE38161.1| unnamed protein product [Mus musculus]
Length = 334
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 106/286 (37%), Positives = 157/286 (54%), Gaps = 14/286 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPES 92
W+ N + I HN E G HG+++ N D+ + + + H + ++ L + P
Sbjct: 52 WEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLM 111
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+ IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VD
Sbjct: 112 ---LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + Y++ GGL EE YPY+ K CK+ R V + V
Sbjct: 169 CSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKY-RAEFAVANDTGFVD 227
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PQ E AL +ATVGPI+V+++AS + Q Y+ GIY + C+S ++H +LLVGY
Sbjct: 228 IPQQEKALMKAVATVGPISVAMDASHPSLQFYSLGIYYEPNCSSKNLDHGVLLVGYGYEG 287
Query: 273 --------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K +N CG+A A Y ++
Sbjct: 288 TDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>gi|195093046|ref|XP_001997691.1| GH23906 [Drosophila grimshawi]
gi|193891596|gb|EDV90462.1| GH23906 [Drosophila grimshawi]
Length = 358
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 103/300 (34%), Positives = 158/300 (52%), Gaps = 10/300 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A ++ + + + N G G+ N +DL ++K++T L
Sbjct: 60 KTYLNAADRRLREGLFSARKSLVDATNTAFASGASGFEQAVNAFADLTNAEFLKQLTGLR 119
Query: 79 HSRIRRTLVR----SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
S + SP + V +P DWREKG +TP Q +CG+C++F+ AI+G
Sbjct: 120 KSASGEQSAKAHRLSPTVPKGVRVPQSFDWREKGGVTPVKFQGECGSCWSFATTGAIEGH 179
Query: 135 IFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+F+ T ++ LS Q ++DC + G GC GG N+VQ G+ K + YPY K+
Sbjct: 180 VFRKTGKLPNLSEQNLIDCGKMELGLAGCDGGFQEYAFNFVQEQNGIAKGDSYPYLDKKD 239
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK+K I+ ++ + P+DE +K +AT GP+A S+N + LY GIYDD+
Sbjct: 240 TCKYKSNISGAQITGFAAIEPKDEATMKTVVATQGPLACSVNGL-ESLLLYKHGIYDDKE 298
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
C + VNH++L+VGY ++ WI+KN W WG+ GY L RG+N CGIA+ Y +I
Sbjct: 299 CNNGEVNHSVLVVGYGSEKGKDFWIVKNSWDKAWGEEGYFRLPRGSNFCGIASECSYPII 358
>gi|195027297|ref|XP_001986520.1| GH21411 [Drosophila grimshawi]
gi|193902520|gb|EDW01387.1| GH21411 [Drosophila grimshawi]
Length = 391
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 103/300 (34%), Positives = 158/300 (52%), Gaps = 10/300 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A ++ + + + N G G+ N +DL ++K++T L
Sbjct: 93 KTYLNAADRRLREGLFSARKSLVDATNTAFASGASGFEQAVNAFADLTNAEFLKQLTGLR 152
Query: 79 HSRIRRTLVR----SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
S + SP + V +P DWREKG +TP Q +CG+C++F+ AI+G
Sbjct: 153 KSASGEQSAKAHRLSPTVPKGVRVPQSFDWREKGGVTPVKFQGECGSCWSFATTGAIEGH 212
Query: 135 IFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+F+ T ++ LS Q ++DC + G GC GG N+VQ G+ K + YPY K+
Sbjct: 213 VFRKTGKLPNLSEQNLIDCGKMELGLAGCDGGFQEYAFNFVQEQNGIAKGDSYPYLDKKD 272
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK+K I+ ++ + P+DE +K +AT GP+A S+N + LY GIYDD+
Sbjct: 273 TCKYKSNISGAQITGFAAIEPKDEATMKTVVATQGPLACSVNGL-ESLLLYKHGIYDDKE 331
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
C + VNH++L+VGY ++ WI+KN W WG+ GY L RG+N CGIA+ Y +I
Sbjct: 332 CNNGEVNHSVLVVGYGSEKGKDFWIVKNSWDKAWGEEGYFRLPRGSNFCGIASECSYPII 391
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 102/294 (34%), Positives = 167/294 (56%), Gaps = 17/294 (5%)
Query: 29 KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
++K +++N KKI HN QG Y + N +D+ + + + + RT VR
Sbjct: 62 QRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFASVVNGFRMNN--RTKVR 119
Query: 89 --------SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
SP V +P +DWR++G++TP +Q CG+C++FS A++GQ F+ T
Sbjct: 120 DHLHSHYISPAI--PVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTGALEGQHFRKTG 177
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
++ LS Q ++DCS GN GC GG + Y++ G E+ YPY+ C+FK+
Sbjct: 178 KLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYEAADGPCRFKKE 237
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
+ + ++ LP DE +K +A VGP++V+I+AS +FQ+Y SG+YD+ C + ++
Sbjct: 238 YVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVYDEVECDPEGLD 297
Query: 261 HAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
H +L+VGY ++ W++KN W WGD GY+ + R NN+CGI++ A Y L+
Sbjct: 298 HGVLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISSMASYPLV 351
>gi|94448666|emb|CAI91571.1| silicatein a2 [Lubomirskia baicalensis]
Length = 326
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 167/307 (54%), Gaps = 19/307 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ Y K Y + + ++ W S+ K I HN A + GY+L NH D+ +
Sbjct: 28 KGSYGKSYASELEELERHSVWLSSRKYIEEHN--AHSDVFGYSLAMNHFGDMSEVEFKDA 85
Query: 74 MTRLTH------SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
LTH SR T ++P+ + V D +DWR KG +T NQ DCGA YAF+
Sbjct: 86 F--LTHEPGNYTSRGIATF-KAPQGMKYV---DSIDWRTKGAVTSVKNQGDCGASYAFAA 139
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
++G S + LS Q ++DCS+ GN GC+GG + YV GG+ E Y
Sbjct: 140 TGTMEGANALSNDKQVSLSEQNIIDCSVPYGNHGCSGGDTYTAIKYVVDNGGIDTESSYS 199
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
++GKQS C++ N + +P E L +ATVGP+AV+++A+ + F+ Y SG
Sbjct: 200 FRGKQSSCQYNSKNSGASATGAVGIPYGSESDLMAAVATVGPVAVAVDANTNAFRFYQSG 259
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIAN 302
++D C+S +NHAML+ GY ++ W++KN W +WGDNGY+ + R N+CGIA+
Sbjct: 260 VFDSSTCSSTKLNHAMLVTGYGSYNGKDYWLVKNSWGKYWGDNGYIMMVRNKYNQCGIAS 319
Query: 303 YAVYALI 309
A+Y+++
Sbjct: 320 DALYSML 326
>gi|293342574|ref|XP_002725265.1| PREDICTED: cathepsin Q-like isoform 2 [Rattus norvegicus]
gi|79152841|gb|AAI07914.1| Ctsq protein [Rattus norvegicus]
gi|149039734|gb|EDL93850.1| rCG24269 [Rattus norvegicus]
Length = 343
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 115/314 (36%), Positives = 172/314 (54%), Gaps = 21/314 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY+K Y + + K++ W+ N KKI HN+E G + Y + N +DL +
Sbjct: 33 KMKYEKLYSPEE-ELLKRVVWEENVKKIELHNRENSLGKNTYIMEINDFADLTDEEFKDM 91
Query: 74 MTRLT-------HSRIRRTLVRS-PES-NESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
+T +T S +R L S P S +P +DWR++G++T Q C +C+A
Sbjct: 92 ITGITLPINNTMKSLWKRALGSSLPNSWYWRDALPKFVDWRKEGYVTHVRVQGRCNSCWA 151
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
F + AI+GQ+FK T ++ LS+Q +VDCS GN GC GG+ N YV GGL E
Sbjct: 152 FPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEA 211
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY+GK+ +C++ N I+ + L P++E L +AT GP+A I+ + + Y
Sbjct: 212 TYPYEGKEGLCRYNPNNSSAKITRFVAL-PENEDVLMDAVATKGPVAAGIHVVHSSLRFY 270
Query: 245 ASGIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGN 295
GIY + C ++YVNHA+L+VGY N W+++N W WG NGYM + K N
Sbjct: 271 KKGIYHEPKC-NNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRN 329
Query: 296 NRCGIANYAVYALI 309
N CGIA +A Y ++
Sbjct: 330 NHCGIATFAQYPIV 343
>gi|312386081|gb|ADQ74585.1| silicatein alpha 2 [Lubomirskia baicalensis]
Length = 326
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 166/307 (54%), Gaps = 19/307 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ Y K Y + + ++ W SN K HN A + GY+L NH D+ +
Sbjct: 28 KGSYGKSYASELEELERHSVWLSNRKYTEEHN--AHSDVFGYSLAMNHFGDMSEVEFKDA 85
Query: 74 MTRLTH------SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
LTH SR T ++P+ + V D +DWR KG +T NQ DCGA YAF+
Sbjct: 86 F--LTHEPGNYTSRGIATF-KAPQGMKYV---DSIDWRTKGAVTSVKNQGDCGASYAFAA 139
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
++G S + LS Q ++DCS+ GN GC+GG + YV GG+ E Y
Sbjct: 140 TGTMEGANALSNDKQVSLSEQNIIDCSVPYGNHGCSGGDTYTAIKYVVDNGGIDTESSYS 199
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
++GKQS C++ N + +P E L +ATVGP+AV+++A+ + F+ Y SG
Sbjct: 200 FRGKQSSCQYNSKNSGASATGAVGIPYGSESDLMAAVATVGPVAVAVDANTNAFRFYQSG 259
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIAN 302
++D C+S +NHAML+ GY ++ W++KN W +WGDNGY+ + R N+CGIA+
Sbjct: 260 VFDSSTCSSTKLNHAMLVTGYGSYNGKDYWLVKNSWGKYWGDNGYIMMVRNKYNQCGIAS 319
Query: 303 YAVYALI 309
A+Y+++
Sbjct: 320 DALYSML 326
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 101/283 (35%), Positives = 159/283 (56%), Gaps = 7/283 (2%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
+ N KI HN EA G+H Y ++ NH DL ++ + ++ +L + N
Sbjct: 54 YMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKTASLGGTYIPN 113
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+++ +P H+DWRE+G +TP NQ CG+C++FS A++GQ F+ T ++ LS Q +VDC
Sbjct: 114 KNIQLPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDC 173
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN GC GG + Y++ G+ E YPY+G C + N + +
Sbjct: 174 SRKFGNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDIGFVDIK 233
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
E LK +A VGPI+V+I+AS +FQ Y+ G+Y + C+S+ ++H +L+VG+ +S
Sbjct: 234 KGSEKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSV 293
Query: 273 -----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN WS WGD GY+ + R N CGIA+ A Y ++
Sbjct: 294 SGEDYWLVKNSWSEKWGDQGYIKMARNKENMCGIASSASYPVV 336
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 165/304 (54%), Gaps = 19/304 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ Y K Y +A ++K+ +++N + I+ HN E QGLH YT+ N +DL +
Sbjct: 2 KSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEF--- 58
Query: 74 MTRLTHSRIRRTL----VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
M S+ RT+ V P ++E D +DWR KG +TP NQ CG+C++FS
Sbjct: 59 MALYVPSKFNRTMPYNTVYLPATSE-----DSVDWRTKGAVTPIKNQGQCGSCWSFSTTG 113
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+ +G +T + LS QQ+VDCS GN GC GG + + Y+ GL EEDYPY
Sbjct: 114 STEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYT 173
Query: 190 GKQSIC-KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C K K ISS+S +P +E L +A GP++V+I A FQLY SG+
Sbjct: 174 AQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAK-GPVSVAIEADQSGFQLYKSGV 232
Query: 249 YDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRG---NNRCGIANYAV 305
+D T+ ++H +L+VGYT + WI+KN W WG GY+ +KRG + CGIA
Sbjct: 233 FDGNCGTN--LDHGVLVVGYTDDYWIVKNSWGTTWGVEGYINMKRGVSASGICGIAMQPS 290
Query: 306 YALI 309
Y ++
Sbjct: 291 YPIV 294
>gi|449679414|ref|XP_002161570.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 353
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 172/320 (53%), Gaps = 21/320 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
M N EW I K+ K Y + ++ K L+W+ N++ I HN E H + + N
Sbjct: 44 MKNPEWRRFKI----KFGKFYSNQDEETSKYLNWKKNNENIINHNSEN----HSFEIGIN 95
Query: 61 HLSDLHPRHYIK---EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQE 117
SDL ++K +L+ S + T S + V IPD +DWR +G++TP NQ
Sbjct: 96 QFSDLTHEEFMKIHGGCLKLSKSIVNFTKEFSLPN--KVNIPDKVDWRTEGYVTPVKNQG 153
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
C +C+AFS A++GQ F+ T + LS Q +VDCS GN GC GG N Y++
Sbjct: 154 LCRSCWAFSTTGALEGQTFRKTGILPTLSEQNLVDCSKSYGNQGCDGGWTNNAFEYIKDN 213
Query: 178 GGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINA 236
GL E YPY K+ C + S + +P DE ALK +ATVGPIAV+I+A
Sbjct: 214 DGLDSENGYPYDAKELGYCYYDEKYKEASDSGFVEIPYGDEDALKEAVATVGPIAVNIDA 273
Query: 237 SPHTFQLYASGIYDDEACTSDYVN--HAMLLVGYTRNS----WILKNWWSHHWGDNGYMY 290
S +FQ Y SG+Y++ C + N HA+L+VGY W++KN W WGD+GY+
Sbjct: 274 SKPSFQSYKSGVYNEPTCGNGITNLTHAVLVVGYGTEKGHKFWLVKNSWGKTWGDHGYIK 333
Query: 291 LKRG-NNRCGIANYAVYALI 309
+ R +N+CGIA A + L+
Sbjct: 334 MSRNKSNQCGIATRASFPLV 353
>gi|344257452|gb|EGW13556.1| Cathepsin L1 [Cricetulus griseus]
Length = 290
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 103/291 (35%), Positives = 161/291 (55%), Gaps = 13/291 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+ +K+ W++N K I HN++ +G HG+ L N DL + + MT +
Sbjct: 5 EGQKRAVWENNRKMIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQSMGTKEMN 64
Query: 87 VRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
V E +L +P +DWR ++TP +Q C +C+AFS +++GQIF+ T ++
Sbjct: 65 V----FQEPLLGDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLIS 120
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
LS Q +VDCS GN+GC GG + YV+ GL YPY+ + C++ N
Sbjct: 121 LSEQNLVDCSWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAA 180
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+++ + V P E AL +ATVGPI+V +++ H+F+ Y G+Y + C+S ++HA+L
Sbjct: 181 NVTDF-VKIPISEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVL 239
Query: 265 LVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+VGY S W++KN W WG NGY+ + R NN CGIA YA+Y +
Sbjct: 240 VVGYGEESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 290
>gi|444515095|gb|ELV10757.1| Aryl hydrocarbon receptor nuclear translocator [Tupaia chinensis]
Length = 786
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/272 (39%), Positives = 153/272 (56%), Gaps = 13/272 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 519 KKTYGKQYNSKVDEISRRLTWEKNLKYISIHNLEASLGIHTYELAMNHLGDMTSEEVVQK 578
Query: 74 MTRL----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
MT L +HS TL + PD +D+R+KG++TP NQ CG+C+AFS
Sbjct: 579 MTGLKVPPSHSLSNDTLYIPDWEGRT---PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVG 635
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ+ K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY
Sbjct: 636 ALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV 693
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y
Sbjct: 694 GQDESCMYNPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY 753
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKN 277
DE C+SD VNHA+L VGY WI+KN
Sbjct: 754 YDENCSSDNVNHAVLAVGYGVQKGSKHWIIKN 785
>gi|405977173|gb|EKC41636.1| Cathepsin K [Crassostrea gigas]
Length = 942
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 167/315 (53%), Gaps = 20/315 (6%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLH---WQSNHKKIHTHNQEAQQGLHGYTLRE 59
+KEW + +K+ Y K T+ +K+ W N I+ HN+EA G H Y L
Sbjct: 640 DKEW--------EDFKRIYSKTYTEQDEKIRKSIWIQNIDIINRHNKEADMGHHSYRLGM 691
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDC 119
N D+ + + P +N + +P+ ++W ++G++TP NQ C
Sbjct: 692 NEFGDMTTKEVTGMLNVPKGYATDNVSTFLPPNN--LQLPETVNWTKEGYVTPVKNQGYC 749
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C+AF+ ++GQ F+ T ++ LS Q +VDC NLGC GG Y+ GG
Sbjct: 750 GSCWAFATTGGLEGQHFRKTKKLVSLSEQNLVDCC--KENLGCTGGLPVTAYKYIARNGG 807
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
+ EE YPY GK C F+ P I + +P DE L+ +A+VGP+ VSI+AS
Sbjct: 808 IDTEESYPYLGKNGNCTFRPPKIGATCQGFVRVPAGDEVGLQKAVASVGPVTVSIDASLK 867
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG- 294
+F LY G+YDD+ C+ NH +L+VGY ++ W++KN W +G +GY+ + R
Sbjct: 868 SFYLYKEGVYDDKKCSKKMFNHFVLIVGYGKHLGKEYWLVKNSWGMSFGMDGYIMMARNQ 927
Query: 295 NNRCGIANYAVYALI 309
+N+CGI+N VY ++
Sbjct: 928 DNQCGISNQPVYPIV 942
>gi|405963298|gb|EKC28885.1| Cathepsin L [Crassostrea gigas]
Length = 265
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 100/269 (37%), Positives = 154/269 (57%), Gaps = 11/269 (4%)
Query: 47 EAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTLVRSPESNESVLIPDHLDWR 105
EA G H Y L N D+ R + HS + + P + + +PD +DW
Sbjct: 2 EADNGHHSYRLGMNEFGDMTSREMAAMLNGARGHSVVNGSTFLPPNN---LQLPDTVDWS 58
Query: 106 EKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGG 165
++G++TP NQ CG+C+AFS ++GQ ++ T ++ LS Q ++DCS N+GC GG
Sbjct: 59 KEGYVTPVKNQGQCGSCWAFSTTGGLEGQHYRKTGKLVSLSEQNLLDCS--KENMGCNGG 116
Query: 166 SLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLA 225
+ Y++ GG+ EE YPY GK+ C F+ + + + + DE ALK +A
Sbjct: 117 LPQKAYKYIKENGGIDTEESYPYLGKKETCSFRPSEVGATCTGFVQVTAGDELALKKAVA 176
Query: 226 TVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSH 281
+VGPI V I+AS +FQLY G+YD+++C +HA+L+VGY ++ W++KN W
Sbjct: 177 SVGPITVCIDASQPSFQLYKGGVYDEQSCNPIVFDHAVLIVGYGVYQGKDYWLVKNSWGT 236
Query: 282 HWGDNGYMYLKRG-NNRCGIANYAVYALI 309
WG +GY+ + R NN+CGIAN+AVY +
Sbjct: 237 SWGMDGYIMMSRNQNNQCGIANHAVYPTV 265
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 110/300 (36%), Positives = 165/300 (55%), Gaps = 10/300 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y + Y++K + ++L W+ N K + HN E G+H Y L NHL+D+
Sbjct: 40 KKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMTSEEVSSL 99
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ + + SN+ +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 100 MSSLRVPSQWQANVTYKSNSNQK--LPDSVDWREKGCVTEVKYQGACGACWAFSAVGALE 157
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T + LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 158 AQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSYPYKAM 217
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ + S ++ LP E ALK +A GP++V+I+A +F LY SG+Y D
Sbjct: 218 DGNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKSGVYYD 277
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY R+ W++KN W ++G+ GY+ + R + N CGIA+Y Y
Sbjct: 278 PSCTQN-VNHGVLVVGYGNLNGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASYPSY 336
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 169/319 (52%), Gaps = 12/319 (3%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
N++ + +F + +KK Y+ + +K W +N KI HN + Y L N
Sbjct: 22 NQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNEY 81
Query: 63 SDLHPRHYIKEMTRLTHS-RIRR------TLVRSPESNESVLIPDHLDWREKGFITPDWN 115
DL + M + R++R T + + +P +DWR+ G +TP N
Sbjct: 82 GDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVKN 141
Query: 116 QEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQ 175
Q CG+C++FS +++GQ K T ++ LS Q ++DCS GN GC GG + Y++
Sbjct: 142 QGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYIK 201
Query: 176 FAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSIN 235
GG+ E YPY+ K C+F + + + + DE LK ATVGPI+V+I+
Sbjct: 202 IQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVDIKSGDEEMLKEAAATVGPISVAID 261
Query: 236 ASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL 291
AS +FQ Y++G+Y + AC+S ++H +L+VGY ++ W++KN W WG+ GY+ +
Sbjct: 262 ASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYGTENGKDYWLVKNSWGEGWGEAGYIKM 321
Query: 292 KR-GNNRCGIANYAVYALI 309
R +N+CGIA A Y L+
Sbjct: 322 SRNADNQCGIATQASYPLV 340
>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
Length = 341
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 173/326 (53%), Gaps = 23/326 (7%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + QK+Y + K + K++ ++ HK I HNQ+ +G + +++N
Sbjct: 22 LVKEEWNAFKMEHQKQYDSEVEDKF---RMKIYAENKHK-IAKHNQKFARGQVPFRVKQN 77
Query: 61 HLSDLHPRHYIKEMTRLTH-----------SRIRRTLVRSPESNESVLIPDHLDWREKGF 109
D+ ++ M S R P +N V +PDH+DWR+ G
Sbjct: 78 KYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATFIPPAN--VRVPDHVDWRKHGA 135
Query: 110 ITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRN 169
+T +Q CG+C++FS A++GQ ++ T+ + LS Q ++DCS GN GC GG + N
Sbjct: 136 VTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCSTAYGNNGCNGGLMDN 195
Query: 170 TLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGP 229
Y++ G+ E+ YPY+ C++ N D + +P DE L +ATVGP
Sbjct: 196 AFKYIKDNKGIDTEKSYPYEAVDDKCRYNPRNSGADDVGFIDIPSGDEGKLMAAVATVGP 255
Query: 230 IAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWG 284
++V+I+AS TFQ Y+ G+Y DE C+S ++H +L+VGY + W++KN W WG
Sbjct: 256 VSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGTDENGGDYWLVKNSWGRSWG 315
Query: 285 DNGYMYLKRG-NNRCGIANYAVYALI 309
D GY+ + R +N CGIA A + L+
Sbjct: 316 DLGYIKMARNRDNHCGIATAASFPLV 341
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 103/291 (35%), Positives = 161/291 (55%), Gaps = 13/291 (4%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+ +K+ W++N K I HN++ +G HG+ L N DL + + MT +
Sbjct: 45 EGQKRAVWENNRKMIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQSMGTKEMN 104
Query: 87 VRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
V E +L +P +DWR ++TP +Q C +C+AFS +++GQIF+ T ++
Sbjct: 105 V----FQEPLLGDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLIS 160
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
LS Q +VDCS GN+GC GG + YV+ GL YPY+ + C++ N
Sbjct: 161 LSEQNLVDCSWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAA 220
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+++ + V P E AL +ATVGPI+V +++ H+F+ Y G+Y + C+S ++HA+L
Sbjct: 221 NVTDF-VKIPISEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVL 279
Query: 265 LVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+VGY S W++KN W WG NGY+ + R NN CGIA YA+Y +
Sbjct: 280 VVGYGEESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 330
>gi|197258082|gb|ACH56225.1| cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 282
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 160/282 (56%), Gaps = 20/282 (7%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--- 97
+ HN QG + + NH++DL E RL + RRT + S +
Sbjct: 6 VKVHNDAYAQGKVSFKIGINHIADLP----FAEYRRL--NGFRRTFGDNIASRNATKWRA 59
Query: 98 -----IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+PD +DWR++G++TP NQ CG+C+AFS +++GQ ++T ++ LS Q +VD
Sbjct: 60 PLNFEVPDAVDWRDEGYVTPVKNQGMCGSCWAFSATGSLEGQHKRATGKLVSLSEQNLVD 119
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN GC GG + YV+ G+ EE YPYK KQ C F++ N+ D + + L
Sbjct: 120 CSADFGNNGCNGGLMDFAFEYVKQNHGIDTEESYPYKAKQKKCHFQKANVGADDTGFVDL 179
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
P DE LK +A+ GP++V+I+A +F+LY +G+Y ++ C+ + ++H +L+VGY +
Sbjct: 180 PEADEEQLKAAVASQGPVSVAIDAGHRSFRLYKTGVYYEKHCSPEQLDHGVLVVGYGTDP 239
Query: 273 -----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
WI+KN W WG+ GY+ + R NN CGIA+ A Y L
Sbjct: 240 EHGDYWIVKNSWGEEWGEKGYVRIARNRNNHCGIASKASYPL 281
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 164/299 (54%), Gaps = 8/299 (2%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR-- 76
K Y+ ++ ++ ++ N++ I HNQEA G Y + N DL Y++ +
Sbjct: 29 KQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPG 88
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L + ES + + D +DWR+KG +TP +Q CG+C+AFS +++GQ F
Sbjct: 89 LLPLNLSTPSENVFESTPGLQVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHF 148
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSIC 195
T ++ LS Q ++DCS GN GC GG + Y++ GG+ EE YPY K + +C
Sbjct: 149 MKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVC 208
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+K +SS++ + DE AL + TVGP++V+I+AS + + Y SGIYD+ C+
Sbjct: 209 DYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECS 268
Query: 256 SDYVNHAMLLVGYTR----NSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++H +L VGY + W++KN W WGD GY+ + R NN+CGIA A Y ++
Sbjct: 269 RTKLDHGVLAVGYGSMDGMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCGIATKASYPVV 327
>gi|348531517|ref|XP_003453255.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 330
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 173/303 (57%), Gaps = 13/303 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-- 73
K++K Y + ++++K W +N K + HN A QGL + L + +D+ Y K
Sbjct: 32 KFEKSYDSSSEETQRKQIWLTNRKLVLKHNALADQGLKSFRLGMTYFADMENEEYKKLGC 91
Query: 74 MTRLTHSRIRR--TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ S R TL R P+ ++P +DWRE+G++T +Q+ CG+C+AFS A+
Sbjct: 92 LGSFNASLPCRASTLNRLPKV---TVLPKTVDWREQGYVTDVKHQQQCGSCWAFSATGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ FK T + LS QQ+VDCS N GC GG Y++ GG+ E+ Y Y+ K
Sbjct: 149 EGQHFKKTGTLVPLSEQQLVDCSRKYRNNGCDGGEPNWAFQYIRDNGGVDTEKSYRYEAK 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+++ +I + + + P +E AL +AT+GPI+VSI+ S +FQLY SG+YD+
Sbjct: 209 DGQCRYRSNSIGAKCNGYVDVSPFEE-ALMEAVATIGPISVSIDDSRVSFQLYQSGVYDE 267
Query: 252 EACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C++ +NHA+L VGY T N W++KN W WG+ GY+ + R N+CGIA A Y
Sbjct: 268 PWCSNINLNHAVLAVGYGTENGHDYWLVKNSWGSGWGNKGYIKMTRNKGNQCGIATEASY 327
Query: 307 ALI 309
L+
Sbjct: 328 PLV 330
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 170/327 (51%), Gaps = 22/327 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + +KKY + ++ + K++ Q+ HK I HNQ G + LR N
Sbjct: 22 LVKEEWNAFKLQHRKKYDSESEERI---RMKIYVQNKHK-IAKHNQRYDLGQEKFRLRVN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRI-------RRTLVRSPE-----SNESVLIPDHLDWREKG 108
+DL ++ + S R L+ E +V +P +DWREKG
Sbjct: 78 KYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKG 137
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLR 168
+TP +Q CG+C++FS A++GQ F+ T ++ LS Q +VDCS GN GC GG +
Sbjct: 138 AVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMD 197
Query: 169 NTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVG 228
N YV+ G+ E+ YPY+ C + I + +P DE ALK LATVG
Sbjct: 198 NAFQYVKDNKGIDTEKAYPYEAIDDECHYNPKAIGATDKGFVDIPQGDEKALKKALATVG 257
Query: 229 PIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHW 283
P++V+I+AS +FQ Y+ G+Y + C S+ ++H +L VGY + W++KN W W
Sbjct: 258 PVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTW 317
Query: 284 GDNGYMYLKRG-NNRCGIANYAVYALI 309
GD GY+ + R N CGIA A Y L+
Sbjct: 318 GDQGYVKMARNRENHCGIATTASYPLV 344
>gi|27960480|gb|AAO27844.1|AF456460_1 cathepsin Q2 [Rattus norvegicus]
Length = 343
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 170/312 (54%), Gaps = 21/312 (6%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY+K Y + + K++ W+ N KKI HN+E G + Y + N +DL + +T
Sbjct: 35 KYEKLYSPEE-ELLKRVVWEENVKKIELHNRENSLGKNTYIMEINDFADLTDEEFKDMIT 93
Query: 76 RLT-------HSRIRRTLVRS-PES-NESVLIPDHLDWREKGFITPDWNQEDCGACYAFS 126
+T S +R L S P S +P +DWR++G++T Q C +C+AF
Sbjct: 94 GITLPINNTMKSLWKRALGSSLPNSWYWRDALPKFVDWRKEGYVTHVRVQGRCNSCWAFP 153
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
+ AI+GQ+FK T ++ LS+Q +VDCS GN GC GG+ N YV GGL E Y
Sbjct: 154 VVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATY 213
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY+GK+ +C++ N I+ + L P++E L +AT GP+A I+ + + Y
Sbjct: 214 PYEGKEGLCRYNPNNSSAKITRFVAL-PENEDVLMDAVATKGPVAAGIHVVHSSLRFYKK 272
Query: 247 GIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNR 297
GIY + C ++YVNHA+L+VGY N W+++N W WG NGYM + K NN
Sbjct: 273 GIYHEPKC-NNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNNH 331
Query: 298 CGIANYAVYALI 309
CGIA +A Y +
Sbjct: 332 CGIATFAQYPTV 343
>gi|156554647|ref|XP_001605314.1| PREDICTED: cathepsin L-like [Nasonia vitripennis]
Length = 353
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/310 (34%), Positives = 169/310 (54%), Gaps = 18/310 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+YKK+Y ++ ++ + N +KI HNQ+ GL Y +R N D+ Y M
Sbjct: 46 RYKKNYNGDVEENFRRSVFHENQRKIAEHNQKHDLGLFTYKVRINQFGDMMFEEYKNYMH 105
Query: 76 RLTHSRIRRTLVRSPESNESVL------IPDHLDWREKGFITPDWNQE-DCGACYAFSIA 128
++ + L R P +E + +P+H+DWR++G +TP +Q CG+C+AFS A
Sbjct: 106 AANNTITQ--LKRIPRGDEFIKPKSAENVPEHVDWRQRGAVTPVRDQGLTCGSCWAFSAA 163
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++ Q FK T + LS Q ++DC++ GNLGC GGS + +V GL E +Y Y
Sbjct: 164 GALEAQYFKKTGVLTALSAQNLIDCTMEYGNLGCGGGSAALSFQFVVDQKGLEPEANYSY 223
Query: 189 KGKQSICKFKRPNIVVD--ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
+G+ C + + + +S+ + DE LKV +ATVGP + +I+ S TF+ Y+
Sbjct: 224 EGRTKECPYNTSDDEDEELDASFIYVNGGDEATLKVAVATVGPFSAAIDGSHDTFRFYSE 283
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
G+Y C D ++HA+L+VGY ++ W++KN W WG+ GY + R N CG
Sbjct: 284 GVYYQPECNEDDLDHAVLIVGYGTDNRTDQDFWLVKNSWGETWGEGGYFKVARNRRNHCG 343
Query: 300 IANYAVYALI 309
IA AVY +I
Sbjct: 344 IAAAAVYPVI 353
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 175/322 (54%), Gaps = 27/322 (8%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + + ++KK+Y + + + N +KI HN + Q+G GY L N
Sbjct: 22 LVMEEWQLF----KAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRS------------PESNESVLIPDHLDWREKG 108
SD+ +I S I L + P +N V +P H+DW + G
Sbjct: 78 KYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPAN--VKLPKHVDWVKLG 135
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLR 168
+TP +Q CG+C+AFS A++G F+ T + LS Q ++DCS GN GC GG +
Sbjct: 136 AVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMD 195
Query: 169 NTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVG 228
YV+ GG+ E YPY+G +C+++ N + ++ +P DE ALK +ATVG
Sbjct: 196 QAFQYVRINGGIDTERSYPYEGNNDVCRYEPENSGAIDTGYTDVPLGDEDALKSAVATVG 255
Query: 229 PIAVSINASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY------TRNSWILKNWWS 280
P++V+I+AS +FQLY+SG+Y + C + + ++H +L+VGY ++ W++KN W
Sbjct: 256 PVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWG 315
Query: 281 HHWGDNGYMYLKR-GNNRCGIA 301
WG+NGY+ + R +N+CGIA
Sbjct: 316 DSWGENGYIKMARNADNQCGIA 337
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 166/313 (53%), Gaps = 14/313 (4%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KEW+ + K+Y + ++++ W+ N + I HN E QG Y L N
Sbjct: 29 KEWV-------DYHGKEYSAMGEEMERRMIWEDNLRIITKHNLEHSQGKTTYRLGMNEFG 81
Query: 64 DLHPRHYIKEMTRLTHSRIRRTLVRSP-ESNESVLIPDHLDWREKGFITPDWNQEDCGAC 122
D+ ++ T S + + S +E + +PD +DWR +G++TP +Q CG+C
Sbjct: 82 DMTNAEFVATRTMKKMSGVPKVGQGSTFLPSEFLQLPDSVDWRTEGYVTPVKDQGQCGSC 141
Query: 123 YAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMK 182
+AFS A++GQ F T + LS Q +VDCS GN GC GG Y++ GG+
Sbjct: 142 WAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQAEGNDGCNGGWPAWADEYIKSNGGIDT 201
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E YPY+G C ++ ++ I+ ++ + E AL+ LA VGPI+V I+A+ +FQ
Sbjct: 202 EVGYPYEGVDDSCHYRTSDVGATITGFAEVEADSEKALEKALAQVGPISVCIDATQPSFQ 261
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNR 297
LY SG+YD+ C+S ++H + VGY + +I+KN W WG GY+++ R +
Sbjct: 262 LYESGVYDEPDCSSTALDHCVTAVGYDSTADGDKYYIVKNSWGTTWGQEGYIWMSRDKQK 321
Query: 298 -CGIANYAVYALI 309
CGIA A Y L+
Sbjct: 322 QCGIATNATYPLV 334
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 172/308 (55%), Gaps = 16/308 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y + + + + N KI HNQ QG + L N +D+ + + M
Sbjct: 33 EHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHEFKETMN 92
Query: 76 RLTHSRIRRTLVRSPE--------SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
H+ +R+ L R+ E S +V +P +DWR+ G +T +Q CG+C++FS
Sbjct: 93 GYNHT-MRKEL-RAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCWSFSS 150
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ F+ + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 151 TGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGVDTEKSYP 210
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+G C F + + + + +P DE A+ +AT+GP+AV+I+AS +FQLY+ G
Sbjct: 211 YEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNESFQLYSEG 270
Query: 248 IYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y+D C+SD ++H +L+VGY ++ W++KN W WGD GY+ + R +N+CGIA
Sbjct: 271 VYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIA 330
Query: 302 NYAVYALI 309
+ + +
Sbjct: 331 TASSFPTV 338
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 167/306 (54%), Gaps = 18/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K Y +K ++ +++ W+ + + I HN E G H + L NH D+ + + M
Sbjct: 36 HGKSYEQK-EETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNG 94
Query: 76 ---RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ TH +++ + P E +P H+DWR++G++TP +Q CG+C+AFS A++
Sbjct: 95 YKYKQTHKKLQGSHFLEPNFQE---VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T ++ LS Q +V+CS GN GC GG + YV+ GG+ E+ YPY G
Sbjct: 152 GQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTD 211
Query: 193 SI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + + + +P E AL +A VGP++V+I+A +FQ Y SGIY +
Sbjct: 212 DTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFE 271
Query: 252 EACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
C+S ++H +L+VGY + WI+KN WS WG NGY+ + K +N CGIA
Sbjct: 272 AECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIAT 331
Query: 303 YAVYAL 308
A Y L
Sbjct: 332 AASYPL 337
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/282 (34%), Positives = 154/282 (54%), Gaps = 8/282 (2%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ N + I HN + G +TL+ N D+ + M + RR +
Sbjct: 46 FEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEEIVATMNGFLGAPTRRPAAVLKADD 105
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
E+ +P+ +DWR KG +TP +Q+ CG+C+AFS +++GQ F ++ LS Q +VDC
Sbjct: 106 ET--LPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDC 163
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S N+GC GG + Y++ G+ E+ YPY+ + C+F N+ + + +
Sbjct: 164 SDKFRNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVE 223
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
E ALK +AT+GPI+V I+AS TF Y +G+Y D+ C+S ++H +L VGY +
Sbjct: 224 HGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDEN 283
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W+ WGD GY+ + R NN CGIA+ A Y L+
Sbjct: 284 GGDFWLVKNSWNTSWGDKGYIKMSRNRNNNCGIASQASYPLV 325
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 106/302 (35%), Positives = 167/302 (55%), Gaps = 12/302 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y+ + + + N I HN + +GL Y L N DL + +
Sbjct: 34 HKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFN- 92
Query: 77 LTHSRIRRT----LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
H R+T + N+S L P +DWR+KG +TP +Q CG+C+AFS +++
Sbjct: 93 -GHHGTRKTGGSTFLPPANVNDSSL-PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLE 150
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
G+ F E+ LS Q +VDCS GN GC GG + + Y++ G+ E+ YPY+
Sbjct: 151 GRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVD 210
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+FK+ ++ + + + E LK +ATVGPI+V+I+AS +FQLY+ G+YD+
Sbjct: 211 GECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP 270
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYA 307
C+S+ ++H +L+VGY + W++KN W+ WGD GY+ + R NN+CGIA+ A Y
Sbjct: 271 ECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYP 330
Query: 308 LI 309
L+
Sbjct: 331 LV 332
>gi|124487918|gb|ABN12042.1| putative cathepsin L precursor [Maconellicoccus hirsutus]
Length = 211
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 137/211 (64%), Gaps = 7/211 (3%)
Query: 106 EKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGG 165
+KG +T +Q DCG+CYAFS +I+GQ F+ + ++ LS QQ++DCS+ GN GC GG
Sbjct: 1 KKGAVTEVKDQGDCGSCYAFSTTGSIEGQQFRKSGTLKSLSEQQIIDCSVKYGNGGCEGG 60
Query: 166 SLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLA 225
+ N NYV GG+ E YPY +++ C +K N +I ++ LP DE LK+ +A
Sbjct: 61 VMENAFNYVIDNGGIDSEGSYPYIDRETQCAYKPENSAANIKDFATLPVGDEEMLKLAVA 120
Query: 226 TVGPIAVSINASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY----TRNSWILKNWW 279
VGPI+++IN SP +F+LY SG+Y D+ C S D + HA+L+VGY ++ W++KN W
Sbjct: 121 KVGPISIAINTSPRSFKLYKSGVYYDKDCKSDPDDLTHAVLVVGYGTEDGKDYWLVKNSW 180
Query: 280 SHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ WG+NGY+ + R NN CGIA+YA Y +
Sbjct: 181 NTDWGENGYIKMARNKNNHCGIASYATYPTV 211
>gi|148709374|gb|EDL41320.1| cathepsin 7, isoform CRA_c [Mus musculus]
Length = 277
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/221 (43%), Positives = 136/221 (61%), Gaps = 10/221 (4%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP LDWR++G++TP Q CGAC+AFS+ + I+GQ+FK T ++ LS+Q ++DCS+
Sbjct: 58 IPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSY 117
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
G GC GG + YV+ GGL E YPY+ K C+++ VV ++ + V+ P++E
Sbjct: 118 GTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKHCRYRPERSVVKVNRFFVV-PRNE 176
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--------T 269
AL L T GPIAV+I+ S +F Y GIY + C D ++H +LLVGY
Sbjct: 177 EALLQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLVGYGYEGHESEN 236
Query: 270 RNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
R W+LKN WG+NGYM L RG NN CGIA+YA+Y +
Sbjct: 237 RKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMYPAL 277
>gi|449681105|ref|XP_002158608.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 339
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 106/307 (34%), Positives = 174/307 (56%), Gaps = 17/307 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y+ ++ L+W++N K++ HN + H + N SD+ + K
Sbjct: 39 KAKFGKTYKSNIEEAPSYLNWKNNLKEVERHNSKK----HSFKKGINQFSDMSHEEFRKM 94
Query: 74 MT---RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+L+ + + + SN V+IPD +DWR +G++T NQ CG+C+AFS A
Sbjct: 95 YGGCFKLSKKNVTKGSIFLSPSN--VVIPDSVDWRTEGYVTRVKNQGQCGSCWAFSSTGA 152
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ F+ T ++E+S Q +VDC+ GN C GG + N Y++ G+ E YPY
Sbjct: 153 LEGQTFRKTGVLQEISEQNLVDCTQSYGNEACNGGWMDNAFTYIKDNKGIDSEVGYPYYA 212
Query: 191 KQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C + + V + + +P DE+ALKV +ATVGPI+V+I+A+ +F Y SG+Y
Sbjct: 213 RALGYCYYNQQYNVASDTGFVDIPSGDENALKVAVATVGPISVAIDATKASFMSYQSGVY 272
Query: 250 DDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
++ C + + ++HA+L+VGY R+ WI+KN W WGD GY+ + R +N+CGIA
Sbjct: 273 NEPTCGNGIENLDHAVLVVGYGTEEGRDFWIVKNSWDTTWGDQGYIKMSRNMSNQCGIAT 332
Query: 303 YAVYALI 309
A Y ++
Sbjct: 333 KASYPIV 339
>gi|359385048|emb|CBY80149.1| silicatein yellow variant [Tethya aurantium]
Length = 332
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 163/303 (53%), Gaps = 16/303 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y+ + + +K L W SN K I HN A G++L NHL D+ Y ++
Sbjct: 33 KNQHSKSYQTELEELEKHLVWLSNKKYIELHNANAD--TFGFSLAMNHLGDMTDYEYQEK 90
Query: 74 MTRLTHSRIR-----RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+S + + R P + P+ +DWR KG +T NQ DCGA YAFS
Sbjct: 91 YLTYQNSNSKSGNYTKVFQREPWMTD----PETVDWRTKGAVTNIKNQGDCGASYAFSAM 146
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G +T ++ LS Q ++DCS+ GN GC GG++ YV G+ E YPY
Sbjct: 147 GALEGASALATGKLIPLSEQNIIDCSVPYGNHGCKGGNMYIAFKYVIANDGVDSETSYPY 206
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
GKQS C +K N V +S + E L+ +A GP+AV+I+ S + F+ Y SG+
Sbjct: 207 GGKQSSCTYKTQNSVASMSGSIQIKYGSETDLEAAVANNGPVAVAIDGSSNAFRFYFSGV 266
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
YD C+S Y+NHAM++ GY + W+ KN W +WG+ GY+ + R N+CGIA+
Sbjct: 267 YDSSRCSSSYLNHAMVITGYGISGDQEYWLAKNSWGTNWGEEGYVKMARNKYNQCGIASD 326
Query: 304 AVY 306
A +
Sbjct: 327 ASF 329
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 179/323 (55%), Gaps = 21/323 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++W + K+Y+ D ++ + K+ +++H + HN+ QGL + L N
Sbjct: 22 LVQEQWGAFKMTHNKQYQSDTEERF---RMKIFMENSHT-VAKHNKLYAQGLVSFKLGIN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITP 112
+D+ +++ + ++ + +RS ES++SV +P +DWR+KG +TP
Sbjct: 78 KYADMLHHEFVQVLNGFNRTK---SGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTP 134
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
+Q CG+C++FS +++GQ F+ + ++ LS Q +VDCS GN GC GG + N
Sbjct: 135 VKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFR 194
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
Y++ GG+ E+ YPYK + C +K N + + +E L+ +ATVGP++V
Sbjct: 195 YIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSV 254
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNG 287
+I+AS +FQLY+ G+Y + C++ ++H +L+VGY W++KN W WGD G
Sbjct: 255 AIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQG 314
Query: 288 YMYLKRG-NNRCGIANYAVYALI 309
Y+ + R +N CGIA A Y L+
Sbjct: 315 YIKMARNRDNNCGIATEASYPLV 337
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 114/307 (37%), Positives = 167/307 (54%), Gaps = 20/307 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
YK++Y + ++ + +N +I HN QG YT+ N SD +E+ R
Sbjct: 73 YKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSD----KTDEELKR 128
Query: 77 LTHSRIRRTLVRSPESNESVLI----PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L R +L S + ++ + I P +DWR KG +TP NQ +CG+C+AFS AI+
Sbjct: 129 LRC--FRGSLNASRDGSKYITIAAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIE 186
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F +T + LS QQ+VDCS GN C GG + N YV+ + G+ E YPY +
Sbjct: 187 GQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGE 246
Query: 193 S-----ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
+ C+F VV ++ + LP LK + GPI+V+INA +F Y SG
Sbjct: 247 TGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSG 306
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+Y D+ C+SD ++H +LLVGY + W++KN W HWG+NGY+ + R NN CG+A+
Sbjct: 307 VYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVAS 366
Query: 303 YAVYALI 309
A Y LI
Sbjct: 367 MASYPLI 373
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 179/323 (55%), Gaps = 21/323 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++W + K+Y+ + ++ + K+ +++H + HN+ QGL + L N
Sbjct: 22 LVQEQWGAFKMTHNKQYQSETEERF---RMKIFMENSHT-VAKHNKLYAQGLVSFKLGIN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITP 112
+D+ +++ + ++ + +RS ES++SV +P +DWR+KG +TP
Sbjct: 78 KYADMLHHEFVQVLNGFNRTK---SGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTP 134
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
+Q CG+C++FS +++GQ F+ + ++ LS Q +VDCS GN GC GG + N
Sbjct: 135 VKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFR 194
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
Y++ GG+ E+ YPYK + C +K N + + +E L+ +ATVGP++V
Sbjct: 195 YIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSV 254
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNG 287
+I+AS +FQLY+ G+Y + C++ ++H +L+VGY W++KN W WGD G
Sbjct: 255 AIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQG 314
Query: 288 YMYLKRG-NNRCGIANYAVYALI 309
Y+ + R NN CGIA A Y L+
Sbjct: 315 YIKMARNRNNNCGIATEASYPLV 337
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 112/305 (36%), Positives = 163/305 (53%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y ++ + ++L W+ N K + HN E GLH Y L NHLSD+
Sbjct: 47 KKTYGKQYEEQNQEVTRRLIWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASL 106
Query: 74 MT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ R+ + R T R N + +PD +DWR+KG +T Q CG+C+AFS A+
Sbjct: 107 MSSLRIPNQWSRNTTYRL---NSNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGAL 163
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISG--NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+ Q+ T ++ LS Q +VDCS N GC GG + Y+ G+ + YPYK
Sbjct: 164 EAQLKLKTGKLVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYK 223
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
K C++ N S ++ LP E ALK +A GP++V I+AS +F LY SG+Y
Sbjct: 224 AKDGKCQYNPANRAATCSRYTELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVY 283
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
D +CT + VNH +L+ GY ++ W++KN W +GD GY+ + R N CGIAN+
Sbjct: 284 YDPSCTQN-VNHGVLVTGYGNLDGKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFP 342
Query: 305 VYALI 309
Y I
Sbjct: 343 SYPEI 347
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 169/309 (54%), Gaps = 11/309 (3%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
F + +K+ Y ++K+ L + +N K+ HN+ Q+G Y + N+ +D
Sbjct: 64 FFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELR 123
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
K + RI + + S+E +PD +DWR G +TP NQ CG+C+AFS AI
Sbjct: 124 KLRGYRSACRIAKPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAI 183
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY--- 188
+GQ ++ T+ + LS QQ++DCS GN GC GG + YV+ G+ E YPY
Sbjct: 184 EGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNEGIDSEISYPYISG 243
Query: 189 KGKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
G +++ C F NI+ ++ + + DE AL +AT+GP++V+INA +F +Y SG
Sbjct: 244 DGDENVRCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGPVSVAINAGLSSFSMYKSG 303
Query: 248 IYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGI 300
IY D C S+ ++H +LLVGY + W++KN W WGD GY+ LK N CG+
Sbjct: 304 IYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILKDSKNMCGV 363
Query: 301 ANYAVYALI 309
A+ A Y L+
Sbjct: 364 ASAASYPLV 372
>gi|221117518|ref|XP_002157675.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 340
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 116/320 (36%), Positives = 173/320 (54%), Gaps = 19/320 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
M + EW I K+ K Y ++ K L+W+ N++KI HN E + + + N
Sbjct: 29 MKDPEWRRFKI----KFGKFYSSNIEETSKYLNWKINNEKIKNHNSENRF----FKIGMN 80
Query: 61 HLSDLHPRHYIK---EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQE 117
SDL +IK +L S I T + +V IPD +DWR KG++ P NQ
Sbjct: 81 QFSDLTHEEFIKIYGGCFKLPKSFINITKGSTFLPPSNVNIPDEVDWRTKGYVNPVKNQG 140
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+C+AFS A++GQ F+ T + +LS Q +VDC+ GN C GG + N Y+
Sbjct: 141 QCGSCWAFSTTGALEGQTFRKTGVLPDLSEQNLVDCTQSYGNEACNGGWMDNAFKYISDN 200
Query: 178 GGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINA 236
G+ E YPY K C + + V + + + DE ALKV +ATVGPI+V+I+A
Sbjct: 201 KGIDSEAGYPYYAKALGYCYYNQQFNVASDTGFVDIASGDEDALKVAVATVGPISVAIDA 260
Query: 237 SPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMY 290
+ +F Y SG+Y + C + + ++HA+L+VGY R+ W++KN W WGD GY+
Sbjct: 261 TKDSFMRYQSGVYYEPTCGNGLENLDHAVLVVGYGTEDGRDFWLVKNSWDITWGDQGYIK 320
Query: 291 LKRG-NNRCGIANYAVYALI 309
+ R +N+CGIA A Y L+
Sbjct: 321 MSRNMSNQCGIATKASYPLV 340
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 169/309 (54%), Gaps = 11/309 (3%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
F + +K+ Y ++K+ L + +N K+ HN+ Q+G Y + N+ +D
Sbjct: 64 FFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELR 123
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
K + RI + + S+E +PD +DWR G +TP NQ CG+C+AFS AI
Sbjct: 124 KLRGYRSACRIAKPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAI 183
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY--- 188
+GQ ++ T+ + LS QQ++DCS GN GC GG + YV+ G+ E YPY
Sbjct: 184 EGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNKGIDSEISYPYISG 243
Query: 189 KGKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
G +++ C F NI+ ++ + + DE AL +AT+GP++V+INA +F +Y SG
Sbjct: 244 DGDENVRCLFNSTNIMAQVTGYINIHEGDERALMNAVATIGPVSVAINAGLPSFSMYKSG 303
Query: 248 IYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGI 300
IY D C S+ ++H +LLVGY + W++KN W WGD GY+ LK N CG+
Sbjct: 304 IYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILKDSKNMCGV 363
Query: 301 ANYAVYALI 309
A+ A Y L+
Sbjct: 364 ASAASYPLV 372
>gi|449491897|ref|XP_002194340.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 304
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 171/302 (56%), Gaps = 19/302 (6%)
Query: 24 KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR 83
+ ++ ++ W+ N ++I HN+E +QG H Y L N DL + + + M T +
Sbjct: 4 QEAEALRRQVWEQNLRRIQQHNREERQGKHSYRLAMNRFGDLTNQEFNELMNGYTP--VP 61
Query: 84 RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIE 143
R +++ ++ P +DWR KG++TP Q DCG+C+AFS A++G F+ T ++
Sbjct: 62 REEPAEFQTSAALESPMKVDWRAKGYVTPVKFQGDCGSCWAFSATGALEGLTFRQTGKLV 121
Query: 144 ELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSICKFKRPNI 202
LS Q ++DC+ GN+GC GGS++ YV+ GGL E YPY G S C+++
Sbjct: 122 ALSEQNLIDCTKNLGNMGCRGGSMQRAFQYVRDNGGLSSERAYPYTGTDNSPCRYEPSYK 181
Query: 203 VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG-IYDDEAC--TSDY- 258
+ S + +P E AL +A VGP++V+++AS +TF+ Y+SG ++ C T Y
Sbjct: 182 AANCSGFRTVPRGSEAALAQAVAAVGPVSVAVDASSYTFRFYSSGTVHSSSPCIFTCPYG 241
Query: 259 ---VNHAMLLVG--------YTRNSWILKNWWSHHWGDNGYMY-LKRGNNRCGIANYAVY 306
+NHAMLLVG Y+ N WILKN WS WG GYM+ LK +N+CG+A+ Y
Sbjct: 242 QQQLNHAMLLVGYNTAHLGNYSVNYWILKNSWSEGWGQKGYMFLLKNAHNQCGVASDGSY 301
Query: 307 AL 308
+
Sbjct: 302 PV 303
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 168/305 (55%), Gaps = 12/305 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++++ K+Y + T++ L + SN ++I HN Q+G + + NH++DL Y K
Sbjct: 87 KEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHITDLPFEEYRKL 146
Query: 73 --EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
R S T P + + +P H DWR+ G++T NQ CG+C+AFS A
Sbjct: 147 NGYKPRYDDSHRNGTKFLVPFN---INVPGHWDWRDHGYVTEVKNQGMCGSCWAFSATGA 203
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ + + LS Q +VDCS GN GC GG + Y++ G+ E YPYKG
Sbjct: 204 LEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYIKDNHGVDTEASYPYKG 263
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K+ C F + + + + LP DE LK+ +AT GPI+V+I+A +FQ+Y G+Y
Sbjct: 264 KEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDAGHPSFQMYRKGVYY 323
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ C+S+ ++H +L+VGY + WI+KN W WG+ GY+ + R +N CGIA+ A
Sbjct: 324 EPQCSSESLDHGVLVVGYGTDEIDGDYWIVKNSWGPGWGEKGYVRIARNRDNHCGIASKA 383
Query: 305 VYALI 309
Y ++
Sbjct: 384 SYPIV 388
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 167/306 (54%), Gaps = 18/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K Y +K ++ +++ W+ + + I HN E G H + L NH D+ + + M
Sbjct: 36 HGKSYEQK-EETWRRMVWEEHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNG 94
Query: 76 ---RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ TH +++ + P E +P H+DWR++G++TP +Q CG+C+AFS A++
Sbjct: 95 YKYKQTHKKLQGSHFLEPNFLE---VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T ++ LS Q +V+CS GN GC GG + YV+ GG+ E+ YPY G
Sbjct: 152 GQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTD 211
Query: 193 SI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + + + +P E AL +A VGP++V+I+A +FQ Y SGIY +
Sbjct: 212 DTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFE 271
Query: 252 EACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
C+S ++H +L+VGY + WI+KN WS WG NGY+ + K +N CGIA
Sbjct: 272 AECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIAT 331
Query: 303 YAVYAL 308
A Y L
Sbjct: 332 AASYPL 337
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 178/323 (55%), Gaps = 21/323 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++W + K+Y+ D ++ + K+ +++H + HN+ QGL + L N
Sbjct: 22 LVQEQWGAFKMTHNKQYQSDTEERF---RMKIFMENSHT-VAKHNKLYAQGLVSFKLGIN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITP 112
+D+ +++ + ++ + +RS ES++SV +P +DWR+KG +TP
Sbjct: 78 KYADMLHHEFVQVLNGFNRTK---SGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTP 134
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
+Q CG+C++FS +++GQ F+ + ++ LS Q +VDCS GN GC GG + N
Sbjct: 135 VKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFR 194
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
Y++ GG+ E+ YPYK + C +K N + + +E L+ +ATVGP++V
Sbjct: 195 YIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSV 254
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNG 287
+I+AS +FQLY+ G+Y + C+ ++H +L+VGY W++KN W WGD G
Sbjct: 255 AIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQG 314
Query: 288 YMYLKRG-NNRCGIANYAVYALI 309
Y+ + R +N CGIA A Y L+
Sbjct: 315 YIKMARNRDNNCGIATEASYPLV 337
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 102/300 (34%), Positives = 168/300 (56%), Gaps = 8/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K Y + + + N +I HN EA QG H Y ++ NH DL ++ +
Sbjct: 36 HQKGYDSSVEEKLRLKIFMENSLRISRHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNG 95
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
++ + TL + ++++ +P+H+DWRE+G +TP NQ CG+C++FS +++GQ F
Sbjct: 96 YIYNN-KTTLGGTFIPSKNINLPEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDF 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ T ++ LS Q +VDCS GN GC GG + Y+Q G+ E YPY+G C
Sbjct: 155 RKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCH 214
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ N + + E L+ LATVGPI+V+I+AS +FQ Y+ G+Y ++ C+
Sbjct: 215 YDPKNKGGSDIGFVDIKKGSEKDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSP 274
Query: 257 DYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ ++H +L VGY + W++KN WS WG++GY+ + R +N CGIA+ A Y ++
Sbjct: 275 ENLDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPVV 334
>gi|395844675|ref|XP_003795081.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 105/290 (36%), Positives = 161/290 (55%), Gaps = 14/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HN E QG +G+T+ N D+ + + M + + ++ + R
Sbjct: 48 RRAVWEKNMKMIELHNGEYSQGKYGFTMAMNAFGDMTNEEFRQVMNGFQNRKHKKGKMFR 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + P +DWR+KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLFFKP---PKSVDWRKKGYVTPVKNQGQCGSCWAFSSTGALEGQMFRKTGKLISLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
+VDCS GN GC+GG + NYV+ GGL E YPY + CK+K V + +
Sbjct: 165 NLVDCSQRQGNHGCSGGLMNFAFNYVKENGGLDSEVSYPYVARDEKCKYKPEYSVANDTG 224
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ +P Q E AL +A +GPI+++I+AS + Q Y SGIY + C+S ++H +LL+GY
Sbjct: 225 FVNIPTQ-EKALMKAVAIIGPISIAIDASHISIQFYKSGIYYEPNCSSKNLDHGVLLIGY 283
Query: 269 TRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W +KN W WG +G + + K NN CGIA+ A Y +
Sbjct: 284 GFEGTDSDDNKFWFIKNSWGIEWGLDGCIKIAKDKNNHCGIASAASYPTV 333
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKAT 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++AS +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W ++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|346574375|gb|AEO36959.1| silicatein-alpha 2 [Baikalospongia fungiformis]
Length = 316
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 165/306 (53%), Gaps = 17/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ Y K Y + + ++ W SN K I HN A + GY+L NH D+ + +
Sbjct: 18 KGSYGKSYASELEELERHSVWLSNRKYIEEHN--AHSDVFGYSLAMNHFGDMSEVEF--K 73
Query: 74 MTRLTHSRIRRT-----LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
LTH T ++P+ + V + +DWR KG +T NQ DCGA YAF+
Sbjct: 74 DVFLTHEPGNYTSRGIATFKAPQGMKYVYL---IDWRTKGAVTSVKNQGDCGASYAFAAT 130
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
++G S + LS Q ++DCS+ GN GC+GG + YV GG+ E Y
Sbjct: 131 GTMEGANALSNDKQVSLSEQNIIDCSVPYGNHGCSGGDTYTAIKYVVDNGGIDTESSYSL 190
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+GKQS C++ N + +P E L +ATVGP+AV+++A+ + F+ Y SG+
Sbjct: 191 RGKQSSCQYNSNNSGASATGAVGIPYGSESDLMAAVATVGPVAVAVDANTNAFRFYQSGV 250
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
+D C+S +NHAML+ GY ++ W++KN W +WGDNGY+ + R N+CGIA+
Sbjct: 251 FDSSTCSSTKLNHAMLVTGYGSYNGKDYWLVKNSWGKYWGDNGYIMMVRNKYNQCGIASD 310
Query: 304 AVYALI 309
A+Y+++
Sbjct: 311 ALYSML 316
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNQILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAT 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 167/306 (54%), Gaps = 18/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K Y +K ++ +++ W+ + + I HN E G H + L NH D+ + + M
Sbjct: 36 HGKSYEQK-EETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNG 94
Query: 76 ---RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ TH +++ + P E +P H+DWR++G++TP +Q CG+C+AFS A++
Sbjct: 95 YKYKQTHKKLQGSHFLEPNFLE---VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T ++ LS Q +V+CS GN GC GG + YV+ GG+ E+ YPY G
Sbjct: 152 GQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTD 211
Query: 193 SI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + + + +P E AL +A VGP++V+I+A +FQ Y SGIY +
Sbjct: 212 DTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFE 271
Query: 252 EACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
C+S ++H +L+VGY + WI+KN WS WG NGY+ + K +N CGIA
Sbjct: 272 AECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIAT 331
Query: 303 YAVYAL 308
A Y L
Sbjct: 332 AASYPL 337
>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
Length = 333
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 171/315 (54%), Gaps = 18/315 (5%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW+ + +Y K Y+ + + + + N I HN + G + L N +
Sbjct: 28 EEWMAF----KLEYNKVYQDETEEQLRFKIFNYNKLLIARHNLKWAAGKVSFNLAVNKFA 83
Query: 64 DLHPRHYIKEM---TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
DL + M + S + P + + +PD +DWR+ GF+TP +Q CG
Sbjct: 84 DLLDHEFQDLMLGKMSPSGSNFGSSTFLPPVN---LTLPDAVDWRKYGFVTPVKDQGSCG 140
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+AFS +++GQ F+ T ++ LS Q ++DCS GN GC G++ Y+Q G+
Sbjct: 141 SCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCS--PGNNGCKNGAVEYAFRYIQSNKGI 198
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E YPY+ Q+ C+F+R I + + L P DE L +ATVGPI+V IN+S +
Sbjct: 199 DTEISYPYEAAQNQCRFRRDTIGATSTGFVKLNPGDEMELAQAVATVGPISVLINSSLDS 258
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKR-G 294
F+ Y G+Y+D +C + + HA+L+VGY + W++KN WS HWG+ GY+ +KR
Sbjct: 259 FKFYHDGVYNDPSCNPNKLTHAVLVVGYGTDDRGGDFWLVKNSWSTHWGEQGYVKIKRNA 318
Query: 295 NNRCGIANYAVYALI 309
NN CGIA+ A+Y L+
Sbjct: 319 NNLCGIASNALYPLV 333
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 168/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNI--TYKSNANQILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKAT 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++AS +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W ++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNQILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++AS +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYSREDVLKEAVANKGPVSVGVDASHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W ++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 168/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNI--TYKSNANQILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKAT 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++AS +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W ++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGVLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 111/306 (36%), Positives = 168/306 (54%), Gaps = 18/306 (5%)
Query: 19 KDYRKKATDSK----KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
KD+ KK K +++ W+ N K I HN + G H Y L N D+ + + M
Sbjct: 33 KDWHKKTYAPKEEGWRRVLWEKNLKMIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFKQLM 92
Query: 75 TRLTHSR-IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ + IR + +P + E+ P +DWR+KG++TP +Q CG+C+AFS A++G
Sbjct: 93 NGYKNQKMIRGSTFLAPNNFEA---PKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q ++ TS++ LS Q +VDCS GN GC GG + YV+ GG+ E+ YPY K
Sbjct: 150 QHYRKTSKLISLSEQNLVDCSRAQGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDD 209
Query: 194 I-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + N + + + + E L +A+VGP++V+I+A +FQ Y SGIY +
Sbjct: 210 QECHYDPNNNSANDTGFVDVQSGCEKDLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEP 269
Query: 253 ACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
C+S+ ++H +L+VGY S WI+KN WS WGDNGY+ + K +N CGIA
Sbjct: 270 ECSSEDLDHGVLVVGYGFESEDVDGKKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATA 329
Query: 304 AVYALI 309
A Y L+
Sbjct: 330 ASYPLV 335
>gi|86285730|gb|ABC94586.1| silicatein alpha [Hymeniacidon perlevis]
Length = 331
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 162/303 (53%), Gaps = 9/303 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y K + ++ L W SN K I HN A + G+TL N D+ +
Sbjct: 31 KSRHSKIYESKLVELERHLTWVSNKKYIEQHN--ANSHIFGFTLAMNKFGDMSELEWANF 88
Query: 74 MTRLTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
++ + + + ++ + + V P+ +DWR KG +T +Q DCGA YAFS A++
Sbjct: 89 LSYBSDGKSKGNYTKTFQPDPRVHDYPEAIDWRTKGAVTAVKDQGDCGASYAFSAMGALE 148
Query: 133 GQ-IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
G E S Q ++DCSI GN GC GG++ + YV GG+ KE YP+ GK
Sbjct: 149 GAYALAHNGNQESFSEQNIIDCSIPYGNYGCHGGNMYDAFLYVIANGGVAKESAYPFLGK 208
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
QS C + R +S + + E L+ +A VGP+AV+I+ + + F+ Y SG+YD
Sbjct: 209 QSSCNYNRNTRGTGMSGSVAIKSESEDDLQTAVANVGPVAVAIDGANNAFRFYYSGVYDS 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
C+S +NHAM++ GY + W+ KN WS +WG +GY+ + RG N+CGIA A Y
Sbjct: 269 SRCSSTSLNHAMVVTGYGTYAGKKYWLAKNSWSTNWGQSGYVMMARGKYNQCGIATDASY 328
Query: 307 ALI 309
+
Sbjct: 329 PTL 331
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNWILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 167/308 (54%), Gaps = 16/308 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y + + + + N KI HNQ QG Y L N +D+ + + M
Sbjct: 34 QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMN 93
Query: 76 RLTHS-----RIRRTLVRS---PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
H+ R R LV + P ++ V +P +DWRE G +T +Q CG+C+AFS
Sbjct: 94 GYNHTLRQLMRERTGLVGATYIPPAH--VTVPKSVDWREHGAVTGVKDQGHCGSCWAFSS 151
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ F+ + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 152 TGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 211
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+G C F + I + + +P DE +K +AT+GP++V+I+AS +FQLY+ G
Sbjct: 212 YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEG 271
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y++ C ++H +L+VGY + W++KN W WG+ GY+ + R NN+CGIA
Sbjct: 272 VYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIA 331
Query: 302 NYAVYALI 309
+ Y +
Sbjct: 332 TASSYPTV 339
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 97/283 (34%), Positives = 156/283 (55%), Gaps = 9/283 (3%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVRSPES 92
++ N + I HN + G +TL+ N D+ + M + RR T + +
Sbjct: 48 FEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEEFTATMNGFLNVPSRRPTAILRADP 107
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+E+ +P +DWR KG +TP +Q+ CG+C+AFS +++GQ F ++ LS Q +VD
Sbjct: 108 DET--LPKEVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVD 165
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
CS GN+GC GG + Y++ G+ E+ YPY+ + C+F N+ + + +
Sbjct: 166 CSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDV 225
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
E ALK +AT+GPI+V+I+AS +FQ Y G+Y +E C+S ++H +L VGY
Sbjct: 226 EHGSESALKKAVATIGPISVAIDASQPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYGETE 285
Query: 273 -----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W+ WG+ GY+ + R N CGIA+ A Y L+
Sbjct: 286 KGEAYWLVKNSWNTSWGNKGYIQMSRDKKNNCGIASQASYPLV 328
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 167/302 (55%), Gaps = 21/302 (6%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y + D + + N K I HN ++ + + N SDL + ++K T
Sbjct: 34 KKYHNQGEDDFRHYVFLQNIKTIAAHNAKST-----FKMAINEFSDLTRKEFVK-----T 83
Query: 79 HSRIRRTLVRSPESNESVL------IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
++ R ++ +S + + +P +DWR++G++TP NQ CG+C+AFS +++
Sbjct: 84 YNGYRLSMKKSTNKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLE 143
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T ++ LS Q ++DCS GN GC GG + + Y++ G+ E YPY+G+
Sbjct: 144 GQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPYEGRD 203
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
IC++K+ N + + + E LK +ATVGPI+V+I+AS +F +Y +G+Y +
Sbjct: 204 DICRYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTGVYHEP 263
Query: 253 ACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+ ++H +L+VGY T N W++KN W WG NGY+ + R +N CGIA A Y
Sbjct: 264 ECSQTVLDHGVLVVGYGTENGEDYWLVKNSWGTDWGMNGYIKMSRNRSNNCGIATNASYP 323
Query: 308 LI 309
LI
Sbjct: 324 LI 325
>gi|139002720|dbj|BAF51966.1| cathepsin K [Carassius auratus]
gi|139002725|dbj|BAF51967.1| tartrate-resistant acid phosphatase [Carassius auratus]
Length = 332
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 161/299 (53%), Gaps = 9/299 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+K++Y +S ++ W+ N I HN+E + G+H Y L NH D+ +++
Sbjct: 37 HKREYNGLDEESIRRAIWEKNMLFIEAHNKEYELGIHTYNLGMNHFGDMTLEEVAEKVMG 96
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L + + + +++V +P +D+R+ G++T NQ CG+C+AFS A++GQ+
Sbjct: 97 L-QMPMYQDQTNTFMPDDTVGLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLK 155
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+ ++ +LS Q +VDC ++ N GC GG + N YV+ G+ EE YPY G C
Sbjct: 156 KTKGQLVDLSPQNLVDC--VTDNDGCGGGYMTNAFRYVKDNQGIDSEEGYPYVGTDQQCA 213
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + +P +E AL +A VGP++V I+A TF Y SG+Y D C
Sbjct: 214 YNSSARAATCKGFKEIPQGNEKALTAAVAKVGPVSVGIDAMQSTFLYYKSGVYYDPNCNK 273
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
D VNHA+L VGY + WI+KN W WG GY+ + R NN CGIA+ A + ++
Sbjct: 274 DDVNHAVLAVGYGATPKGKKYWIVKNSWGEDWGKKGYVLMARNRNNACGIASLASFPVM 332
>gi|46251290|gb|AAS84611.1| cathepsin L-like cysteine proteinase I variant form precursor
[Heterodera glycines]
Length = 374
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 170/305 (55%), Gaps = 9/305 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++K+ K Y + ++++ L + S + I HN+ ++G + + E H++DL Y K
Sbjct: 70 KQKHGKAYADQEVENERMLTYLSAKQFIDKHNEAYKEGKVSFRVGETHIADLPFSEYQKL 129
Query: 73 -EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
RL +RR +P+ +DWR+KG++T NQ CG+C+AFS A+
Sbjct: 130 NGFRRLMGDSLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSATGAL 189
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ + + LS Q ++DCS GN+GC GG + N Y++ G+ KE YPYK K
Sbjct: 190 EGQHVRDKGHLVSLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNKGIDKETAYPYKAK 249
Query: 192 QS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C FKR ++ S ++ + DE L++ +AT GP++V+I+A +FQLY +G+Y
Sbjct: 250 TGKKCLFKRNDVGATDSGYNDIAEGDEEDLRMAVATQGPVSVAIDAGHRSFQLYTNGVYF 309
Query: 251 DEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
++ C ++H +L+ GY + WI+KN W WG+ GY+ + R NN CGIA++A
Sbjct: 310 EKECDPQNLDHGVLVEGYGTDPTQGDYWIVKNSWGTRWGEQGYIRMARNRNNNCGIASHA 369
Query: 305 VYALI 309
+ L+
Sbjct: 370 SFPLV 374
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 167/303 (55%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNITY--KSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 DLKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|198432217|ref|XP_002130230.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 327
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 98/287 (34%), Positives = 154/287 (53%), Gaps = 6/287 (2%)
Query: 29 KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
K++L W+ N + + HN E +GLH YT+ +DL + R
Sbjct: 41 KRQLIWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDEFAAMYLPRMRKDSRNGFCS 100
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+ V P +DWR +G++TP NQ CG+C+AFS +++GQ F T + LS Q
Sbjct: 101 AQPVGGFVENPTSIDWRTRGYVTPVKNQLQCGSCWAFSTTGSLEGQHFAKTKNLVSLSEQ 160
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
Q++DCS G+ GC GG + +Y+ AGG+ E DYPY+ + C+F +I ++
Sbjct: 161 QLMDCSFKEGDEGCGGGIMDYAFDYIFLAGGVESEADYPYEARNDHCRFDNSSIAATLTG 220
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ E L+ + ++GP++V+I+AS +FQLY SG+ + C++ ++H +L VGY
Sbjct: 221 CVDVTSGSETQLEKAVGSIGPVSVAIDASHISFQLYGSGVNYEPMCSTTTLDHGVLAVGY 280
Query: 269 TRNS----WILKNWWSHHWGD-NGYMYL-KRGNNRCGIANYAVYALI 309
++ WI+KN W WG NGY+ + K NN CGIA A Y +
Sbjct: 281 GADNGNEYWIVKNSWGEGWGHLNGYIKMSKNRNNNCGIATQASYPTV 327
>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 333
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 159/304 (52%), Gaps = 9/304 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + KDY + D ++++ W+ N ++ HN E G+H Y L NHL+D+ +
Sbjct: 31 KNTHNKDYEDEIEDLQRRITWEKNLNLVNMHNLEYSMGMHTYELGMNHLADMTSEEIKSK 90
Query: 74 MTRLT---HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+T L S + T S +PD +DWR+KG ++ NQ CG+C+AFS A
Sbjct: 91 LTGLILPPQSERQATFSSQKNSTFGGKVPDSIDWRDKGCVSDVKNQGGCGSCWAFSAVGA 150
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ T ++ LS Q +VDCS GN GC GG + YV G+ + YPY
Sbjct: 151 LEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYVIDNKGIDSDSYYPYHA 210
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C + + ++ + P E LK L ++GPI+V+I+ + +F LY SG+Y
Sbjct: 211 MDEKCHYDPTGKASTCAKYTEIVPGTEDNLKQALGSIGPISVAIDGTRPSFFLYRSGVYS 270
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D C+ + VNH +L VGY ++ W+LKN W +GD GY+ + R N CG+A+Y
Sbjct: 271 DPTCSHE-VNHGVLAVGYGNLNGQDFWLLKNSWGTKYGDQGYVRIARNKGNLCGVASYTC 329
Query: 306 YALI 309
Y I
Sbjct: 330 YPEI 333
>gi|301609086|ref|XP_002934107.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 332
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 166/301 (55%), Gaps = 9/301 (2%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ K I HN E GLH Y + N L D+ M
Sbjct: 35 KTHQKTYKDTEEERTRRTIWEETLKFITVHNLEYSLGLHTYEVGMNRLGDMTGEEVAATM 94
Query: 75 TRLTHS-RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T T S + P+ L P +DWR KG +TP Q DCG+CYAFS A++
Sbjct: 95 TGYTGSDHSLANMSHVPKEILEALPPASIDWRTKGCVTPVRKQGDCGSCYAFSTVGAMEC 154
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ K T + LS Q++VDCS GN GC G+L + Y++ G+M+E Y Y G+++
Sbjct: 155 QVKKKTGRLVILSPQELVDCSYTEGNNGCINGNLLSAFRYME-KYGIMEESSYRYTGQEA 213
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+ +R + + +++ + DE+ LK + TVGP++V+++A + F+ Y SG+Y D
Sbjct: 214 DCR-RRVHTGLKVNAIYNVTDGDENVLKNAVGTVGPVSVAVDARQNGFRTYKSGVYFDPY 272
Query: 254 CTSDYVNHAMLLVGYTR----NSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
CT +V HA+L+VGY R + W++KN W WGD GY+ + R N+ CGIAN A Y
Sbjct: 273 CTL-HVTHAVLVVGYGREYGNDYWLVKNSWGVDWGDQGYVKMARNRNSHCGIANQAFYPT 331
Query: 309 I 309
+
Sbjct: 332 V 332
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 166/307 (54%), Gaps = 12/307 (3%)
Query: 13 PQKKYKKDYRKKATDSKKKLHWQS----NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
P + +K Y + +++ L+ QS N + I HN + + G +TL+ N D+
Sbjct: 18 PWQDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSE 77
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ M + R + +E+ +P H+DWR KG +TP +Q+ CG+C+AFS
Sbjct: 78 EFAATMNGFLNVPTRHPVAILEADDET--LPKHVDWRTKGAVTPVKDQKQCGSCWAFSTT 135
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ F ++ LS Q +VDCS GN+GC GG + Y++ G+ EE YPY
Sbjct: 136 GSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPY 195
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C+F N+ + + + +E++L +A +GPI+V+I+AS +FQ Y G+
Sbjct: 196 EAQDGKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGV 255
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y ++ C+S ++H +L +GY W++KN W+ WGD G++ + R N CGIA+
Sbjct: 256 YYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIAS 315
Query: 303 YAVYALI 309
A Y L+
Sbjct: 316 QASYPLV 322
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 166/307 (54%), Gaps = 12/307 (3%)
Query: 13 PQKKYKKDYRKKATDSKKKLHWQS----NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
P + +K Y + +++ L+ QS N + I HN + + G +TL+ N D+
Sbjct: 2 PWQDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSE 61
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ M + R + +E+ +P H+DWR KG +TP +Q+ CG+C+AFS
Sbjct: 62 EFAATMNGFLNVPTRHPVAILEADDET--LPKHVDWRTKGAVTPVKDQKQCGSCWAFSTT 119
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++GQ F ++ LS Q +VDCS GN+GC GG + Y++ G+ EE YPY
Sbjct: 120 GSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPY 179
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C+F N+ + + + +E++L +A +GPI+V+I+AS +FQ Y G+
Sbjct: 180 EAQDGKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGV 239
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y ++ C+S ++H +L +GY W++KN W+ WGD G++ + R N CGIA+
Sbjct: 240 YYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGIAS 299
Query: 303 YAVYALI 309
A Y L+
Sbjct: 300 QASYPLV 306
>gi|340368360|ref|XP_003382720.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 326
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 103/297 (34%), Positives = 152/297 (51%), Gaps = 10/297 (3%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y K + +++ W+SN K + HN + + G+T+ N +DL +
Sbjct: 29 KYNKVYETKDIELARQVIWESNKKFVENHNANSDK--FGFTVAMNEFADLDAAEFASIFN 86
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ + V + +DWREKG +T NQ CG+C++FS +++GQ
Sbjct: 87 GFL--SLPNNSTKDFYKKTGVKVAATVDWREKGAVTAIKNQGKCGSCWSFSTTGSLEGQH 144
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F T + LS QQ VDCS GN GC GG++ N Y++ G E YPY + C
Sbjct: 145 FLKTGTLLSLSEQQFVDCSTKFGNHGCKGGTMDNAFRYLETVSGDETEMMYPYTAEDGFC 204
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
KF+ V + +P DE AL+ +ATVGPI+V+I+A +FQLY G+Y + C+
Sbjct: 205 KFRSTEGKVKCEGYKDIPRDDEDALREAVATVGPISVAIDAGHSSFQLYKEGVYYNPTCS 264
Query: 256 SDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S ++H +L VGY + W++KN W WG GY+ + R N CGIA A Y
Sbjct: 265 STKLDHGVLAVGYGTYEGSEEYWLVKNSWGPSWGMEGYIMMSRNRENNCGIATMASY 321
>gi|50761194|ref|XP_418273.1| PREDICTED: cathepsin L1 [Gallus gallus]
Length = 336
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 166/306 (54%), Gaps = 19/306 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY---IKE 73
Y K+Y +A ++++ W++N ++I HN E QG H + L NH DL + +
Sbjct: 37 YAKEYPGEAELIRREV-WENNLRRIEQHNWEESQGQHTFRLGMNHYGDLMDEEFNQLLNG 95
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ H T +++ + P +DWR +G++TP NQ CG+C+AFS A++G
Sbjct: 96 FAPVQHEEPALTF----QASAAQKTPAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEG 151
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ- 192
+F T ++ LS Q ++DCS GN GC GG + YV GG+ E YPY+
Sbjct: 152 LVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATDT 211
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
S C++ + + S+ ++ E AL+ +ATVGP++V+++AS F Y SGI++
Sbjct: 212 SSCRYNPADRAANCSTVWLVAQGSEAALEQAVATVGPVSVAVDASSFFFHFYKSGIFNSM 271
Query: 253 ACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
C S VNH ML VGY + WILKN WS WG+ GY+ L +G NN CG+AN
Sbjct: 272 FC-SQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIRLLKGVNNHCGVANQ 330
Query: 304 AVYALI 309
A + L+
Sbjct: 331 ASFPLL 336
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 106/307 (34%), Positives = 163/307 (53%), Gaps = 13/307 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + + + + N KI HN++ +G Y++ N D+ ++
Sbjct: 31 KAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVS- 89
Query: 74 MTRLTHSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
TR R + R PE+ E +P +DWR KG +TP NQ CG+C+AFS
Sbjct: 90 -TRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSA 148
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ F+ + + LS Q +V CS GN GC GG + + Y++ G+ E+ YP
Sbjct: 149 TGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYP 208
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G C FK+ + S + + E LK +ATVGPI+V+I+AS +FQ Y+ G
Sbjct: 209 YNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 248 IYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+YD+ C S+ ++H +L+VGY T N W +KN W WGD GY+ + R N+CGIA+
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIAS 328
Query: 303 YAVYALI 309
A L+
Sbjct: 329 SASIPLV 335
>gi|14041143|emb|CAA71554.1| cathepsin [Geodia cydonium]
Length = 322
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 159/303 (52%), Gaps = 17/303 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + D ++ W SN K + + E + GYT+ N +DL PR ++
Sbjct: 25 KYNKQYSSQEEDYLRQRVWLSNLKFVEEFDSERE----GYTVAMNEFADLDPREFVSHYN 80
Query: 76 RLTHSRIRRTLVRSPE----SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
L RR S E + +P +DWR KG++T NQ CG+C+AFS ++
Sbjct: 81 GLR----RRPHTSSGEPCTLGEDVSALPTTVDWRTKGYVTGVKNQGQCGSCWAFSATGSL 136
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F +T ++ LS Q +VDCS GN GC GG + YV GG+ E YPY +
Sbjct: 137 EGQHFNATGKLVSLSEQNLVDCSSAEGNEGCNGGLPDDAFKYVIKNGGIDTEASYPYVAR 196
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + NI SS+ + + E L+V ATVGPI V I+AS FQLY G+Y
Sbjct: 197 DEKCHYSSANIGSTCSSYVDIESKSEAQLQVASATVGPIPVGIDASHLGFQLYDGGVYHS 256
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+ C+ ++H +L+VGY ++ W++KN W +WG +G M + R +N CGIA A Y
Sbjct: 257 DLCSQTRLDHGVLVVGYGVYKEKDYWMVKNSWGTNWGISGDMMMSRNRDNNCGIATMASY 316
Query: 307 ALI 309
++
Sbjct: 317 PVV 319
>gi|326934570|ref|XP_003213361.1| PREDICTED: cathepsin L1-like [Meleagris gallopavo]
Length = 332
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 110/306 (35%), Positives = 165/306 (53%), Gaps = 19/306 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY---IKE 73
Y K+Y +A ++++ W+ N ++I HN E QG H + L NH DL + +
Sbjct: 33 YAKEYPGEAEVIRREV-WEKNLRRIEQHNWEESQGQHTFRLGMNHYGDLMDEEFNQLLNG 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T + H T +++ + P +DWR +G++TP NQ CG+C+AFS A++G
Sbjct: 92 FTPVQHEEPALTF----QASAAQKTPAEVDWRVRGYVTPVKNQGHCGSCWAFSATGALEG 147
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ- 192
+F T ++ LS Q ++DCS GN GC GG + YV GG+ E YPY+
Sbjct: 148 LVFNWTRKLAVLSEQNLIDCSRKLGNNGCQGGYMTRAFQYVHDNGGMNSEHVYPYQATDT 207
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
S C++ + + SS ++ E AL+ +ATVGP++V+++AS F Y SGI+
Sbjct: 208 SSCRYNPGDRAANCSSVWLVAQGSEAALEQAVATVGPVSVAVDASSFFFHFYKSGIFSSM 267
Query: 253 ACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
C S VNH ML VGY + WILKN WS WG+ GY+ L +G +N CG+AN
Sbjct: 268 FC-SQTVNHGMLAVGYGTSQEGRKNMSYWILKNSWSEVWGEQGYIRLLKGASNHCGVANQ 326
Query: 304 AVYALI 309
A + L+
Sbjct: 327 ASFPLL 332
>gi|410990002|ref|XP_004001239.1| PREDICTED: cathepsin L1-like isoform 1 [Felis catus]
Length = 333
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/289 (35%), Positives = 160/289 (55%), Gaps = 12/289 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W N K I HN+E Q H +T+ N D+ + + M L + ++ V
Sbjct: 48 RRAVWGENMKMIVQHNREYNQKEHSFTMAMNGFGDMTNEEFRQVMNGLQIQKHKKGKVFQ 107
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
P IP ++WR+KG++TP NQ C + +AFS AI+GQ+F T ++ LS Q
Sbjct: 108 PPLFAE--IPSSVNWRKKGYVTPTKNQGQCASGWAFSATGAIEGQMFGKTHKLVSLSEQN 165
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
++DC+ GN GC+GGS+ N YV+ GGL EE YPY + CK+ + DI+ +
Sbjct: 166 LLDCAWSEGNGGCSGGSMGNAFQYVKDNGGLDSEESYPYHAQLQSCKYNPESSAADITGF 225
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+ P+ E+ L ++A VGP++ +I+AS TF+ Y GIY D C+ + ++H +L+VGY
Sbjct: 226 LTI-PKTEYKLMRSVAIVGPVSAAIDASLDTFRFYDQGIYYDSNCSHEILHHGVLIVGYG 284
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ WI+KN W WG +GY+ + K +N CGIA+ A + +
Sbjct: 285 FEGAELDNKKYWIVKNSWGEAWGIDGYILMAKDRDNHCGIASLASFPTV 333
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 169/308 (54%), Gaps = 17/308 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K++ + + + + N KI HNQ QG + L N SD+ + + M
Sbjct: 33 EHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHEFKETMN 92
Query: 76 RLTHSRIRRTL--------VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
H+ +R+ L + P +N V IP +DWR+ G +T +Q CG+C+AFS
Sbjct: 93 GYNHT-MRKVLRAQGFSGIIYIPPAN--VQIPKSVDWRQHGAVTAVKDQGHCGSCWAFSS 149
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+A++GQ F+ + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 150 TAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 209
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+G C F + + + + +P DE AL +AT+GP++V+I+AS +FQLY+ G
Sbjct: 210 YEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHESFQLYSEG 269
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y++ C + ++H +L+VGY + W++KN W WGD GY+ + R +N+CGIA
Sbjct: 270 VYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIA 329
Query: 302 NYAVYALI 309
+ Y +
Sbjct: 330 TASSYPTV 337
>gi|15826035|pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
gi|15826036|pdb|1FH0|B Chain B, Crystal Structure Of Human Cathepsin V Complexed With An
Irreversible Vinyl Sulfone Inhibitor
Length = 221
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 136/218 (62%), Gaps = 9/218 (4%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR+KG++TP NQ+ CG+C+AFS A++GQ+F+ T ++ LS Q +VDCS
Sbjct: 1 LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + YV+ GGL EE YPY ICK++ N V + ++V+ P E
Sbjct: 61 GNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVAQDTGFTVVAPGKE 120
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------TRN 271
AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY + N
Sbjct: 121 KALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSDN 180
Query: 272 S--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
S W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 181 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 218
>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
Length = 330
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 169/303 (55%), Gaps = 12/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K++K Y +++K+ W++N K I HN++ +G HG+ L N DL + +
Sbjct: 33 KTKHRKTYNMN-EEAQKRAVWENNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFREL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT + T+ + P + +P +DWR+ G++TP +Q CG+C+AFS +++
Sbjct: 92 MTGFQSMGHKEMTIFQEPLLGD---VPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQIF+ T ++ LS Q ++DCS GN+GC GG + YV+ GL E Y Y+
Sbjct: 149 GQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ V+I+ + V P E AL +A+VGP++V I+ H+F+ Y G Y +
Sbjct: 209 GPCRYDPKYSAVNITGF-VKVPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEP 267
Query: 253 ACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
C+S ++HA+L+VGY S W++KN W WG +GY+ + K +N CGIA YA+Y
Sbjct: 268 DCSSTNLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIY 327
Query: 307 ALI 309
+
Sbjct: 328 PTV 330
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 169/303 (55%), Gaps = 12/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K++K Y +++K+ W++N K I HN++ +G HG+ L N DL + +
Sbjct: 41 KTKHRKTYNMN-EEAQKRAVWENNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFREL 99
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT + T+ + P + +P +DWR+ G++TP +Q CG+C+AFS +++
Sbjct: 100 MTGFQSMGHKEMTIFQEPLLGD---VPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLE 156
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQIF+ T ++ LS Q ++DCS GN+GC GG + YV+ GL E Y Y+
Sbjct: 157 GQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWD 216
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ V+I+ + V P E AL +A+VGP++V I+ H+F+ Y G Y +
Sbjct: 217 GPCRYDPKYSAVNITGF-VKVPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEP 275
Query: 253 ACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
C+S ++HA+L+VGY S W++KN W WG +GY+ + K +N CGIA YA+Y
Sbjct: 276 DCSSTNLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIY 335
Query: 307 ALI 309
+
Sbjct: 336 PTV 338
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 169/303 (55%), Gaps = 12/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K++K Y +++K+ W++N K I HN++ +G HG+ L N DL + +
Sbjct: 52 KTKHRKTYNMNE-EAQKRAVWENNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFREL 110
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT + T+ + P + +P +DWR+ G++TP +Q CG+C+AFS +++
Sbjct: 111 MTGFQSMGHKEMTIFQEPLLGD---VPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLE 167
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQIF+ T ++ LS Q ++DCS GN+GC GG + YV+ GL E Y Y+
Sbjct: 168 GQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWD 227
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ V+I+ + V P E AL +A+VGP++V I+ H+F+ Y G Y +
Sbjct: 228 GPCRYDPKYSAVNITGF-VKVPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEP 286
Query: 253 ACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
C+S ++HA+L+VGY S W++KN W WG +GY+ + K +N CGIA YA+Y
Sbjct: 287 DCSSTNLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIY 346
Query: 307 ALI 309
+
Sbjct: 347 PTV 349
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 173/322 (53%), Gaps = 17/322 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++W + +K+YK D +K + K+ +++HK + N+ + GL Y L+ N
Sbjct: 22 LVQEQWGTFKLQHKKQYKSDTEEKF---RMKIFMENSHK-VAKXNKLYEMGLVSYKLKIN 77
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI-------PDHLDWREKGFITPD 113
+D+ ++ + ++ L S + + I P+++DWRE G +T
Sbjct: 78 KYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXV 137
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
+Q CG+C++FS A++GQ F+ T+++ LS Q +VDCS GN GC GG + N Y
Sbjct: 138 KDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKY 197
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
V++ G+ E YPY C + + +P DE L +ATVGP++V+
Sbjct: 198 VKYNHGIDTEASYPYHADDEKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPVSVA 257
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGY 288
I+AS +FQLY+ G+Y D C+S+ ++H +L+VGY ++ WI+KN W WG+ GY
Sbjct: 258 IDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTDENGQDYWIVKNSWGESWGEQGY 317
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R +N CGIA A Y L+
Sbjct: 318 IKMARNRDNNCGIATQASYPLV 339
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 167/313 (53%), Gaps = 20/313 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
YK++Y + ++ + +N +I HN QG YT+ N SD I +
Sbjct: 73 YKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDKVIGLIIHTICF 132
Query: 77 LTHSRIRR------TLVRSPESNESVLI----PDHLDWREKGFITPDWNQEDCGACYAFS 126
T ++R +L S + ++ + I P +DWR KG +TP NQ +CG+C+AFS
Sbjct: 133 QTDEELKRLRCFRGSLNASRDGSKYITIAAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFS 192
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
AI+GQ F +T + LS QQ+VDCS GN C GG + N YV+ + G+ E Y
Sbjct: 193 ATGAIEGQNFLATGNLVSLSEQQLVDCSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASY 252
Query: 187 PYKGKQS-----ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
PY ++ C+F VV ++ + LP LK + GPI+V+INA +F
Sbjct: 253 PYVSGETGDANPTCRFNLKEAVVRVTGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSF 312
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NN 296
Y SG+Y D+ C+SD ++H +LLVGY + W++KN W HWG+NGY+ + R NN
Sbjct: 313 MSYKSGVYSDDQCSSDDLDHGVLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNN 372
Query: 297 RCGIANYAVYALI 309
CG+A+ A Y L+
Sbjct: 373 LCGVASMASYPLM 385
>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
Length = 330
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 169/303 (55%), Gaps = 12/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K++K Y +++K+ W++N K I HN++ +G HG+ L N DL + +
Sbjct: 33 KTKHRKTYSMNE-EAQKRAVWENNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFREL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT + T+ + P + +P +DWR+ G++TP +Q CG+C+AFS +++
Sbjct: 92 MTGFQSMGHKEMTIFQEPLLGD---VPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLE 148
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQIF+ T ++ LS Q ++DCS GN+GC GG + YV+ GL E Y Y+
Sbjct: 149 GQIFRKTGKLVPLSEQNLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWD 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ V+I+ + V P E AL +A+VGP++V I+ H+F+ Y G Y +
Sbjct: 209 GPCRYDPKYSAVNITGF-VKVPLSEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEP 267
Query: 253 ACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
C+S ++HA+L+VGY S W++KN W WG +GY+ + K +N CGIA YA+Y
Sbjct: 268 DCSSTNLDHAVLVVGYGEESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIY 327
Query: 307 ALI 309
+
Sbjct: 328 PTV 330
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/296 (34%), Positives = 158/296 (53%), Gaps = 7/296 (2%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ ++ N I N+ L G+TL N +D+ + + L
Sbjct: 37 KSYRDGQEELIRRFIFEDNLHTIEEFNR-VNASLAGFTLGVNEFADMTNTEFSNMLLGL- 94
Query: 79 HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS 138
R + ES+ +P +DW +KG++T NQ CG+C+AFS +++GQ+FK
Sbjct: 95 GGRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFKK 154
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
T ++ LS Q +VDCS GN GC GG + Y++ GG+ E YPY G C+F
Sbjct: 155 TGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGTCRFL 214
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
+ +S + + DE+ALK +ATVGPI+V+I+AS FQ Y G+Y+ C+S
Sbjct: 215 ENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCSSTE 274
Query: 259 VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++H +L+VGY ++ W++KN W WG GY+ + R NRCGIA A Y +
Sbjct: 275 LDHGVLVVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNRCGIATQASYPTV 330
>gi|198432221|ref|XP_002130541.1| PREDICTED: similar to cathepsin L [Ciona intestinalis]
Length = 330
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 159/290 (54%), Gaps = 9/290 (3%)
Query: 29 KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-IKEMTRLTHSRIRRTLV 87
K++L W+ N + + HN E +GLH YT+ +DL + + + R +
Sbjct: 41 KRQLIWEKNLRVVTQHNYEYDEGLHTYTMAMTKFADLENDEFNTMYLASMPADRKNELVC 100
Query: 88 RSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
+ ++ P +DWR +G++TP NQ CG+C+AFS +++GQ F T ++ LS
Sbjct: 101 KKQTIDKFAQNPTTVDWRTQGYVTPVKNQLQCGSCWAFSATGSLEGQHFAKTKKLVSLSE 160
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
QQ++DCS G+LGC GG Y+ GG+ E +YPY+ K +C+F + ++
Sbjct: 161 QQLIDCSTKQGDLGCGGGYPDWAFAYINQVGGIESETNYPYEAKNDVCRFNVSEVAATLT 220
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD--YVNHAMLL 265
+ P E L+ + ++GP++V I+AS +FQLY SGIY ++ C+S ++H +L
Sbjct: 221 GCVDITPDSETQLEKAVGSIGPVSVLIDASHISFQLYGSGIYYEQQCSSSPASLDHGVLA 280
Query: 266 VGYTRNS----WILKNWWSHHWGD-NGYMYL-KRGNNRCGIANYAVYALI 309
VGY ++ W++KN W WG GY+ + K NN CGIA A Y ++
Sbjct: 281 VGYGADNGQEYWMVKNSWGEGWGKLGGYIKMAKNKNNNCGIATQASYPIV 330
>gi|354502591|ref|XP_003513367.1| PREDICTED: cathepsin L1-like isoform 1 [Cricetulus griseus]
Length = 330
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 106/317 (33%), Positives = 167/317 (52%), Gaps = 22/317 (6%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSK---KKLHWQSNHKKIHTHNQEAQQGLHGYTLRE 59
N EW +K+K YRK +++ ++ W+ N K I HN+E QG + +T+
Sbjct: 26 NAEW--------QKWKTKYRKTYINNEEVERRALWEENMKLIQLHNKEYHQGKNTFTMAM 77
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRT-LVRSPESNESVLIPDHLDWREKGFITPDWNQED 118
N D P + +T + T + + P + +P +DWR+ G++TP +Q
Sbjct: 78 NAFGDQRPLELAQTLTAFQSQEAKETNIFQEPLLGD---VPKSVDWRKHGYVTPVKDQGS 134
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
C +C+AFS +++GQ+F+ T ++ LS Q +VDCS N GC GG + Y++ G
Sbjct: 135 CVSCWAFSAVGSLEGQMFRKTGKLVPLSEQNLVDCSRSQHNNGCHGGLFTSAFQYIKDNG 194
Query: 179 GLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
GL E YPY+ + C++ + +I+ + V+ P +E AL +ATVGPI++ I+
Sbjct: 195 GLDTSESYPYEAQDGPCRYDPKHSAANITGFVVV-PSNEEALMKAVATVGPISIGISVRL 253
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-K 292
+ Y SG Y D C + Y NH++LLVGY S W++KN W WG +GY+ + K
Sbjct: 254 RSLLFYKSGFYYDPDCYNHYPNHSVLLVGYGEESDGQKYWLVKNSWGEEWGMDGYIKIAK 313
Query: 293 RGNNRCGIANYAVYALI 309
NN C IA A Y +
Sbjct: 314 DRNNHCSIATIAAYPTV 330
>gi|432114312|gb|ELK36240.1| Aryl hydrocarbon receptor nuclear translocator [Myotis davidii]
Length = 897
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 131/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 684 PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLMKKTGKLLNLSPQNLVDC--VSE 741
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G+ C + + +P +E
Sbjct: 742 NDGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYKEIPEGNEK 801
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGPI+V+I+AS +FQ Y+ G+Y DE C SD +NHA+L VGY + WI
Sbjct: 802 ALKKAVARVGPISVAIDASLSSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWI 861
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 862 IKNSWGENWGNKGYILMARNKNNACGIANLASF 894
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 163/304 (53%), Gaps = 12/304 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y+ + + ++L W+ N K + HN E GLH Y L NHL D+ I
Sbjct: 32 KKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSMNHLGDMTSEEVISL 91
Query: 74 MT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M+ R+ + R T R + + +PD +DWREKG +T Q CG+C+AFS A+
Sbjct: 92 MSSLRIPNQWNRNTTYRLSSNQK---LPDSVDWREKGCVTEVKYQGSCGSCWAFSAVGAL 148
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
+ Q+ T ++ LS Q +VDCS N GC GG + + YV G+ + YPYK
Sbjct: 149 EAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVIDNNGIDSDVSYPYKA 208
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C++ + S ++ LP E ALK +A GP++V I+A +F LY SG+Y
Sbjct: 209 TDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVGIDAKTPSFFLYKSGVYY 268
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +CT VNH +L++GY ++ W++KN W H+GD GY+ + R N CGIAN+
Sbjct: 269 DPSCTQK-VNHGVLVIGYGNLDGQDYWLVKNSWGLHFGDKGYVRIARNRGNHCGIANFPS 327
Query: 306 YALI 309
Y I
Sbjct: 328 YPEI 331
>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 329
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 165/317 (52%), Gaps = 20/317 (6%)
Query: 2 TNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENH 61
T+ EW + + Y K Y + ++ W+ N I THN ++ + HGYTL N
Sbjct: 23 TSNEWELW----KATYGKSYLTLEEEKYRRDTWEENSLLIKTHNTDSDK--HGYTLEMNS 76
Query: 62 LSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQED 118
DL E + L ++ R+ L S S L +P LDWR+K +T NQ
Sbjct: 77 FGDLTS----AEFSSL-YNGYRQNLETSGSVFSSSLRNAMPSSLDWRDKKVVTDVKNQGK 131
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C+AFS +++G T + LS QQ++DCS+ GN GC GG++R+ Y++ AG
Sbjct: 132 CGSCWAFSTTGSLEGLHALKTGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAG 191
Query: 179 GLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
G EE YPY K C+F + + +P DE +L L VGPI+V+++A
Sbjct: 192 GDDTEESYPYTAKNESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGL 251
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKR 293
TFQ Y GIY D C++ ++NH + L+GY +S W++KN W WG +GY L R
Sbjct: 252 KTFQFYKKGIYSDYLCSNTHLNHGVTLIGYGESSDGSPYWLVKNSWGKDWGIDGYFMLAR 311
Query: 294 -GNNRCGIANYAVYALI 309
N CG+A A Y ++
Sbjct: 312 YVGNMCGVATDASYPIL 328
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 166/307 (54%), Gaps = 13/307 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K Y+ + + + + N KI HNQ G + + N +D+ + + M
Sbjct: 33 EHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADMLHHEFRETMN 92
Query: 76 RLTHSRIRRTLVRSPE-------SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
++ + P S V +P +DWREKG +T +Q CG+C+AFS
Sbjct: 93 GFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCGSCWAFSST 152
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YPY
Sbjct: 153 GALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 212
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+G C F + ++ ++ +P +E + +AT+GP++V+I+AS +FQ Y+ GI
Sbjct: 213 EGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDASHESFQFYSEGI 272
Query: 249 YDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
Y++ C S ++H +L+VGY ++ W++KN W WGD G++ + R +N+CGIA+
Sbjct: 273 YNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIKMARNEDNQCGIAS 332
Query: 303 YAVYALI 309
+ Y L+
Sbjct: 333 ASSYPLV 339
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 166/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ L S+ +R + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 TSSLRVPSQWQRNITY--KSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 269
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 270 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
Query: 307 ALI 309
I
Sbjct: 329 PEI 331
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 159/301 (52%), Gaps = 11/301 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM-- 74
+ + Y ++ ++ + SN + I+ HN G H YTL N DL + +
Sbjct: 28 HNRQYASAQEEALRQEIYLSNLELINEHNA---AGRHSYTLGMNEFGDLAHHEFAAKYLG 84
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
R ++ S V +PD +DWR G +TP NQ CG+C++FS +++GQ
Sbjct: 85 VRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQ 144
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+ T + LS Q +VDCS GN GC GG + + Y+ GG+ E YPY
Sbjct: 145 HARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTATTGT 204
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
CKF NI ++S+ + E L+ +ATVGP++V+I+AS FQ Y +G+Y+++ C
Sbjct: 205 CKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVYNEKKC 264
Query: 255 TSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYAL 308
++ ++H +L VGY ++ W++KN W WG GY+++ R +N+CGIA A Y L
Sbjct: 265 STTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIATSASYPL 324
Query: 309 I 309
+
Sbjct: 325 V 325
>gi|84660244|emb|CAI43319.1| silicatein alpha [Lubomirskia baicalensis]
gi|85677148|emb|CAI46306.1| silicatein alpha [Lubomirskia baicalensis]
gi|220675708|emb|CAP69653.1| silcatein [Lubomirskia baicalensis]
Length = 326
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 163/297 (54%), Gaps = 9/297 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRL 77
K Y + + ++ W SN K + HN A GYTL NHL DL Y+++ +T
Sbjct: 33 KRYASELEELERHAIWLSNKKYVEEHNARADA--FGYTLAMNHLGDLSAEEYVEQYLTNA 90
Query: 78 THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
S R + ++ + + V + +DWR KG +T Q CGA YAF+ A++G
Sbjct: 91 RGSNEHREM-KAFLAPKGVQYAESIDWRTKGAVTSVKYQGQCGASYAFAATGALEGASAL 149
Query: 138 STSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKF 197
+ + LS Q ++DCS+ GN GC+GG YV GG+ E Y +KGKQS C++
Sbjct: 150 ANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVIDNGGIDTESSYSFKGKQSSCQY 209
Query: 198 KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD 257
+ + E L +ATVGP+AV+++A+ + F+ Y SG++D +C+S
Sbjct: 210 NNKTSGASATGVVSIGYGSESDLLAAVATVGPVAVAVDANTNAFRFYQSGVFDSSSCSST 269
Query: 258 YVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN WS +WGD+GY+ + R N+CGIA+ A+Y ++
Sbjct: 270 KLNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDSGYILMVRNKYNQCGIASDALYPML 326
>gi|354502589|ref|XP_003513366.1| PREDICTED: testin-2-like [Cricetulus griseus]
Length = 333
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 170/305 (55%), Gaps = 13/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + + ++ + W+ N K I HN E QG H +T+ N DL+ + K
Sbjct: 33 KTKHGKTYDENEENLRRAV-WEKNFKMIELHNWEYLQGKHDFTMAMNAFGDLNNTEFRKT 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M RI+R R + + + +P ++WRE+G++TP +Q C + +AFS A++G
Sbjct: 92 MAGFRRQRIKRR--RIFQDHLFLSVPKQVNWREQGYVTPVKSQGHCASSWAFSATGALEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+FK T ++ LS Q ++DC + C+GG +++ YV+ GGL EE YPY+G
Sbjct: 150 QMFKKTRKLNALSEQNLLDCMEFNVTRSCSGGFMQSAFQYVRDNGGLATEESYPYQGHAM 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+++ N ++ + +P +E AL +A VGPI+V+I+A +FQ Y SGIY +
Sbjct: 210 ECRYQAKNSAANVKDFVQIPGHEE-ALMKAVANVGPISVAIDARHSSFQFYESGIYYEPK 268
Query: 254 CTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C + NHA+L+VGY + W++KN W WG GYM + + NN CGIA +A
Sbjct: 269 CKRVHQNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGIKGYMKIAKDWNNHCGIATHA 328
Query: 305 VYALI 309
Y ++
Sbjct: 329 TYPIV 333
>gi|1834307|dbj|BAA09820.1| cysteine proteinase [Spirometra erinaceieuropaei]
gi|1834309|dbj|BAA09821.1| cysteine proteinase [Spirometra erinaceieuropaei]
Length = 336
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 102/301 (33%), Positives = 167/301 (55%), Gaps = 11/301 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE--- 73
+KK+Y + +K + +N I HNQ Q L Y +R N SDL P + +
Sbjct: 39 FKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTPGEFAERYLC 98
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ + +++RR S E+ +PD ++WRE+G +T NQ CG+C++FS AI+G
Sbjct: 99 LRGIVLTKLRRKEAVSVPLKEN--LPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEG 156
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
I T + LS QQ++DCS GN GC GG + Y Q G+ E DY Y +
Sbjct: 157 AIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRY-GVEAEVDYRYTERDG 215
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
+C++++ +V +++ ++ LP DE L+ +AT+GPI+V I+A+ F Y+ G++ +
Sbjct: 216 VCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYSHGVFVSKT 275
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C+ ++H +L+VGY + W++KN W WG++GY+ + R NN CGIA+ A Y
Sbjct: 276 CSPYAIDHGVLVVGYGAENGDAYWLVKNSWGSSWGEDGYLKMARNRNNMCGIASMASYPT 335
Query: 309 I 309
+
Sbjct: 336 V 336
>gi|208972996|dbj|BAG74347.1| silicatein-G2 [Ephydatia fluviatilis]
Length = 326
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 105/300 (35%), Positives = 161/300 (53%), Gaps = 11/300 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ + K Y + + W +N ++I HN EA+ HG+TL N SDL + +
Sbjct: 28 KETHGKAYSTTDEEQNRLSVWLANKRRIDHHNVEAEN--HGFTLAMNSFSDLTDEEFAER 85
Query: 74 MTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
L H + TL R E+ + V D +DWR KG + NQ CGA YAF+ +++
Sbjct: 86 F--LNHQQGNYTLRRVAMFEAPQGVKYADSVDWRTKGAVNSVKNQGQCGASYAFAAVASL 143
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+G + ++ LS Q V+DCS+ N GC GG L YV GG+ E Y YK +
Sbjct: 144 EGMNALANEKMVALSEQNVIDCSVPYSNRGCNGGDTYAALKYVVDNGGIDTESTYAYKER 203
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
QS C+F I S + E L +AT+GP+AV+++A+ + F+ Y SGI+
Sbjct: 204 QSSCQFNSKYIGATASGVVAISSSSESELMAAVATMGPVAVAVDANTYAFRYYQSGIFSS 263
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
AC+S +NHAM++ GY +S W++KN W +WG+ GY+ + R N+CGIA+ A++
Sbjct: 264 SACSSTKLNHAMVVTGYGTSSGKDYWLVKNSWGSNWGNGGYIMMARNKYNQCGIASDALF 323
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 169/302 (55%), Gaps = 12/302 (3%)
Query: 17 YKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+K Y +K D+K++L+ +Q N + I N++ + G + + N D+ +
Sbjct: 22 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 81
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M + + R ++ + E+ + +DWR K +TP +QE CG+C+AFS A++
Sbjct: 82 VMK--GYKKGSRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCGSCWAFSATGALE 139
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F E+ LS QQ+VDCS GN GC GG + + +Y++ GG+ E YPY+ +
Sbjct: 140 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 199
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+F N + I + SV E AL+ ++ VGPI+V+I+AS +FQ Y+SG+Y ++
Sbjct: 200 RSCRFD-ANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQ 258
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+ +++H +L VGY T++ W++KN W WGD GY+ + R +N CGIA+ Y
Sbjct: 259 NCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYP 318
Query: 308 LI 309
+
Sbjct: 319 TV 320
>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
Length = 345
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 104/301 (34%), Positives = 167/301 (55%), Gaps = 18/301 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKK----IHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
K++K Y K S++ ++ + ++ + T +++ + G Y++ NH +D+ P +
Sbjct: 40 KFRKIYNKTYGTSEETVYREQVFRRTFNFLRTVDEKFKNGTLLYSVAVNHFADMTPDEVV 99
Query: 72 KEMTRL---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
T + ++ + +P ++ P+ ++WRE GF+TP NQ CG+C+AFS
Sbjct: 100 ANYTGYKPPSAQQLAEIPLYAPLFGDT---PEFIEWRENGFVTPVKNQGQCGSCWAFSST 156
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ+FK T + LS Q ++DC+ GN GC GG + YVQ AGGL E YP
Sbjct: 157 GALEGQVFKRTRRLISLSEQNLMDCAGQRYGNNGCNGGQMPGAFQYVQDAGGLDTEARYP 216
Query: 188 YK-GKQSICKFKRPNIV--VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
Y+ G C+F V ++ + +PP++E L+ +A VGPI+++INASP TF Y
Sbjct: 217 YRQGTNFQCQFSNSFEARRVSVNGHTRVPPRNERVLQDAVANVGPISIAINASPQTFMFY 276
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGI 300
+GIY + C +NHA+LLVGY WI+KN W WG+ GY+ + R N CG+
Sbjct: 277 KNGIYGEPNCDPRGLNHAVLLVGYGEERGVPYWIVKNSWGPGWGEGGYIKILRNRNVCGM 336
Query: 301 A 301
+
Sbjct: 337 S 337
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 169/302 (55%), Gaps = 12/302 (3%)
Query: 17 YKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+K Y +K D+K++L+ +Q N + I N++ + G + + N D+ +
Sbjct: 23 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 82
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M + + R ++ + E+ + +DWR K +TP +QE CG+C+AFS A++
Sbjct: 83 VMK--GYKKGSRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCGSCWAFSATGALE 140
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F E+ LS QQ+VDCS GN GC GG + + +Y++ GG+ E YPY+ +
Sbjct: 141 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 200
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+F N + I + SV E AL+ ++ VGPI+V+I+AS +FQ Y+SG+Y ++
Sbjct: 201 RSCRFD-ANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQ 259
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+ +++H +L VGY T++ W++KN W WGD GY+ + R +N CGIA+ Y
Sbjct: 260 NCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYP 319
Query: 308 LI 309
+
Sbjct: 320 TV 321
>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 331
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/306 (35%), Positives = 170/306 (55%), Gaps = 18/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y K YR + + K++ W N K + HN EA +G H Y + N +DL + + MT
Sbjct: 31 YGKVYRAEE-ELKRQYIWLENLKYVTQHNLEADEGKHTYKVDTNQFADLSNDEWRELMT- 88
Query: 77 LTHSRIRRTLVRSPESN-------ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
S++ R + N + V+ P ++DWR++G++TP +Q+ CG+C+AFS
Sbjct: 89 ---SQVTRPTNQMSFCNMTFMTVGDHVIAPKNVDWRKEGYVTPVKDQKQCGSCWAFSTTG 145
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+++GQ FK T ++ LS Q +VDCS+ GN GC GG + Y+ GG+ E YPY
Sbjct: 146 SLEGQHFKKTGKLVSLSEQNLVDCSMKEGNHGCQGGLMDLGFEYIFDNGGIDTESSYPYM 205
Query: 190 GK-QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K + C +KR N ++ + E AL +A VGPI+V+I+A +FQ+Y SG+
Sbjct: 206 AKNEPQCMYKRSNSGATLTGCVDIKRGSESALMKAVADVGPISVAIDAGHKSFQMYKSGV 265
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + +C+S ++H +L VG+ ++ W++KN W WG GY+ + R +N CGIA
Sbjct: 266 YYEPSCSSVKLDHGVLAVGFGADNGEDFWLVKNSWGPIWGMEGYIMMSRNRDNNCGIATQ 325
Query: 304 AVYALI 309
A Y L+
Sbjct: 326 ASYPLV 331
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/308 (31%), Positives = 170/308 (55%), Gaps = 16/308 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K+Y + + + + N K+ HN+ QG + L N +D+ ++ +
Sbjct: 33 QHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLN 92
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + +++ + N++V +PD +DWR+KG +T +Q CG+C++FS
Sbjct: 93 GF--NKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSA 150
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ F+ T ++ LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 151 TGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 210
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C +K N + + +E LK +ATVGP++++I+AS TFQLY+ G
Sbjct: 211 YLAEDEKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIAIDASHETFQLYSDG 270
Query: 248 IYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y D C+S ++H +L+VGY ++ W++KN W WG NGY+ + R +N CG+A
Sbjct: 271 VYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLNGYIKMARNQDNMCGVA 330
Query: 302 NYAVYALI 309
+ A Y L+
Sbjct: 331 SQASYPLV 338
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 173/313 (55%), Gaps = 14/313 (4%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW+ + + K+YR + + + + N KKI HN + + G Y ++ NHL
Sbjct: 11 QEWLAF----KAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLG 66
Query: 64 DLHPRHYIKEMT--RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGA 121
DL + M + T + R + P SNE+ +P +DWR++G +TP +Q CG+
Sbjct: 67 DLMVHEFKALMNGFKKTPNAERNGKIYVP-SNEN--LPKSVDWRQRGAVTPVKDQGHCGS 123
Query: 122 CYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
C++FS +++GQ+F T + LS Q +VDCS GN GC GG + YV+ G+
Sbjct: 124 CWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGID 183
Query: 182 KEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E YPY+ +++ C+FK + + + E L+ +ATVGPI+V I+AS +F
Sbjct: 184 TEASYPYEARENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESF 243
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGN-N 296
Q Y+ G+Y ++ C+ ++H +L VGY T N W++KN W WG++GY+ + R + N
Sbjct: 244 QFYSEGVYKEQYCSPSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHKN 303
Query: 297 RCGIANYAVYALI 309
CGIA+ A Y ++
Sbjct: 304 HCGIASMASYPVV 316
>gi|21245114|ref|NP_640355.1| cathepsin Q precursor [Rattus norvegicus]
gi|12585197|sp|Q9QZE3.1|CATQ_RAT RecName: Full=Cathepsin Q; Flags: Precursor
gi|6010771|gb|AAF01247.1|AF187323_1 cathepsin Q [Rattus norvegicus]
gi|149039733|gb|EDL93849.1| rCG24173 [Rattus norvegicus]
Length = 343
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 112/312 (35%), Positives = 166/312 (53%), Gaps = 21/312 (6%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY---IK 72
KY+K Y + + K++ W+ N KKI HN+E G + YT+ N +D+ + I
Sbjct: 35 KYEKLYSPEE-EVLKRVVWEENVKKIELHNRENSLGKNTYTMEINDFADMTDEEFKDMII 93
Query: 73 EMTRLTHSRIRRTLVRS-----PES-NESVLIPDHLDWREKGFITPDWNQEDCGACYAFS 126
H+ +R R+ P S N +P +DWR +G++T Q C +C+AF
Sbjct: 94 GFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQGGCSSCWAFP 153
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
+ AI+GQ+FK T ++ LS+Q ++DCS GN GC G+ N YV GGL E Y
Sbjct: 154 VTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVLHNGGLEAEATY 213
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY+ K+ +C++ N I+ + VL P+ E L +AT GPIA ++ +F+ Y
Sbjct: 214 PYERKEGVCRYNPKNSSAKITGFVVL-PESEDVLMDAVATKGPIATGVHVISSSFRFYQK 272
Query: 247 GIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNR 297
G+Y + C+S YVNHA+L+VGY N W++KN W WG GYM + K NN
Sbjct: 273 GVYHEPKCSS-YVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLRGYMKIAKDRNNH 331
Query: 298 CGIANYAVYALI 309
C IA+ A Y +
Sbjct: 332 CAIASLAQYPTV 343
>gi|344271939|ref|XP_003407794.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 335
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/307 (35%), Positives = 166/307 (54%), Gaps = 15/307 (4%)
Query: 16 KYKKDYRKKATDSKKKLH---WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
++K Y+K +++ L W+ N K I HNQE QG HG+T+ N D + +
Sbjct: 31 QWKSTYKKVYAANEEGLTRAVWEKNMKMIERHNQEHSQGKHGFTMAMNAFGDKTNEEFRQ 90
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M + ++ + IP ++W ++G++TP +Q C +C+AFS A++
Sbjct: 91 LMNGFQSQKHKKGKLFHFHEPVFGHIPTSVNWTQRGYVTPVKDQGSCHSCWAFSATGALE 150
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+F+ T ++ LS Q +VDCS N GC+GG + YV+ GGL EE YPY K+
Sbjct: 151 GQMFRKTGKLVSLSEQNLVDCSRPESNNGCSGGLMDKAFQYVKNNGGLDSEESYPYTAKE 210
Query: 193 SI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
S C +K + + + +PPQ E AL +A+VGPI+V+++AS +F+ Y SGIY D
Sbjct: 211 SRNCLYKPEFSAANNTGFVNIPPQ-EKALMNAVASVGPISVAVDASLKSFRFYKSGIYFD 269
Query: 252 EACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
AC VNH +L+VGY W++KN W WG +GY+ + K NN CGIA
Sbjct: 270 PACRLA-VNHGVLVVGYGFEGTDPDKNKYWLVKNSWGKSWGADGYIKIAKDRNNHCGIAR 328
Query: 303 YAVYALI 309
A Y +
Sbjct: 329 AASYPTV 335
>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
Length = 398
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 105/288 (36%), Positives = 162/288 (56%), Gaps = 16/288 (5%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-----IKEMTRLTHSRIRRTLVR 88
+ N + I HN++ Q+G + + NHL+DL Y ++ + R T +R
Sbjct: 115 FTKNLEYIKQHNEKFQRGEVTFEMGVNHLTDLPFDEYKKLNGFRKNNDDSRPRNGSTFLR 174
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
V IPD +DWR ++T +Q CG+C+AFS A++GQ + T ++ LS Q
Sbjct: 175 P----HFVQIPDTVDWRNSSYVTVVKDQGQCGSCWAFSATGALEGQHMRKTHQLVSLSEQ 230
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDIS 207
+VDCS GN GC GG + N Y++ G+ EE YPYKG + C F+R + +
Sbjct: 231 NLVDCSRKYGNNGCNGGLMDNAFEYIKDNHGIDTEESYPYKGVEGKKCHFRRKFVGAEDY 290
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
++ LP DE ALKV +AT+GPI+V+I+A +FQ Y GIY + C+ + ++H +L+VG
Sbjct: 291 GYTDLPEGDEEALKVAVATIGPISVAIDAGHISFQNYRKGIYTENECSPEDLDHGVLVVG 350
Query: 268 YTRNS-----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
Y + WI+KN W WG++GY+ + R N+CGIA+ A Y ++
Sbjct: 351 YGTDENAGDYWIVKNSWGTRWGEHGYIRMARNKRNQCGIASKASYPIV 398
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y K ++ L W+ N + I HN E G H ++L N D+ + + M
Sbjct: 36 HKKSYAPKEEGWRRVL-WEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNG 94
Query: 77 L-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+IR + +P + ES P +DWR+KG++TP +Q CG+C+AFS A++GQ
Sbjct: 95 YKNQKKIRGSTFLAPNNFES---PKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQH 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
+++T ++ LS Q +VDCS GN GC GG + YV+ GG+ E+ YPY K
Sbjct: 152 YRNTGKMISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQE 211
Query: 196 KFKRPNI-------VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
PN VD++S S E L +A+VGP++V+++A +FQ Y SGI
Sbjct: 212 CHYDPNYNSANDTGFVDVTSGS------EKDLMNAVASVGPVSVAVDAGHQSFQFYKSGI 265
Query: 249 YDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCG 299
Y + C+S+ ++H +L+VGY + WI+KN WS WG++GY+Y+ K +N CG
Sbjct: 266 YYEPECSSEDLDHGVLVVGYGFEGEDEDGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCG 325
Query: 300 IANYAVYALI 309
IA A Y L+
Sbjct: 326 IATAASYPLV 335
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 169/310 (54%), Gaps = 27/310 (8%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y K ++ L W+ N + I HN E G H ++L N D+ + + M
Sbjct: 36 HKKSYAPKEEGWRRVL-WEKNLRMIEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNG 94
Query: 77 L-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+IR + +P + ES P +DWR+KG++TP +Q CG+C+AFS A++GQ
Sbjct: 95 YKNQKKIRGSTFLAPNNFES---PKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQH 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
+++T ++ LS Q +VDCS GN GC GG + YV+ GG+ E+ YPY K
Sbjct: 152 YRNTGKMISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQE 211
Query: 196 KFKRPNI-------VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
PN VD++S + E L +A+VGP++V+++A +FQ Y SGI
Sbjct: 212 CHYDPNYNSANDTGFVDVTS------ESEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGI 265
Query: 249 YDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCG 299
Y + C+S+ ++H +L+VGY + WI+KN WS WG++GY+Y+ K +N CG
Sbjct: 266 YYEPECSSEDLDHGVLVVGYGFEGEDEDGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCG 325
Query: 300 IANYAVYALI 309
IA A Y L+
Sbjct: 326 IATAASYPLV 335
>gi|346574373|gb|AEO36958.1| silicatein-alpha 1 [Baikalospongia fungiformis]
Length = 324
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 162/297 (54%), Gaps = 9/297 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRL 77
K Y + + ++ W SN K + HN A GYTL NHL DL Y ++ +T
Sbjct: 31 KRYASELEELERHAIWLSNKKYVEEHNARADA--FGYTLAMNHLGDLSAEEYAEQYLTNA 88
Query: 78 THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
S R + ++ + + V + +DWR KG +T Q CGA YAF+ A++G
Sbjct: 89 RGSNEHREM-KAFLAPKGVQYAESIDWRTKGAVTSFQYQGQCGASYAFAATGALEGASAL 147
Query: 138 STSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKF 197
+ + LS Q ++DCS+ GN GC+GG YV GG+ E Y +KGKQS C++
Sbjct: 148 ANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVIDNGGIDTESSYSFKGKQSSCQY 207
Query: 198 KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD 257
+ + E L +ATVGP+AV+++A+ + F+ Y SG++D +C+S
Sbjct: 208 NNKTSGASATGVVSIGYGSESDLLAAVATVGPVAVAVDANTNAFRFYQSGVFDSSSCSST 267
Query: 258 YVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
+NHAML+ GY ++ W++KN WS +WGD+GY+ + R N+CGIA+ A+Y ++
Sbjct: 268 KLNHAMLVTGYGSYNGKDYWLIKNSWSKNWGDSGYILMVRNKYNQCGIASDALYPML 324
>gi|15128493|dbj|BAB62718.1| plerocercoid growth factor/cysteine protease [Spirometra
erinaceieuropaei]
gi|15130639|dbj|BAB62799.1| plerocercoid growth factor-2/cysteine protease [Spirometra
erinaceieuropaei]
Length = 336
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/301 (33%), Positives = 166/301 (55%), Gaps = 11/301 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE--- 73
+KK+Y + +K + +N I HNQ Q L Y +R N SDL P + +
Sbjct: 39 FKKEYFSSEEELHRKRAFFNNLDFIIRHNQRYYQQLESYAVRLNDFSDLTPGEFAERYLC 98
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ + +++RR S E+ +PD ++WRE+G +T NQ CG+C++FS AI+G
Sbjct: 99 LRGIVLTKLRRKEAVSVPLKEN--LPDSVNWRERGAVTSVKNQGQCGSCWSFSANGAIEG 156
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
I T + LS QQ++DCS GN GC GG + Y Q G+ E DY Y +
Sbjct: 157 AIQIKTGALRSLSEQQLMDCSWDYGNQGCNGGLMPQAFQYAQRY-GVEAEVDYRYTERDG 215
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
+C++++ +V +++ ++ LP DE L+ +AT+GPI+V I+A+ F Y+ G++ +
Sbjct: 216 VCRYRQDLVVANVTGYAELPEGDEGGLQRAVATIGPISVGIDAADPGFMSYSHGVFVSKT 275
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C+ ++H +L+VGY + W++KN W WG+ GY+ + R NN CGIA+ A Y
Sbjct: 276 CSPYAIDHGVLVVGYGAENGEAYWLVKNSWGSSWGEGGYVKMARNRNNMCGIASMASYPT 335
Query: 309 I 309
+
Sbjct: 336 V 336
>gi|301609084|ref|XP_002934126.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 340
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 114/305 (37%), Positives = 168/305 (55%), Gaps = 16/305 (5%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ K I HN E GLH Y + NHL D+ M
Sbjct: 35 KTHQKTYKDTEEERTRRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATM 94
Query: 75 TRLT---HSRIRRTLVRSPESNE--SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
T T HS + VR NE P +DWR + +TP +Q C +CYAFS
Sbjct: 95 TCYTSSDHSLANMSHVR----NEILEAQPPASIDWRTQNCVTPVRHQGKCCSCYAFSTVG 150
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ Q K TS++ S Q++VDCS GN GC G L + Y++ G+M+E YPY
Sbjct: 151 ALECQWKKKTSKLVTFSPQELVDCSESEGNKGCKWGYLSKSFEYMK-KYGVMEESAYPYT 209
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
GK+ CK K+P+ ++ + L +E ALK + T+GP++V+I+AS F++Y SG+Y
Sbjct: 210 GKEGQCKRKKPSNTGVVNQFYRLHAGNETALKNVVGTIGPVSVAIDASRQGFRMYKSGVY 269
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
D CT++ VNHA+L+VGY ++ W +KN W +GD GY+ + R N CGIA A
Sbjct: 270 YDPCCTTE-VNHAVLVVGYGTDNGKAYWRVKNSWGIGFGDKGYIKMARNRKNHCGIAKEA 328
Query: 305 VYALI 309
VY ++
Sbjct: 329 VYPIL 333
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 105/297 (35%), Positives = 162/297 (54%), Gaps = 13/297 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRL 77
K Y + D ++ + W N + + HN E H Y L NH +DL + + M
Sbjct: 36 KTYTGEEEDLRRAI-WNDNLEIVKKHNAEN----HSYKLDMNHFADLTVTEFKQRFMGYR 90
Query: 78 THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
S P SN V +P +DWR+KGF+T NQ CG+C+AFS +++GQ F+
Sbjct: 91 AASNSTGGSTFLPLSN--VQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFR 148
Query: 138 STSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKF 197
T ++ LS Q +VDCS GN GC GG + Y++ G+ E+ YPY + C F
Sbjct: 149 KTGKLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQCHF 208
Query: 198 KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD 257
K ++ ++ ++ + E L+ +ATVGPI+V+I+A +FQLY +G+Y + C+S
Sbjct: 209 KPGSVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSST 268
Query: 258 YVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++H +L VGY ++ W++KN W WG NGY+ + R +N+CGIA A Y L+
Sbjct: 269 QLDHGVLAVGYGAEDGKDYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPLV 325
>gi|119389039|pdb|2C0Y|A Chain A, The Crystal Structure Of A Cys25ala Mutant Of Human
Procathepsin S
Length = 315
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 166/303 (54%), Gaps = 10/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 16 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 75
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + +SN + ++PD +DWREKG +T Q CGA +AFS A++
Sbjct: 76 MSSLRVPSQWQRNITY--KSNPNRILPDSVDWREKGCVTEVKYQGSCGAAWAFSAVGALE 133
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 134 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 193
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 194 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 253
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 254 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 312
Query: 307 ALI 309
I
Sbjct: 313 PEI 315
>gi|94448670|emb|CAI91573.1| silicatein a4 [Lubomirskia baicalensis]
gi|312386085|gb|ADQ74587.1| silicatein alpha 4 [Lubomirskia baicalensis]
Length = 326
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 162/303 (53%), Gaps = 17/303 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +++K Y + ++ W SN K I HN + GYTL NH DL Y
Sbjct: 28 KGQHQKSYMSGLEELERHSIWLSNKKYIEEHNAHSDD--FGYTLAMNHFGDLSTEEY--N 83
Query: 74 MTRLTH-----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
LTH + R R+P+ + V D +DWR KG +T Q CGA YAF+
Sbjct: 84 AMYLTHDPGNYTHHGRKAFRTPKGVQYV---DSIDWRTKGAVTSVKYQGQCGASYAFAAT 140
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G S + LS Q ++DCS+ GN GC+GG + YV GG+ E Y +
Sbjct: 141 GALEGASALSNDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVIDNGGIDTESSYSF 200
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+GKQS C++ N + + E L +ATVGP+AV+++A+ + F+ Y SG+
Sbjct: 201 QGKQSSCQYSSKNSGASATGVISIASGSETDLFAAVATVGPVAVAVDANTNAFRFYQSGV 260
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
+D +C++ +NHAML+ GY ++ W++KN WS +WGDNGY+ + R N+CGIA
Sbjct: 261 FDSSSCSNTKLNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGYIMMVRNKYNQCGIATD 320
Query: 304 AVY 306
A+Y
Sbjct: 321 ALY 323
>gi|195123821|ref|XP_002006400.1| GI18587 [Drosophila mojavensis]
gi|193911468|gb|EDW10335.1| GI18587 [Drosophila mojavensis]
Length = 366
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 103/300 (34%), Positives = 154/300 (51%), Gaps = 10/300 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A ++ + + + N + G Y L N +DL ++K++T L
Sbjct: 68 KTYASAADRQLRERIFGARKNLVDATNAAFKGGAKTYELAVNAFADLTKAEFLKQLTGLR 127
Query: 79 HSRIRRTLVR----SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
S + +P +PD DWREKG +TP Q DCG+C++F+ AI+G
Sbjct: 128 KSSSGEQNAKMHRLAPNLAAKEKLPDSFDWREKGGVTPVKFQGDCGSCWSFAATGAIEGH 187
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNL-GCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+F+ T ++ LS Q +VDC L GC GG N+V+ G+ YPY K+
Sbjct: 188 VFRKTGKLPNLSEQNLVDCGPRDLGLDGCDGGYQEYAFNFVKEQDGIAVGSKYPYVDKKD 247
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK+ I+ ++V+PP+DE A+K +AT GP+A S+ + LY GIY DE
Sbjct: 248 TCKYTSSLSGAQITGFAVIPPKDEQAMKTVIATQGPLACSVYGL-ESLLLYKRGIYADEE 306
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
C + VNH++L+VGY ++ WI+KN W WG++GY L RG N CGIA Y ++
Sbjct: 307 CNNGEVNHSVLVVGYGSENGQDFWIVKNSWDKIWGEDGYFRLPRGKNFCGIATECSYPIV 366
>gi|426365294|ref|XP_004049712.1| PREDICTED: LOW QUALITY PROTEIN: putative cathepsin L-like protein
6-like [Gorilla gorilla gorilla]
Length = 333
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 105/281 (37%), Positives = 151/281 (53%), Gaps = 15/281 (5%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI 98
K I HNQE QG H +T+ N D+ + + M + + R+ + E +L+
Sbjct: 58 KMIEQHNQEYSQGKHSFTMAMNAFGDMTNEEFRQVMNGFQYQKHRK----GKQFQERLLL 113
Query: 99 --PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
P +DWREKG++TP +Q CG+C+AFS A++GQ+F T ++ L+ Q +VDCS
Sbjct: 114 ESPTSVDWREKGYMTPVKDQGQCGSCWAFSATGALEGQVFWKTGKLISLNEQNLVDCSGP 173
Query: 157 SGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD 216
GN GC GG + N YVQ GL E YPY+GK C + P + V P
Sbjct: 174 QGNEGCNGGFMDNPFRYVQENRGLDSEASYPYEGKVKTCGYN-PKYSAANDTGFVDIPSR 232
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS---- 272
E L +ATVGPI+V++ AS +FQ Y GIY + C + ++HAML+VGY+
Sbjct: 233 EKDLAKAVATVGPISVAVGASHVSFQFYKKGIYFEPRCDPEGLDHAMLVVGYSYEGADWD 292
Query: 273 ---WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W +WG +GY+ + K N CGI A Y +
Sbjct: 293 NKYWLVKNSWGKNWGMDGYIKMAKDRRNNCGITTAASYPTV 333
>gi|19698257|dbj|BAB86771.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 164/303 (54%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL-HPRHYIK 72
+ K+ K Y ++ +K W +NH+KI HNQ A QG+H Y N SD+ H
Sbjct: 26 KAKFGKSYPSLEEEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDMDHEEFRQT 85
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+T++ + R P +V + +DWR G ++P NQ CG+C++FS A++
Sbjct: 86 VLTKMDPPKNNRG-ASEPFRAPNVGLAASVDWRTSGCVSPIKNQGQCGSCWSFSATGALE 144
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q + LS QQ+VDCS GN GC GG + YVQ GG+ E YPY+ +
Sbjct: 145 SQTCLRRGYLPSLSEQQLVDCSGPYGNYGCNGGWPDHAFQYVQANGGIDSESYYPYQARV 204
Query: 193 SICKFKRPNIVVDISSW-SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + S + V P E AL+ +A VGP++++I+AS +Q Y SG+++D
Sbjct: 205 GTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDAS--GWQSYQSGVFND 262
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVY 306
+C S +HA+LLVGY ++ W++KN W WG+ GY+ + R NN+CGIAN+A Y
Sbjct: 263 PSC-SQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMARNANNQCGIANHASY 321
Query: 307 ALI 309
L+
Sbjct: 322 PLV 324
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 171/321 (53%), Gaps = 16/321 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N+EW+ + +K YK D ++ + K+ + HK I HN + Y L+ N
Sbjct: 29 LVNQEWMTFKMEHKKVYKSDVEERF---RMKIFMDNKHK-IAKHNSNYEMKKVSYKLKMN 84
Query: 61 HLSDLHPRHYIKEMTRLTHS-----RIRRTLV-RSPESNESVLIPDHLDWREKGFITPDW 114
D+ ++ + S R R V S +V++P +DWR++G +TP
Sbjct: 85 KYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVK 144
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C++FS A++GQ F+ T + LS Q ++DCS GN GC GG + Y+
Sbjct: 145 DQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 204
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GL E YPY+ + C++ N + +P DE LK +AT+GP++V+I
Sbjct: 205 KDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATIGPVSVAI 264
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y + C+S+ ++H +L++GY N W++KN W WG+NGY+
Sbjct: 265 DASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYI 324
Query: 290 YLKRGN-NRCGIANYAVYALI 309
+ R N CGIA+ A Y L+
Sbjct: 325 KMARNKLNHCGIASSASYPLV 345
>gi|27532972|ref|NP_083912.2| cathepsin Q precursor [Mus musculus]
gi|27960482|gb|AAO27845.1|AF456461_1 cathepsin Q [Mus musculus]
gi|16445011|gb|AAK00505.1| cathepsin Q precursor [Mus musculus]
gi|71050990|gb|AAH99415.1| Cathepsin Q [Mus musculus]
gi|148709365|gb|EDL41311.1| cathepsin Q, isoform CRA_a [Mus musculus]
Length = 343
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 171/324 (52%), Gaps = 28/324 (8%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KEW+ F +K Y + + ++ W+ N K+I HN+E G + YT+ N +
Sbjct: 30 KEWMGSF---EKLYSPE-----EEVLRRAIWEENVKRIKLHNRENSLGKNTYTMGLNGFA 81
Query: 64 DLHPRHYIKEM--TRLTHSRIRRTL----VRSPESNE---SVLIPDHLDWREKGFITPDW 114
D+ ++ + L R++L + SP +P +DWR +G++T
Sbjct: 82 DMTDEEFMNIVIGATLPVDNTRKSLWKRALGSPFPKSWYWKDALPKFVDWRNEGYVTRVR 141
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
NQ +C +C+AF + AI+GQ+FK T ++ LS+Q +VDCS GN GC G+ N YV
Sbjct: 142 NQRNCNSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLVDCSRPQGNRGCRWGNTYNGFQYV 201
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
GGL + YPY+GK+ +C++ N I+ + VL P+ E L +AT GPIA I
Sbjct: 202 LHNGGLEAQATYPYEGKEGLCRYNPKNSAAKITGFVVL-PESEDVLMDAVATKGPIATGI 260
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDN 286
+ +F+ Y G+Y + CTS VNHA+L++GY N W++KN W WG +
Sbjct: 261 HVVSSSFRFYDGGVYYEPNCTSS-VNHAVLIIGYGYVGNETDGNNYWLIKNSWGRRWGLS 319
Query: 287 GYMYL-KRGNNRCGIANYAVYALI 309
GYM + K NN C IA+ A Y +
Sbjct: 320 GYMMIAKDRNNHCAIASLAQYPTV 343
>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
Length = 344
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 173/312 (55%), Gaps = 18/312 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN---HLSDLHPRHYIK 72
+++ +Y+ + D+ + + + I HNQ+ + GL Y L N D+ ++K
Sbjct: 33 QHRLNYKSEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVK 92
Query: 73 EMTRLTHSR-------IRRTLVRSPE--SNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
M + ++ VR + S +V +P+ +DWR+ G +T +Q CG+C+
Sbjct: 93 TMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCW 152
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
+FS A++GQ F+ + + LS Q ++DCS GN GC GG + N Y++ GG+ E
Sbjct: 153 SFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTE 212
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
+ YPY+G C++ N + + +P DE L +ATVGP++V+I+AS FQL
Sbjct: 213 QAYPYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQL 272
Query: 244 YASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNR 297
Y+SG+Y++E C+S ++H +L+VGY + W++KN W WG+ GY+ + R NNR
Sbjct: 273 YSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNR 332
Query: 298 CGIANYAVYALI 309
CGIA+ A Y L+
Sbjct: 333 CGIASSASYPLV 344
>gi|344257450|gb|EGW13554.1| Testin-2 [Cricetulus griseus]
Length = 401
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 160/285 (56%), Gaps = 12/285 (4%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W+ N K I HN E QG H +T+ N DL+ + K M RI+R + + +
Sbjct: 120 WEKNFKMIELHNWEYLQGKHDFTMAMNAFGDLNNTEFRKTMAGFRRQRIKRRRIF--QDH 177
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ +P ++WRE+G++TP +Q C + +AFS A++GQ+FK T ++ LS Q ++DC
Sbjct: 178 LFLSVPKQVNWREQGYVTPVKSQGHCASSWAFSATGALEGQMFKKTRKLNALSEQNLLDC 237
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
+ C+GG +++ YV+ GGL EE YPY+G C+++ N ++ + +P
Sbjct: 238 MEFNVTRSCSGGFMQSAFQYVRDNGGLATEESYPYQGHAMECRYQAKNSAANVKDFVQIP 297
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----- 268
+E AL +A VGPI+V+I+A +FQ Y SGIY + C + NHA+L+VGY
Sbjct: 298 GHEE-ALMKAVANVGPISVAIDARHSSFQFYESGIYYEPKCKRVHQNHAVLVVGYGFEGE 356
Query: 269 ---TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W WG GYM + + NN CGIA +A Y ++
Sbjct: 357 ESDGNSYWLVKNSWGEEWGIKGYMKIAKDWNNHCGIATHATYPIV 401
>gi|147903593|ref|NP_001080822.1| cathepsin S precursor [Xenopus laevis]
gi|33417128|gb|AAH56059.1| Ctss-a protein [Xenopus laevis]
Length = 333
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 161/304 (52%), Gaps = 9/304 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + K+Y + D ++++ W+ N ++ HN E G+H Y L NHL+D+ +
Sbjct: 31 KNTHSKEYEDETEDLQRRITWEKNLDFVNMHNLEYSMGMHTYELGMNHLADMTSEEMKSK 90
Query: 74 MTRLT---HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+T L HS + + D +DWR+KG ++ NQ CG+C+AFS A
Sbjct: 91 LTGLILPPHSERKAKFSSQRNGTFGGKVRDSIDWRDKGCVSDVKNQGGCGSCWAFSAVGA 150
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ+ T ++ LS Q +VDC+ GN GC+GG + + YV G+ + YPY
Sbjct: 151 LEGQLMLKTGKLVSLSPQNLVDCASKYGNKGCSGGFMTSAFQYVIDNNGIDSDSYYPYHA 210
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C ++ ++ + P E LK L T+GPI+V+I+ + TF LY SG+Y
Sbjct: 211 MDEKCHYELAGKASSCVKYTEIVPGTEDNLKQALGTIGPISVAIDGTRPTFFLYKSGVYS 270
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D +C+ + VNH +L +GY ++ W+LKN W ++GD G++ + R N CG+A+Y
Sbjct: 271 DPSCSQE-VNHGVLAIGYGTLNGQDFWLLKNSWGTYYGDKGFVRIARNKGNLCGVASYTS 329
Query: 306 YALI 309
Y I
Sbjct: 330 YPEI 333
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 166/306 (54%), Gaps = 18/306 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT- 75
+ K Y +K ++ +++ W+ + + I HN E G H + L NH D+ + + M
Sbjct: 36 HGKSYEQK-EETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNG 94
Query: 76 ---RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ TH +++ + P E +P H+DWR++G++TP +Q CG+C+AFS A++
Sbjct: 95 YKYKQTHKKLQGSHFLEPNFLE---VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F+ T ++ LS Q +V+CS GN GC GG + YV+ GG+ E+ YPY G
Sbjct: 152 GQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTD 211
Query: 193 SI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + + + + +P E AL +A VGP++V+I+A +FQ Y SGIY +
Sbjct: 212 DTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFE 271
Query: 252 EACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIAN 302
C+S ++H +L+VGY + WI+KN WS G NGY+ + K +N CGIA
Sbjct: 272 AECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKLGQNGYILMAKDKDNHCGIAT 331
Query: 303 YAVYAL 308
A Y L
Sbjct: 332 AASYPL 337
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 168/311 (54%), Gaps = 19/311 (6%)
Query: 17 YKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+K ++RK DS ++ + N I HNQ G Y L N +D+ + +
Sbjct: 32 FKLEHRKNYADSTEETFRMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHEFRE 91
Query: 73 EMTRLTHSRIRRTLVRSPE--------SNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
M ++ + + L + E S E V +P +DWR KG +T +Q CG+C+A
Sbjct: 92 TMNGFNYT-LHKQLRSTDESFTGVTFISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSCWA 150
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
FS AI+GQ F+ + + LS Q +VDCS GN GC GG + N YV+ GG+ E+
Sbjct: 151 FSSTGAIEGQHFRKSGTLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYVKDNGGIDTEK 210
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
Y Y+G C F + +I ++ +P +E L +AT+GP++V+I+AS +FQ Y
Sbjct: 211 SYAYEGIDDSCHFDKNSIGATDRGFADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFY 270
Query: 245 ASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRC 298
+ G+YD+ C+++ ++H +L+VGY + W++KN W WGD G++ + R N+C
Sbjct: 271 SEGVYDEPNCSAENLDHGVLVVGYGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRNKENQC 330
Query: 299 GIANYAVYALI 309
GIA+ + Y L+
Sbjct: 331 GIASASSYPLV 341
>gi|197205894|gb|ACH48000.1| silicatein A2 [Latrunculia oparinae]
Length = 329
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 166/312 (53%), Gaps = 14/312 (4%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW + QK Y+ D + + W SN K I HNQ + + G+TL NH +
Sbjct: 26 QEWNMWKGMHQKSYQNDLEEL----DRHTVWLSNKKYIEAHNQNSH--VFGFTLAMNHFA 79
Query: 64 DLHPRHYIKEMTRLTH-SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGAC 122
DL + + ++ +TH S + E N+ PD +DWR K +T +Q CGA
Sbjct: 80 DLTDQEWTEKF--VTHVSDTAGNYTKYYEPNQFKSYPDTVDWRTKDAVTKVKDQSQCGAS 137
Query: 123 YAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMK 182
YAFS A++G +T + LS Q ++DCS+ GN GC GG++ Y+ GL
Sbjct: 138 YAFSAVGALEGANALATGSLSVLSEQNIIDCSVPYGNHGCKGGNMLYAFKYIIANDGLDV 197
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
+ YP++GKQ C + + IS + E L +A VGP++V+I+ S + F+
Sbjct: 198 AKSYPFQGKQQSCVYDDQDTGGKISGMVRIKQGSESDLIGAVANVGPVSVAIDGSSNAFR 257
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NR 297
YASG+YD C+S +NHAM++ GY ++ W++KN W +WG +GY+ + RG N+
Sbjct: 258 FYASGVYDSSRCSSSKLNHAMVVTGYGTYGGKDYWLVKNSWGTNWGQSGYIMMARGKYNQ 317
Query: 298 CGIANYAVYALI 309
CGIA+ A Y +
Sbjct: 318 CGIASDACYPTL 329
>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 167/309 (54%), Gaps = 11/309 (3%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
F + +K+ Y ++K+ L + +N K+ HN+ Q+G Y + N+ +D
Sbjct: 64 FFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMGVNNFTDKTEYELR 123
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
K + RI + + S+E +PD +DWR G +TP NQ CG+C+AFS AI
Sbjct: 124 KLRGYRSACRIAKPKGSTFISSEHAKLPDRVDWRRNGAVTPVKNQGQCGSCWAFSSTGAI 183
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY--- 188
+GQ ++ T+ + LS QQ++DCS GN GC GG + YV+ G+ E YPY
Sbjct: 184 EGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDLAFQYVRDNEGIDSEISYPYISG 243
Query: 189 KGKQSI-CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
G +++ C F NI+ ++ + + DE AL + T+GP++V+INA +F +Y SG
Sbjct: 244 DGDENVRCLFNFTNIMAQVTGYINIHEGDERALMNAVTTIGPVSVAINAGLSSFSMYKSG 303
Query: 248 IYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGI 300
IY D C S+ ++H +LLVGY + W++KN W WGD GY+ LK N C +
Sbjct: 304 IYSDPECASASEDLDHGVLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILKDSKNMCSV 363
Query: 301 ANYAVYALI 309
A+ A Y L+
Sbjct: 364 ASAASYPLV 372
>gi|328909405|gb|AEB61370.1| cathepsin S-like protein, partial [Equus caballus]
Length = 281
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 108/283 (38%), Positives = 152/283 (53%), Gaps = 10/283 (3%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTLVRSPES 92
W+ N K + HN E G+H Y L NHL D+ M+ L S+ +R +
Sbjct: 2 WERNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTSLMSSLRVPSQWQRNVTYKSNP 61
Query: 93 NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
NE +PD LDWREKG +T Q CGAC+AFS A++ Q+ T + LS Q +VD
Sbjct: 62 NEK--LPDSLDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVD 119
Query: 153 CSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
CS N GC GG + Y+ G+ + YPYK C++ N S ++
Sbjct: 120 CSTEKYSNKGCNGGFMTAAFQYIIDNNGIDSDASYPYKAMDGKCRYDSKNRAATCSKYTE 179
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
LP E LK +A GP++V+I+AS +F LY SG+Y D +CT + VNH +L+VGY
Sbjct: 180 LPFGSEDDLKEAVANKGPVSVAIDASHPSFFLYKSGVYYDPSCTQN-VNHGVLVVGYGNL 238
Query: 269 -TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
++ W++KN W ++GD GY+ + R + N CGIANY Y I
Sbjct: 239 NGKDYWLVKNSWGINFGDKGYIRMARNSGNHCGIANYCSYPEI 281
>gi|195382039|ref|XP_002049740.1| GJ20585 [Drosophila virilis]
gi|194144537|gb|EDW60933.1| GJ20585 [Drosophila virilis]
Length = 333
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 164/306 (53%), Gaps = 17/306 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM- 74
+Y+K Y + + +KL + N K I HN G Y + N +D+ P+ + M
Sbjct: 33 EYEKRYESEDEELLRKLIFYDNKKAIDKHNIRYALGKEAYEMGVNQFTDMLPKEFGSLML 92
Query: 75 --TRLTHSRIRRTLVRS-PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
LT + ++ S PE+ E IP +DWR KG +T +Q CG+C+AFS +
Sbjct: 93 TSINLTDATSDIDIIYSAPENTE---IPSSIDWRVKGAVTSVKDQGKCGSCWAFSAVGTL 149
Query: 132 QGQIFKSTSEIEELSIQQVVDCS--IISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+GQ F T ++ LS Q ++DCS N GC GG L YV+ GG+ E YPY
Sbjct: 150 EGQQFLKTRQLMSLSTQNLLDCSSRYPYSNKGCNGGLPLQALMYVRDNGGIDIESSYPYD 209
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+Q C+F R N+ +S+ L DE L V A GPI+V I+A TF Y SG+Y
Sbjct: 210 SRQLSCRFDRHNVGASVSAIVRLKQDDESNLAVATAIKGPISVLIHAG-QTFMQYRSGVY 268
Query: 250 DDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
D +C Y NHA+L+VGY +S W++KN W WG++GY+ + R NN+C IA+Y
Sbjct: 269 KDNSCNK-YFNHAVLVVGYGHDSREGDYWLVKNSWGSKWGESGYIRMARNRNNQCRIASY 327
Query: 304 AVYALI 309
A++ L+
Sbjct: 328 AIFPLV 333
>gi|157128512|ref|XP_001661463.1| cathepsin l [Aedes aegypti]
gi|91992510|gb|ABE72971.1| cathepsin L [Aedes aegypti]
gi|108872552|gb|EAT36777.1| AAEL011167-PA [Aedes aegypti]
Length = 327
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 105/311 (33%), Positives = 165/311 (53%), Gaps = 14/311 (4%)
Query: 5 EWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSD 64
EW KKY + + +++ ++ N +I +HN+ + G + + N SD
Sbjct: 25 EWTSFKSRHGKKYNR-----TEEHRRRGNYAFNKARIDSHNKRHEHGGASFRMGVNKYSD 79
Query: 65 LHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
+ + + M +S + R ++ ++ +DWR KG +TP +Q CG+CYA
Sbjct: 80 MDADEFAQTMNGFKYSGVPSQAPR--QARQATTTVTSIDWRTKGAVTPVKDQGRCGSCYA 137
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
FS A++G F T ++ LS Q +VDC+ GN GC GGS+ + Y++ G+
Sbjct: 138 FSALGALEGATFTKTGKLVNLSEQNIVDCTSTYGNYGCNGGSMTSVFKYIKTNNGVDTGA 197
Query: 185 DYPYKGK-QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
YPYK + C F P V + VL P +E AL+ +A +GP++V+I+AS +FQ
Sbjct: 198 FYPYKAAVAATCGFN-PAYVGATDTGYVLLPANETALQTAVANIGPVSVAIDASNPSFQQ 256
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRC 298
Y SGIY + C+S +NH +L+VGY T N W +KN W WG+ GY+ + R NN C
Sbjct: 257 YKSGIYYEPLCSSSKLNHGVLVVGYGTENGTDYWQVKNSWGTTWGEKGYIKMARNKNNHC 316
Query: 299 GIANYAVYALI 309
GIA++A Y +
Sbjct: 317 GIASFASYPTV 327
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 165/307 (53%), Gaps = 13/307 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K Y+ + + + + N KI HNQ G + + N +D+ + + M
Sbjct: 34 EHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMN 93
Query: 76 RLT---HSRIRRTLVRSPE----SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
H ++R + S E V +P +DWR KG +T +Q CG+C+AFS
Sbjct: 94 GFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSST 153
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F+ T + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YPY
Sbjct: 154 GALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 213
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+G C F + I ++ +P DE L +AT+GP++V+I+AS +FQ Y++G+
Sbjct: 214 EGIDDSCHFNKGTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGV 273
Query: 249 YDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
YD+ C ++H +L+VGY ++ W++KN W WGD G++ + R +N+CGIA
Sbjct: 274 YDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIAT 333
Query: 303 YAVYALI 309
+ Y L+
Sbjct: 334 ASSYPLV 340
>gi|119573900|gb|EAW53515.1| cathepsin K (pycnodysostosis), isoform CRA_a [Homo sapiens]
Length = 288
Score = 189 bits (481), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 141/235 (60%), Gaps = 10/235 (4%)
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L+HSR TL PE PD +D+R+KG++TP NQ CG+C+AFS A++GQ+
Sbjct: 56 LSHSRSNDTLY-IPEWEGRA--PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLK 112
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K T ++ LS Q +VDC +S N GC GG + N YVQ G+ E+ YPY G++ C
Sbjct: 113 KKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCM 170
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y DE+C S
Sbjct: 171 YNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS 230
Query: 257 DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
D +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 231 DNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 285
>gi|208972988|dbj|BAG74343.1| silicatein-M2 [Ephydatia fluviatilis]
Length = 326
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 163/303 (53%), Gaps = 21/303 (6%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRL 77
K Y + + ++ W SN K + HN A GYTL N+L DL Y ++ +T
Sbjct: 33 KRYASELEELERHTIWLSNKKYVEEHNARADA--FGYTLAMNNLGDLSAEEYAEQYLTNA 90
Query: 78 THSRIRRTLVRSPESNESVLIP------DHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
S +R + E+ L P + +DWR KG +T Q CGA YAF+ A+
Sbjct: 91 RGSNEQRKM-------ETFLAPKGVQYAESIDWRTKGAVTSVKYQGQCGASYAFAATGAL 143
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+G + + LS Q ++DCS+ GN GC+GG YV GG+ E Y +KGK
Sbjct: 144 EGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTAFKYVIDNGGIDTESSYSFKGK 203
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
QS C++ + + E+ L +ATVGP+AV+I+A+ + F+ Y SG++D
Sbjct: 204 QSSCQYNNKTSGASATGVVSIAYGSENDLLAAVATVGPVAVAIDANTNAFRFYQSGVFDS 263
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+C+S +NHAML+ GY ++ W++KN WS +WGD+GY+ + R N+CGIA+ A+Y
Sbjct: 264 SSCSSTKLNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDSGYILMVRNKYNQCGIASDALY 323
Query: 307 ALI 309
++
Sbjct: 324 PML 326
>gi|354504284|ref|XP_003514207.1| PREDICTED: cathepsin R-like [Cricetulus griseus]
Length = 334
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 167/308 (54%), Gaps = 18/308 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y+K Y K+ + ++ + W+ N K I HN+E G +G+ + N D+ + K
Sbjct: 33 KETYEKSYSKENEELRRSV-WEKNMKMIKHHNEENSLGKNGFIMEINEFGDMTVEEFRKT 91
Query: 74 MTRL---THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
M + TH+ + +R E+ ++P ++WR+KG++T Q C +C+AF++ A
Sbjct: 92 MNYIPVRTHTEGKS--IRKREAGG--VLPKSVNWRKKGYVTSVKKQAYCNSCWAFAVNGA 147
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
I+GQ+FK T + LS+Q +VDCS GN GC G YV GG+ E YPY+G
Sbjct: 148 IEGQMFKKTGNLTRLSVQNLVDCSKPHGNNGCDWGDPYIAYEYVLHNGGVEAEATYPYEG 207
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K+ C++ +I+ + L P+ E +L +AT+GPI+ I+ + F Y GI+
Sbjct: 208 KEGPCRYNPKYSAANITGFVSL-PKSEESLMAAVATIGPISAGIDIASDFFMFYKKGIFY 266
Query: 251 DEACTSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIA 301
D C +D VNH +L+VGY N W++KN + WG GYM + K NN C IA
Sbjct: 267 DPKCHNDTVNHVVLVVGYGFEGNETDGNNYWLVKNSYGKKWGLRGYMKIAKDQNNHCAIA 326
Query: 302 NYAVYALI 309
+YA Y ++
Sbjct: 327 SYAHYPIV 334
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/304 (34%), Positives = 165/304 (54%), Gaps = 15/304 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y K ++ L W+ N + I HN + G H Y L N D+ + + M
Sbjct: 36 HKKSYLPKEEGWRRVL-WEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNG 94
Query: 77 LTHSR-IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ + I+ + +P + E+ P +DWREKG++TP +Q CG+C+AFS A++GQ
Sbjct: 95 YKNQKMIKGSTFLAPNNFEA---PKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQH 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI- 194
++ ++ LS Q +VDCS GN GC GG + YV+ GG+ E+ YPY K
Sbjct: 152 YRKAGKLISLSEQNLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQE 211
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C + + + + +P E L +A+VGP++V+++A +FQ Y SGIY D C
Sbjct: 212 CHYDPNYNSANDTGFVDVPSGSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPEC 271
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S+ ++H +L+VGY + WI+KN WS WG+NGY+ + K +N CGIA A
Sbjct: 272 SSEDLDHGVLVVGYGFEGEDVDGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAAS 331
Query: 306 YALI 309
Y L+
Sbjct: 332 YPLV 335
>gi|94733563|emb|CAK11015.1| novel protein similar to vertebrate cathepsin L (CTSL) [Danio
rerio]
Length = 334
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 171/310 (55%), Gaps = 19/310 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KK++ Y +++ D +K W++N +KI +N + GL + + N DL Y
Sbjct: 30 KKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEY--- 86
Query: 74 MTRLTHSRIRRTLVRSPE--------SNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
RL S+I+ T R + N L ++D+R KG++T +Q CG+C++F
Sbjct: 87 -KRLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSF 145
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S AI+GQ++K T + LS QQ+VDCS G GC+G + N +YV L +
Sbjct: 146 STTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYV-INNALESSDT 204
Query: 186 YPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C +++ + IS + +P +E AL +ATVGP++V+I+A +F Y
Sbjct: 205 YPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFY 264
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GNNRCG 299
+SGIY + C + +NHA+L+VGY WI+KN W WG+ GYM + R G N CG
Sbjct: 265 SSGIYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIKNSWGTGWGEGGYMRMIRNGKNTCG 324
Query: 300 IANYAVYALI 309
IA+YA+Y +I
Sbjct: 325 IASYALYPII 334
>gi|354502593|ref|XP_003513368.1| PREDICTED: cathepsin L1-like isoform 2 [Cricetulus griseus]
Length = 330
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 99/290 (34%), Positives = 157/290 (54%), Gaps = 11/290 (3%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRT 85
+ +K+ W++N K I HN++ +G HG+ L N DL + + MT +
Sbjct: 45 EGQKRAVWENNRKMIELHNEDYTKGKHGFHLEMNAFGDLTNTEFRQLMTGFQSMGTTEMN 104
Query: 86 LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEEL 145
+ + P + +P +DWR+ G++TP +Q C +C+AFS +++GQ+F+ T ++ L
Sbjct: 105 VFQEPRLGD---VPKSVDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKTGKLVPL 161
Query: 146 SIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVD 205
S Q +VDCS N GC GG + Y++ GGL E YPY+ + C++ + +
Sbjct: 162 SEQNLVDCSRSQHNNGCHGGLFTSAFQYIKDNGGLDTSESYPYEAQDGPCRYDPKHSAAN 221
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
I+ + V+ P +E AL +ATVGPI++ I+ + Y SG Y D C + Y NH++LL
Sbjct: 222 ITGFVVV-PSNEEALMKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYNHYPNHSVLL 280
Query: 266 VGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
VGY S W++KN W WG +GY+ + K NN C IA A Y +
Sbjct: 281 VGYGEESDGQKYWLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 330
>gi|68399197|ref|XP_695425.1| PREDICTED: cathepsin L [Danio rerio]
Length = 349
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 171/310 (55%), Gaps = 19/310 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+KK++ Y +++ D +K W++N +KI +N + GL + + N DL Y
Sbjct: 45 KKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEY--- 101
Query: 74 MTRLTHSRIRRTLVRSPE--------SNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
RL S+I+ T R + N L ++D+R KG++T +Q CG+C++F
Sbjct: 102 -KRLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSF 160
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S AI+GQ++K T + LS QQ+VDCS G GC+G + N +YV L +
Sbjct: 161 STTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYV-INNALESSDT 219
Query: 186 YPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C +++ + IS + +P +E AL +ATVGP++V+I+A +F Y
Sbjct: 220 YPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFY 279
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR-GNNRCG 299
+SGIY + C + +NHA+L+VGY WI+KN W WG+ GYM + R G N CG
Sbjct: 280 SSGIYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIKNSWGTGWGEGGYMRMIRNGKNTCG 339
Query: 300 IANYAVYALI 309
IA+YA+Y +I
Sbjct: 340 IASYALYPII 349
>gi|19698255|dbj|BAB86770.1| cathepsin L-like [Engraulis japonicus]
Length = 324
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 163/303 (53%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL-HPRHYIK 72
+ K+ K Y ++ +K W +NH+KI HNQ A QG+H Y N SD+ H
Sbjct: 26 KAKFGKSYPSLEKEAHRKGLWLANHQKIQAHNQLADQGVHSYRQGLNQFSDMDHEEFRQT 85
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+T++ + R P +V + +DWR G ++P NQ CG+C++FS A++
Sbjct: 86 VLTKMDPPKNNRG-ASEPFRALNVGLAASVDWRTSGCVSPIKNQGQCGSCWSFSATGALE 144
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q + LS QQ+VDCS GN GC GG Y+Q GG+ E YPY+ +
Sbjct: 145 SQTCLRRGYLPSLSEQQLVDCSGSYGNYGCNGGWPDQAFQYIQANGGIDSESYYPYQARV 204
Query: 193 SICKFKRPNIVVDISSW-SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + S + V P E AL+ +A VGP++++I+AS +Q Y SG+++D
Sbjct: 205 GTCHYNSAYSAATCSGYQDVTPVGSESALQYYVANVGPLSIAIDAS--GWQSYQSGVFND 262
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVY 306
+C S +HA+LLVGY ++ W++KN W WG+ GY+ + R NN+CGIAN+A Y
Sbjct: 263 PSC-SQTADHAVLLVGYGTYNGQDYWLVKNSWGTWWGEQGYIMMTRNANNQCGIANHASY 321
Query: 307 ALI 309
L+
Sbjct: 322 PLV 324
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/299 (35%), Positives = 165/299 (55%), Gaps = 17/299 (5%)
Query: 24 KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR 83
+ ++++K +++N KKI HN +QG +T+ N SD+ + + M +
Sbjct: 1 ETEENQRKEVFRNNIKKIQMHNYLHEQGKSPFTMGINQFSDMDEKEFSTIMNGFRMNN-- 58
Query: 84 RTLVR--------SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
RT VR SP V +P +DWR+KG++TP NQ CG+C+AFS A++GQ
Sbjct: 59 RTKVRDHLHSHYISPAI--PVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQH 116
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS Q +VDCS GN GC GG + Y++ G E YPY+ +C
Sbjct: 117 FRKTGKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMC 176
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+FKR + ++ LP +E +K +A VGP++V+I+AS +F Y G+Y ++ C+
Sbjct: 177 RFKRECVGATCRGYTDLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKECS 236
Query: 256 SDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++H +L+VGY W++KN W WGD GY+ + R +N CGIA+ A Y L+
Sbjct: 237 PYQLDHGVLVVGYGTEQGLDYWLVKNSWGTTWGDQGYIKMARNMHNHCGIASMACYPLV 295
>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
Length = 343
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 170/322 (52%), Gaps = 19/322 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW+ + Y K Y ++ ++ + N KI NQE +G + + N
Sbjct: 28 LLQEEWMAF----KLTYNKSYASPEEENFRREIFIENRHKIARFNQEYGRGQWSFVQQLN 83
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN-----ESVLIPDHLDWREKGFITPDWN 115
+ +D+ + + + + R + P+S+ +V+ PD++DWRE G +TP N
Sbjct: 84 NFADMLHHEFHRTLNGFNRTLSARVGI--PQSSTFIPSANVIFPDYVDWREVGAVTPVKN 141
Query: 116 QEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQ 175
Q C C+AFS A A++G F+ T + ELS Q ++DCS GN GC+GG + YV+
Sbjct: 142 QGSCAGCWAFSAAGALEGHNFRKTGRLVELSPQNLIDCSTNYGNDGCSGGLMNPAYEYVR 201
Query: 176 FAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSIN 235
G+ E+ YPY+ + C+F+ + + + + DE L+ +AT+GP++ +++
Sbjct: 202 TNPGIDTEDSYPYEARNGPCRFRPETVGAYCTGYVDIAEGDEQGLEAAIATLGPVSAAMD 261
Query: 236 ASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGY 288
A +FQ Y+ GIY D C + D VNHA+L+VGY + W++KN + WG GY
Sbjct: 262 AGRQSFQFYSDGIYYDPQCGNRPDDVNHAVLVVGYGTEPNGQKYWLVKNSYGPQWGIGGY 321
Query: 289 MYL-KRGNNRCGIANYAVYALI 309
+ L K NN CGIA A Y L+
Sbjct: 322 VKLAKDANNHCGIAIQASYPLV 343
>gi|410990004|ref|XP_004001240.1| PREDICTED: cathepsin L1-like isoform 2 [Felis catus]
Length = 334
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/290 (35%), Positives = 160/290 (55%), Gaps = 13/290 (4%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ W N K I HN+E Q H +T+ N D+ + + M L + ++ V
Sbjct: 48 RRAVWGENMKMIVQHNREYNQKEHSFTMAMNGFGDMTNEEFRQVMNGLQIQKHKKGKVFQ 107
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
P IP ++WR+KG++TP NQ C + +AFS AI+GQ+F T ++ LS Q
Sbjct: 108 PPLFAE--IPSSVNWRKKGYVTPTKNQGQCASGWAFSATGAIEGQMFGKTHKLVSLSEQN 165
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDISS 208
++DC+ GN GC+GGS+ N YV+ GGL EE YPY + CK+ + DI+
Sbjct: 166 LLDCAWSEGNGGCSGGSMGNAFQYVKDNGGLDSEESYPYHAQDLQSCKYNPESSAADITG 225
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ + P+ E+ L ++A VGP++ +I+AS TF+ Y GIY D C+ + ++H +L+VGY
Sbjct: 226 FLTI-PKTEYKLMRSVAIVGPVSAAIDASLDTFRFYDQGIYYDSNCSHEILHHGVLIVGY 284
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+ WI+KN W WG +GY+ + K +N CGIA+ A + +
Sbjct: 285 GFEGAELDNKKYWIVKNSWGEAWGIDGYILMAKDRDNHCGIASLASFPTV 334
>gi|261824891|pdb|3H6S|A Chain A, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824892|pdb|3H6S|B Chain B, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824893|pdb|3H6S|C Chain C, Strucure Of Clitocypin - Cathepsin V Complex
gi|261824894|pdb|3H6S|D Chain D, Strucure Of Clitocypin - Cathepsin V Complex
gi|310942696|pdb|3KFQ|A Chain A, Unreduced Cathepsin V In Complex With Stefin A
gi|310942697|pdb|3KFQ|B Chain B, Unreduced Cathepsin V In Complex With Stefin A
Length = 221
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 135/218 (61%), Gaps = 9/218 (4%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR+KG++TP NQ+ CG+ +AFS A++GQ+F+ T ++ LS Q +VDCS
Sbjct: 1 LPKSVDWRKKGYVTPVKNQKQCGSXWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + YV+ GGL EE YPY ICK++ N V + ++V+ P E
Sbjct: 61 GNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVAQDTGFTVVAPGKE 120
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------TRN 271
AL +ATVGPI+V+++A +FQ Y SGIY + C+S ++H +L+VGY + N
Sbjct: 121 KALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSDN 180
Query: 272 S--WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVY 306
S W++KN W WG NGY+ + K NN CGIA A Y
Sbjct: 181 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 218
>gi|301628908|ref|XP_002943589.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 307
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 108/297 (36%), Positives = 158/297 (53%), Gaps = 10/297 (3%)
Query: 21 YRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS 80
Y + + ++ W+ K I HN E GLH Y + NHL D+ MT T S
Sbjct: 13 YNSQEEERARRTIWEETLKFISVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATMTGYTGS 72
Query: 81 RIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQED-CGACYAFSIASAIQGQIFKS 138
+ P+ L P +DWR + +TP +Q C +CYAFS A++ Q K
Sbjct: 73 GDSLANMSHVPKEILEALAPPSIDWRTQNCVTPVRDQGSFCRSCYAFSAVGALECQWKKK 132
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
T + S Q++VDCS GN GC GG + Y++ G +M+E YPY G++ +C+ K
Sbjct: 133 TVRLVTFSPQELVDCSDGEGNHGCNGGKIEKAFKYMKKYG-VMEESAYPYTGQKGLCRKK 191
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
+P + + + LP +E L T+ T+GP++VSINAS F + SG+Y + C +
Sbjct: 192 QPGNIGVVKAIHDLPSGNETLLMNTVGTIGPVSVSINASSEKFHQFKSGVYYNPDCLPNK 251
Query: 259 VNHAMLLVGYTRNS----WILKNWWSHHWGDNGY--MYLKRGNNRCGIANYAVYALI 309
VNHA+L+VGY + + W++KN W +G+NGY M RGNN CGIA VYA +
Sbjct: 252 VNHAVLVVGYGKENGMDYWLVKNSWGVQFGENGYIKMARNRGNN-CGIATRPVYATV 307
>gi|256561153|gb|ACU86976.1| silicatein-like protein [Aulosaccus sp. GV-2009]
Length = 352
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 161/305 (52%), Gaps = 13/305 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y + + L W SN K + HN A Q + GYTL NHL D+ + Y T
Sbjct: 50 KHTKSYESSLQELEHHLVWLSNKKYVDYHN--ANQHIFGYTLALNHLGDVTEKQYQDTYT 107
Query: 76 RLTH-SRIRRTLVR---SPESNESVLIPDHLDWREKGFITPDWNQ--EDCGACYAFSIAS 129
+ S I+R +R +P + + PD +DWR KG + NQ C +CYAFS
Sbjct: 108 CYSAASSIQRASIRVHETPPNFNASSYPDSMDWRTKGAVGSVKNQWYGQCDSCYAFSAVG 167
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++G + LS Q ++DCSI GN GC+GG+ N+ YV G+ K+ Y YK
Sbjct: 168 ALEGATALAKGYFVSLSEQNIIDCSIPYGNYGCSGGNHYNSFMYVIANDGIDKQSSYTYK 227
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+Q C + S + DE +L + ++GPIAV+++A + F+ Y+SG++
Sbjct: 228 GRQDSCSYSDNYRGSSQSGIVQIKSGDESSLLAAVYSMGPIAVAVDARSNAFKYYSSGVF 287
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYA 304
D CTS + H+MLL GY +N W+LKN W WG +GY+ + R G N+CGIA A
Sbjct: 288 DSSRCTSTNLAHSMLLTGYGAYQGKNYWLLKNSWGSQWGMSGYIMMTRNGYNQCGIATDA 347
Query: 305 VYALI 309
Y +
Sbjct: 348 SYPTL 352
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 111/315 (35%), Positives = 171/315 (54%), Gaps = 27/315 (8%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHP----RHYI 71
++ K Y ++ + ++ + N +I+ HN+ + GL Y + NHL DL R Y
Sbjct: 34 EHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYT 93
Query: 72 KEMTRLTHSR--------------IRRTLVRS-PESNESVLIPDHLDWREKGFITPDWNQ 116
M +L S ++ + + P + + V +P +DWR+KG +TP NQ
Sbjct: 94 VNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQ 153
Query: 117 EDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQF 176
+CG+C++FS A++ Q FK T+++ LS QQ+VDCS GN GC GG + Y++
Sbjct: 154 RNCGSCWSFSATGALEAQWFKKTNKLISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKE 213
Query: 177 AGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINA 236
GG+ E+ YPY K C +K N +S ++ P+ E+ L +++VGPI+++
Sbjct: 214 NGGIDTEQSYPYTAKDGRCAYKPGNKAATVSQ-VIMVPRGENQLAAKVSSVGPISIAAEV 272
Query: 237 SPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL- 291
S H FQ Y SG+YD+ C +NHAML VGY +N W++KN W WGD GY+ +
Sbjct: 273 S-HKFQFYHSGVYDEPQCGHS-LNHAMLAVGYGSMGGKNFWLVKNSWGTGWGDQGYIRMA 330
Query: 292 KRGNNRCGIANYAVY 306
K NN+CGIA A Y
Sbjct: 331 KDKNNQCGIALMASY 345
>gi|295321664|pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
gi|295321665|pdb|3H7D|E Chain E, The Crystal Structure Of The Cathepsin K Variant M5 In
Compl Chondroitin-4-Sulfate
Length = 215
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 132/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+REKG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 2 PDSVDYREKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSE 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G++ C + + +P +E
Sbjct: 60 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 119
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE+C SD +NHA+L VGY + WI
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGESKGNKHWI 179
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W +WG GY+ + R NN CGIAN A +
Sbjct: 180 IKNSWGENWGMGGYIKMARNKNNACGIANLASF 212
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 166/302 (54%), Gaps = 11/302 (3%)
Query: 17 YKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+K Y +K D+K++L+ +Q N + I N++ + G + + N D+ +
Sbjct: 22 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 81
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M + + R ++ + E + +DWR K +TP +QE CG+C+AFS A++
Sbjct: 82 VMK--GYKKGSRGEPKAVFTAEGRPMARDVDWRTKALVTPVKDQEQCGSCWAFSATGALE 139
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F E+ LS QQ+VDCS GN GC GG + + +Y++ GG+ E YPY+ +
Sbjct: 140 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 199
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+F +I + + E AL+ ++ VGPI+V+I+AS +FQ Y+SG+Y ++
Sbjct: 200 RSCRFDANSIGAICTGSVEIVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQ 259
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+ +++H +L VGY T++ W++KN W WGD GY+ + R +N CGIA+ Y
Sbjct: 260 NCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYP 319
Query: 308 LI 309
+
Sbjct: 320 TV 321
>gi|344257451|gb|EGW13555.1| Cathepsin L1 [Cricetulus griseus]
Length = 474
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 99/296 (33%), Positives = 158/296 (53%), Gaps = 11/296 (3%)
Query: 21 YRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS 80
Y + + +++ W+ N K I HN+E QG + +T+ N D P + +T
Sbjct: 183 YNFQNEEVERRALWEENMKLIQLHNKEYHQGKNTFTMAMNAFGDQRPLELAQTLTAFQSQ 242
Query: 81 RIRRT-LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKST 139
+ T + + P + +P +DWR+ G++TP +Q C +C+AFS +++GQ+F+ T
Sbjct: 243 EAKETNIFQEPLLGD---VPKSVDWRKHGYVTPVKDQGSCVSCWAFSAVGSLEGQMFRKT 299
Query: 140 SEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKR 199
++ LS Q +VDCS N GC GG + Y++ GGL E YPY+ + C++
Sbjct: 300 GKLVPLSEQNLVDCSRSQHNNGCHGGLFTSAFQYIKDNGGLDTSESYPYEAQDGPCRYDP 359
Query: 200 PNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYV 259
+ +I+ + V+ P +E AL +ATVGPI++ I+ + Y SG Y D C + Y
Sbjct: 360 KHSAANITGFVVV-PSNEEALMKAVATVGPISIGISVRLRSLLFYKSGFYYDPDCYNHYP 418
Query: 260 NHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
NH++LLVGY S W++KN W WG +GY+ + K NN C IA A Y +
Sbjct: 419 NHSVLLVGYGEESDGQKYWLVKNSWGEEWGMDGYIKIAKDRNNHCSIATIAAYPTV 474
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 86/155 (55%), Gaps = 4/155 (2%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRT 85
+ +K+ W++N K I HN++ +G HG+ L N DL + + MT +
Sbjct: 10 EGQKRAVWENNRKMIELHNEDYTKGKHGFHLEMNAFGDLTNTEFRQLMTGFQSMGTTEMN 69
Query: 86 LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEEL 145
+ + P + +P +DWR+ G++TP +Q CGAC+AFS ++ GQ+F T ++ L
Sbjct: 70 VFQEPRLGD---VPKSVDWRKHGYVTPVKDQGSCGACWAFSAVGSLVGQMFWKTGKLVPL 126
Query: 146 SIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
S Q +VDCS GN+GC GG ++N YV GGL
Sbjct: 127 SEQNLVDCSWSHGNIGCHGGLMQNAFQYVMDNGGL 161
>gi|307141900|gb|ADN34745.1| putative cysteine peptidase [Echinococcus granulosus]
Length = 218
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 89/215 (41%), Positives = 129/215 (60%), Gaps = 7/215 (3%)
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
+L+PD +DWR+KG +TP +Q DCG+C+AFS A++GQ+ + ++ LS QQ+VDCS
Sbjct: 5 MLVPDSIDWRKKGLVTPIKDQGDCGSCWAFSATGALEGQLKRKKGKLISLSEQQLVDCST 64
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
GN GC GG + + Y G E DYPY CKF +V +S + +P +
Sbjct: 65 DMGNEGCNGGYMNDAFRY-WMQNGAESESDYPYTAMDGKCKFNSSKVVTKVSKFVKVPKK 123
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS--- 272
E LK+++A VGP++V+I+A+ F LY GIY D C+ Y++HA+L+VGY +
Sbjct: 124 REDQLKLSVAQVGPVSVAIDAASSGFMLYKKGIYQDNTCSQQYLDHAVLVVGYDADMAGQ 183
Query: 273 --WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
WI+KN W WG GY+++ R N CGIA A
Sbjct: 184 KYWIVKNSWGEDWGQRGYIWMARDKGNMCGIATMA 218
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 161/309 (52%), Gaps = 31/309 (10%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y + ++L ++ NH+ I HN + + + L NH DL + Y
Sbjct: 85 KAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKE---FDFYLGMNHFGDLTNKEY--- 138
Query: 74 MTRLTHSRIRRTLVRSPESNESVL------------IPDHLDWREKGFITPDWNQEDCGA 121
R R R PE+ S +PD +DWR++GF+TP NQ CG+
Sbjct: 139 -------RERYLGYRRPENTPSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGS 191
Query: 122 CYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
C+AFS +++GQ FKST ++ LS Q +VDCS GN GC GG + YV+ G+
Sbjct: 192 CWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGID 251
Query: 182 KEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E+ YPY G C FK +I + + + DE AL+ + GP++V+I+AS F
Sbjct: 252 TEDSYPYVGTDGSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSMLF 311
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-N 295
Q Y G+Y+ C++ ++H +L+VGY ++ W++KN W WG GY+ + R
Sbjct: 312 QFYRGGVYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKG 371
Query: 296 NRCGIANYA 304
N+CGIA+ A
Sbjct: 372 NQCGIASKA 380
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 172/314 (54%), Gaps = 21/314 (6%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
F + ++K+ Y ++++ + +N K+ HN Q+G Y + N +D + +
Sbjct: 62 FFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTD-KTDYEL 120
Query: 72 KEMT--RLTHSRIR---RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFS 126
K++ ++T IR T +RS E +P +DWR +G +T NQ CG+C+AFS
Sbjct: 121 KKLRGYKVTSGAIRHKGSTFIRS----EHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFS 176
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
AI+GQ ++ T+ + LS QQ+VDCS GN GC+GG + + YV+ G+ E Y
Sbjct: 177 TTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISY 236
Query: 187 PYKGKQSI----CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
PY C F NI+ ++ + + DE AL +AT GP++V+INA +F
Sbjct: 237 PYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFS 296
Query: 243 LYASGIYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN- 295
+Y SGIY D C T D ++H +L+VGY R+ W++KN W WG+ GY+ + +G+
Sbjct: 297 MYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKGSH 356
Query: 296 NRCGIANYAVYALI 309
N CG+A+ A Y L+
Sbjct: 357 NMCGVASAASYPLV 370
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 169/321 (52%), Gaps = 16/321 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N+EW+ + +K YK D ++ + K+ + HK I HN + Y L+ N
Sbjct: 23 LVNQEWMTFKMEHKKAYKSDVEERF---RMKIFMDNKHK-IAKHNSNYEMKKVSYKLKMN 78
Query: 61 HLSDLHPRHYIKEMTRLTHS------RIRRTLVRSPESNESVLIPDHLDWREKGFITPDW 114
D+ ++ + S R + S +V +P +DWR++G +TP
Sbjct: 79 KYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKVDWRKEGAVTPVK 138
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C++FS A++GQ F+ T + LS Q ++DCS GN GC GG + Y+
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GL E YPY+ + C++ N + +P +E LK +AT+GP++V+I
Sbjct: 199 KDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGNEKLLKAAVATIGPVSVAI 258
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y + C+S+ ++H +L++GY N W++KN W WG+NGY+
Sbjct: 259 DASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNNGYI 318
Query: 290 YLKRGN-NRCGIANYAVYALI 309
+ R N CGIA+ A Y L+
Sbjct: 319 KMARNKLNHCGIASSASYPLV 339
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 167/321 (52%), Gaps = 16/321 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N+EW + K YK D ++ + K+ + HK I HN + Y L+ N
Sbjct: 23 LVNQEWTTFKMEHNKVYKNDVEERF---RMKIFMDNKHK-IAKHNGNYEMKKVSYKLKMN 78
Query: 61 HLSDLHPRHYIKEMTRLTHS------RIRRTLVRSPESNESVLIPDHLDWREKGFITPDW 114
D+ ++ + S R + S +V++P +DWRE G +TP
Sbjct: 79 KYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAASFIEPANVVLPKTVDWREHGAVTPVK 138
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C++FS A++GQ F+ T + LS Q ++DCS GN GC GG + Y+
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GL E YPY+ + C++ N + +P +E LK +AT+GP++V+I
Sbjct: 199 KDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAI 258
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y + C+S+ ++H +L VGY ++ W++KN W WGDNGY+
Sbjct: 259 DASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYI 318
Query: 290 YLKRGN-NRCGIANYAVYALI 309
+ R N CGIA+ A Y L+
Sbjct: 319 KMARNKLNHCGIASTASYPLV 339
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 161/312 (51%), Gaps = 17/312 (5%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
+ +K ++RK D ++ + N KI HNQ G + + N +D+ +
Sbjct: 28 QTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADMLHHEF 87
Query: 71 IKEMTRLTHSRIRRTLVRSPE-------SNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
M ++ ++ P S E V IP +DWR KG +T +Q CG+C+
Sbjct: 88 HTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSCW 147
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS A++GQ F+ + LS Q +VDCS GN GC GG + N Y++ GG+ E
Sbjct: 148 AFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTE 207
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
+ YPY+G C F + I +P DE + +AT+GP++V+I+AS +FQ
Sbjct: 208 KSYPYEGIDDSCHFNKATIGATDRGSVDIPQGDEKKMAEAVATIGPVSVAIDASHESFQF 267
Query: 244 YASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKR-GNNR 297
Y+ GIY++ C ++H +L+VGY + W++KN W WGD G++ + R +N+
Sbjct: 268 YSEGIYNEPQCDPQNLDHGVLVVGYGTDESGQDYWLVKNSWGTTWGDKGFIKMARNADNQ 327
Query: 298 CGIANYAVYALI 309
CGIA+ + Y L+
Sbjct: 328 CGIASASSYPLV 339
>gi|75765285|pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abe854
gi|75765286|pdb|1U9W|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abi491
gi|75765287|pdb|1U9X|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor Nvp-Abj688
gi|160286063|pdb|2R6N|A Chain A, Crystal Structure Of A Pyrrolopyrimidine Inhibitor In
Complex With Human Cathepsin K
Length = 217
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 132/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 4 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSE 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G++ C + + +P +E
Sbjct: 62 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 121
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE+C SD +NHA+L VGY WI
Sbjct: 122 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 181
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 182 IKNSWGENWGNKGYILMARNKNNACGIANLASF 214
>gi|2914594|pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent
Vinyl Sulfone Inhibitor
gi|28374044|pdb|1NL6|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374045|pdb|1NL6|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374047|pdb|1NLJ|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|28374048|pdb|1NLJ|B Chain B, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Azepanone Inhibitor
gi|47168617|pdb|1Q6K|A Chain A, Cathepsin K Complexed With T-butyl(1s)-1-cyclohexyl-2-
Oxoethylcarbamate
gi|55670045|pdb|1TU6|A Chain A, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|55670046|pdb|1TU6|B Chain B, Cathepsin K Complexed With A Ketoamide Inhibitor
gi|62738654|pdb|1YK7|A Chain A, Cathepsin K Complexed With A Cyanopyrrolidine Inhibitor
gi|73535690|pdb|1YK8|A Chain A, Cathepsin K Complexed With A Cyanamide-Based Inhibitor
gi|73535721|pdb|1YT7|A Chain A, Cathepsin K Complexed With A Constrained Ketoamide
Inhibitor
gi|93278849|pdb|2BDL|A Chain A, Cathepsin K Complexed With A Pyrrolidine Ketoamide-Based
Inhibitor
gi|114793438|pdb|2ATO|A Chain A, Crystal Structure Of Human Cathepsin K In Complex With
Myocrisin
gi|114793448|pdb|2AUX|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|114793451|pdb|2AUZ|A Chain A, Cathepsin K Complexed With A Semicarbazone Inhibitor
gi|126030469|pdb|2FTD|A Chain A, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|126030470|pdb|2FTD|B Chain B, Crystal Structure Of Cathepsin K Complexed With 7-Methyl-
Substituted Azepan-3-One Compound
gi|157830076|pdb|1ATK|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With The Covalent Inhibitor E-64
gi|157830085|pdb|1AU0|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Symmetric Diacylaminomethyl
Ketone Inhibitor
gi|157830086|pdb|1AU2|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Propanone Inhibitor
gi|157830087|pdb|1AU3|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830088|pdb|1AU4|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin
K In Complex With A Covalent Pyrrolidinone Inhibitor
gi|157830146|pdb|1AYU|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Symmetric Biscarbohydrazide
Inhibitor
gi|157830147|pdb|1AYV|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Thiazolhydrazide Inhibitor
gi|157830148|pdb|1AYW|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent
Benzyloxybenzoylcarbohydrazide Inhibitor
gi|157830300|pdb|1BGO|A Chain A, Crystal Structure Of Cysteine Protease Human Cathepsin K
In Complex With A Covalent Peptidomimetic Inhibitor
gi|197305045|pdb|3C9E|A Chain A, Crystal Structure Of The Cathepsin K : Chondroitin Sulfate
Complex.
gi|290560385|pdb|3KW9|A Chain A, X-Ray Structure Of Cathepsin K Covalently Bound To A
Triazine Ligand
gi|290560386|pdb|3KWZ|A Chain A, Cathepsin K In Complex With A Non-Selective 2-Cyano-
Pyrimidine Inhibitor
gi|290560387|pdb|3KX1|A Chain A, Cathepsin K In Complex With A Selective 2-Cyano-Pyrimidine
Inhibitor
gi|293651910|pdb|3KWB|X Chain X, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|293651911|pdb|3KWB|Y Chain Y, Structure Of Catk Covalently Bound To A Dioxo-Triazine
Inhibitor
gi|308198615|pdb|3O1G|A Chain A, Cathepsin K Covalently Bound To A 2-Cyano Pyrimidine
Inhibitor With A Benzyl P3 Group.
gi|327200584|pdb|3O0U|A Chain A, Cathepsin K Covalently Bound To A Cyano-Pyrimidine
Inhibitor With Improved Selectivity Over Herg
gi|394986262|pdb|4DMX|A Chain A, Cathepsin K Inhibitor
gi|394986263|pdb|4DMY|A Chain A, Cathepsin K Inhibitor
gi|394986264|pdb|4DMY|B Chain B, Cathepsin K Inhibitor
Length = 215
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 132/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 2 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSE 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G++ C + + +P +E
Sbjct: 60 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 119
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE+C SD +NHA+L VGY WI
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 179
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 180 IKNSWGENWGNKGYILMARNKNNACGIANLASF 212
>gi|50513589|pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized
Norleucine Aldehyde
Length = 214
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 132/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 1 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSE 58
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G++ C + + +P +E
Sbjct: 59 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 118
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE+C SD +NHA+L VGY WI
Sbjct: 119 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 178
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 179 IKNSWGENWGNKGYILMARNKNNACGIANLASF 211
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 161/301 (53%), Gaps = 6/301 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY + Y DS +++ ++ N K I N++ + G + L N D+ +
Sbjct: 24 KGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAV 83
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M R V P+ E+ +DWR KG +TP +Q CG+C+AFS +++G
Sbjct: 84 MKGNIPRRSAPVSVFYPK-KETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEG 142
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F T + L+ QQ+VDCS G GC GG + + +Y++ G+ E YPY+ +
Sbjct: 143 QHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEASYPYEARDG 202
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+F ++ S + + E L+ + +GPI+V+I+A+ +FQ Y+SG+Y + +
Sbjct: 203 SCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPS 262
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C+ Y++HA+L VGY ++ W++KN W+ WGD GY+ + R NN CGIA A Y L
Sbjct: 263 CSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPL 322
Query: 309 I 309
+
Sbjct: 323 V 323
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 172/321 (53%), Gaps = 21/321 (6%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+EW + +K Y+ D T+ + +L + N KI HNQ +G + L N
Sbjct: 27 EEWHTFKLEHRKNYQDD-----TEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKY 81
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDW 114
+DL + + M ++ + + L + +S + V +P +DWR KG +T
Sbjct: 82 ADLLHHEFRQLMNGFNYT-LHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C+AFS A++GQ F+ + + LS Q +VDCS GN GC GG + N Y+
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GG+ E+ YPY+ C F + I ++ +P DE + +ATVGP+AV+I
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAI 260
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y++ C + ++H +L+VGY + W++KN W WGD G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 290 -YLKRGNNRCGIANYAVYALI 309
L+ +N+CGIA+ + Y L+
Sbjct: 321 KMLRNKDNQCGIASASSYPLV 341
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 171/321 (53%), Gaps = 21/321 (6%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+EW + +K Y+ D T+ + +L + N KI HNQ +G + L N
Sbjct: 27 EEWHTFKLEHRKNYQDD-----TEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKY 81
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDW 114
+DL + + M ++ + + L + ES + V +P +DWR KG +T
Sbjct: 82 ADLLHHEFRQLMNGFNYT-LHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C+AFS A++GQ F+ + + LS Q +VDCS GN GC GG + N Y+
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GG+ E+ YPY+ C F + I ++ +P DE + +ATVGP+AV+I
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAI 260
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y++ C + ++H +L+VG+ + W++KN W WGD G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320
Query: 290 -YLKRGNNRCGIANYAVYALI 309
L+ N+CGIA+ + Y L+
Sbjct: 321 KMLRNKENQCGIASASSYPLV 341
>gi|205689966|sp|A6NFJ7.3|CATL6_HUMAN RecName: Full=Putative cathepsin L-like protein 6
Length = 277
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 106/281 (37%), Positives = 153/281 (54%), Gaps = 17/281 (6%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--I 98
I HNQE QG H +T+ N D+ + + M + + R+ + E +L I
Sbjct: 2 IEQHNQEYSQGKHSFTMAMNAFGDMTNEEFRQVMNGFQYQKHRK----GKQFQERLLPEI 57
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP +Q CG+C+AFS A++GQ+F T ++ L+ Q +VDCS G
Sbjct: 58 PTSVDWREKGYMTPVKDQGQCGSCWAFSATGALEGQMFWKTGKLISLNEQNLVDCSGPQG 117
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ GGL E YPY+GK C++ + + + +P Q E
Sbjct: 118 NEGCNGGFMDNPFRYVQENGGLDSEASYPYEGKVKTCRYNPKYSAANDTGFVDIPSQ-EK 176
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
L +ATVGPI+V+ AS +FQ Y GIY + C + ++HAMLLVGY+
Sbjct: 177 DLAKAVATVGPISVAAGASHVSFQFYKKGIYFEPRCDPEGLDHAMLLVGYSYEGADSDNN 236
Query: 273 --WILKN-WWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN +WG +GY+ + K N CGIA A Y +
Sbjct: 237 KYWLVKNSSEGKNWGMDGYIKMAKDRRNNCGIATAASYPTV 277
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 161/301 (53%), Gaps = 6/301 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY + Y DS +++ ++ N K I N++ + G + L N D+ +
Sbjct: 24 KGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAV 83
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M R V P+ E+ +DWR KG +TP +Q CG+C+AFS +++G
Sbjct: 84 MKGNIPRRSAPVSVFYPK-KETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEG 142
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q F T + L+ QQ+VDCS G GC GG + + +Y++ G+ E YPY+ +
Sbjct: 143 QHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDG 202
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+F ++ S + + E L+ + +GPI+V+I+A+ +FQ Y+SG+Y + +
Sbjct: 203 SCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPS 262
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C+ Y++HA+L VGY ++ W++KN W+ WGD GY+ + R NN CGIA A Y L
Sbjct: 263 CSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPL 322
Query: 309 I 309
+
Sbjct: 323 V 323
>gi|45384464|ref|NP_990302.1| cathepsin K precursor [Gallus gallus]
gi|25089842|sp|Q90686.1|CATK_CHICK RecName: Full=Cathepsin K; AltName: Full=JTAP-1; Flags: Precursor
gi|1017831|gb|AAC59739.1| JTAP-1 [Gallus gallus]
Length = 334
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 98/271 (36%), Positives = 146/271 (53%), Gaps = 20/271 (7%)
Query: 48 AQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPD------- 100
A+ G H + L N+L D+ ++ MT L R R P N ++ +PD
Sbjct: 69 ARLGKHSFQLAMNYLGDMTSEEVVRTMTGLRVPRSR------PRPNGTLYVPDWSSRAPA 122
Query: 101 HLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNL 160
+DWR KG++TP +Q CG+C+AFS A++GQ+ + T ++ LS Q +V C +S N
Sbjct: 123 AVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYC--VSNNN 180
Query: 161 GCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHAL 220
GC GG + N YV+ G+ E+ YPY G+ C + + +P +E AL
Sbjct: 181 GCGGGYMTNAFEYVRLNRGIDSEDAYPYIGQDESCMYSPTGKAAKCRGYREIPEDNEKAL 240
Query: 221 KVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILK 276
K +A +GP++V I+AS +FQ Y+ G+Y D C + +NHA+L VGY WI+K
Sbjct: 241 KRAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYGAQKGTKHWIIK 300
Query: 277 NWWSHHWGDNGYMYLKRGNNR-CGIANYAVY 306
N W WG+ GY+ L R + CGIAN A +
Sbjct: 301 NSWGTEWGNKGYVLLARNMKQTCGIANLASF 331
>gi|340370384|ref|XP_003383726.1| PREDICTED: silicatein-like [Amphimedon queenslandica]
Length = 337
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 168/323 (52%), Gaps = 27/323 (8%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW++ ++K+ K Y +SK+ W N I HNQ+A +HG+TL+ NH
Sbjct: 25 EEWLLW----KEKHGKVYPHGEEESKRLNIWLENKNYIEEHNQKAH--VHGFTLKMNHFG 78
Query: 64 DLHPRHYIK------------EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFIT 111
DL Y + ++ L H+ + R + N +P+ +DWR K +T
Sbjct: 79 DLTIEEYRQSRYATCYHSPDDDILSLPHNATVKMYSRPDDPN----LPEEVDWRTKNAVT 134
Query: 112 PDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTL 171
+Q CG+CYAFS A++G + ++ LS Q +VDCSI GN GC GG++ +
Sbjct: 135 GVKDQGQCGSCYAFSAVGALEGAQALAHDKLVHLSEQNIVDCSIPYGNKGCNGGNMYESF 194
Query: 172 NYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIA 231
Y+ G+ +E+ Y Y G+Q CKF R I +P E L+ LAT GP++
Sbjct: 195 RYIIDNDGIDREDGYKYTGRQGQCKFDRKAIGGRQVGIIHIPTGSEAELQSALATAGPVS 254
Query: 232 VSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNG 287
V+I+ S + F+ Y G++D+ C++ + HA L++GY + W++KN W HWG G
Sbjct: 255 VAIDGSSNAFRFYEKGVFDEPNCSTTKLTHAGLIIGYGKKKGKPYWLVKNSWGPHWGMKG 314
Query: 288 YMYLKRGN-NRCGIANYAVYALI 309
Y+ + R N+CGIA A + +
Sbjct: 315 YIMMARNKANQCGIATAASFPTL 337
>gi|410921048|ref|XP_003973995.1| PREDICTED: digestive cysteine proteinase 2-like [Takifugu rubripes]
Length = 290
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 104/294 (35%), Positives = 161/294 (54%), Gaps = 18/294 (6%)
Query: 29 KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI--------KEMTRLTHS 80
++K+ W+ N ++I +NQ G +T+ N DL + + ++ + +
Sbjct: 2 QRKVIWEGNKQQIEDNNQGFLMGSKHFTMAMNKYGDLTAQEFQVLQGAMIDAQLAKRGKT 61
Query: 81 RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
R L S + + ++I D+R+ G++T +Q CG+C+AFS AI+GQIFK T
Sbjct: 62 VASRKLRNSAKKFDGMVI----DYRQMGYVTEVKDQGYCGSCWAFSTTGAIEGQIFKKTG 117
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
++ LS Q +VDCS G GC+G + N +YV GL YPY C +
Sbjct: 118 QLMSLSEQNLVDCSKSYGTYGCSGAWMANAYDYV-VNNGLESTITYPYTSDTQPCYYDSR 176
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
V I + +P DE AL +AT+GPI V+I+AS +F Y+SGIY++ C + ++
Sbjct: 177 LAVAHIKDYRFIPKGDEQALADAVATIGPITVAIDASHSSFLFYSSGIYEESNCNPNNLS 236
Query: 261 HAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
HA+LLVGY ++ W++KN W WG+ GYM L R G N CGIA+YA+Y ++
Sbjct: 237 HAVLLVGYGSEGGQDYWLIKNSWGPSWGEGGYMRLIRDGKNPCGIASYALYPIL 290
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 159/304 (52%), Gaps = 12/304 (3%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
+ +K ++ KK +D ++L WQ N K I HN + + G+TL N DL H
Sbjct: 23 EDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDK--FGFTLGMNKFGDLES-HE 79
Query: 71 IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
EM + R + ++ + +DWR KG +T NQ CG+C+AFS +
Sbjct: 80 FAEMFNGYMMQARSNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTGS 139
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ F T ++ LS Q +VDCS GN GC GG + Y++ GG+ E YPY+
Sbjct: 140 LEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEASYPYQA 199
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C+FK ++ + + + +DE+AL + +GP++V+I+AS +FQLY SG+Y
Sbjct: 200 HDERCRFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQLYRSGVYY 259
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
+ C+ ++H +L +GY W++KN W WG GY+ + R NN CGIA A
Sbjct: 260 ERECSQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEGYIMMSRNRNNNCGIATEAS 319
Query: 306 YALI 309
Y +
Sbjct: 320 YPTV 323
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 174/314 (55%), Gaps = 21/314 (6%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
F + ++K+ Y ++++ + +N K+ HN Q+G Y + N +D + +
Sbjct: 62 FFKIQFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTD-KTDYEL 120
Query: 72 KEMT--RLTHSRIR---RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFS 126
K++ ++T IR T +RS E +P +DWR +G +T NQ CG+C+AFS
Sbjct: 121 KKLRGYKVTSGAIRHKGSTFIRS----EHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFS 176
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
AI+GQ ++ T+ + LS QQ+VDCS GN GC+GG + + YV+ G+ E Y
Sbjct: 177 TTGAIEGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISY 236
Query: 187 PYKG----KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
PY + + C F NI+ ++ + + DE AL +AT GP++V+INA +F
Sbjct: 237 PYVSGDGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFS 296
Query: 243 LYASGIYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN- 295
+Y SGIY D C T D ++H +L+VGY R+ W++KN W WG+ GY+ + +G+
Sbjct: 297 MYKSGIYSDTDCEGTLDALDHGVLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKGSH 356
Query: 296 NRCGIANYAVYALI 309
N CG+A+ A Y L+
Sbjct: 357 NMCGVASAASYPLV 370
>gi|359385046|emb|CBY80148.1| silicatein red variant [Tethya aurantium]
Length = 329
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 109/307 (35%), Positives = 162/307 (52%), Gaps = 17/307 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
+K++ K Y + +K L W SN K I HN A G+TL NHL D+ Y +
Sbjct: 29 KKQHGKSYNTNLEELEKHLVWLSNKKYIELHN--ANAATFGFTLAMNHLGDMTDHEYREN 86
Query: 73 ----EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ T + R P + P+ +DWR KG +T NQ DCGA YAFS
Sbjct: 87 YLTYQSTNSKSGNYTKVFRREPW----MTFPEAIDWRTKGAVTGIKNQGDCGASYAFSAM 142
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G +T ++ LS Q ++DCS+ GN GC GG++ YV G+ E+ Y +
Sbjct: 143 GALEGINALATGKLTPLSEQNIIDCSVPYGNHGCKGGNMYVAFLYVVANEGVDTEDKYQF 202
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+GKQS C ++ +S + E L+ +A VGP++V+I+A+ + F+ Y SG+
Sbjct: 203 RGKQSSCNYQVQYCGASMSGSVQIKSGSESDLEAAVANVGPVSVAIDAASNAFRFYYSGV 262
Query: 249 YDDEACTSDY--VNHAMLLVGY-TRNS--WILKNWWSHHWGDNGYMYLKRGN-NRCGIAN 302
YD C+S Y +NHAM++ GY NS W+ KN W +WG+ GY+ + R N+CGIA
Sbjct: 263 YDSSRCSSGYDSLNHAMVITGYGISNSEYWLAKNSWGENWGEQGYVRMARNKYNQCGIAT 322
Query: 303 YAVYALI 309
A Y +
Sbjct: 323 DASYPTL 329
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 104/287 (36%), Positives = 161/287 (56%), Gaps = 11/287 (3%)
Query: 31 KLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSP 90
K+H + N KI HN EA G H Y ++ NH DL ++ + + + +L S
Sbjct: 49 KIHME-NSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEFVAMVNGYEYVN-KTSLGGSF 106
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
+++V +P H+DWRE G +TP NQ CG+C+AFS +++GQ F+ T ++ LS Q +
Sbjct: 107 IPSKNVKLPTHVDWREDGAVTPVKNQGQCGSCWAFSSTGSLEGQTFRKTGKLIPLSEQNL 166
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK-RPNIVVDISSW 209
VDCS GN GC GG + Y++ G+ E YPY+G C + DI
Sbjct: 167 VDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSYPYEGVGGRCHYDPSKKGSSDIGFV 226
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYT 269
V +E LK +A+VGP++V+I+AS +FQ Y+ G+Y + C+ + ++H +L+VGY
Sbjct: 227 DVKKGSEEELLK-AVASVGPVSVAIDASHMSFQFYSHGVYFESKCSPENLDHGVLVVGYG 285
Query: 270 RNS------WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN WS +WGD GY+ + R N CGIA+ A Y ++
Sbjct: 286 TDENSGEDYWLVKNSWSENWGDQGYIKMARNKKNMCGIASSASYPVV 332
>gi|27960490|gb|AAO27848.1|AF456464_1 cathepsin 3 [Mus musculus]
Length = 316
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 167/305 (54%), Gaps = 18/305 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + KEM
Sbjct: 19 KYGKTYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKEMI 77
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+A S+ AI+GQ+
Sbjct: 78 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVRTQIACNSCWAISVTGAIEGQM 135
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDC + G+ GC GS+ + L Y+ GGL E YPY+ KQ C
Sbjct: 136 FRKTGKLIPLSVQNLVDC--VDGS-GCHAGSVLDALKYLMEKGGLESEATYPYEDKQGSC 192
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P +E L +A++GPI+V I+A +F Y GIY + C
Sbjct: 193 RYNPENSTASITGFEFI-PNNEVDLMNAVASLGPISVIIDAWHESFLFYKRGIYYEPNCN 251
Query: 256 SDY--VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
+ + HA+LLVGY R WI+KN WG+ GYM + K N CGIA+
Sbjct: 252 NSLFGLRHAVLLVGYGFIGRESEGRKYWIIKNSLGTKWGNKGYMKIAKDQGNHCGIASLP 311
Query: 305 VYALI 309
+Y +
Sbjct: 312 LYPRV 316
>gi|197258086|gb|ACH56227.1| cathepsin S-like cysteine proteinase [Radopholus similis]
Length = 314
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 106/296 (35%), Positives = 164/296 (55%), Gaps = 11/296 (3%)
Query: 23 KKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRI 82
+++ DS++ + + HN ++G + L ENHL+ L R Y + + R
Sbjct: 21 QQSLDSERLGALTRSRALVTEHNAAFERGEVSFRLAENHLAHLTEREYKQRLGLRATERP 80
Query: 83 RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
+V+ + +PD +DWR+KG +T +Q CG+C+AFS ++ G K+T ++
Sbjct: 81 SNKVVKLSAIVGNQSVPDAVDWRKKGLVTEVKDQGQCGSCWAFSTTGSLGGAHAKATGKL 140
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPN 201
LS Q +VDCS S N G + +Y++ GG+ E YPY+G +Q CK+ + N
Sbjct: 141 VSLSEQNLVDCS--SENSVHEHGLMDVAFDYIEENGGIDTERSYPYRGYEQYRCKYSKRN 198
Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN- 260
+ ++S+ LP DE LK+ +AT GPI+V+I+AS +FQLY SG+Y D+ C + N
Sbjct: 199 VGATMASYVDLPSGDEQELKIAVATQGPISVAIDASSDSFQLYESGVYKDKQCGNRRSNL 258
Query: 261 -HAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
H +LLVGY + WI+KN WS WG+ GY+ + R N N CGIA A Y +
Sbjct: 259 DHGVLLVGYGTDPKHGDYWIVKNSWSAAWGEKGYIRMARNNRNMCGIATMASYPQV 314
>gi|12837902|dbj|BAB23995.1| unnamed protein product [Mus musculus]
Length = 332
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 166/305 (54%), Gaps = 18/305 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + KEM
Sbjct: 35 KYGKTYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKEMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+A S+ AI+GQ+
Sbjct: 94 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPARTQIACNSCWAISVTGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDC + G+ GC GS+ ++ Y+ GGL E YPY+ KQ C
Sbjct: 152 FRKTGQLIPLSVQNLVDC--VDGS-GCHAGSVLDSFKYLMEKGGLESEATYPYEDKQGSC 208
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P +E L +A++GPI+V I+A +F Y GIY + C
Sbjct: 209 RYNPENSTASITGFEFI-PNNEVDLMSAVASLGPISVVIDAWHESFLFYKRGIYYEPNCN 267
Query: 256 SDY--VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
+ + HA+LLVGY R WI+KN WG GYM + K N CGIA+
Sbjct: 268 NSLFALRHAVLLVGYGFIGRESEGRKYWIIKNSLGTKWGYKGYMKIAKDQGNHCGIASLP 327
Query: 305 VYALI 309
V+ +
Sbjct: 328 VFPRV 332
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 162/306 (52%), Gaps = 14/306 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K Y +S +K + +N+K I HN G +T+ N +D+ + + M
Sbjct: 47 EHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTVGVNEFADMTVHEFAQMMN 106
Query: 76 RL--THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
L +R+ + SP N +P +DWR KG ++ NQ CG+C+AFS +++G
Sbjct: 107 GLKPDSTRVSGSTYLSP--NIDAPLPVEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEG 164
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q + T + +LS Q +VDCS GN GC GG + N Y++ G+ EE YPY G+
Sbjct: 165 QHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKGIDTEEAYPYAGRDG 224
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CKFK+ + ++ + +P +E L+ LATVGP++V+I+A+ +F LY SG+YD+
Sbjct: 225 DCKFKKNKVGATVTGFVEIPAGNEKKLQEALATVGPVSVAIDANHQSFMLYKSGVYDEPE 284
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG------NNRCGIANY 303
C S ++H +L VGY ++ +I+KN W WG+ GY+ CGI
Sbjct: 285 CDSAQLDHGVLAVGYGSIHGKDYYIVKNSWGTTWGEQGYIRFSTTAVPDAIGGICGILLD 344
Query: 304 AVYALI 309
A Y +I
Sbjct: 345 ASYPVI 350
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 166/321 (51%), Gaps = 16/321 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N+EW + K YK D ++ + K+ + HK I HN + Y L+ N
Sbjct: 23 LVNQEWTTFKMEHNKVYKNDIEERF---RMKIFMDNKHK-IAKHNGNYEMKKVSYKLKMN 78
Query: 61 HLSDLHPRHYIKEMTRLTHS------RIRRTLVRSPESNESVLIPDHLDWREKGFITPDW 114
D+ ++ + S R + S +V++P +DWRE G +TP
Sbjct: 79 KYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGASFIEPANVVLPKTVDWREHGAVTPVK 138
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C++FS A++GQ F+ T + LS Q ++DCS GN GC GG + Y+
Sbjct: 139 DQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYI 198
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GL E YPY+ + C++ N + +P +E LK +AT+GP++V+I
Sbjct: 199 KDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAI 258
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y + C+S+ ++H +L VGY + W++KN W WGDNGY+
Sbjct: 259 DASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYI 318
Query: 290 YLKRGN-NRCGIANYAVYALI 309
+ R N CGIA+ A Y L+
Sbjct: 319 KMARNKLNHCGIASTASYPLV 339
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 166/322 (51%), Gaps = 21/322 (6%)
Query: 1 MTNKEWIIIFIFPQKKYK----KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYT 56
+ + EW I + K YK + YR K + N +KI HN++ + Y
Sbjct: 24 ILDAEWFIFKLHHNKVYKSPVEEGYRMKI--------YMDNKRKIAEHNRKYELNEVTYK 75
Query: 57 LRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPE--SNESVLIPDHLDWREKGFITPDW 114
L N D+ ++ + S S +V +PD +DW ++G +T
Sbjct: 76 LGMNKYGDMLHHEFVNTLNGFNKSVTAGIETEGVTFISPANVKLPDEVDWTKQGAVTAVK 135
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C+AFS A++GQ F+ST + LS Q ++DCS GN GC GG + Y+
Sbjct: 136 DQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYI 195
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GL E+ YPY+ + C++ N + +P DE LK +AT+GPI+V+I
Sbjct: 196 KDNKGLDTEKTYPYEAENDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPISVAI 255
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGY 288
+AS +FQLY+ G+Y D C+++ ++H +L+VGY + W++KN W WG GY
Sbjct: 256 DASHESFQLYSEGVYYDPDCSAENLDHGVLIVGYGTDETSGHDYWLVKNSWGKTWGQKGY 315
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R NN CGIA+ A Y L+
Sbjct: 316 IKMARNKNNHCGIASSASYPLV 337
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/298 (34%), Positives = 163/298 (54%), Gaps = 14/298 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y + ++K+ + N + I HN GL Y N +DL + +T
Sbjct: 31 KHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQFTDLTIDEFKAYLT 90
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
HS+ TL P + +P LDWR +G++T +Q DCG+C+AFS+ + +G
Sbjct: 91 L--HSK--PTLNTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAY 146
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
+KST ++ LS QQ++DC+ + N GC GG L T YVQ GL+ E YPY G+ C
Sbjct: 147 YKSTGKLVSLSEQQLIDCT-TNVNDGCDGGYLEETFPYVQQT-GLVSESSYPYTGRDGNC 204
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+ ++V +S + +L E L + +VGP++V+++A+ YASG+Y+ C+
Sbjct: 205 RISESDVVTKVSKYVLLG--GEADLLEAVGSVGPVSVAMDAT--YIYSYASGVYESSLCS 260
Query: 256 SDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+NH +L+VGY ++ W++KN W + WG+ GY+ L RG N CGIA VY +I
Sbjct: 261 LYSLNHGVLVVGYGTQDGKDYWLIKNSWGNTWGEQGYLKLLRGTNECGIAEDDVYPII 318
>gi|19424144|ref|NP_081182.2| cathepsin 3 precursor [Mus musculus]
gi|339715188|ref|NP_473433.2| cathepsin 3 precursor [Mus musculus]
gi|15418824|gb|AAK58450.1| cathepsin-3 precursor [Mus musculus]
gi|68534882|gb|AAH99388.1| Cts3 protein [Mus musculus]
gi|148669361|gb|EDL01308.1| mCG114648, isoform CRA_a [Mus musculus]
Length = 332
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 166/305 (54%), Gaps = 18/305 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + KEM
Sbjct: 35 KYGKTYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKEMI 93
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+A S+ AI+GQ+
Sbjct: 94 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVRTQIACNSCWAISVTGAIEGQM 151
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDC + G+ GC GS+ ++ Y+ GGL E YPY+ KQ C
Sbjct: 152 FRKTGQLIPLSVQNLVDC--VDGS-GCHAGSVLDSFKYLMEKGGLESEATYPYEDKQGSC 208
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P +E L +A++GPI+V I+A +F Y GIY + C
Sbjct: 209 RYNPENSTASITGFEFI-PNNEVDLMSAVASLGPISVVIDAWHESFLFYKRGIYYEPNCN 267
Query: 256 SDY--VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
+ + HA+LLVGY R WI+KN WG GYM + K N CGIA+
Sbjct: 268 NSLFALRHAVLLVGYGFIGRESEGRKYWIIKNSLGTKWGYKGYMKIAKDQGNHCGIASLP 327
Query: 305 VYALI 309
V+ +
Sbjct: 328 VFPRV 332
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 168/323 (52%), Gaps = 25/323 (7%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+EW + +K Y+ D T+ + +L + N KI HNQ +G + L N
Sbjct: 27 EEWHTFKLEHRKNYQDD-----TEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKY 81
Query: 63 SDLHPRHYIKEMT----------RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITP 112
+DL + + M R T + SP V +P +DWR KG +T
Sbjct: 82 ADLLHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAH---VTLPKSVDWRSKGAVTA 138
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
+Q CG+C+AFS A++GQ F+ + + LS Q +VDCS GN GC GG + N
Sbjct: 139 VKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFR 198
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
Y++ GG+ E+ YPY+ C F + I ++ +P DE + +ATVGP++V
Sbjct: 199 YIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSV 258
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNG 287
+I+AS +FQ Y+ G+Y++ C + ++H +L+VG+ + W++KN W WGD G
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKG 318
Query: 288 YM-YLKRGNNRCGIANYAVYALI 309
++ L+ +N+CGIA+ + Y L+
Sbjct: 319 FIKMLRNKDNQCGIASASSYPLV 341
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 167/308 (54%), Gaps = 15/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y+ + + + + N KI HNQ +G + L N +DL + + M
Sbjct: 35 EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMN 94
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + + L + ES + V +P +DWR KG +T +Q CG+C+AFS
Sbjct: 95 GFNYT-LHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSS 153
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ F+ + + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 154 TGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 213
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ C F + + ++ +P DE + +ATVGP++V+I+AS +FQ Y+ G
Sbjct: 214 YEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEG 273
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM-YLKRGNNRCGIA 301
+Y++ C + ++H +L+VG+ + W++KN W WGD G++ L+ N+CGIA
Sbjct: 274 VYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIA 333
Query: 302 NYAVYALI 309
+ + Y L+
Sbjct: 334 SASSYPLV 341
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 171/321 (53%), Gaps = 21/321 (6%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+EW + +K Y+ D T+ + +L + N KI HNQ +G + L N
Sbjct: 27 EEWHTFKLEHRKNYQDD-----TEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKY 81
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDW 114
+DL + + M ++ + + L + ES + V +P +DWR KG +T
Sbjct: 82 ADLLHHEFRQLMNGFNYT-LHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 140
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
+Q CG+C+AFS A++GQ F+ + + LS Q +VDCS GN GC GG + N Y+
Sbjct: 141 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 200
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
+ GG+ E+ YPY+ C F + I ++ +P DE + +ATVGP++V+I
Sbjct: 201 KDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM 289
+AS +FQ Y+ G+Y++ C + ++H +L+VG+ + W++KN W WGD G++
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFI 320
Query: 290 -YLKRGNNRCGIANYAVYALI 309
L+ N+CGIA+ + Y L+
Sbjct: 321 KMLRNKENQCGIASASSYPLV 341
>gi|148669362|gb|EDL01309.1| mCG114648, isoform CRA_b [Mus musculus]
Length = 333
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 166/305 (54%), Gaps = 18/305 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y + + +K+ W+ N KKI HN E G HG+T+ N D+ + KEM
Sbjct: 36 KYGKTYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKEMI 94
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+ +++ +S + SV +P ++W+++G++TP Q C +C+A S+ AI+GQ+
Sbjct: 95 EIPVPTVKKG--KSVQKRLSVNLPKFINWKKRGYVTPVRTQIACNSCWAISVTGAIEGQM 152
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F+ T ++ LS+Q +VDC + G+ GC GS+ ++ Y+ GGL E YPY+ KQ C
Sbjct: 153 FRKTGQLIPLSVQNLVDC--VDGS-GCHAGSVLDSFKYLMEKGGLESEATYPYEDKQGSC 209
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N I+ + + P +E L +A++GPI+V I+A +F Y GIY + C
Sbjct: 210 RYNPENSTASITGFEFI-PNNEVDLMSAVASLGPISVVIDAWHESFLFYKRGIYYEPNCN 268
Query: 256 SDY--VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYA 304
+ + HA+LLVGY R WI+KN WG GYM + K N CGIA+
Sbjct: 269 NSLFALRHAVLLVGYGFIGRESEGRKYWIIKNSLGTKWGYKGYMKIAKDQGNHCGIASLP 328
Query: 305 VYALI 309
V+ +
Sbjct: 329 VFPRV 333
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 167/308 (54%), Gaps = 15/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y+ + + + + N KI HNQ +G + L N +DL + + M
Sbjct: 69 EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMN 128
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + + L + ES + V +P +DWR KG +T +Q CG+C+AFS
Sbjct: 129 GFNYT-LHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSS 187
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ F+ + + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 188 TGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 247
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ C F + + ++ +P DE + +ATVGP++V+I+AS +FQ Y+ G
Sbjct: 248 YEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEG 307
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM-YLKRGNNRCGIA 301
+Y++ C + ++H +L+VG+ + W++KN W WGD G++ L+ N+CGIA
Sbjct: 308 VYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIA 367
Query: 302 NYAVYALI 309
+ + Y L+
Sbjct: 368 SASSYPLV 375
>gi|93279455|pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor
Length = 215
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 130/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 2 PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSE 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G+ C + + +P +E
Sbjct: 60 NDGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEK 119
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE C+SD +NHA+L VGY WI
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNLNHAVLAVGYGIQKGNKHWI 179
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W WG+ GY+ + R NN CGIAN A +
Sbjct: 180 IKNSWGESWGNKGYILMARNKNNACGIANLASF 212
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 167/308 (54%), Gaps = 15/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y+ + + + + N KI HNQ +G + L N +DL + + M
Sbjct: 65 EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMN 124
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + + L + ES + V +P +DWR KG +T +Q CG+C+AFS
Sbjct: 125 GFNYT-LHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSS 183
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ F+ + + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 184 TGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 243
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ C F + + ++ +P DE + +ATVGP++V+I+AS +FQ Y+ G
Sbjct: 244 YEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEG 303
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM-YLKRGNNRCGIA 301
+Y++ C + ++H +L+VG+ + W++KN W WGD G++ L+ N+CGIA
Sbjct: 304 VYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIA 363
Query: 302 NYAVYALI 309
+ + Y L+
Sbjct: 364 SASSYPLV 371
>gi|405966500|gb|EKC31778.1| Cathepsin L [Crassostrea gigas]
Length = 271
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 142/239 (59%), Gaps = 7/239 (2%)
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+++ +R + L SP SN L PD +DWR++G++T NQ CG+C++FS +++GQ
Sbjct: 35 KMSANRTKGDLYMSP-SNIGDL-PDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQH 92
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
FK++ ++ LS Q +VDCS GN GC GG + N Y++ G+ EE YPY K C
Sbjct: 93 FKASKKLVSLSEQNLVDCSQREGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGFC 152
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
FK+ N+ + + +P E L+ +ATVGPI+V+I+A +FQLY G+Y + AC+
Sbjct: 153 HFKKENVGATDTGYVDIPHMQEDKLQEAVATVGPISVAIDAGHKSFQLYREGVYSEPACS 212
Query: 256 SDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
S ++H +L VGY S W++KN W WG GY+ + R +N CGIA A Y +
Sbjct: 213 SSKLDHGVLAVGYGTESGDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIATQASYPKV 271
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 158/289 (54%), Gaps = 26/289 (8%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y K Y + K+ +++N IHTHNQ+ + Y+L+ NH DL + ++
Sbjct: 126 YGKSYATEEETQKRYAIFKNNLAYIHTHNQQG----YSYSLKMNHFGDLSREEFRRKYLG 181
Query: 77 LTHSR--------IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
SR + L++ S+ +P +DWREKG +TP +Q DCG+C+AFS
Sbjct: 182 YNKSRNLKSNNLGVATELLKVSPSD----VPSAVDWREKGCVTPVKDQRDCGSCWAFSAT 237
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G T E+ LS Q++VDCS+ GN GC+GG + + YV +GGL EE YPY
Sbjct: 238 GALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPY 297
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ CK + VV IS + +P + E A+K LA P++++I A FQ Y G+
Sbjct: 298 LARDGECK-RACKKVVTISGFKDVPRKSETAMKAALAH-SPVSIAIEADQLPFQFYHEGV 355
Query: 249 YDDEACTSDYVNHAMLLVGY------TRNSWILKNWWSHHWGDNGYMYL 291
+ D +C +D ++H +LLVGY ++ WI+KN W WG +GYMY+
Sbjct: 356 F-DASCGTD-LDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYM 402
>gi|405966497|gb|EKC31775.1| Cathepsin L1 [Crassostrea gigas]
Length = 305
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 160/284 (56%), Gaps = 18/284 (6%)
Query: 3 NKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL 62
+ EW + +++Y+K Y +++++ W++N I+ HN E ++G H YTL N
Sbjct: 25 DTEWALY----KQEYRKQYLTADEETERRDIWEANLDYINQHNDEFKRGEHSYTLGLNEF 80
Query: 63 SDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI-----PDHLDWREKGFITPDWNQE 117
+DL ++ L IR S + + +++ P +DWR++G++ P NQ
Sbjct: 81 ADLSHEEFL----HLYGGGIRPRDSGSSDPDTDIVVDTSGLPSEVDWRKEGWVGPVGNQF 136
Query: 118 DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFA 177
CG+C+AF+ A++GQ+ T ++ LS+QQ++DCS GN GC GG + Y+
Sbjct: 137 ACGSCWAFTATGALEGQVRNKTGKLIVLSVQQMMDCSEKWGNHGCEGGLMDAAFKYIHDV 196
Query: 178 GGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GG+ YPYK + CKF +V + + L P+ E +L V +ATVGPI+ +++AS
Sbjct: 197 GGIESNASYPYKPAEEKCKFNESAVVAKVKGYKDL-PKSEESLMVAVATVGPISAALDAS 255
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKN 277
+FQLY SG+YDD C+S V+H++++VGY + WI KN
Sbjct: 256 HSSFQLYKSGVYDDPNCSSGQVDHSLVVVGYGLMDGKKYWIAKN 299
>gi|261289781|ref|XP_002611752.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
gi|229297124|gb|EEN67762.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
Length = 327
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 170/315 (53%), Gaps = 11/315 (3%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N EW + +K Y + Y + + ++L ++ N K I HN+EA +GLH + L N
Sbjct: 18 LMNPEWEVF----KKAYNRVYAAEE-EYARRLIFEDNLKTIQMHNEEADRGLHTFRLGVN 72
Query: 61 HLSDLHPRHYIKEMTR--LTHSRIRRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQ 116
+D+ + +++ + L + ++ + L +PD +DWR+KG++TP NQ
Sbjct: 73 QYADMTHKEFLENVIGGCLLDTNTSKSTADHVHEYDPTLTDVPDTVDWRDKGYVTPVKNQ 132
Query: 117 EDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQF 176
CG+C+AFS +++GQ FK+T+++ LS Q ++DCS GN GC GG + Y++
Sbjct: 133 AQCGSCWAFSTTGSLEGQHFKATNKLVSLSEQNLMDCSRKEGNQGCQGGLMDQAFKYIKT 192
Query: 177 AGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINA 236
GG+ EE YPYK K C ++ +SS++ + +DE AL+ +ATVGPI+V+I+A
Sbjct: 193 NGGIDTEECYPYKAKNEQCNYQASCSGATLSSYTDVKSKDEDALQQAVATVGPISVAIDA 252
Query: 237 SPHTFQLYASGIYDD-EACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRG- 294
+FQLY SG S + ++ Y + + W WG+ GY+ + R
Sbjct: 253 GHSSFQLYHSGKPPSLRRAYSSEIKRSVTKCTYCAQLLLREGRWGESWGEKGYIKMSRNR 312
Query: 295 NNRCGIANYAVYALI 309
+N CGIA A Y +
Sbjct: 313 HNNCGIATQASYPTV 327
>gi|146386731|pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor
Bound To Cathepsin K
Length = 215
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 91/213 (42%), Positives = 132/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K+T + L+ Q +VDC +S
Sbjct: 2 PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKATGALLNLAPQNLVDC--VSE 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G+ C + + +P +E
Sbjct: 60 NDGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEA 119
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WI 274
ALK +A VGP++V+I+AS +FQ Y++G+Y DE C+SD +NHA+L VGY + WI
Sbjct: 120 ALKRAVAAVGPVSVAIDASLTSFQFYSAGVYYDENCSSDALNHAVLAVGYGIQAGNKHWI 179
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W WG+ GY+ + R NN CGIAN A +
Sbjct: 180 IKNSWGESWGNAGYILMARNKNNACGIANLASF 212
>gi|349604730|gb|AEQ00199.1| Cathepsin K-like protein, partial [Equus caballus]
Length = 219
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 130/213 (61%), Gaps = 7/213 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S
Sbjct: 6 PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSE 63
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G+ C + + +P +E
Sbjct: 64 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPQGNEK 123
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE C SD +NHA+L VGY WI
Sbjct: 124 ALKRAVARVGPVSVAIDASLTSFQFYSRGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWI 183
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 184 IKNSWGENWGNKGYILMARNKNNACGIANMASF 216
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 156/297 (52%), Gaps = 6/297 (2%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A ++ + ++ N +I HN G + + N +D+H +++
Sbjct: 37 KTYANAAEEAYRAKVFKENAIRIAKHNDRFASGEVTFKVGYNQYADMHTHEVTEKLNGYR 96
Query: 79 HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS 138
+ + SN+S +DWR KG +TP +Q CG+C++FS +++GQ+F
Sbjct: 97 SGLKQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLFLK 156
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
+ LS Q +VDCS GN GC GG + + YV+ GG+ EE YPY + C +K
Sbjct: 157 NKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSYGGIDTEESYPYTAEDGTCLYK 216
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
N + + + + E AL+ + VGP++V+I+AS +FQ+Y SGIY + AC+SD
Sbjct: 217 AANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPACSSDS 276
Query: 259 VNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++H +L VGY + WI+KN W WG+ GY+ + R N CGIA A Y L+
Sbjct: 277 LDHGVLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 333
>gi|225579644|gb|ACN93991.1| cathepsin L [Dicentrarchus labrax]
Length = 316
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/276 (36%), Positives = 149/276 (53%), Gaps = 15/276 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR---TL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M R+ +L
Sbjct: 43 RRMVWEKNLKKIELHNLEHSMGKHTYRLGMNHFGDMTHEEFRQLMNGYKLKAARKFSGSL 102
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ F+ + ++ LS
Sbjct: 103 FMEPNFLEA---PRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKSGKLVSLS 159
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI-VVD 205
Q +VDCS GN GC GG + YV+ GL E+ YPY G PN +
Sbjct: 160 EQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSAN 219
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P EHAL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 220 DTGFVDIPSGKEHALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLV 279
Query: 266 VGY--------TRNSWILKNWWSHHWGDNGYMYLKR 293
VGY + WI+ N WS WGD GY+YL +
Sbjct: 280 VGYGFEGQDVDGKKYWIVNNSWSEKWGDEGYIYLAK 315
>gi|33242884|gb|AAQ01146.1| cathepsin [Petromyzon marinus]
Length = 333
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 165/303 (54%), Gaps = 13/303 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY----IK 72
Y K Y + D+ ++ ++ N K++ HN A +G + L N SDL Y +
Sbjct: 34 YGKHYGSEQEDAHRRDVFEQNLKRVLQHNLLADEGNVSFHLGINKYSDLELHEYHEKVVG 93
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L + RR S ++ +P+ +DWR KG++TP Q CG+ +AFS +++
Sbjct: 94 RFWNLRNGTRRRGAPFPLRSMDN--LPEQVDWRLKGYVTPVKEQGLCGSSWAFSATGSLE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F +T + LS QQ+VDC+ N GC GG L Y+ G+ E YPY+
Sbjct: 152 GQHFAATGNLTSLSEQQLVDCTKSYYNNGCNGGRSERALQYIIDNNGIDSELSYPYEHAD 211
Query: 193 SICKFKRPNIVVDISSWS-VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+FK N+ SS+ V P +E L+ +A+VGPIA+++NA TF+ Y SG++++
Sbjct: 212 GKCRFKPANVATKCSSYQFVEPSSNEEVLRQAVASVGPIAIAMNADLDTFKHYKSGLFNE 271
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+C NHAML+VGY S WI+KN W WG+ GY+Y+ R +N+CGIA+ +Y
Sbjct: 272 PSCDKS-PNHAMLVVGYGSLSGNDFWIVKNSWGEDWGEKGYIYMIRNKDNQCGIASIGIY 330
Query: 307 ALI 309
+I
Sbjct: 331 PII 333
>gi|208972990|dbj|BAG74344.1| silicatein-M3 [Ephydatia fluviatilis]
Length = 326
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 158/301 (52%), Gaps = 7/301 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +++K Y + + ++ W SN K I HN + + GYTL NH DL +
Sbjct: 28 KGQHQKSYMSELEELERHAIWLSNKKYIEEHNARSNE--FGYTLAINHFGDLTTEEHNAM 85
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+S ++ E V D +DWR KG +T Q CGA YAF+ A++G
Sbjct: 86 YLTYGTGNYTHHGQKSFQAPEGVQYADSIDWRTKGAVTSVKYQGQCGASYAFAATGALEG 145
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+ + LS Q ++DCS+ GN GC+GG + YV GG+ E Y ++GKQS
Sbjct: 146 ASALANDKQVILSEQNIIDCSVPYGNHGCSGGDTYTAMKYVIDNGGIDTESSYSFQGKQS 205
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ N + + E L +ATVGP+AV+++A+ + F+ Y SG++D +
Sbjct: 206 SCQYSSKNSGASATGVISITSGSETDLLAAVATVGPVAVAVDANTNAFRFYQSGVFDSSS 265
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYAL 308
C++ NHAML+ GY ++ W++KN WS +WGDNGY+ + R N+C IA A+Y
Sbjct: 266 CSNTKPNHAMLVTGYGSYNGKDYWLVKNSWSKNWGDNGYILMVRNKYNQCAIATDALYPT 325
Query: 309 I 309
+
Sbjct: 326 L 326
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 177/323 (54%), Gaps = 20/323 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N+EWI + ++KK Y+ +A + + + N +I HN + + Y L+ N
Sbjct: 23 LVNQEWINF----KMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKIN 78
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTL--VRSPESNE-----SVLIPDHLDWREKGFITPD 113
D+ H K M + I TL R P +V +P +DWR+ G +T
Sbjct: 79 KYGDM-LNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEV 137
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
+Q CG+C+AFS +++GQ F+ T + LS Q ++DCS GN GC GG + +Y
Sbjct: 138 KDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSY 197
Query: 174 VQFAGGLMKEEDYPYKGKQSICKF-KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
++ GL E+ YPY+G+ C++ KR + D+ + +P DE LK +ATVGP++V
Sbjct: 198 IKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDV-GFVDIPVGDEQKLKAAVATVGPVSV 256
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNG 287
+I+AS +FQ Y+ GIY + C+S ++H +L+VGY R+ WI+KN W WG+ G
Sbjct: 257 AIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEGRDYWIVKNSWGESWGEKG 316
Query: 288 YMYLKRG-NNRCGIANYAVYALI 309
Y+ + R +N CGIA+ A Y ++
Sbjct: 317 YIKMARNIDNHCGIASSASYPIV 339
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 161/301 (53%), Gaps = 12/301 (3%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
+F + K+ K Y ++++ + N I+ HN EA +G+H +T+ N +DL
Sbjct: 29 LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNE 88
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLI---PDHLDWREKGFITPDWNQEDCGACYAF 125
Y R + R T + E E L +DWR+KG +TP NQ CG+C++F
Sbjct: 89 EY-----RQLYLRPYPTELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSF 143
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S +++G +T + LS QQ+VDCS GN GC GG + N Y+ GGL E+D
Sbjct: 144 STTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQD 203
Query: 186 YPYKGKQSIC-KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + +C K K V IS + +P +E L + GP++V+I A +FQ+Y
Sbjct: 204 YPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEK-GPVSVAIEADQQSFQMY 262
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
+SG++ T+ ++H +L+VGYT + WI+KN W WGD GY+ +KRG + GI A
Sbjct: 263 SSGVFSGPCGTN--LDHGVLVVGYTSDYWIVKNSWGASWGDQGYIMMKRGVSSAGICGIA 320
Query: 305 V 305
+
Sbjct: 321 M 321
>gi|259089092|ref|NP_001158584.1| Cathepsin L precursor [Oncorhynchus mykiss]
gi|225705034|gb|ACO08363.1| Cathepsin L precursor [Oncorhynchus mykiss]
Length = 330
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 157/292 (53%), Gaps = 9/292 (3%)
Query: 24 KATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR 83
K + +++ W++N + I HN EA+ G H +TL N D+ + Y +T ++
Sbjct: 42 KVDEGFRRMIWETNQRLIRQHNLEAEIGKHTFTLGMNQFGDMMNKEYNALLTANDAEEVK 101
Query: 84 RTLVRSPESNESVLI---PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
+L P S + + P+ D R+ G++TP NQ CG+CYAF+ A +GQ+FK T
Sbjct: 102 -SLDGIPLSKWNCSLSAAPETWDRRQYGYVTPVKNQGSCGSCYAFAAVGAPEGQLFKQTG 160
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
+ LS Q +VDCS N GC GG +YV G+M E YPY + C+++
Sbjct: 161 TLLPLSEQNLVDCSGDYHNNGCDGGLAMRCFSYVS-DHGIMSERKYPYTAEVGPCQYQNA 219
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
++ +P DE + TL VGPIAVS+NA+ +F+ Y G+ + C++ N
Sbjct: 220 TKEAWCKGFNRVPSLDEKVFRDTLYEVGPIAVSVNATHPSFKFYKDGVLYEPDCSTR-TN 278
Query: 261 HAMLLVGYTR---NSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
HA+L VGY + WI+KN W WG +GY+ + RG NRCGIA VY ++
Sbjct: 279 HAVLAVGYGSSYLDYWIVKNSWGTGWGRDGYILMARGYNRCGIARRPVYPIM 330
>gi|315364648|pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A
Ketoamide Warhead
Length = 213
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 91/212 (42%), Positives = 131/212 (61%), Gaps = 7/212 (3%)
Query: 100 DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGN 159
D +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC +S N
Sbjct: 1 DSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSEN 58
Query: 160 LGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHA 219
GC GG + N YVQ G+ E+ YPY G++ C + + +P +E A
Sbjct: 59 DGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKA 118
Query: 220 LKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWIL 275
LK +A VGP++V+I+AS +FQ Y+ G+Y DE+C SD +NHA+L VGY WI+
Sbjct: 119 LKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWII 178
Query: 276 KNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 179 KNSWGENWGNKGYILMARNKNNACGIANLASF 210
>gi|392876478|gb|AFM87071.1| cathepsin L1 [Callorhinchus milii]
Length = 331
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 164/303 (54%), Gaps = 19/303 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K Y ++ +++ WQ+N ++I THN E ++ + + NH DL E R
Sbjct: 33 HRKQYANLVEETNRRIIWQANLQEIKTHNMEYKRRRVSFKMGMNHFGDLSS----SEFQR 88
Query: 77 LTH--SRIRRTLVRSPESNESVL------IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ H S +R+ V + + L I +DWR+KG++TP + Q CG+CY F+
Sbjct: 89 IYHHQSFKQRSFVYIIQEVKPFLPFNNSFIVRSVDWRKKGYVTPVYYQGTCGSCYGFAAT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++ IFK ++ +LS Q +VDCS GNLGC+GGS+ +V G+ + YPY
Sbjct: 149 GSLEAMIFKKYGKLIKLSEQSIVDCSSYYGNLGCSGGSIIRAYKFV-MENGIQSADTYPY 207
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C+ + + V +S + + +E AL T++TVGP+AV ++ ++Q Y SGI
Sbjct: 208 TAKAGTCRHNKSSTVATMSHY-IKVHSNEEALAQTVSTVGPVAVCVHTKTRSWQFYKSGI 266
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
D C + +H ML+VG+ WI+KN W WG GY+++ K NN CGI++Y
Sbjct: 267 LYDPLCKNYTYDHGMLVVGFGIQKDEYYWIVKNSWGSRWGMKGYIWIAKDRNNHCGISSY 326
Query: 304 AVY 306
A Y
Sbjct: 327 ATY 329
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 106/299 (35%), Positives = 163/299 (54%), Gaps = 12/299 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
K+ K Y+ + ++ + ++ N + I HN +QGL Y N +D+ +
Sbjct: 31 KHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEEF---RA 87
Query: 76 RLTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
LT S ++ + E + L +PD +DWR KG +T +Q +CG+C+AFS+ + +
Sbjct: 88 FLTLSSSKKPHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAA 147
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
++ ++ LS QQ+VDCS N GC GG L T YV+ + GL E YPYKG
Sbjct: 148 YYRKAGKLVSLSEQQLVDCS-TDINAGCNGGYLDETFTYVK-SKGLEAESTYPYKGTDGS 205
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
CK+ +V +S L +DE+AL + VGP++V+I+A+ Y SGIY+D+ C
Sbjct: 206 CKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDAT--YLSSYESGIYEDDWC 263
Query: 255 TSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ +NH +L+VGY T N WI+KN W +G++GY L RG N CG+A VY +I
Sbjct: 264 SPSELNHGVLVVGYGTSNGKKYWIVKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPII 322
>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
Length = 333
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 156/299 (52%), Gaps = 6/299 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y ++ + ++ N +I HN G + + N +D+H +++
Sbjct: 35 HAKTYANAVEEAYRAKVFKENAIRIAKHNDRFASGEVTFKVGYNQYADMHTHEVTEKLNG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
+ + SN+S +DWR KG +TP +Q CG+C++FS +++GQ+F
Sbjct: 95 YRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLF 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ LS Q +VDCS GN GC GG + + YV+ GG+ EE YPY + C
Sbjct: 155 LKNKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSNGGIDTEESYPYTAEDGTCL 214
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+K N + + + + E AL+ + VGP++V+I+AS +FQ+Y SGIY + AC+S
Sbjct: 215 YKAANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYTSGIYYEPACSS 274
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
D ++H +L VGY + WI+KN W WG+ GY+ + R N CGIA A Y L+
Sbjct: 275 DSLDHGVLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 333
>gi|432853333|ref|XP_004067655.1| PREDICTED: cathepsin L2-like [Oryzias latipes]
Length = 352
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 104/301 (34%), Positives = 161/301 (53%), Gaps = 27/301 (8%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY------------IKEM 74
D +++ W+ NH+ I+ HNQ G +++ N DL + Y +K
Sbjct: 61 DMQRRAIWEENHRMINDHNQRFLTGKRPFSMGMNKYGDLTGQEYRVLQGAVMNAQFVKRG 120
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
++ R+R +S +D+R G++T +Q CG+C+AFS AI+GQ
Sbjct: 121 KEVSARRLRYNARKSEAG--------FVDYRNMGYVTEVKDQGYCGSCWAFSTTGAIEGQ 172
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS- 193
I K T ++ LS Q +VDCS G GC+G + + +YV + GL + YPY +
Sbjct: 173 IVKKTGQLLSLSEQNLVDCSRPYGTHGCSGAWMASAYDYV-LSNGLQTTDSYPYTSVDTQ 231
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C + V I + +P DE AL +AT+GPI V+I+A +F Y+SGIYD+
Sbjct: 232 PCFYDSRLAVAHIKDYRFIPQGDEQALADAVATIGPITVAIDADHASFLFYSSGIYDEPN 291
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYAL 308
C + ++HA+LLVGY ++ WI+KN W WG+ GYM + R G+N CGIA+YA+Y +
Sbjct: 292 CDPNRLSHAVLLVGYGSEEGQDYWIIKNSWGSSWGEGGYMRIIRNGSNTCGIASYALYPI 351
Query: 309 I 309
+
Sbjct: 352 L 352
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 153/303 (50%), Gaps = 19/303 (6%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K YR D+ ++ W N ++ HN + L N +DL +
Sbjct: 35 KYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSS----FQLEVNEFADLTAEEFSSIYN 90
Query: 76 RLTHSRIRR-----TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
R R T+ R IPD +DWR KG +TP NQ+ CG+C+AFS +
Sbjct: 91 GYGKGRNRENHENTTIYRYTGG----AIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTTGS 146
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++G K T ++ LS Q +VDC + GC GG + Y++ G+ EE YPYK
Sbjct: 147 LEGAHAKKTGKLVSLSEQNLVDCD--KKDHGCQGGLMTTAFKYIEENKGIDTEESYPYKA 204
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K C+FK+ +I + + D ALK +A +GPI+V+++AS +FQLY SGIYD
Sbjct: 205 KNGRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQLYKSGIYD 264
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
+ C+S ++H +L+VGY + W++KN W +WG GY + N CGI A Y
Sbjct: 265 PKICSSRKLDHGVLVVGYGKEDGEEYWLVKNSWGKNWGMEGYFKIASKKNLCGICTSACY 324
Query: 307 ALI 309
++
Sbjct: 325 PVV 327
>gi|288764227|emb|CAQ03434.1| silcatein 4 [Spongilla lacustris]
Length = 331
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 162/303 (53%), Gaps = 17/303 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +++K Y + + ++ W SN K I HN A+ L GYTL N L DL Y
Sbjct: 33 KGQHQKSYTSELEELERHAIWSSNKKYIEEHN--ARSDLFGYTLAMNSLGDLSMEEY--N 88
Query: 74 MTRLTHSRIRRT-----LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
LTH T + R+P V + +DWR KG +T Q CGA YAF+
Sbjct: 89 QMYLTHKTGDHTHHGLKVFRAPRG---VQYAESIDWRTKGAVTSIKYQAQCGASYAFAAT 145
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G + + LS Q ++DCS+ GN GC+GG L YV GG+ E Y +
Sbjct: 146 GALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTALKYVIDNGGIDTESSYSF 205
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ KQS C++ N + + E L +ATVGP+A++++A+ + F+ Y SG+
Sbjct: 206 QAKQSSCQYNSTNSGASATGVVRIASGSESDLLAAVATVGPVAIAVDANTNAFRFYKSGV 265
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
+D +C++ +NHAML+ GY ++ W++KN + +WGD+GY+ + R N+CGIA+
Sbjct: 266 FDSSSCSNTVLNHAMLVTGYGSYNGKDYWLVKNSFGKNWGDSGYILMVRNKYNQCGIASD 325
Query: 304 AVY 306
++Y
Sbjct: 326 SLY 328
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 170/306 (55%), Gaps = 15/306 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + + Y + +K +++N + I HN + G +++ N+ +DL +
Sbjct: 37 KSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF--- 93
Query: 74 MTRLTHSRIRR----TLVRSPESNESV-LIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R T + RR +L S ++ V +P +DW KG +TP NQ+ CG+C+AFS
Sbjct: 94 --RATFNGYRRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFSAV 151
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
++++GQ T ++ LS Q +VDCS G++GC+GG + YV G+ E YPY
Sbjct: 152 ASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASYPY 211
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C+FKR +I I S+ + DE AL+ +A++GPI+V+I+AS +FQ Y+SG+
Sbjct: 212 KAIDESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAIDASQPSFQFYSSGV 271
Query: 249 YDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y++ C+++ ++H + VGY T N W +KN W WG GY+++ R N+CGIA
Sbjct: 272 YNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGIATK 331
Query: 304 AVYALI 309
A Y ++
Sbjct: 332 ASYPVV 337
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 108/307 (35%), Positives = 154/307 (50%), Gaps = 22/307 (7%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
KK+ YR+K D +K H +++N + I N G Y L N +DL + +
Sbjct: 60 KKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAG---GKKKYVLGTNQFADLTSKEF 116
Query: 71 IKEMTRL-----THSRIRRTLVRSPESNESVLIPD-HLDWREKGFITPDWNQEDCGACYA 124
T L S ++ N + L D +DWR++G +TP NQ CG C+A
Sbjct: 117 AAMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWA 176
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
FS A++G I +T + LS QQ++DC GN GC GG + N YV GG+ E+
Sbjct: 177 FSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTED 236
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY Q C+ +P IS + LP DE+AL +A P++V ++ FQ Y
Sbjct: 237 AYPYSAVQGTCQNVQP--AATISGFQDLPSGDENALANAVANQ-PVSVGVDGGSSPFQFY 293
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNRCG 299
GIYD + C +D +NHA+ +GY + WILKN W WG+NG+M L+ G CG
Sbjct: 294 QGGIYDGDGCGTD-MNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGACG 352
Query: 300 IANYAVY 306
I+ A Y
Sbjct: 353 ISTMASY 359
>gi|387915178|gb|AFK11198.1| cathepsin L1 [Callorhinchus milii]
Length = 331
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 165/303 (54%), Gaps = 19/303 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K Y ++ +++ WQ+N ++I THN E ++ + + NH DL E R
Sbjct: 33 HRKQYANLVEETNRRIIWQANLQEIKTHNMEYKRRRVSFKMGMNHFGDLSS----SEFQR 88
Query: 77 LTH--SRIRRTLVRSPESNESVL------IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ H S +R+ V + + L I +DWR+KG++TP + Q CG+CY F+
Sbjct: 89 IYHHQSFKQRSFVYIIQEVKPFLPFNNSFIVRSVDWRKKGYVTPVYYQGTCGSCYGFAAT 148
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++ IFK ++ +LS Q +VDCS GNLGC+GGS+ +V G+ + YPY
Sbjct: 149 GSLEAMIFKKYGKLIKLSEQSIVDCSSYYGNLGCSGGSIIRAYKFV-MENGIQSADTYPY 207
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C+ + + V +S + + +E AL T++TVGP+AV ++ ++Q Y SGI
Sbjct: 208 TAKAGTCRHNKSSTVATMSHY-IKVHSNEEALAQTVSTVGPVAVCVHTKTRSWQFYKSGI 266
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANY 303
D C + +H +L+VG+ + WI+KN W WG GY+++ K NN CGI++Y
Sbjct: 267 LYDPLCKNYTYDHGILVVGFGIQKDEHYWIVKNSWGSRWGMKGYIWIAKDRNNHCGISSY 326
Query: 304 AVY 306
A Y
Sbjct: 327 ATY 329
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 166/308 (53%), Gaps = 15/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y+ + + + + N KI HNQ G + + N +D+ + M
Sbjct: 35 EHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMN 94
Query: 76 RLTHSRIRRTLVRSPES--------NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + + L + ES E V +P +DWR KG +T +Q CG+C+AFS
Sbjct: 95 GFNYT-LHKQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSS 153
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ ++ + + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 154 TGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 213
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ C F + +I + +P +E + +AT+GP+AV+I+AS +FQ Y+ G
Sbjct: 214 YEAIDDSCHFNKGSIGATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEG 273
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM-YLKRGNNRCGIA 301
+Y++ AC + ++H +L+VG+ + W++KN W WGD G++ L+ N+CGIA
Sbjct: 274 VYNEPACDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIA 333
Query: 302 NYAVYALI 309
+ + Y L+
Sbjct: 334 SASSYPLV 341
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 169/326 (51%), Gaps = 21/326 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ +EW + +KKY + ++ + K++ Q+ HK I HNQ G + LR N
Sbjct: 23 LVKEEWTAFKLQHRKKYDSETEERI---RMKIYVQNKHK-IAKHNQRYDLGQEKFRLRVN 78
Query: 61 HLSDLHPRHYIKEMTRLTHS-----RIRRTLVRSPESN------ESVLIPDHLDWREKGF 109
+DL ++ + S ++ R ++ E +V +P +DWR KG
Sbjct: 79 KYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANVDVPTAMDWRTKGA 138
Query: 110 ITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRN 169
+T +Q CG+C++FS A++GQ F+ T ++ LS Q +VDCS GN GC GG +
Sbjct: 139 VTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDF 198
Query: 170 TLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGP 229
Y++ G+ E+ YPY+ C + + + +P +E AL LATVGP
Sbjct: 199 AFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAVGATDKGFVDIPQGNEKALMKALATVGP 258
Query: 230 IAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWG 284
++V+I+AS +FQ Y+ G+Y + C S+ ++H +L VGY + W++KN W WG
Sbjct: 259 VSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWG 318
Query: 285 DNGYMYLKRG-NNRCGIANYAVYALI 309
D GY+ + R +N CGIA A Y L+
Sbjct: 319 DQGYVKMARNRDNHCGIATTASYPLV 344
>gi|17224950|gb|AAL37181.1|AF320084_1 cathepsin L-like protease [Ancylostoma caninum]
Length = 214
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 132/214 (61%), Gaps = 6/214 (2%)
Query: 102 LDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
+DWR+KG +T NQ CG+C+AFS A++GQ +++ ++ LS Q +VDCS GN G
Sbjct: 1 VDWRDKGLVTEVKNQGMCGSCWAFSATGALEGQHARASGQMVSLSEQNLVDCSTKYGNHG 60
Query: 162 CAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALK 221
C GG + Y++ G+ EE YPY G+ C FK+ +I + + LP DE ALK
Sbjct: 61 CNGGLMDLAFEYIKDNHGIDTEESYPYVGRDMKCHFKKKDIGAVDNGYVDLPEGDEEALK 120
Query: 222 VTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILK 276
+ +AT GPI+++I+A TFQLY G+Y DE C+S+ ++H +LLVGY + W++K
Sbjct: 121 IAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEAGDYWLVK 180
Query: 277 NWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
N W WG+ GY+ + R NN CG+A A Y L+
Sbjct: 181 NSWGTGWGEKGYIRIARNRNNHCGVATKASYPLV 214
>gi|313241067|emb|CBY33367.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 162/310 (52%), Gaps = 14/310 (4%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KEW Q K K Y+ + + + +W + + + HN +A G +T+ N +
Sbjct: 25 KEW-------QAKVSKAYQSEGHEMRGFQNWLVSRQFVMEHNIKAFTGQESFTVEMNQFA 77
Query: 64 DLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
DL ++ E RL + + + S+ + PD +DWR +G++TP +Q CG+C+
Sbjct: 78 DLSDEEWV-EYNRLRNGAKANSDCKPMPSSSAEANPDSVDWRNEGYVTPIKDQGQCGSCW 136
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS + +G FK T ++ LS QQ+VDCS G+ GC GG + Y+ G+ E
Sbjct: 137 AFSTTGSTEGAHFKKTGKLVTLSEQQLVDCSTKEGDHGCNGGLMDFGFTYIIENDGITTE 196
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
YPYK + CK +S + E L+ +ATVGPI+V+I+A +F+L
Sbjct: 197 SAYPYKAQDGSCK-SGMTAAATLSECYDVAQGSEADLETAVATVGPISVAIDAHLLSFRL 255
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRC 298
Y GIY D C+S ++H +L VGY + N WI+KN W+ WG+ GY+++ K N C
Sbjct: 256 YKQGIYHDRLCSSTRLDHGVLAVGYKNDPSGNYWIVKNSWNTTWGNEGYIWMAKDKKNTC 315
Query: 299 GIANYAVYAL 308
GIA A Y +
Sbjct: 316 GIATAASYPV 325
>gi|351712164|gb|EHB15083.1| Cathepsin L1 [Heterocephalus glaber]
Length = 278
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 147/279 (52%), Gaps = 16/279 (5%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI 98
+ + HN E +G HG+T+ N D+ + + M H + ++ E +L+
Sbjct: 2 RMLELHNGEYSEGKHGFTMAMNAFGDMTSEEFKQVMNGFQHQKHKKGKTY----QEPLLL 57
Query: 99 P--DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
+DWREKG++TP NQ CG C+AFS +++GQ+F+ T ++ LS Q +VDCS
Sbjct: 58 QLLKSVDWREKGYVTPVKNQGQCGTCWAFSATGSLEGQMFQKTGQLVSLSEQNLVDCSRP 117
Query: 157 SGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD 216
GN GC GG + YV+ GL E+ YPY+GK CK+K P + + V Q
Sbjct: 118 QGNQGCNGGLMDFAFEYVKENKGLESEKFYPYEGKDGSCKYK-PELSAANDTGFVDISQR 176
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS---- 272
E AL +A GPI+V+++A +FQ Y GIY D C+S +NH +L++GY
Sbjct: 177 EKALMKAVAEEGPISVAVDAGLTSFQFYKDGIYFDPECSSKDLNHGVLVLGYGYEEVNSE 236
Query: 273 ----WILKNWWSHHWGDNGYMYLKRGNNR-CGIANYAVY 306
W++KN WG GYM + N+ CGIA A Y
Sbjct: 237 KNEYWLVKNSSGPEWGAKGYMKIAGNRNKHCGIATAASY 275
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 165/308 (53%), Gaps = 15/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y+ + + + + N KI HNQ G + + N +D+ + M
Sbjct: 35 EHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMN 94
Query: 76 RLTHSRIRRTLVRSPES--------NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + + L + ES E V +P +DWR KG +T +Q CG+C+AFS
Sbjct: 95 GFNYT-LHKQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSS 153
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ ++ + + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 154 TGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 213
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ C F + I + +P +E + +AT+GP+AV+I+AS +FQ Y+ G
Sbjct: 214 YEAIDDSCHFNKGTIGATDRGFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEG 273
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM-YLKRGNNRCGIA 301
+Y++ AC + ++H +L+VG+ + W++KN W WGD G++ L+ N+CGIA
Sbjct: 274 VYNEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIA 333
Query: 302 NYAVYALI 309
+ + Y L+
Sbjct: 334 SASSYPLV 341
>gi|313221001|emb|CBY31833.1| unnamed protein product [Oikopleura dioica]
gi|313229611|emb|CBY18426.1| unnamed protein product [Oikopleura dioica]
Length = 362
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 174/331 (52%), Gaps = 26/331 (7%)
Query: 1 MTNK-EWIIIFIFPQKK--YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTL 57
M++K E +I F Q K ++K+Y + ++ W N I HN + G +TL
Sbjct: 26 MSDKYEQRLINEFKQWKDAFEKEYESIEQEIERMGTWMKNMLHIEEHNFQHSLGKKTFTL 85
Query: 58 RENHLSD-------------LHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDW 104
N D LH +++ L V + ES + +DW
Sbjct: 86 GMNKYGDQSSEEFAATYNGFLHAEGQTRKLFGLHEDAFYLDWVDADESK----LDKSVDW 141
Query: 105 REKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAG 164
REKG +T +Q CG+C++FS A++GQ+ + ++ +LS Q +VDCS GN GC G
Sbjct: 142 REKGAVTEVKDQGQCGSCWSFSATGALEGQMAQVFGKLPDLSEQNLVDCSRPEGNQGCNG 201
Query: 165 GSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVT 223
G + YV+ GL E+ YPY+G C++ + + D + + ++P +E ALK
Sbjct: 202 GLMDAAFQYVKDQDGLDGEDWYPYEGVDNKECRYDKSHREADDTGFKMIPEGNEKALKHA 261
Query: 224 LATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWW 279
LA VGP++V+I+AS +FQ Y SG+Y + C+ + ++H +L VGY +++KN W
Sbjct: 262 LAKVGPVSVAIDASNPSFQFYQSGVYYEPNCSPENLDHGVLAVGYGTEDGEHYYLVKNSW 321
Query: 280 SHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
S WGDNGY+ + R N CGIA+YAVY ++
Sbjct: 322 SEAWGDNGYIKMARNKENHCGIASYAVYPIV 352
>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 325
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 170/316 (53%), Gaps = 21/316 (6%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
+EW+ + K K Y+ + + +Q N +KI HN++ G + +
Sbjct: 21 EEWVQFKV----KNNKSYKSYVEEQTRFRIFQENLRKIENHNEKYNNGESTFKFGVTKFT 76
Query: 64 DLHPRHYIKEMTRLTHSRIRRTL---VRSPESNESVLIPDHLDWREKGFITPDWNQEDCG 120
DL + ++ + ++R RT + +P + +P DWR+KG +T +Q CG
Sbjct: 77 DLTEKEFLDLLVLSKNARPNRTHATHLLAPLRD----LPSAFDWRDKGAVTEVKDQGMCG 132
Query: 121 ACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
+C+ FS +++ F T + LS Q +VDC+ + GC GG + L Y++ GG+
Sbjct: 133 SCWTFSTTGSVEAAHFLKTGNLVSLSEQNLVDCAKDTC-YGCGGGWMDKALEYIE-KGGI 190
Query: 181 MKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
M E+DYPY+G C+F + IS+++ + DE LK +A GPI+V+I+AS T
Sbjct: 191 MSEKDYPYEGVDDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAIDASA-T 249
Query: 241 FQLYASGIYDDEACTSDY--VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG 294
FQLY SGI DD C++++ +NH +L+VGY ++ WI+KN W +WG +GY+ + R
Sbjct: 250 FQLYVSGILDDTECSNEFDSLNHGVLVVGYGTENGKDYWIIKNSWGVNWGMDGYIRMSRN 309
Query: 295 -NNRCGIANYAVYALI 309
NN+CGI VY I
Sbjct: 310 KNNQCGITTDGVYPNI 325
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 108/308 (35%), Positives = 154/308 (50%), Gaps = 23/308 (7%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
KK+ YR+K D +K H +++N + I N G Y L N +DL + +
Sbjct: 60 KKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAG---GKKKYVLGTNQFADLTSKEF 116
Query: 71 IKEMTRLTHSRIRRTLVR------SPESNESVLIPD-HLDWREKGFITPDWNQEDCGACY 123
T L + + S N + L D +DWR++G +TP NQ CG C+
Sbjct: 117 AAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCW 176
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS A++G I +T + LS QQ++DC GN GC GG + N YV GG+ E
Sbjct: 177 AFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINNGGVTTE 236
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
+ YPY Q C+ +P IS + LP DE+AL +A P++V ++ FQ
Sbjct: 237 DAYPYSAVQGTCQNVQP--AATISGFQDLPSGDENALANAVANQ-PVSVGVDGGSSPFQF 293
Query: 244 YASGIYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNRC 298
Y GIYD + C +D +NHA+ +GY + WILKN W WG+NG+M L+ G C
Sbjct: 294 YQGGIYDGDGCGTD-MNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGAC 352
Query: 299 GIANYAVY 306
GI+ A Y
Sbjct: 353 GISTMASY 360
>gi|348504496|ref|XP_003439797.1| PREDICTED: digestive cysteine proteinase 2-like [Oreochromis
niloticus]
Length = 352
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/293 (34%), Positives = 161/293 (54%), Gaps = 11/293 (3%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-IKEMTRLTHSRIRR- 84
D ++++ W+ N + I +N+ G+ +++ N DL + Y + + ++ ++R
Sbjct: 61 DMERRVIWEENKQLIEDNNRRFLMGMKSFSMAMNKYGDLTQQEYRVLQGAKMNAQFVKRG 120
Query: 85 --TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
R ++ L +D+R GF+T +Q CG+C+AFS AI+ Q++K T ++
Sbjct: 121 KEVSARRLRNSARHLDGFAVDYRSMGFVTEVKDQGFCGSCWAFSTTGAIEAQLYKKTGQL 180
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPN 201
LS Q +VDCS G GC+G + N +YV + GL YPY + C +
Sbjct: 181 ISLSEQNLVDCSKSFGTYGCSGAWMANAYDYV-VSNGLESSNTYPYTSVDTQPCFYDSSL 239
Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNH 261
V I + +P DE A+ LAT+GPI V+I+A +F Y+SGIYD+ C + +NH
Sbjct: 240 AVAHIRDYRFIPRGDEQAMADALATIGPITVTIDADHASFLFYSSGIYDEPNCNPNNLNH 299
Query: 262 AMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
A+LLVGY ++ WI+KN W WG+ GYM + R G N CG+A+YA+Y ++
Sbjct: 300 AVLLVGYGSQEGQDYWIIKNSWGTGWGEGGYMRIVRNGQNACGLASYALYPIL 352
>gi|288764225|emb|CAQ03433.1| silcatein 2 [Spongilla lacustris]
gi|296168749|emb|CAQ54052.1| silicatein alpha 4 [Spongilla lacustris]
Length = 326
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 163/303 (53%), Gaps = 17/303 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY--- 70
+ +++K Y + + ++ W SN K I HN A+ L GYTL N L DL Y
Sbjct: 28 KGQHQKSYTSELEELERHAIWSSNKKYIEEHN--ARSDLFGYTLAMNSLGDLSMEEYNEI 85
Query: 71 --IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
E TH ++ + ++P+ V D +DWR KG +T Q CGA YAF+
Sbjct: 86 YLTHETGNYTHHGLK--MFQAPKG---VQYADSIDWRAKGAVTSIKYQAQCGASYAFAAT 140
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G + + LS Q ++DCS+ GN GC+GG L YV GG+ E +
Sbjct: 141 GALEGASALANDKQVTLSEQNIIDCSVPYGNHGCSGGDTYTALKYVIDNGGIDTESSCSF 200
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ KQS C++ N + + E L +ATVGP+A++++A+ + F+ Y SG+
Sbjct: 201 QAKQSSCQYNSKNSGASATGVINIAFGSESDLLAAVATVGPVAIAVDANTNAFRFYQSGV 260
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
+D +C++ +NHAML+ GY ++ W++KN W +WGD+GY+ + R N+CGIA+
Sbjct: 261 FDSSSCSNTKLNHAMLVTGYGSYNGKDYWLVKNSWGKNWGDSGYILMVRNKYNQCGIASD 320
Query: 304 AVY 306
++Y
Sbjct: 321 SLY 323
>gi|33242865|gb|AAQ01137.1| cathepsin [Branchiostoma lanceolatum]
Length = 328
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 175/320 (54%), Gaps = 20/320 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ N +W + +K Y + Y + + ++L ++ N K I HN+EA +GLH + L N
Sbjct: 18 LMNPQWEVF----KKAYNRVYAAEE-EFARRLIFEDNLKTIQMHNEEADRGLHTFRLGVN 72
Query: 61 HLSDLHPRHYIKEMTR--LTHSRIRRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQ 116
+D+ + +++ + L + ++ + L +PD +DWR+KG++TP NQ
Sbjct: 73 QYADMTHKEFLENVIGGCLLDTNTSKSTADHVHEYDPTLTDLPDTVDWRDKGYVTPVKNQ 132
Query: 117 EDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQF 176
E CG+C+AFS +++GQ FKST ++ LS Q +VDCS G G G + Y++
Sbjct: 133 EQCGSCWAFSTTGSLEGQHFKSTQKLVSLSEQNLVDCSRKRGT-GLPGRLMDQGFKYIKD 191
Query: 177 AGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE--HALKVTLATVGPIAVSI 234
GG+ EE YPYK K C ++ ++ + QDE AL+ +ATVGPI+V+I
Sbjct: 192 NGGIDTEECYPYKAKNEKCNYQAS---CSGATLTAKRRQDEGRGALQQAVATVGPISVAI 248
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMY 290
+A +FQLY SG+Y C+ ++H +L VGY ++ W++KN W WG+ GY+
Sbjct: 249 DAGHSSFQLYQSGVYHKFFCSETKMDHGVLAVGYGTEEGKDYWLVKNSWGASWGEKGYIK 308
Query: 291 LKRG-NNRCGIANYAVYALI 309
+ R +N GIA A Y +
Sbjct: 309 MSRNRHNNWGIATSASYPTV 328
>gi|443722452|gb|ELU11310.1| hypothetical protein CAPTEDRAFT_132308 [Capitella teleta]
Length = 235
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 85/222 (38%), Positives = 141/222 (63%), Gaps = 6/222 (2%)
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
E++ +P +DWREKG++TP NQ CG+C+AFS +++GQ+F+ T + +S Q +VDC
Sbjct: 14 ENLQVPKTVDWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDC 73
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN+GC+GG + N Y++ G+ E+ YPY+ C++K+ + V S + +P
Sbjct: 74 SRDEGNMGCSGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSVTTDSGFVDIP 133
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYT---- 269
DE AL+ +A+VGP++V+I+AS +FQ Y +G+Y + C+S ++H +L+V
Sbjct: 134 HGDETALRTAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVVGYGVEN 193
Query: 270 -RNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
++ W++KN W WG+ GY+ + R N+CGIA+ A Y L+
Sbjct: 194 GQDYWLVKNSWGASWGEAGYIKMARNHGNQCGIASQASYPLL 235
>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
Length = 318
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 166/301 (55%), Gaps = 13/301 (4%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
+ +K + K T K+ L+ +++N K + HN+ +QGL + L+ N D+ +
Sbjct: 19 ENFKLTHAKVYTHGKEDLYRRSIFENNQKVVEEHNERFRQGLVTFDLKMNRFGDMTTEEF 78
Query: 71 IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+ +MT L +++ RT+ + V D +DWR+KG +TP +Q CG+C+AFS A
Sbjct: 79 VSQMTGL--NKVERTVGKVFAHYPEVERADTVDWRDKGAVTPVKDQGQCGSCWAFSTTGA 136
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++G F ++ LS Q +VDCS + N GC GG ++ +Y++ G+ E YPY+
Sbjct: 137 LEGAHFLKHGDLVSLSEQNLVDCS--TENSGCNGGVVQWAYDYIKSNNGIDTESSYPYEA 194
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C+F ++ ++ ++ +P DE + GP++V I+A ++FQLY+SG+Y
Sbjct: 195 QDLTCRFDAAHVGATVTGYADIPYADEVTQASAVHDDGPVSVCIDAGHNSFQLYSSGVYY 254
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
+ C +NHA+L VGY + W++KN W WG +GYM L R +N CG+A +
Sbjct: 255 EPNCNPSSINHAVLPVGYGTEEGSDYWLIKNSWGTGWGLSGYMKLTRNKSNHCGVATQSC 314
Query: 306 Y 306
Y
Sbjct: 315 Y 315
>gi|291224868|ref|XP_002732424.1| PREDICTED: cathepsin L-like [Saccoglossus kowalevskii]
Length = 823
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 82/196 (41%), Positives = 124/196 (63%), Gaps = 5/196 (2%)
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C+AFS +++GQ FK T ++ +LS QQ+VDCS GN GC GG + Y++ A
Sbjct: 628 CGSCWAFSTTGSLEGQTFKKTGKLPDLSEQQLVDCSTQFGNHGCNGGLMDLAFEYIKAAP 687
Query: 179 GLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
G+ E DYPY K C F + +V + + +P DE+ALK +AT+GPI+V+I+A
Sbjct: 688 GIEGEMDYPYLAKDGRCMFDQSKVVATDTGYVDIPSMDENALKEAVATIGPISVAIDAGH 747
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG 294
+FQ+Y SG+Y++ C+S+ ++H +L VGY ++ W++KN W WG GY+ + R
Sbjct: 748 PSFQMYKSGVYNEPGCSSERLDHGVLAVGYGTEDGQDYWLVKNSWGDSWGQAGYIMMSRN 807
Query: 295 -NNRCGIANYAVYALI 309
NN+CGIA A Y L+
Sbjct: 808 MNNQCGIATQASYPLV 823
>gi|38146075|gb|AAR11477.1| cathepsin L [Litopenaeus vannamei]
Length = 297
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 142/261 (54%), Gaps = 7/261 (2%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ N + I HN + G +TL+ N D+ + M + RR +
Sbjct: 39 FEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEEIVATMNGFLGAPTRRPAAVLKADD 98
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
E+ +P+ +DWR KG +TP +Q+ CG+C+AFS +++GQ F ++ LS Q +VDC
Sbjct: 99 ET--LPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDC 156
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLP 213
S GN+GC GG + Y++ G+ E+ YPY+ + C+F N+ + + +
Sbjct: 157 SDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVE 216
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS- 272
E ALK +AT+GPI+V I+AS TF Y +G+Y D+ C+S ++H +L VGY +
Sbjct: 217 HGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDEN 276
Query: 273 ----WILKNWWSHHWGDNGYM 289
W++KN W+ WGD GY+
Sbjct: 277 GGDFWLVKNSWNTSWGDKGYI 297
>gi|313235898|emb|CBY11285.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 162/310 (52%), Gaps = 14/310 (4%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KEW Q K K Y+ + + + +W + + + HN +A G +T+ N +
Sbjct: 25 KEW-------QVKVSKAYQSEGHEMRGFQNWLVSRQFVMEHNIKAFTGQESFTVEMNQFA 77
Query: 64 DLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
DL ++ E RL + + + S+ + PD +DWR +G++TP +Q CG+C+
Sbjct: 78 DLSDEEWV-EYNRLRNGAKANSDCKPMPSSSAQANPDAVDWRPQGYVTPIKDQGQCGSCW 136
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
AFS + +G FK T ++ LS QQ+VDCS G+ GC GG + Y+ G+ E
Sbjct: 137 AFSTTGSTEGAHFKKTGKLVMLSEQQLVDCSTKEGDHGCNGGLMDFGFTYIIENDGITTE 196
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
YPYK + CK +S + E L+ +ATVGPI+V+I+A +F+L
Sbjct: 197 SAYPYKAQDGSCK-SGMTAAATLSECYDVAQGSEADLETAVATVGPISVAIDAHLLSFRL 255
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYL-KRGNNRC 298
Y GIY D C+S ++H +L VGY + N WI+KN W+ WG+ GY+++ K N C
Sbjct: 256 YKQGIYHDRLCSSTRLDHGVLAVGYKNDPSGNYWIVKNSWNTTWGNEGYIWMAKDKKNTC 315
Query: 299 GIANYAVYAL 308
GIA A Y +
Sbjct: 316 GIATAASYPV 325
>gi|256535831|gb|ACU82390.1| cathepsin L 2 [Pheronema raphanus]
Length = 320
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 164/304 (53%), Gaps = 15/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y K DS +K+ W++N + I HN + + Y L N ++D+ + + ++
Sbjct: 24 KAKYGKQYMSKDEDSMRKIIWENNLQYIEKHNSHNK---YAYKLAANKMADMTNKEFRQQ 80
Query: 74 MTRLTHSRIRRTLVRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
L + I + S++ IP +DWR KG++T +Q+ CG+C+AFS A ++
Sbjct: 81 Y--LGYIPIEEGPIEYKHLYNSIITDIPTSVDWRTKGYVTKVKDQKQCGSCWAFSAAGSL 138
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F + LS QQ+VDCS GN GC GG + Y + G M E Y Y G+
Sbjct: 139 EGQYFTGQTNYF-LSEQQLVDCSRKEGNEGCDGGDMVLAFKYYEKYGA-MNESVYSYLGR 196
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+ CKF V ++ + +P D ++LK A GPI+V+++AS ++FQ+Y+SGIYD
Sbjct: 197 DAKCKFDENKAVAKVTGYKKIPKMDCNSLKNAAAVTGPISVAMDASHNSFQIYSSGIYDP 256
Query: 252 EAC--TSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
+ C + ++H +L+VGY S W++KN W WG GY + G + CGI A
Sbjct: 257 KRCGKRNRQLDHGVLVVGYGTESGEDFWLIKNSWGEDWGMQGYFKIAVGKDECGICTSAS 316
Query: 306 YALI 309
+ +
Sbjct: 317 FPTV 320
>gi|21617827|sp|P09648.1|CATL1_CHICK RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain
Length = 218
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 91/217 (41%), Positives = 131/217 (60%), Gaps = 6/217 (2%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP +Q CG+C+AFS A++GQ F++ ++ LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDE 217
N GC GG + YVQ GG+ EE YPY K C++K + + + +P E
Sbjct: 62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
AL +A+VGP++V+I+A +FQ Y SGIY + C+S+ ++H +L+VGY + W
Sbjct: 122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKKYW 181
Query: 274 ILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
I+KN W WGD GY+Y+ K N CGIA A Y L+
Sbjct: 182 IVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218
>gi|1705639|sp|Q10991.1|CATL1_SHEEP RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
Length = 217
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 131/218 (60%), Gaps = 7/218 (3%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q +VD S
Sbjct: 1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + N Y++ GGL EE YPY+ + C +K P + V PQ E
Sbjct: 61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYK-PEYSAAKDTGFVDIPQRE 119
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNS- 272
AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VGY T N
Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKF 179
Query: 273 WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN W WG+ GY+ + K NN CGIA A Y +
Sbjct: 180 WIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>gi|432117576|gb|ELK37815.1| Cathepsin L1 [Myotis davidii]
Length = 299
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 106/302 (35%), Positives = 155/302 (51%), Gaps = 35/302 (11%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVRSPESNESVL 97
K I HN E QG +TL N D+ + M + + ++ + + P E
Sbjct: 2 KVIELHNWEHSQGRRNFTLAMNAFGDMTNEEFRLVMNGFQNQKHKKGDMFQEPALAE--- 58
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP +DWR+KG +TP +Q CG+C+AFS A++GQ+F+ T ++ LS Q +VDCS
Sbjct: 59 IPPSVDWRKKGCVTPVKDQGGCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 118
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC+GG + N YV+ GL EE YPY G CK+K P + V +DE
Sbjct: 119 GNEGCSGGLMDNAFQYVKDNEGLDTEESYPYYGTDDTCKYK-PEFSAANDTGFVDIHKDE 177
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYAS---------------------GIYDDEACTS 256
+L +A+VGPI+V+++AS +FQ Y GIY D C+S
Sbjct: 178 RSLMKAVASVGPISVALDASLESFQFYEKGKVTVSSYLEIFTPAMTSVFLGIYYDPDCSS 237
Query: 257 DYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
+ +NH +L+VGY WI+KN W WG +GY+ + + +N CGIA+ A Y
Sbjct: 238 EDLNHGVLVVGYGFEGVEMDNNKYWIVKNSWGTKWGMDGYIKMAKDLDNHCGIASMASYP 297
Query: 308 LI 309
+
Sbjct: 298 TV 299
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 161/305 (52%), Gaps = 12/305 (3%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
++K Y K +S + + N KI HNQ+ + Y L N D+ +I +
Sbjct: 35 HRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNG 94
Query: 77 LTHS------RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
S RR + +V IP +DWR G +TP +Q CG+C++FS A
Sbjct: 95 FNKSVSAQLRAQRRPIGSRFIEPANVEIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGA 154
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++GQ ++ T ++ LS Q ++DCS GN GC GG + Y++ GL E YPY+
Sbjct: 155 LEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEA 214
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C++ N S + +P +E LK +AT+GP++V+I+AS +FQ Y G+Y
Sbjct: 215 ENDKCRYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVSVAIDASAESFQFYREGVYY 274
Query: 251 DEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ C+S+ ++H +L+VGY ++ W++KN W WGD GY+ + R +N CGIA+ A
Sbjct: 275 EPRCSSENLDHGVLVVGYGTDDNDQDYWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSA 334
Query: 305 VYALI 309
Y L+
Sbjct: 335 SYPLV 339
>gi|340374922|ref|XP_003385986.1| PREDICTED: cathepsin L-like [Amphimedon queenslandica]
Length = 325
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 162/313 (51%), Gaps = 15/313 (4%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+T++EW I ++K Y+ A + ++ W SN+K I HN + + L N
Sbjct: 19 LTDEEWEQWKI----THRKTYQNVAVEESRRQVWSSNYKLIQEHNMKESS----FKLALN 70
Query: 61 HLSDLHPRHYI-KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDC 119
H +DL Y K ++ +R ++ + +DWR++G +T NQ C
Sbjct: 71 HFADLTNDEYTEKYLSSFDLEGFKRNMMNESLHANNAKADTAVDWRDRGAVTSVKNQGQC 130
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G+C+AFS A++ F + + LS QQ+VDCS GN GC GG + YV+ G
Sbjct: 131 GSCWAFSATGALEAMHFLNKGTLVSLSEQQLVDCSYRYGNAGCQGGLMDRAFAYVRNKGY 190
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
+ EE YPY G CK + + + DE +L +++VG ++V+I+AS +
Sbjct: 191 ICSEESYPYVGYMWACKSSSCSPAASCRGYKNIASGDEESLTNAVSSVGTVSVAIDASRY 250
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG 294
+FQ Y+SG++ D C+S +NH +L+VGY +I+KN WS WGD GY+ + R
Sbjct: 251 SFQFYSSGVFYDSDCSSSRLNHGVLVVGYGTYYDGTEYYIVKNSWSSDWGDKGYVLMARN 310
Query: 295 -NNRCGIANYAVY 306
+N CGIA+ A Y
Sbjct: 311 KDNNCGIASAASY 323
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/308 (31%), Positives = 166/308 (53%), Gaps = 15/308 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+++K+Y + + + + N KI HNQ G Y L N +D+ + + M
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170
Query: 76 RLTHSRIRRTLVRSPES--------NESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + + L + ES E V +P +DWR+KG +T +Q CG+C+AFS
Sbjct: 171 GFNYT-LHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSS 229
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ ++ + + LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 230 TGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYP 289
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ C F + I + +P +E L +AT+GP++V+I+AS +FQ Y+ G
Sbjct: 290 YEALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEG 349
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYM-YLKRGNNRCGIA 301
+Y + AC + ++H +L+VG+ + W++KN W WGD G++ L+ +N+CGIA
Sbjct: 350 VYVEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIA 409
Query: 302 NYAVYALI 309
+ + Y L+
Sbjct: 410 SASSYPLV 417
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 170/306 (55%), Gaps = 15/306 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + + Y + +K +++N + I HN + G +++ N+ +DL +
Sbjct: 37 KSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF--- 93
Query: 74 MTRLTHSRIRR----TLVRSPESNESV-LIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R T + RR +L S ++ V +P +DW KG +TP NQ+ CG+C+AFS
Sbjct: 94 --RATFNGYRRLAAVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFSAV 151
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
++++GQ T ++ LS Q +VDCS G++GC+GG + YV G+ E YPY
Sbjct: 152 ASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASYPY 211
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C+FKR ++ I S+ + DE AL+ +A++GPI+V+I+A+ +FQ Y+SG+
Sbjct: 212 KAIDESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAIDAAQPSFQFYSSGV 271
Query: 249 YDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y++ C+++ ++H + VGY T N W +KN W WG GY+++ R N+CGIA
Sbjct: 272 YNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGIATK 331
Query: 304 AVYALI 309
A Y ++
Sbjct: 332 ASYPVV 337
>gi|258588539|pdb|3HWN|A Chain A, Cathepsin L With Az13010160
gi|258588540|pdb|3HWN|B Chain B, Cathepsin L With Az13010160
gi|258588541|pdb|3HWN|C Chain C, Cathepsin L With Az13010160
gi|258588542|pdb|3HWN|D Chain D, Cathepsin L With Az13010160
Length = 258
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 99/260 (38%), Positives = 144/260 (55%), Gaps = 14/260 (5%)
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLV-RSPESNESVLIPDHLDWREKGFITPDWNQED 118
N D+ + + M + + R+ V + P E+ P +DWREKG++TP NQ
Sbjct: 3 NAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA---PRSVDWREKGYVTPVKNQGQ 59
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C+AFS A++GQ+F+ T + LS Q +VDCS GN GC GG + YVQ G
Sbjct: 60 CGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG 119
Query: 179 GLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
GL EE YPY+ + CK+ V + + + +P Q E AL +ATVGPI+V+I+A
Sbjct: 120 GLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPKQ-EKALMKAVATVGPISVAIDAGH 178
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS--------WILKNWWSHHWGDNGYMY 290
+F Y GIY + C+S+ ++H +L+VGY S W++KN W WG GY+
Sbjct: 179 ESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVK 238
Query: 291 L-KRGNNRCGIANYAVYALI 309
+ K N CGIA+ A Y +
Sbjct: 239 MAKDRRNHCGIASAASYPTV 258
>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
Length = 330
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 165/303 (54%), Gaps = 11/303 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ +
Sbjct: 32 KKTYGKQYKEKNEEAVRRLIWEKNLKFVMIHNLEHSMGMHSYDLGMNHLGDMTSEEVMSL 91
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M+ L S+ +R + + +SN + ++PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 92 MSSLRVPSQWQRNI--TYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 150 AQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 209
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ S ++ E LK +A GP++V ++A +F LY SG+Y +
Sbjct: 210 VK-CQYDSKYRAATCSKYTDFXYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYE 268
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+CT + VNH +L+VGY + W++KN W ++G+ GY+ + R N CGIA++ +
Sbjct: 269 PSCTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSF 327
Query: 307 ALI 309
I
Sbjct: 328 PEI 330
>gi|198457180|ref|XP_001360577.2| GA18475 [Drosophila pseudoobscura pseudoobscura]
gi|198135890|gb|EAL25152.2| GA18475 [Drosophila pseudoobscura pseudoobscura]
Length = 372
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 152/304 (50%), Gaps = 17/304 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K+Y A + + + + + N +G Y L N SDL ++ ++T L
Sbjct: 73 KNYLSAADKALHEGVFAARKNLVDAGNDAFAKGASSYQLAVNAFSDLTKSEFLSQLTGLR 132
Query: 79 HSR-------IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
S R L P IP+ DWR+KG +T Q CG+C+AF+ AI
Sbjct: 133 KSSQGASKATANRKLASVPAGAS---IPESFDWRQKGGVTSVKFQGTCGSCWAFATTGAI 189
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNL-GCAGGSLRNTLNYV-QFAGGLMKEEDYPYK 189
+G IF+ T + LS Q +VDC + L GC GG + ++ + G+ K + YPY
Sbjct: 190 EGHIFRKTGTLPNLSEQNLVDCGTLEFGLSGCDGGFQEYAMAFINEEQKGVSKADGYPYI 249
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ CK+ + I+ ++ +PP+DE +K +AT+GP+A S+N Q Y SGIY
Sbjct: 250 DNKDTCKYSKNLSGAQITGFATIPPKDEALMKKVIATLGPLACSLNGLETLLQ-YKSGIY 308
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
DE C NH++L+VGY ++ WI+KN W WG+ GY L RGNN CGIA
Sbjct: 309 SDEKCNEGEPNHSILVVGYGSEKGQDYWIVKNSWDKVWGEEGYFRLPRGNNFCGIALECT 368
Query: 306 YALI 309
Y ++
Sbjct: 369 YPIV 372
>gi|28932704|gb|AAO60046.1| midgut cysteine proteinase 3 [Rhipicephalus appendiculatus]
Length = 334
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 170/321 (52%), Gaps = 20/321 (6%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRE 59
+ EW K+Y D T+ +L + N KI HN++ + Y L
Sbjct: 22 LIGAEWSAFKSLHGKEYDSD-----TEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAM 76
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLVRS------PESNESVLIPDHLDWREKGFITPD 113
N DL ++ TR R R R PE E + +P +DWR+KG +TP
Sbjct: 77 NEFGDLLHHEFVS--TRNGFKRNYRDTPREGSFFIEPEGFEDLHLPKTVDWRKKGAVTPV 134
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
NQ CG+C+AFS +++GQ F+ ++ LS Q +VDC GN GC GG + N Y
Sbjct: 135 KNQGQCGSCWAFSTTGSLEGQHFRKMRKLVSLSEQNLVDCMQKLGNNGCGGGLMDNAFKY 194
Query: 174 VQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVS 233
++ G+ E YPY +C FK+ + + + +P +DE++ +A VGP++V+
Sbjct: 195 IKANKGIDTELSYPYNATDGVCHFKKSGVGATATGFEDIPARDENSWDA-VAPVGPVSVA 253
Query: 234 INASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYM 289
I+AS +FQ Y+ G+ D+ C+SD ++H +L+VGY ++ W++KN W WGD GY+
Sbjct: 254 IDASHESFQFYSEGVLDEPECSSDQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYI 313
Query: 290 YLKRG-NNRCGIANYAVYALI 309
Y+ R +N+CGIA+ A Y L+
Sbjct: 314 YMTRNKDNQCGIASSASYPLV 334
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 155/302 (51%), Gaps = 8/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y +S + + K I HN+ +G Y L+ N+ SDL +
Sbjct: 24 KTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLAT 83
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T +T R R L P+S + + +DWR KG +TP +Q CG+C+AFS +A++G
Sbjct: 84 KTGMT--RRRHPLSVLPKSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAVAALEG 141
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
F T ++ LS Q +VDCS GN GC GG Y+ G+ E YPYK
Sbjct: 142 AHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYPYKAIDD 201
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ NI +SS+ DE AL+ + GP++V I+A +F Y G+Y +
Sbjct: 202 NCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYGGGVYYEPN 261
Query: 254 CTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C S Y NHA+ VGY ++ WI+KN W WG++GY+ + R +N C IA Y+VY
Sbjct: 262 CDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDNNCAIATYSVYP 321
Query: 308 LI 309
++
Sbjct: 322 VV 323
>gi|195150387|ref|XP_002016136.1| GL11434 [Drosophila persimilis]
gi|194109983|gb|EDW32026.1| GL11434 [Drosophila persimilis]
Length = 372
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 152/304 (50%), Gaps = 17/304 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K+Y A + + + + + N +G Y L N SDL ++ ++T L
Sbjct: 73 KNYLSAADKALHEGVFAARKNLVDAGNDAFAKGASSYQLAVNAFSDLTKSEFLSQLTGLR 132
Query: 79 HSR-------IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
S R L P IP+ DWR+KG +T Q CG+C+AF+ AI
Sbjct: 133 KSSQGASKATANRKLASVPAGAS---IPESFDWRQKGGVTSVKFQGTCGSCWAFATTGAI 189
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNL-GCAGGSLRNTLNYV-QFAGGLMKEEDYPYK 189
+G IF+ T + LS Q +VDC + L GC GG + ++ + G+ K + YPY
Sbjct: 190 EGHIFRKTGTLPNLSEQNLVDCGTLEFGLSGCDGGFQEYAMAFINEEQKGVSKADGYPYI 249
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ CK+ + I+ ++ +PP+DE +K +AT+GP+A S+N Q Y SGIY
Sbjct: 250 DNKDTCKYSKNLSGAQITGFATIPPKDETLMKKVIATLGPLACSLNGLETLLQ-YKSGIY 308
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
DE C NH++L+VGY ++ WI+KN W WG+ GY L RGNN CGIA
Sbjct: 309 SDEKCNEGEPNHSVLVVGYGSEKGQDYWIVKNSWDKVWGEEGYFRLPRGNNFCGIALECT 368
Query: 306 YALI 309
Y ++
Sbjct: 369 YPIV 372
>gi|390363592|ref|XP_790934.3| PREDICTED: counting factor associated protein D-like
[Strongylocentrotus purpuratus]
Length = 560
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 160/303 (52%), Gaps = 13/303 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++KY K Y+ ++K H+ N + IH+ N+ GY L NH++D + +
Sbjct: 257 KQKYDKTYKTDVEHVQRKGHFTKNVRMIHSINRANL----GYVLDINHMADQSHQELKRM 312
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
RL +R L +PDH+DW +G ++P +Q CG+C++F A I+G
Sbjct: 313 RGRLRQTRPNNGLPYDGSDISDDAVPDHIDWNVRGAVSPVKDQAVCGSCWSFGSAETIEG 372
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKGKQ 192
+F + + LS Q ++DC+ +GN GC GG ++ GG+ EE Y PY G+
Sbjct: 373 AVFMQSGKRVRLSQQMLMDCTWAAGNNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQN 432
Query: 193 SICKF-KRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C + K V I + + ++ LK LAT GPIAV I+A+ +F Y+ G Y D
Sbjct: 433 GMCHYGKSTPAVASIKKYYNVTSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYD 492
Query: 252 EAC--TSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
+C T D ++HA+L VGY +S W++KN WS HWG+NGY+ + +N CG+A A
Sbjct: 493 ASCGNTVDDLDHAVLAVGYGTDSSGQDYWLIKNSWSTHWGNNGYVAISMKDNNCGVATAA 552
Query: 305 VYA 307
Y
Sbjct: 553 TYV 555
>gi|156717488|ref|NP_001096284.1| uncharacterized protein LOC100124852 precursor [Xenopus (Silurana)
tropicalis]
gi|134026063|gb|AAI35549.1| LOC100124852 protein [Xenopus (Silurana) tropicalis]
Length = 333
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 158/303 (52%), Gaps = 12/303 (3%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ K I HN E GLH Y + NHL D+ M
Sbjct: 35 KTHQKTYKNAEEERARRTIWEETLKFISAHNLEYSLGLHTYEVGMNHLGDMTGEEVAATM 94
Query: 75 TRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
T T SR TL E+ + +L P +DWR KG +TP NQ C YAF+ A+
Sbjct: 95 TGYTGSR--NTLANITEAPKEILEAQPPASIDWRTKGCVTPVKNQGSCRCDYAFAAVGAL 152
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+ Q T + S QQ+VDCS GN GC GG + + Y++ GLM+E YPY+GK
Sbjct: 153 ECQWKIKTGSLFTFSPQQLVDCSYTEGNNGCYGGYIMYSFTYMKKY-GLMQEPAYPYEGK 211
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+ C K+P+ + + +P + +AL + VGP++V I+A F++Y SG+Y D
Sbjct: 212 EGKCTKKKPSNTGVVKQFYRIPSGNGNALMKAVGRVGPVSVWIDAGQQGFRMYKSGVYYD 271
Query: 252 EACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNR-CGIANYAVY 306
CT+ + NH +L+VGY W++KN W +G GY+ + R ++ CGI AVY
Sbjct: 272 PQCTT-HTNHVVLIVGYGTAKGSKYWLVKNSWGKGYGHKGYIKMARNYDKDCGITLRAVY 330
Query: 307 ALI 309
+
Sbjct: 331 PTV 333
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 105/286 (36%), Positives = 158/286 (55%), Gaps = 15/286 (5%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK-----EMTRLTHSRIRRTLVR 88
+Q N I N E + GYTL +D+ + + M T +++R+ L R
Sbjct: 189 FQRNLAHIEKFNAE-KAASRGYTLGITQFADMSTAEFRQTYLGLRMNASTIAKLRK-LQR 246
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
+++ L P+ +DWR+KG ++P +Q CG+C+AFS + AI+GQ F E+ LS Q
Sbjct: 247 EVVADDRDL-PEAVDWRDKGAVSPVKDQGQCGSCWAFSTSGAIEGQHFLKNGELLSLSEQ 305
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
Q+VDCS + + GC GG + YV+F GGL E YPYKG C + + I+
Sbjct: 306 QMVDCSWL--DFGCNGGQPMLAMEYVRFNGGLELETAYPYKGVGGSCHSDKKSAAAKITG 363
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ + E AL+ +A VGPI+V ++AS FQ Y SGIY+ E+C+S ++HA+L VGY
Sbjct: 364 FWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYKSGIYNPESCSSIGLDHAVLAVGY 423
Query: 269 TRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W+ WG+ GY L R N+CGIA +Y +
Sbjct: 424 GTSDDGDYWLVKNSWNTSWGEKGYFKLPRNKGNKCGIATTPIYPTV 469
>gi|58332124|ref|NP_001011214.1| cathepsin S, gene 2 precursor [Xenopus (Silurana) tropicalis]
gi|56556518|gb|AAH87770.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 332
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 164/301 (54%), Gaps = 9/301 (2%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ K I HN E GLH Y + NHL D+ M
Sbjct: 35 KTHQKSYKDTEEERTRRTIWEETLKFITVHNLEYSLGLHTYEVGMNHLGDMTGEEVAATM 94
Query: 75 TRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T T S + P+ + P +DWR + +TP +Q CG CYAFS A++
Sbjct: 95 TGYTGSGGSLANMTEVPKEIQEAQPPASIDWRTQACVTPVRDQGPCGCCYAFSAVGAMEC 154
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q + + LS Q++VDCS GN GC GG L + Y+ G+M+E Y Y G+++
Sbjct: 155 QAKRKRGMLFTLSPQELVDCSDTEGNNGCKGGRLMSAFTYM-MKHGVMEENAYRYTGQEA 213
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C+ ++ + + +++ + DE+ L + TVGP++V+I+++ F+LY SG+Y D
Sbjct: 214 DCR-RQVHTGLKVTAIHNVAAGDENVLMHAVGTVGPVSVNIDSNRKGFRLYKSGVYYDPY 272
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
CT++ ++H++L+VGY ++ W++KN W +GD GY+ + R NN CGIA AV+
Sbjct: 273 CTTN-LDHSVLVVGYGTDNGNDYWLVKNSWGAGYGDKGYIKMARNRNNHCGIAQEAVFPT 331
Query: 309 I 309
I
Sbjct: 332 I 332
>gi|33242867|gb|AAQ01138.1| cathepsin [Branchiostoma lanceolatum]
Length = 327
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 108/317 (34%), Positives = 169/317 (53%), Gaps = 14/317 (4%)
Query: 2 TNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENH 61
T+ EW + K+Y DS + +Q N K + HN+EA G H + ++ N
Sbjct: 16 TDNEWEAFKLLHGKQYSV-----YEDSARHAIFQENSKSVKQHNEEAAMGKHTFFMKMNK 70
Query: 62 LSDLHPRHY---IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQED 118
L D+ + + + ++ + ES + + D +DWR++G +T NQE
Sbjct: 71 LGDMTNEEFQMLVSGSGLMQTNKTEQAEGGVFESMPGLKVNDTVDWRQQGAVTKVKNQEQ 130
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C+AFS +++GQ F + + LS Q ++DCS GN GC GG + Y++ G
Sbjct: 131 CGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLMDCSRKGGNKGCKGGLMDQAFKYIKTNG 190
Query: 179 GLMKEEDYPYKGK-QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
G+ EE YPYKGK + C +K ISS+ DE AL AT+GPI+ I+AS
Sbjct: 191 GIDTEECYPYKGKDEKECDYKSSCSGATISSFVDFKAGDEEALMQAAATIGPISFGIDAS 250
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR 293
+FQLY G+Y ++ C+S ++H +L+VGY ++ W++KN W WG GY+ + R
Sbjct: 251 HPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTHGNKDYWLVKNSWGAEWGMEGYIMMSR 310
Query: 294 G-NNRCGIANYAVYALI 309
+N+CGIA A Y ++
Sbjct: 311 NKDNQCGIATQASYPVV 327
>gi|410968392|ref|XP_003990691.1| PREDICTED: cathepsin S, partial [Felis catus]
Length = 310
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 102/270 (37%), Positives = 146/270 (54%), Gaps = 9/270 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K Y K Y++K + ++L W+ N K + HN E G+H Y L NHL D+ I
Sbjct: 44 KKTYGKQYKEKNEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISL 103
Query: 74 MTRL-THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
M L S+ +R + SN+ +PD +DWREKG +T Q CGAC+AFS A++
Sbjct: 104 MGCLRVPSQWQRNVTYKSNSNQK--LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALE 161
Query: 133 GQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T + LS Q +VDCS GN GC GG + Y+ G+ E YPYK
Sbjct: 162 AQLKLKTGNLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAM 221
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ N S ++ LP E LK T+A GP++V+I+AS +F LY SG+Y +
Sbjct: 222 DGKCQYDSKNRAATCSKYTELPFGSEEDLKETVANKGPVSVAIDASHSSFFLYRSGVYYE 281
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKN 277
ACT VNH +L+VGY ++ W++KN
Sbjct: 282 PACTQT-VNHGVLVVGYGNLNGKDYWLVKN 310
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 156/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ KY K Y ++++ +Q + I HN EA G+H Y + N +DL + +
Sbjct: 35 KDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTYLVGVNEFADLTREEFRQH 94
Query: 74 -MTRLTHSRIRRTLVRSP--------ESNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
+TRL +R V + + +S +DWR++G +TP NQ CG
Sbjct: 95 HVTRLPFDDDKRDPVTATLHLDEHAVHAADSNGDSSGIDWRKRGAVTPVRNQGQCGNPAI 154
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
F+ A++G S+ + ELS QQV+DCS G GC+GGSL + Y+ GGL
Sbjct: 155 FAAVEAVEGMHAISSGNLVELSTQQVIDCS---GTPGCSGGSLVSFFKYIARNGGLDSAA 211
Query: 185 DYPYKGKQSIC-KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
DYP G C K K V + +SV+PP++E L + + P+AV+I A +FQ+
Sbjct: 212 DYPTSGAGGQCNKAKEARHVAKVGGYSVVPPRNETKLAAAVFKM-PVAVAIEADTPSFQM 270
Query: 244 YASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANY 303
Y SG+Y T ++HA+L+VGYT WI+KN W WGD GY+ +KRG GI
Sbjct: 271 YTSGVYSGPCGTQ--LDHAVLVVGYTDEYWIVKNSWGASWGDQGYIMMKRGVGAAGICGI 328
Query: 304 AVYAL 308
+ A+
Sbjct: 329 TLDAM 333
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 155/299 (51%), Gaps = 7/299 (2%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+Y + Y + + + N + I HN++ G Y L N D+ M
Sbjct: 28 QYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEEINAVMN 87
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L + R + +++ +P +DWR KG +TP +Q+ CG+C+AFS +++GQ
Sbjct: 88 GLLPASESRGVAVLGGRDDT--LPAEVDWRTKGAVTPVKDQKACGSCWAFSATGSLEGQH 145
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F ++ LS Q +VDCS G+ GC GG + Y++ GG+ E YPY+ C
Sbjct: 146 FLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASYPYEATDGKC 205
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N ++ + + E AL+ +AT+GPI+V+I+AS TF Y G+Y D+ C+
Sbjct: 206 QYNPANSGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHKGVYYDKECS 265
Query: 256 SDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
S ++H +L VGY W++KN W+ WG++G++ + R NN CGIA A Y L+
Sbjct: 266 STSLDHGVLAVGYGTQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGIATQASYPLV 324
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 166/301 (55%), Gaps = 16/301 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K+Y + + + + N K+ H++ QG + L N +D+ ++ +
Sbjct: 33 QHSKNYDSETEERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLN 92
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + +++ + N++V +PD +DWR+KG +T +Q CG+C++FS
Sbjct: 93 GF--NKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSG 150
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+ +++GQ F+ T ++ LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 151 SGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDNAFRYIKDNGGIDTEQSYP 210
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C +K N + + +E LK +ATVGP++++I+AS TFQLY+ G
Sbjct: 211 YLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVSIAIDASYETFQLYSDG 270
Query: 248 IYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y D C+S ++H +L+VGY ++ W++KN W G NGY+ + R +N CG+A
Sbjct: 271 VYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLNGYIKMARNQDNMCGVA 330
Query: 302 N 302
+
Sbjct: 331 S 331
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 164/303 (54%), Gaps = 14/303 (4%)
Query: 17 YKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+K Y +K D+K++L+ +Q N + + N++ + G + + N D+ +
Sbjct: 15 FKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEFNA 74
Query: 73 EMTRLTH-SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
M SR T V + E + +DWR KG +TP +Q CG+C+AFS ++
Sbjct: 75 VMKGYKKGSRGEPTTVFTAEGRP---MAADVDWRTKGAVTPVKDQGQCGSCWAFSATGSL 131
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
+GQ F +E+ LS Q++VDCS GN GC GG + + +Y++ GG+ E YPY+ +
Sbjct: 132 EGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAQ 191
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+F +I + + V E AL ++ +GPI+V+I+AS +FQ Y+SG+Y +
Sbjct: 192 DRSCRFDANSIGATCTGF-VEVQHTEEALHEAVSDIGPISVAIDASHFSFQFYSSGVYYE 250
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
+ C+ ++H +L VGY T + W++KN W WGD GY+ + R +N CGIA+ Y
Sbjct: 251 KKCSPTNLDHGVLAVGYGTESTEDYWLVKNSWGSGWGDAGYIKMSRNRDNNCGIASEPSY 310
Query: 307 ALI 309
+
Sbjct: 311 PTV 313
>gi|3243102|gb|AAC23951.1| silicatein alpha [Tethya aurantium]
Length = 330
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 106/304 (34%), Positives = 160/304 (52%), Gaps = 14/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K++ K Y + +K L W SN K I HN A G+TL NHL D+ Y +
Sbjct: 33 KKQHDKSYSTNLEELEKHLVWLSNKKYIELHNANADT--FGFTLAMNHLGDMTDHEYKER 90
Query: 74 MTRLTHSR---IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
T+S+ + R P + P+ +DWR KG +T +Q DCGA YAFS A
Sbjct: 91 YLTYTNSKSGNYTKVFKREPW----MAYPETVDWRTKGAVTGIKSQGDCGASYAFSAMGA 146
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++G +T ++ LS Q ++DCS+ GN GC GG++ YV G+ YP++G
Sbjct: 147 LEGINALATGKLTYLSEQNIIDCSVPYGNHGCKGGNMYVAFLYVVANEGVDDGGSYPFRG 206
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
KQS C ++ +S + E L+ +A VGP+AV+I+ + F+ Y SG+YD
Sbjct: 207 KQSSCTYQEQYRGASMSGSVQINSGSESDLEAAVANVGPVAVAIDGESNAFRFYYSGVYD 266
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
C+S +NHAM++ GY + W+ KN W +WG+ GY+ + R N+CGIA+ A
Sbjct: 267 SSRCSSSSLNHAMVITGYGISNNQEYWLAKNSWGENWGELGYVKMARNKYNQCGIASDAS 326
Query: 306 YALI 309
Y +
Sbjct: 327 YPTL 330
>gi|432882407|ref|XP_004074015.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 330
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 165/298 (55%), Gaps = 5/298 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++K+ K Y + + ++ W+ N + HNQ+A + H +T+ NHLSD+ ++
Sbjct: 30 KQKHDKSYSNQTEMNFRRAVWEKNLHVVMKHNQQATEEKHSFTVGLNHLSDMTAEEINEK 89
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
+ + + + V P +DWR+ G + P NQ CG+C+AFS A++G
Sbjct: 90 LNGFKMEEHADYRNHTSKLSSGVSTPASVDWRKSGLVGPVTNQGLCGSCWAFSSLGALEG 149
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q+ K + LS Q +VDCS + GNLGC GG + + +YV G+ EE YPY+ +
Sbjct: 150 QLRKQRGVLVPLSPQNLVDCSTVDGNLGCGGGYITKSYSYVIRNKGVDSEEFYPYEHQNR 209
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ SS+ +LP DE AL+ +A VGP+AV++NA +F Y G+YDD
Sbjct: 210 KCRYSVTGKAGYCSSFHILPRGDEGALQAAVAAVGPVAVAVNAMLPSFHSYRGGLYDDLQ 269
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C S VNHA+L+VGY + W++KN W WG+ G++ + R +N CGIA +AVY
Sbjct: 270 CNSMLVNHAVLVVGYGSEAGQDYWLVKNSWGSGWGEEGFIRIARNKSNLCGIATFAVY 327
>gi|167427531|gb|ABZ80402.1| cathepsin L6, partial [Fasciola hepatica]
Length = 306
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 105/298 (35%), Positives = 156/298 (52%), Gaps = 16/298 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL Y L N +DL
Sbjct: 5 KRMYNKEY-NGADDEHRRNIWEQNVKHIEEHNLRHDLGLVTYKLGLNQFTDLTFEEFKAK 63
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ + S + + E N+ +P +DWR+ G++T +Q CG+C+AFS
Sbjct: 64 YLMEMSPESES-LSDGISYEAEGND---VPASIDWRQYGYVTEVKDQGGCGSCWAFSTTG 119
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDCS I GN GC GG +R Y++ GL E YPYK
Sbjct: 120 AIEGQYVKKFQTRVSFSEQQLVDCSTIPGNHGCRGGGMRRAYEYLK-KNGLEPESSYPYK 178
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C++K + +++ ++ +E LK + GP +V+++ P F +Y SGIY
Sbjct: 179 AVEGQCQYKSDLALAKVTNSQLVRSGNETQLKNLIGAEGPASVAVDVKPD-FSMYRSGIY 237
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+ C+S +NHA+L VGY WI+KN W WG+ GY+ + R NN CGIA+
Sbjct: 238 QSQTCSSRRMNHAVLAVGYGTEGGMDYWIVKNSWGPRWGEAGYIRMARNRNNMCGIAS 295
>gi|344250850|gb|EGW06954.1| Cathepsin R [Cricetulus griseus]
Length = 279
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/283 (34%), Positives = 152/283 (53%), Gaps = 17/283 (6%)
Query: 39 KKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL---THSRIRRTLVRSPESNES 95
K I HN+E G +G+ + N D+ + K M + TH+ + +R E+
Sbjct: 2 KMIKHHNEENSLGKNGFIMEINEFGDMTVEEFRKTMNYIPVRTHTEGKS--IRKREAGG- 58
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
++P ++WR+KG++T Q C +C+AF++ AI+GQ+FK T + LS+Q +VDCS
Sbjct: 59 -VLPKSVNWRKKGYVTSVKKQAYCNSCWAFAVNGAIEGQMFKKTGNLTRLSVQNLVDCSK 117
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
GN GC G YV GG+ E YPY+GK+ C++ +I+ + L P+
Sbjct: 118 PHGNNGCDWGDPYIAYEYVLHNGGVEAEATYPYEGKEGPCRYNPKYSAANITGFVSL-PK 176
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------- 268
E +L +AT+GPI+ I+ + F Y GI+ D C +D VNH +L+VGY
Sbjct: 177 SEESLMAAVATIGPISAGIDIASDFFMFYKKGIFYDPKCHNDTVNHVVLVVGYGFEGNET 236
Query: 269 -TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
N W++KN + WG GYM + K NN C IA+YA Y ++
Sbjct: 237 DGNNYWLVKNSYGKKWGLRGYMKIAKDQNNHCAIASYAHYPIV 279
>gi|194741328|ref|XP_001953141.1| GF17358 [Drosophila ananassae]
gi|190626200|gb|EDV41724.1| GF17358 [Drosophila ananassae]
Length = 329
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/310 (32%), Positives = 162/310 (52%), Gaps = 12/310 (3%)
Query: 5 EWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSD 64
+W +F +KKY+ + D+ + ++ N + I HN++ + G + + N +D
Sbjct: 27 DWSYFKLFNEKKYQDE----TVDAVRFEIFKENRQYIAQHNRKWENGEVSFRIDINEYAD 82
Query: 65 LHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
L H EM R + +P+++DWR G +TP NQ+ C +C+A
Sbjct: 83 L-LNHEFNEMRNGNDQNGRSIKGSTFIPPMGFTLPENIDWRSLGAVTPVKNQKYCDSCWA 141
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
F+ A A++GQ F+ T + LS Q ++DCS+ N GC G ++ GG+ E
Sbjct: 142 FAAAGALEGQHFRHTGNLVSLSEQNLLDCSV--SNNGCKSGLASRAFLDIKKEGGIATES 199
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
Y Y+ Q +C+ + + L +E L + +ATVGPIAVS+++S F LY
Sbjct: 200 SYSYEATQKLCRLNNATFGASDTGFVQLEFGNETQLAIAVATVGPIAVSLDSSSILFHLY 259
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTR----NSWILKNWWSHHWGDNGYMYLKR-GNNRCG 299
SGIYD+ C+ + HA L+VGY + W++KN W HWG+NGY+ ++R N+CG
Sbjct: 260 HSGIYDNPLCSDTNLIHAALVVGYGTEKGIDYWLVKNSWGEHWGENGYIKMRRNAGNQCG 319
Query: 300 IANYAVYALI 309
IA AVY L+
Sbjct: 320 IATKAVYPLL 329
>gi|208972994|dbj|BAG74346.1| silicatein-G1 [Ephydatia fluviatilis]
Length = 347
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 156/303 (51%), Gaps = 17/303 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT--- 75
+ Y K D +K W++N I HN++A++ HG+ L+ N D+ + +
Sbjct: 44 RRYLDKGEDQQKFSIWKANMDYIEKHNKDAEK--HGFVLKMNKFGDMTRNEFFSDYQCVQ 101
Query: 76 ------RLTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ HS R+ + L +P+ +DWR+ G +T +Q CGA YAFS
Sbjct: 102 SIGPEWKDVHSMKSAYGSRASQYKAVNLSLPESIDWRDAGIVTGVKDQLRCGASYAFSTV 161
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G + +LS Q VVDCS GN GC GS+ NT Y+ GG+ YPY
Sbjct: 162 GALEGMNALGKGSLVKLSEQNVVDCSGPYGNHGCTCGSVINTYLYIIDNGGIDTASSYPY 221
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K KQ CKF N+ + + ++ E L +AT GP+AV ++A+ + FQ Y+ GI
Sbjct: 222 KSKQYYCKFSTSNVGAKATGFVLISSGSESDLMSAVATAGPVAVHVDANSYAFQFYSDGI 281
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
D C+S ++H +L+VGY ++ W++KN W +WG NGY + R N+CGIA
Sbjct: 282 LDVVYCSSTNLSHTVLVVGYGTYKNKDYWLIKNSWGPNWGINGYAMMARNRYNQCGIATA 341
Query: 304 AVY 306
A +
Sbjct: 342 ASF 344
>gi|340370388|ref|XP_003383728.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 398
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 130/219 (59%), Gaps = 7/219 (3%)
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
V PD +DWREKG +TP +Q CG+C+AFS +++GQ F +T + LS QQ+VDCS+
Sbjct: 181 VKAPDTVDWREKGAVTPIKDQGQCGSCWAFSAIGSLEGQHFINTGNLVSLSEQQLVDCSL 240
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
N GC GG L Y++ G E DYPY K C++ V ++ ++ LP
Sbjct: 241 --KNDGCNGGMLSTAFKYIESVAGEESETDYPYTAKNGTCQYDPSKAVAKVTGYTALPSG 298
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRN 271
DE +L + + GPI+V I+AS +FQLY+ G+Y +++C+ ++H +L+VGY T +
Sbjct: 299 DEDSLNDAVTSKGPISVCIDASHKSFQLYSEGVYYEKSCSYFLLDHCVLVVGYGTEDTAD 358
Query: 272 SWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W WG GY+ + R N CGIA A Y L+
Sbjct: 359 YWLVKNSWGTSWGMKGYIRMSRNRKNNCGIATNAAYPLV 397
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 81/155 (52%), Gaps = 17/155 (10%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
KY K Y K T+ ++++ W+SN K + HN + + G+T+ N +DL + K
Sbjct: 34 KYNKVYETKETELERQIIWESNKKFVENHNANSDK--FGFTVAMNEFADLDADAFSKLKK 91
Query: 76 RLTHSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+H + +N VL +P+ +DWR+KG +TP +Q CG + + I +
Sbjct: 92 IPSHP--------AQANNNKVLLTGGNVPNSIDWRKKGAVTPVSSQGQCGV-WPWPIVGS 142
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGG 165
++ Q F T + LS+QQ++DC+ I+ NL AG
Sbjct: 143 VESQYFIKTGTLVPLSVQQILDCANIT-NLDEAGA 176
>gi|10441624|gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera
virgifera]
Length = 322
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 167/308 (54%), Gaps = 26/308 (8%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY----- 70
++ K Y+ + + +Q+N K I+ HN + +QGL GYT+ N +D+ P +
Sbjct: 27 QHGKVYKNPIEERVRFSVFQANLKTINEHNAKYEQGLVGYTMAVNQFADMTPEEFKAKLG 86
Query: 71 --IKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
K M ++ SR + N + +PD +DWR+KG + +Q CG+C+AFS
Sbjct: 87 MQAKNMPKIKKSRHVK--------NVNAEVPDSVDWRQKGAVLGVKDQGQCGSCWAFSAT 138
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGC-AGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ + + E LS Q+++DCS+ GN C GG + +V+ G++ E YP
Sbjct: 139 GSLEGQNYIVNGKSEPLSEQELLDCSVEYGNGDCDEGGLMTLAFEFVE-ENGIVSEASYP 197
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y+ Q C+ V+ I ++ + P +E AL+ + TVGPI+ +I A P Q ++SG
Sbjct: 198 YEAIQGDCRTTNDKAVLHIQGYNEVYPSEE-ALRQAVGTVGPISAAIWAEP--IQFFSSG 254
Query: 248 IYDDEACTS--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
IYDD C + +Y++H +L+VGY + WI+KN W WG+ GY LKR CG+A
Sbjct: 255 IYDDPNCLNYVEYLDHGILVVGYGEENGTPYWIVKNSWGATWGEEGYFRLKRNIALCGLA 314
Query: 302 NYAVYALI 309
A Y ++
Sbjct: 315 QMASYPVL 322
>gi|289741839|gb|ADD19667.1| cysteine proteinase cathepsin L [Glossina morsitans morsitans]
Length = 365
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/273 (35%), Positives = 148/273 (54%), Gaps = 11/273 (4%)
Query: 47 EAQQGLH-GYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR----SPESNESVLIPDH 101
EA+ LH GY L N +DL ++ ++T S V+ + + N + +PD
Sbjct: 94 EAENQLHAGYELALNAFADLTKEEFLSQLTGNHKSPQAEAKVKNRRLALKLNTTAKLPDS 153
Query: 102 LDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNL- 160
DWRE G +TP Q CG+C+AF++ A++G F+ + ++ LS Q +VDC + L
Sbjct: 154 FDWREHGAVTPVKFQGKCGSCWAFAVTGALEGHSFRKSGKLINLSEQNLVDCGEKAYGLD 213
Query: 161 GCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHAL 220
GC GG ++ G+ Y Y K++ C +++ ++ +SV+PP DE +
Sbjct: 214 GCDGGYQEYGFEFISRQNGVAHGAKYLYVDKKNTCSYRKTFKAAELKGFSVIPPNDEETM 273
Query: 221 KVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILK 276
K +AT+GP+A SINA T LY GIY DE C D NH++L+VGY ++ WI+K
Sbjct: 274 KKVVATLGPLACSINA-LETLLLYKKGIYADEECNKDEPNHSVLVVGYGTEDDQDYWIVK 332
Query: 277 NWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
N W + WG+ GY L RG N C IA+ Y ++
Sbjct: 333 NSWDNVWGEEGYFRLPRGKNFCKIASECSYPVL 365
>gi|197205900|gb|ACH48003.1| cathepsin [Latrunculia oparinae]
Length = 351
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/294 (34%), Positives = 155/294 (52%), Gaps = 24/294 (8%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM------------TRLTHSRIR- 83
N I +HN A + G+TL N D+ Y + M +R T +
Sbjct: 61 NTAYIPSHNSYADK--FGFTLAMNKYGDMTSEEYSQSMRCVEKFLLPSSPSRPTQPPVSG 118
Query: 84 -RTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
RT + +P + E+V P+ +DWR KG ++ +Q C +CYAF+ +A++G +T
Sbjct: 119 IRTCHSLMAPNTPETVC-PEEVDWRTKGAVSAVKDQGRCKSCYAFATTAALEGMHALATG 177
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
+ LS Q V+DCS+ GN GC+GGS T+ Y GG+ YPY G+Q +CKF
Sbjct: 178 RLVPLSEQNVIDCSVPYGNRGCSGGSRMATIMYAVDNGGIDGTSSYPYLGRQYLCKFTEE 237
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
+I + + E LK +A VGP+ V++++ +FQ YASGIY + +C+ +
Sbjct: 238 SIATGCTGMVRIKRGKEQDLKKAVAVVGPVTVAVDSRHTSFQFYASGIYSEPSCSRTKLT 297
Query: 261 HAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
H ++++GY S W+LKN W WG++GY+ + R N+CGIA A+Y I
Sbjct: 298 HTLIIIGYGSKSGHDYWLLKNSWGTSWGEDGYIMMSRNYANQCGIATKAMYTTI 351
>gi|195984441|gb|ACG63793.1| silicatein A1 [Latrunculia oparinae]
Length = 329
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/299 (35%), Positives = 157/299 (52%), Gaps = 10/299 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ Y D K L SN K I HN A + GY+L NH DL + ++
Sbjct: 32 KNRHGMSYESDLHDLDKHLVRLSNKKFIELHN--ANSHIFGYSLAMNHFGDLTDLEWNEK 89
Query: 74 M-TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
T + S T V + +S P+ +DWR KG +T +Q CGA YAFS +A++
Sbjct: 90 YGTYSSSSAGNYTKVFKADPYQS--YPESVDWRTKGAVTSVKDQSQCGASYAFSAMAALE 147
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
G +T + LS Q ++DCS+ GN GC GG++ YV G+ YP+ GKQ
Sbjct: 148 GANALATDTLVNLSEQNLIDCSVPYGNHGCKGGNMLYAFKYVIANEGVDTANSYPFYGKQ 207
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
S C + V IS + E L +A VGP+AV+I+ S + F+ Y+SG+YD
Sbjct: 208 SSCVYNEKYAAVKISGMVRISQGSESDLLGAVANVGPVAVAIDGSSNAFRFYSSGVYDSS 267
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
C+S +NHA+++ GY S W++KN W +WG+ GY+ + RG N+CGIA+ A Y
Sbjct: 268 RCSSSKLNHAIVVTGYGSYSGKKYWLVKNSWGKNWGNYGYIMMARGKYNQCGIASDASY 326
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 163/300 (54%), Gaps = 12/300 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y+ + ++ + SN +I HNQ +GL Y + N +DL P +++
Sbjct: 30 FSKSYQNVVEEKRRFNIFLSNLLRIEEHNQNFSRGLSTYEMGVNKFADLTPEEFMERFRP 89
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L ++ + L + N +P +DW ++G +T +Q CG+C+AFS +++ F
Sbjct: 90 LRKTK-PKFLSEQAKFNFDGDLPAEVDWTKQGAVTEVKSQGSCGSCWAFSTTGSVESHNF 148
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
T ++ LS QQ+VDC + N GCAGG + L Y++ A G+M E+DYPY+ + + C+
Sbjct: 149 IKTGKLISLSEQQLVDC--VKNNSGCAGGWMDIALEYIE-ADGIMSEDDYPYEERNTTCR 205
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC-- 254
F V I S+ + DE L+ +A GP++V+I + FQLYA GI +D C
Sbjct: 206 FNNSKAAVQIKSYKAIKKNDEIDLQKAVALEGPVSVAIEVTI-AFQLYARGILNDPQCKN 264
Query: 255 TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
T + HA+L+ GY ++ WI+KN W +G +GY+ + R +N+CGIA A Y ++
Sbjct: 265 TEGDLTHAVLVTGYGSQDGKDYWIVKNSWGAEYGMDGYLRMSRNADNQCGIATRASYPVL 324
>gi|246148|gb|AAB21516.1| Cyclic Protein-2 [Rattus sp.]
Length = 247
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/221 (41%), Positives = 133/221 (60%), Gaps = 10/221 (4%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP +DWREKG +TP NQ CG+C+AFS + ++GQ+F T ++ LS Q +VDCS
Sbjct: 27 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQ 86
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + Y++ GGL EE YPY+ K CK++ V + + + V PQ E
Sbjct: 87 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGF-VDIPQQE 145
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----- 272
AL +ATVGPI+V+++AS + Q Y+SGIY + C+S ++H +L+VGY
Sbjct: 146 KALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNK 205
Query: 273 ---WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG +GY+ + K NN CG+A A Y ++
Sbjct: 206 DKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 246
>gi|15593255|gb|AAL02223.1|AF410883_1 cysteine protease CP19 precursor [Frankliniella occidentalis]
Length = 334
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 156/300 (52%), Gaps = 7/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y ++ + ++ N +I HN G + + N +D+H +++
Sbjct: 35 HAKTYANAVEEAYRAKVFKENAIRIAKHNDLFASGEVTFKVGYNQYADMHTHEVTEKLNG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
+ + SN+S +DWR KG TP +Q CG+C++FS +++GQ+F
Sbjct: 95 YRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAATPIKDQGQCGSCWSFSATGSLEGQLF 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSIC 195
+ LS Q +VDCS GN GC GG + + YV+ GG+ EE YPY C
Sbjct: 155 LKNKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSNGGIDTEESYPYTAVDGDSC 214
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
++ N + + + + E AL+ + VGP++V+I+AS +FQ+Y+SGIY + AC+
Sbjct: 215 LYRAANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNWSFQMYSSGIYYESACS 274
Query: 256 SDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
SDY++H +L VGY + WI+KN W WG+ GY+ + R N CGIA A Y L+
Sbjct: 275 SDYLDHGVLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 334
>gi|167427523|gb|ABZ80398.1| cathepsin L3, partial [Fasciola hepatica]
Length = 306
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 107/305 (35%), Positives = 159/305 (52%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN +GL Y L N +DL
Sbjct: 5 KRMYNKEY-NGADDEHRRNIWEKNVKHIEEHNLRHDRGLVTYKLGLNQFTDLTFEEFKAK 63
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ ++ S + + E N+ +P +DWRE G++T +Q CG+C+AFS
Sbjct: 64 YLMEMSLVSES-LSDGISYEAEGND---VPASVDWREYGYVTEVKDQGQCGSCWAFSAVG 119
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ + S QQ+VDC+ GN GC GG + N Y++ + GL DYPY+
Sbjct: 120 AIEGQYLRKFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNS-GLETASDYPYQ 178
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G + C++++ V ++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 179 GWEYQCQYRKELGVAKVTGAYTVHSGDEMKLMQMVGREGPAAVAVDAQS-DFYMYESGIF 237
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ CTS V HA+L VGY S WILKN W WG++GYM R NN C IA+ A
Sbjct: 238 QSQTCTSRSVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRNNMCAIASVA 297
Query: 305 VYALI 309
++
Sbjct: 298 SVPMV 302
>gi|391333248|ref|XP_003741031.1| PREDICTED: uncharacterized protein LOC100898636 [Metaseiulus
occidentalis]
Length = 642
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 163/294 (55%), Gaps = 19/294 (6%)
Query: 25 ATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR 84
A DS ++ ++ N I+ HN Y + + +D P +EM + I
Sbjct: 353 AEDSMRRRIFEKNVAMINGHNLLHDLKRVSYRMGLSRFTDSTP----EEMRAMRCLNINV 408
Query: 85 TLVRSP------ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS 138
++ ++ ES + + +DWR++G++TP NQ +CG+C+AFS A++GQ FK+
Sbjct: 409 SMTTGGPHEEVFDAIESSDLSEAIDWRQQGYVTPVKNQGNCGSCWAFSATGAVEGQHFKA 468
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
T +E LS Q +VDC + + GC GG Y++ GG+ E+ YPY+ C+F+
Sbjct: 469 TGRLESLSEQNLVDC--VKESKGCDGGFFEQAFQYIKDNGGINTEDSYPYEAFDGSCRFR 526
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
+I +S + +P E L+ ++T+GPI+V+I+ S +FQ Y G+Y + +C+S
Sbjct: 527 EDSIGATVSGYQTIPKGSEADLQKAVSTIGPISVAIDVSNPSFQNYREGVYYEPSCSSSN 586
Query: 259 VNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR--GNNRCGIANYAVY 306
++HA+L+VGY + W++KN W +G+ GY+ + R GNN CGIA+ A Y
Sbjct: 587 LDHAVLVVGYGSDGGEDYWLVKNSWGTSFGEQGYVRMARNKGNN-CGIASAAAY 639
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 160/288 (55%), Gaps = 8/288 (2%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTL 86
+S ++ ++ N I+ HN Y + + L+D P ++ + L + +T
Sbjct: 35 ESMRRRIFEKNVAMINAHNLLHDLKQVSYRMGLSRLTDATPAE-VQALKCLNFTLPNKTS 93
Query: 87 VRSPESN-ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEEL 145
+S + +P+ +DW ++G++TP +Q CGAC+ F+ AI+GQ FK+T + L
Sbjct: 94 RKSTLGTLQRQDLPEAVDWTQQGYVTPVKDQGKCGACWTFAATGAIEGQHFKATGNLVSL 153
Query: 146 SIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVD 205
S Q ++DC + + GC+GG +Y++ +GG+ EE YPY+ C+F++ ++
Sbjct: 154 SEQNILDCVKTATSNGCSGGLFVEAFDYLKNSGGIDAEESYPYEASGGTCRFRQDSVAAT 213
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+S + + +E L+ +AT+GPI+V I++ FQ Y GIY + CT ++++HA+L+
Sbjct: 214 VSGYQAISAGNEAELQEAVATIGPISVGIDSGHPGFQHYTGGIYYEPECT-EHLSHAVLV 272
Query: 266 VGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
VGY T N W++KN W +G GY+ + R NN CGIA A Y +
Sbjct: 273 VGYGTENGEDYWLVKNSWGASYGLQGYIKMARNRNNNCGIATGAAYPI 320
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 161/312 (51%), Gaps = 26/312 (8%)
Query: 11 IFPQKKYK-----------KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRE 59
IF QK+Y+ K Y S+ + +Q N + NQ+ + G +
Sbjct: 22 IFSQKQYQTAFQNWMVKHQKSYTNDEFGSRYSV-FQDNMDIVAKWNQKGSNTILGLNVMA 80
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDC 119
+ ++ + Y+ +T+ + +TLV +P +DWR G +T NQ C
Sbjct: 81 DLTNEEFKKLYLGTKANVTYKK--KTLVGVSG------LPASVDWRANGAVTAVKNQGQC 132
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
G CYAFS +++G ++ ++ LS QQ++DCS GN GC GG + N+ Y+ GG
Sbjct: 133 GGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGG 192
Query: 180 LMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
L E YPY G+ CKF + NI I+ + + E L+ +A P++V+I+AS
Sbjct: 193 LDTEASYPYTGEVGKCKFNKKNIGATITGYKNVESGSESDLQTAVAAQ-PVSVAIDASQS 251
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG- 294
+FQLYASG+Y + C+S ++H +L VGY S WI+KN W WG+NG++ + R
Sbjct: 252 SFQLYASGVYYEPECSSTQLDHGVLAVGYGSQSGQDYWIVKNSWGADWGENGFILMARNK 311
Query: 295 NNRCGIANYAVY 306
+N CGIA A +
Sbjct: 312 DNNCGIATMASF 323
>gi|211909242|gb|ACJ12894.1| cathepsin L1D [Fasciola hepatica]
Length = 326
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 157/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCGGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY+ + C++ R V ++ + L +E LK + + GP AV+++ F +Y
Sbjct: 194 SYPYRAVEGQCRYNRQLGVAKVTGYYTLHSGNEAGLKSLVGSEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C+ +NHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 RSGIYQSQTCSPLGLNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|195729975|gb|ACG50798.1| cathepsin L1 [Fascioloides magna]
Length = 327
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 157/305 (51%), Gaps = 15/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++KY K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRKYNKEY-NGADDEHRRNIWEQNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ E++ + + + N+ +P+ +DWR+ G++T +Q CG+C+AFS
Sbjct: 84 YLFEISPKSELLSHSGISYQAKGND---VPESIDWRDYGYVTEVKDQGQCGSCWAFSSTG 140
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ K S QQ+VDC+ GN GC GG + Y++ GL E YPY+
Sbjct: 141 AMEGQYIKKFRTTVSFSEQQLVDCTRNYGNSGCNGGWMERAFEYLR-RNGLETESSYPYR 199
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
C+++ V ++ + +E +L + GP+AV+++ F +Y SGIY
Sbjct: 200 AVDDHCRYESQLGVAKVTGYYTEHSGNEVSLMNMVGGEGPVAVAVDVQS-DFSMYKSGIY 258
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
E C++ YVNHA+L VGY S WILKN W WGD GY+ R NN CGIA+YA
Sbjct: 259 QSETCSTYYVNHAVLAVGYGTESGTDYWILKNSWGSWWGDQGYIRFARNRNNMCGIASYA 318
Query: 305 VYALI 309
++
Sbjct: 319 SVPMV 323
>gi|21263041|gb|AAM44832.1|AF510856_1 cathepsin L2 [Fasciola gigantica]
Length = 326
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 156/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN+GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNMGCMGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
+ GIY C+S +VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 SGGIYQSRTCSSLHVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 165/301 (54%), Gaps = 16/301 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
++ K+Y + + + + N K+ H++ QG + L N +D+ ++ +
Sbjct: 33 QHSKNYDSETEERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNKYADMLHHEFVSTLN 92
Query: 76 RLTHSRIRRTLVRSPESNESVL--------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
++ + +++ + N++V +PD +DWR+KG +T +Q CG+C++FS
Sbjct: 93 GF--NKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSG 150
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+ +++GQ F+ T ++ LS Q +VDCS GN GC GG + N Y++ GG+ E+ YP
Sbjct: 151 SGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYP 210
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C +K N + + +E LK +ATVGPI+++I+AS TFQLY+ G
Sbjct: 211 YLAEDEKCHYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPISIAIDASYETFQLYSDG 270
Query: 248 IYDDEACTSDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
+Y D C S ++H +L+VGY ++ W++KN W G NGY+ + R +N CG+A
Sbjct: 271 VYSDPECISQELDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLNGYIKMARNQDNMCGVA 330
Query: 302 N 302
+
Sbjct: 331 S 331
>gi|333830589|gb|AEG20937.1| ctsk [Danio rerio]
Length = 284
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 140/276 (50%), Gaps = 7/276 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+K++Y +S ++ W+ N HN+E + G+H Y L NH D+ +++
Sbjct: 11 HKREYNGLNEESIRRTIWEKNMLFFEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMG 70
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L R + +P +D+R+ G++T NQ CG+C+AFS A++GQ+
Sbjct: 71 LQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLM 130
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+ ++ +LS Q +VDC ++ N GC GG + N YV G+ EE YPY G C
Sbjct: 131 KTKGQLVDLSPQNLVDC--VTENDGCGGGYMTNAFRYVSNNQGIDSEESYPYVGTDQQCA 188
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + +P +E AL +A VGP++V I+A TF Y SG+Y D C
Sbjct: 189 YNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNCNK 248
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNG 287
+ VNHA+L VGY + WI+KN W WG G
Sbjct: 249 EDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKG 284
>gi|146386358|gb|ABQ23967.1| cathepsin L [Oryctolagus cuniculus]
Length = 246
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/238 (39%), Positives = 135/238 (56%), Gaps = 9/238 (3%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVRSPES 92
W+ N + I HN E QG HG+T+ L + + + M + + ++ + R P
Sbjct: 13 WEKNMRMIELHNGEYSQGKHGFTMGSERLRHMTNEEFRQVMNGFQNQKHKKGKMFRDP-- 70
Query: 93 NESVLI--PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
+L+ P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q +
Sbjct: 71 ---LLLQYPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNL 127
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWS 210
VDCS GN GC GG + YV+ GL EE YPY+G CK+K P V +
Sbjct: 128 VDCSHPQGNQGCNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYK-PECSVANDTGF 186
Query: 211 VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
V P E AL LATVGPI+ +I+A +FQ Y SGIY D C+S ++H +L+VGY
Sbjct: 187 VDIPGHEKALLRALATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGVLVVGY 244
>gi|211909240|gb|ACJ12893.1| cathepsin L1D [Fasciola hepatica]
Length = 326
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 108/306 (35%), Positives = 158/306 (51%), Gaps = 18/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ E+ R S I + SN +V PD +DWRE G++T +Q +CG+C+AFS
Sbjct: 84 YLTEIPRA--SDILSHGIPYEASNRAV--PDKIDWRESGYVTGVKDQGNCGSCWAFSTTG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E YPY
Sbjct: 140 TMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCGGGLMENAYEYLKQF--GLETESSYPY 197
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C++ R V ++ + L +E LK + + GP AV+++ F +Y SGI
Sbjct: 198 RAVEGQCRYNRQLGVAKVTGYYTLHSGNEAGLKSLVGSEGPAAVAVDVESD-FMMYRSGI 256
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + C+ +NHA+L VGY WI+KN W WG+ GY+ + R N CGIA+
Sbjct: 257 YQSQTCSPLGLNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASL 316
Query: 304 AVYALI 309
A ++
Sbjct: 317 ASLPMV 322
>gi|60649669|gb|AAH90560.1| LOC594890 protein, partial [Xenopus (Silurana) tropicalis]
Length = 355
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 103/302 (34%), Positives = 162/302 (53%), Gaps = 10/302 (3%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI-KE 73
+ +KK Y+ + + ++L W+ K I HN E GLH Y + NHL D+ K+
Sbjct: 57 QTHKKIYKNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMVAEEMTDKQ 116
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M + T V S S P+ +DWR K +T +Q C A +AFS A++
Sbjct: 117 MNFIPQVIANITDVPVEISKSSP--PESIDWRNKNCVTSVKDQGSCIASWAFSSIGALEC 174
Query: 134 QIFKS-TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q K T ++E LS+Q ++DCS GN GC GG + ++ Y+ G+ E +YPY+GK
Sbjct: 175 QNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFRYI-IDNGIELESNYPYQGKD 233
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + +S+ LP DE LK + +GP++V+I+AS TF++Y +G+Y D
Sbjct: 234 GKCSYTPVKKASVCTSYRQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYYDP 293
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+S +H++L+VGY W++KN W +GD GY+ + R +N CGIAN+ +
Sbjct: 294 NCSSSTPDHSVLVVGYGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGIANFGCFP 353
Query: 308 LI 309
++
Sbjct: 354 VV 355
>gi|358334193|dbj|GAA43174.2| cysteine proteinase 3, partial [Clonorchis sinensis]
Length = 374
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 163/316 (51%), Gaps = 30/316 (9%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHP--- 67
+K++ ++ +K TDS+++++ + + ++ HN+ ++G + N SD P
Sbjct: 68 EKFRVEFNRKYTDSQEQINRLNVFCQSFMRVREHNKAYEEGRVTFKRGINEFSDRFPDER 127
Query: 68 RHYIK---EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
+H +++ + S R+ +P+S +DWR G +TP Q DCGAC+A
Sbjct: 128 QHACGGRINISKHSGSTFRKVAAPAPQS---------IDWRRNGAVTPVRRQGDCGACWA 178
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
F+ AI+G+ F +E S QQ+VDC GC GG YV+ GGL E
Sbjct: 179 FAATGAIEGRYFIFEKRLETFSPQQLVDCIQGDTTNGCNGGYPSEAFEYVENVGGLELER 238
Query: 185 DYPYKGKQS-----ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
DYPY + C + + V ++S +LP DE AL ++ GPIA+ +AS
Sbjct: 239 DYPYVSVATGLPNPFCGYDQTKQQVKLTSHVILPSGDEEALLQAVSIYGPIAILFDASHP 298
Query: 240 TFQLYASGIYDDEAC--TSDYVNHAMLLVGYTRN----SWILKNWWSHHWGDNGYMYLKR 293
+F+ Y S IY +E C T D V HAML+VGY W++KN W WG+ GYM ++R
Sbjct: 299 SFKDYESDIYSEENCGTTLDDVTHAMLVVGYGEELGEPYWLVKNSWGDKWGEKGYMRVRR 358
Query: 294 GNNRCGIANYAVYALI 309
G N C +A ++ Y L+
Sbjct: 359 GVNMCAVAGFSSYPLM 374
>gi|74152091|dbj|BAE32077.1| unnamed protein product [Mus musculus]
Length = 245
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 100/247 (40%), Positives = 139/247 (56%), Gaps = 14/247 (5%)
Query: 74 MTRLTHSRIRR----TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ R+ RI R T+ SN + +PD +DWREKG +T Q CGAC+AFS
Sbjct: 2 LCRMGALRIPRQSPKTVTFRSYSNRT--LPDTVDWREKGCVTEVKYQGSCGACWAFSAVG 59
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ + YP
Sbjct: 60 ALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYP 119
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
YK C + N S + LP DE ALK +AT GP++V I+AS +F Y SG
Sbjct: 120 YKATDEKCHYNSKNRAATCSGYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSG 179
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIAN 302
+YDD +CT + VNH +L+VGY ++ W++KN W ++GD GY+ + R N N CGIA+
Sbjct: 180 VYDDPSCTGN-VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIAS 238
Query: 303 YAVYALI 309
Y Y I
Sbjct: 239 YCSYPEI 245
>gi|4574304|gb|AAD23996.1|AF112566_1 cathepsin [Fasciola gigantica]
Length = 326
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 156/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTELKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN+GC+GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNMGCSGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
+ GIY C+S VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 SGGIYQSRTCSSLRVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|313754424|pdb|3OF8|A Chain A, Structural Basis For Reversible And Irreversible
Inhibition Of Human Cathepsin L By Their Respective
Dipeptidyl Glyoxal And Diazomethylketone Inhibitors
gi|313754425|pdb|3OF9|A Chain A, Structural Basis For Irreversible Inhibition Of Human
Cathepsin L By A Diazomethylketone Inhibitor
Length = 221
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 3 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 62
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 63 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK 121
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 122 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 181
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 182 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221
>gi|15593249|gb|AAL02221.1|AF410881_1 cysteine protease CP10 precursor [Frankliniella occidentalis]
Length = 334
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 97/300 (32%), Positives = 156/300 (52%), Gaps = 7/300 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+ K Y ++ + ++ N +I HN G + + + +D+H +++
Sbjct: 35 HAKTYANTVEEAYRAKVFKENAIRIAKHNDLFASGEVTFKVGYSQYADMHTHEVTEKLNG 94
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
+ + SN+S +DWR KG +TP +Q CG+C++FS +++GQ+F
Sbjct: 95 YRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCWSFSATGSLEGQLF 154
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSIC 195
+ LS Q +VDCS GN GC GG + + YV+ GG+ EE YPY C
Sbjct: 155 LKNKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVESNGGIDTEESYPYTAVDGDSC 214
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+K N + + + + E AL+ + GP++V+I+AS +FQ+Y+SGIY + AC+
Sbjct: 215 LYKAANNAGVNTGYKDVQAKSESALRDAVEKAGPVSVAIDASNWSFQMYSSGIYYESACS 274
Query: 256 SDYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
SDY++H +L VGY + WI+KN W WG+ GY+ + R N CGIA A Y L+
Sbjct: 275 SDYLDHGVLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIKMARNKKNNCGIATEASYPLV 334
>gi|405977172|gb|EKC41635.1| Cathepsin L [Crassostrea gigas]
Length = 224
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 136/225 (60%), Gaps = 9/225 (4%)
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
P SN + +P+ ++W ++G++TP +Q C +C+AF+ ++GQ F+ T ++ LS Q
Sbjct: 4 PPSN--LRLPNAVNWTKEGYVTPVKDQGFCLSCWAFAATGGLEGQHFRKTKKLVRLSEQN 61
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
+VDCS NLGC G+ N LNY+ GGL KE YPY GK C+F+ + +
Sbjct: 62 LVDCS--KENLGCTAGTPENALNYIARNGGLDKEVSYPYVGKNGRCRFRPTEVGANCQGI 119
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P DE +L+ + ++GPI VS++AS +F+ Y +G+YDD+AC+ + H L+VGY
Sbjct: 120 VHVPAGDELSLQKAVGSIGPIVVSLDASKRSFKQYRNGVYDDKACSKKLITHYALIVGYG 179
Query: 269 ---TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W++KN W WG +GYM L R +N CGIAN A+Y +
Sbjct: 180 EFQNKKYWLIKNSWGTSWGMDGYMMLARNQDNICGIANQAIYPSV 224
>gi|333830587|gb|AEG20936.1| ctsk [Danio rerio]
Length = 284
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 141/276 (51%), Gaps = 7/276 (2%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+K++Y +S ++ W+ N I HN+E + G+H Y L NH D+ +++
Sbjct: 11 HKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEEVAEKVMG 70
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
L R + +P +D+R+ G++T NQ CG+C+AFS A++GQ+
Sbjct: 71 LQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLM 130
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
K+ ++ +LS Q +VDC ++ + GC GG + N YV G+ EE YPY G C
Sbjct: 131 KTKGQLVDLSPQNLVDC--VTEDDGCGGGYMTNAFRYVSNNQGIDSEESYPYVGTDQQCA 188
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS 256
+ + + +P +E AL +A VGP++V I+A TF Y SG+Y D C
Sbjct: 189 YNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNCNK 248
Query: 257 DYVNHAMLLVGY-----TRNSWILKNWWSHHWGDNG 287
+ VNHA+L VGY + WI+KN W WG G
Sbjct: 249 EDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKG 284
>gi|20136379|gb|AAM11647.1|AF490984_1 cathepsin L, partial [Fasciola hepatica]
Length = 311
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 159/306 (51%), Gaps = 18/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 10 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 68
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+R S I V +N +V PD +DWRE G++T +Q +CG+C+AFS
Sbjct: 69 YLTEMSRA--SDILSHGVPYEANNRAV--PDKIDWRESGYVTEVKDQGNCGSCWAFSTTG 124
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E YPY
Sbjct: 125 TMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF--GLETESSYPY 182
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C++ + V ++ + + E LK + GP AV+++ F +Y SGI
Sbjct: 183 TAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMYRSGI 241
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + C+ VNHA+L VGY WI+KN W +WG+ GY+ + R N CGIA+
Sbjct: 242 YQSQTCSPLRVNHAVLAVGYGTQDGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIASL 301
Query: 304 AVYALI 309
A A++
Sbjct: 302 ASVAMV 307
>gi|261824899|pdb|3H89|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824900|pdb|3H89|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824901|pdb|3H89|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824902|pdb|3H89|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824903|pdb|3H89|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824904|pdb|3H89|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 4)
gi|261824905|pdb|3H8B|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824906|pdb|3H8B|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824907|pdb|3H8B|C Chain C, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824908|pdb|3H8B|D Chain D, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824909|pdb|3H8B|E Chain E, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|261824910|pdb|3H8B|F Chain F, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors(Compound 9)
gi|317455049|pdb|2XU3|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455050|pdb|2XU4|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455051|pdb|2XU5|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009432|pdb|2YJ2|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009433|pdb|2YJ8|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009434|pdb|2YJ9|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009435|pdb|2YJB|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|358009436|pdb|2YJC|A Chain A, Cathepsin L With A Nitrile Inhibitor
Length = 220
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 180
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|390355503|ref|XP_001200245.2| PREDICTED: counting factor associated protein D-like
[Strongylocentrotus purpuratus]
Length = 509
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 99/292 (33%), Positives = 154/292 (52%), Gaps = 12/292 (4%)
Query: 23 KKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRI 82
K + ++K H+ N + IH+ N+ GY L NH++D + + RL +R
Sbjct: 216 KSESHVQRKGHFTKNVRMIHSINRANL----GYVLDINHMADQSHQELKRMRGRLRQTRP 271
Query: 83 RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
L +PDH+DW G ++P +Q CG+C++F A I+G +F + +
Sbjct: 272 NNGLPYDGSDVSDDAVPDHIDWNVLGAVSPVKDQAVCGSCWSFGSAETIEGAVFMQSGKR 331
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKGKQSICKFKRPN 201
LS Q ++DC+ +GN GC GG ++ GG+ EE Y PY G+ +C + +
Sbjct: 332 VRLSQQMLMDCTWAAGNNGCDGGEEWRVYEWLMKNGGIPLEETYGPYLGQNGMCHYDKSK 391
Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC--TSDYV 259
V I + + ++ LK LAT GPIAV I+A+ +F Y+ G Y D +C T D +
Sbjct: 392 AVASIKKYYNVTSGNQKDLKKALATKGPIAVGIDAAVPSFSFYSYGTYYDASCGNTVDDL 451
Query: 260 NHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
+HA+L VGY +S W++KN WS HWG+NGY+ + +N CG+A A Y
Sbjct: 452 DHAVLAVGYGTDSSGQDYWLIKNSWSTHWGNNGYVAISMKDNNCGVATAATY 503
>gi|340370382|ref|XP_003383725.1| PREDICTED: silicatein-like [Amphimedon queenslandica]
Length = 369
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 155/315 (49%), Gaps = 24/315 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y + +K + W SN I HN A + GYTL N DL Y
Sbjct: 54 KAKHTKSYESDIHELEKYVTWVSNSALIDAHN--ALRSSFGYTLAMNQFGDLTSSDYYYG 111
Query: 74 MTRLTH-----------------SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQ 116
L + L P+ S +P +DWR G + +Q
Sbjct: 112 TKCLYQYNYGGSQNSSATEADQDNLFNMKLFEWPKGLSSSSLPSEVDWRTMGAVGDVKDQ 171
Query: 117 EDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQF 176
C +CYAFS+ AI+G ++ + LS Q ++DCS+I GN+GC+GGS + YV
Sbjct: 172 GRCKSCYAFSVTGAIEGMEALASGKFVSLSEQNIIDCSVIYGNMGCSGGSREISFLYVID 231
Query: 177 AGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINA 236
G+ E+ YPY Q +C FK I + + + E L +A GP+ V ++
Sbjct: 232 KNGINTEQSYPYIESQYLCYFKESAIGGRATGMVRIASKSETDLMAAVAISGPVTVGVDH 291
Query: 237 SPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLK 292
+FQ Y+SGI+D+ +C+S + HAM++VGY ++ W++KN W +WG +GY+ +
Sbjct: 292 MHSSFQFYSSGIFDEPSCSSTSLTHAMVIVGYGSYNGKDYWLVKNSWGANWGMSGYIMMV 351
Query: 293 RGN-NRCGIANYAVY 306
RG N+CGIA A+Y
Sbjct: 352 RGKYNQCGIATRAIY 366
>gi|241913450|pdb|3HHA|A Chain A, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913451|pdb|3HHA|B Chain B, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913452|pdb|3HHA|C Chain C, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|241913453|pdb|3HHA|D Chain D, Crystal Structure Of Cathepsin L In Complex With
Az12878478
gi|317455045|pdb|2XU1|A Chain A, Cathepsin L With A Nitrile Inhibitor
gi|317455046|pdb|2XU1|B Chain B, Cathepsin L With A Nitrile Inhibitor
gi|317455047|pdb|2XU1|C Chain C, Cathepsin L With A Nitrile Inhibitor
gi|317455048|pdb|2XU1|D Chain D, Cathepsin L With A Nitrile Inhibitor
Length = 220
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPKQ-EK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 180
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|126294322|ref|XP_001373208.1| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 329
Score = 177 bits (449), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 169/303 (55%), Gaps = 19/303 (6%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y K+Y +K +S ++ W+ N K I+ HN+ ++G Y L N D+ ++ ++
Sbjct: 36 YGKNYSEKE-ESFRRGVWEKNLKFINDHNRLHKEGKITYYLGMNAFGDMTINETVRMLSI 94
Query: 77 LTH-SRIRR-TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
T +RIRR T +++ + +P +DWRE GF+TP Q C +C+AFS +++GQ
Sbjct: 95 ATAPARIRRDTKLKTSTFPD---LPTSVDWREHGFMTPVRAQWTCASCWAFSAVGSLEGQ 151
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+F+ + + E+S Q ++DC C GGS+ Y++ G++ E+ YPYK K++
Sbjct: 152 LFRKSGKPLEMSKQMLIDC---DRRQSCRGGSVILAFKYMK-KKGIVSEQCYPYKDKKNQ 207
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
+ V I + VLP +E AL +A VGP++V+++A +F Y G+Y + C
Sbjct: 208 SCLNKTCPVTQIKKYVVLPSGNEQALMRAVANVGPVSVTVHAI-RSFFYYRGGVYTEPKC 266
Query: 255 TSDYVNHAMLLVGY-------TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVY 306
Y+NH++L+VGY + WILKN W WG+ GYM + K NN+CGIA AVY
Sbjct: 267 KKKYINHSVLVVGYGYEKEKENKKYWILKNSWGARWGEKGYMRFQKDNNNQCGIATMAVY 326
Query: 307 ALI 309
++
Sbjct: 327 PVL 329
>gi|7271891|gb|AAF44676.1|AF239265_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 101/302 (33%), Positives = 159/302 (52%), Gaps = 10/302 (3%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-IK 72
++ Y K+Y D ++ + W+ N K I HN GL YTL N +D+ + K
Sbjct: 25 KRIYNKEYNGADDDHRRNI-WEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+T + H + E+N+ +P +DWRE G++T +Q CG+C+AFS A++
Sbjct: 84 YLTEMPHRSDILSHGIPYEANKRA-VPASIDWRESGYVTEVKDQGQCGSCWAFSTTGAME 142
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ K+ S QQ+VDCS GN GC GG + N Y++ GL E YPY+ +
Sbjct: 143 GQYMKNQRTSISFSEQQLVDCSDDFGNFGCNGGLMENACEYLKRF-GLETESSYPYRAVE 201
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C++ + V ++ + ++ DE L+ + GP AV+++ F +Y SGIY +
Sbjct: 202 GPCRYNKQLGVAKVTGYYMVHSGDEVELQNLVGIEGPAAVALDVDSD-FMMYRSGIYQSQ 260
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C+ +++NH +L VGY S WI+KN W WG+NGY+ + R N CGIA+ A
Sbjct: 261 TCSPEFLNHGVLAVGYGTQSGTDYWIVKNSWGPWWGENGYIRMVRNRGNMCGIASLASVP 320
Query: 308 LI 309
++
Sbjct: 321 MV 322
>gi|350606375|ref|NP_001076821.2| uncharacterized protein LOC594890 precursor [Xenopus (Silurana)
tropicalis]
Length = 333
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 163/304 (53%), Gaps = 14/304 (4%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
+ +KK Y+ + + ++L W+ K I HN E GLH Y + NHL D+ +EM
Sbjct: 35 QTHKKIYKNEGEELARRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDM----VAEEM 90
Query: 75 TRLTHSRIRRTLVR---SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
T + I + + P P+ +DWR K +T +Q C A +AFS A+
Sbjct: 91 TDKQMNFIPQVIANITDVPVEISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGAL 150
Query: 132 QGQIFKS-TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
+ Q K T ++E LS+Q ++DCS GN GC GG + ++ Y+ G+ E +YPY+G
Sbjct: 151 ECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFRYI-IDNGIELESNYPYQG 209
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K C + +S+ LP DE LK + +GP++V+I+AS TF++Y +G+Y
Sbjct: 210 KDGKCSYTPVKKASVCTSYRQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYY 269
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D C+S +H++L+VGY W++KN W +GD GY+ + R +N CGIAN+
Sbjct: 270 DPNCSSSTPDHSVLVVGYGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGIANFGC 329
Query: 306 YALI 309
+ ++
Sbjct: 330 FPVV 333
>gi|332030000|gb|EGI69825.1| Cathepsin L [Acromyrmex echinatior]
Length = 328
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 168/312 (53%), Gaps = 10/312 (3%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ + EW FIF + +KK Y+ + + + N +KI HN++ + Y L N
Sbjct: 24 ILDAEW---FIF-KTHHKKIYKSSVEEGYRMKIFLDNKRKIAEHNRKYELNEVPYKLGMN 79
Query: 61 HLSDLHPRHYIKEMTRLTHSRI--RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQED 118
D+ ++ + S ++ + + S +V +P +DWR+ G +T +Q
Sbjct: 80 KYGDMLHHEFVNTLNGFNKSEKAQKQFMGATFISPANVELPKEVDWRKHGAVTEVKDQGH 139
Query: 119 CGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAG 178
CG+C+AFS +++GQ F+ T + LS Q ++DCS GN GC GG + N YV+
Sbjct: 140 CGSCWAFSTTGSLEGQHFRQTGILVSLSEQNLIDCSGNYGNEGCNGGLMDNAFKYVRDNK 199
Query: 179 GLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
GL E+ YPY+ + C++ N + + +P +EH LK +AT+GP++V+I+AS
Sbjct: 200 GLDTEKSYPYEAENDKCRYNPRNSGAIDTGFVDIPRGNEHKLKAAVATIGPVSVAIDASH 259
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRG-NNR 297
+FQLY+ G+Y D C SD ++H +L+VGY +S ++W G Y+ + R +N
Sbjct: 260 ESFQLYSEGVYFDPECDSDNLDHGVLIVGYGTDSKTGHDYW---LGKKSYIKMARNKDNH 316
Query: 298 CGIANYAVYALI 309
CGIA+ A Y L+
Sbjct: 317 CGIASSASYPLV 328
>gi|310975575|gb|ADP55136.1| truncated cathepsin L-like protein [Miichthys miiuy]
Length = 246
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 138/244 (56%), Gaps = 13/244 (5%)
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
R + + +L P E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ
Sbjct: 6 RKAERKFKGSLFMEPNFLEA---PRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQH 62
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSI 194
F+ T ++ LS Q +VDCS GN GC GG + YV+ GL E+ YPY G
Sbjct: 63 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDAYPYLGTGDQP 122
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C + + + + +P EHAL +A VGP++V+I+AS +FQ Y SGIY ++ C
Sbjct: 123 CHYDPNYNSANDTGFIDVPSGKEHALMKAVAAVGPVSVAIDASHESFQFYQSGIYYEKDC 182
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAV 305
+S+ ++H +L+VGY + WI+KN WS WGD GY+Y+ K N CGIA A
Sbjct: 183 SSEELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAAS 242
Query: 306 YALI 309
Y L+
Sbjct: 243 YPLV 246
>gi|358255491|dbj|GAA57187.1| cathepsin L [Clonorchis sinensis]
Length = 368
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 164/303 (54%), Gaps = 28/303 (9%)
Query: 25 ATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR--------HYIKEMT 75
A +S K+L+ + N + HN ++G + L N +D P+ H ++
Sbjct: 76 AEESSKRLNVFCENFLYVRRHNNAYEEGTESFKLGINQFADRLPKERENICGGHIPANLS 135
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+R R+ P+S +DWR+KG +T Q CG+C+AF+ A+A++G
Sbjct: 136 SHGGARFRKIAAPPPKS---------IDWRKKGAVTSIRKQGRCGSCWAFAAAAAVEGHT 186
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI- 194
+ +++E LS QQ++DCS+ GN GC GG + Y++ +GGL ++ DYPY ++I
Sbjct: 187 YIHNNQLETLSTQQLIDCSLEYGNGGCTGGDSVTSFKYLKESGGLERDRDYPYVSDKTIR 246
Query: 195 ----CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
CKF +++ + VLP DE A+ + GP+A+S+++ +F+ Y IY
Sbjct: 247 PNPECKFDWTKCAAEVTGFVVLPYHDEDAILQAVGFYGPVAISVDSRLQSFKDYKGDIYS 306
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
D C + +H+M++VGY + WI+KN W HWG+ GY+ L+RG N CG+A+ + Y
Sbjct: 307 DPLCGKN-SDHSMVVVGYGEENGTPYWIIKNSWGEHWGEKGYLRLRRGVNMCGVASVSTY 365
Query: 307 ALI 309
L+
Sbjct: 366 PLV 368
>gi|195455845|ref|XP_002074891.1| GK22909 [Drosophila willistoni]
gi|194170976|gb|EDW85877.1| GK22909 [Drosophila willistoni]
Length = 370
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 149/279 (53%), Gaps = 11/279 (3%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL--- 97
+ N+ +G Y L N +DL ++ ++T S + V + + V
Sbjct: 93 VDAGNEAFSKGTSTYKLAVNAFADLTNAEFLSQLTGRRKSNQGESKVAASRQSAHVQPGG 152
Query: 98 -IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
+PD DWR++G +T Q CG+C+AF+ AI+G +F+ T ++ LS Q +VDC +
Sbjct: 153 NVPDAFDWRQQGGVTSVKYQGTCGSCWAFATTGAIEGHVFRKTGKLPNLSEQNLVDCGSL 212
Query: 157 SGNL-GCAGGSLRNTLNYV-QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPP 214
L GC GG + ++ + G+ K + YPY + CK+ I+ ++ +PP
Sbjct: 213 DFGLNGCDGGYQEYAMAFINEKQRGISKSDQYPYIDNKETCKYTNSLSGAQITGFASIPP 272
Query: 215 QDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TR 270
+DE +K +AT+GP+A S+N + LY SGIY DE C D NH++L+VGY +
Sbjct: 273 KDEALMKKVIATLGPLACSLNGL-ESLLLYKSGIYADEKCNDDEPNHSVLVVGYGSEKGQ 331
Query: 271 NSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ WI+KN W +WG++GY L RG N CGIA Y ++
Sbjct: 332 DYWIIKNSWDKNWGEDGYFRLPRGKNFCGIALECSYPIV 370
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 159/305 (52%), Gaps = 18/305 (5%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F+ Q+KY K Y+ + S ++ ++ N KI HN++ QQ L Y L N SDL
Sbjct: 24 FVTFQQKYGKVYQNDSELSVREEIFKENLAKIEEHNKQFQQNLVSYELGLNQFSDLTEAE 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVL----IPDHLDWREKGFITPDWNQEDCGACYAF 125
+ LT S + L + E S P ++W EKG +TP NQ +CG+C+ F
Sbjct: 84 F---QALLTMSPLTDQLTKQMEKYNSEFDIKTAPVSVNWAEKGVVTPVKNQGNCGSCWTF 140
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
+ I+ ++ T + LS QQ++DC+ + N GC GG L L YV+ A GL E++
Sbjct: 141 TTTGTIESRLALKTGSLVSLSEQQLLDCNRV--NAGCDGGVLSYALQYVESA-GLTTEDE 197
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPYK C + ++++ + E L +A GP+AV++NA Q Y+
Sbjct: 198 YPYKAWNGTCNSTHKPVAAYTKGYTLIYTRSESDLMKAVAE-GPVAVALNA--DLLQYYS 254
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
GI++ AC+S VNH L+VGY N+ WI+KN W WG+NGY + +G N CGI
Sbjct: 255 KGIFNPSACSST-VNHGGLVVGYEENATLPYWIIKNSWGATWGENGYFRMAKGYNLCGIT 313
Query: 302 NYAVY 306
+ +Y
Sbjct: 314 SQPIY 318
>gi|256535829|gb|ACU82389.1| cathepsin L 1 [Pheronema raphanus]
Length = 328
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 166/301 (55%), Gaps = 16/301 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y ++ ++L W+ N + T N+ +GL YTL N +D+ + +++ M
Sbjct: 36 HKKVYYTLIEENFRRLIWEDN---LSTFNEMNSRGL-SYTLGTNEFADMTSKEFVEIMNG 91
Query: 77 LTHS-RIRRTL-VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
RI + V ++ S+ + D +DWR KG +TP NQ CG+C+AFS +++GQ
Sbjct: 92 YKPELRIDKLEDVNEVKNYSSIKLSDSVDWRSKGAVTPVKNQGQCGSCWAFSSTGSLEGQ 151
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE---DYPYKGK 191
F + ++ S ++VDCS GN GC GG + N Y + + KEE DYPY K
Sbjct: 152 YFINNDKLLSFSESELVDCSRRYGNNGCKGGLMDNAFRYWE----VYKEELESDYPYVAK 207
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C++ + V ISS+ +P + +L+ + T+GPI+V+++AS +FQLY SG+Y +
Sbjct: 208 DGPCRYSQDKGVTTISSYKNVPHFSQISLQDAVRTIGPISVAMDASHKSFQLYHSGVYSE 267
Query: 252 EACTSDYVNHAMLLVGYTRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
C+ ++H +L+VGY +S W++KN W WG +GY + NN CG+ Y +
Sbjct: 268 SECSQTKLDHGVLVVGYGTSSEPFWLVKNSWGAGWGMDGYFEIAMRNNMCGLETEPSYPI 327
Query: 309 I 309
+
Sbjct: 328 L 328
>gi|146147376|gb|ABQ01982.1| cathepsin [Fasciola gigantica]
Length = 326
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 155/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN G YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHYLGFVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN+GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNMGCMGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
+ GIY C+S VNHA+L VGY S WI+KN W WG+ GY+ + R N CG
Sbjct: 253 SGGIYQSRTCSSLRVNHAVLAVGYGTQSGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|134025544|gb|AAI35768.1| LOC594890 protein [Xenopus (Silurana) tropicalis]
Length = 333
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 163/304 (53%), Gaps = 14/304 (4%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
+ +KK Y+ + + ++L W+ K I HN E GLH Y + NHL D+ +EM
Sbjct: 35 QTHKKIYKNEGEELVRRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDM----VAEEM 90
Query: 75 TRLTHSRIRRTLVR---SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
T + I + + P P+ +DWR K +T +Q C A +AFS A+
Sbjct: 91 TDKQMNFIPQVIANITDVPVEISKSSPPESIDWRNKNCVTSVKDQGSCIASWAFSSIGAL 150
Query: 132 QGQIFKS-TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
+ Q K T ++E LS+Q ++DCS GN GC GG + ++ Y+ G+ E +YPY+G
Sbjct: 151 ECQNMKRRTGKLESLSVQNLLDCSQTYGNNGCKGGWVVSSFRYI-IDNGIELESNYPYQG 209
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K C + +S+ LP DE LK + +GP++V+I+AS TF++Y +G+Y
Sbjct: 210 KDGKCSYTPVKKASVCTSYRQLPYGDEATLKQVVGLMGPVSVAIDASRKTFRMYKNGVYY 269
Query: 251 DEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
D C+S +H++L+VGY W++KN W +GD GY+ + R +N CGIAN+
Sbjct: 270 DPNCSSSTPDHSVLVVGYGAEDGVEYWLVKNSWGTSFGDEGYIKMARNHHNNCGIANFGC 329
Query: 306 YALI 309
+ ++
Sbjct: 330 FPVV 333
>gi|3641698|dbj|BAA33398.1| preprocathepsin L [Bos taurus]
Length = 301
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 97/258 (37%), Positives = 141/258 (54%), Gaps = 14/258 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-TLVR 88
++ W+ N K I HNQE +G HG+ + N D+ + + M + + ++ L
Sbjct: 48 RRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFH 107
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P V +P +DW +KG++TP NQ CG+C+AFS A++GQ+F+ T ++ LS Q
Sbjct: 108 EPLL---VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDIS 207
+VDCS GN GC GG + N Y++ GGL EE YPY + C +K P
Sbjct: 165 NLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYK-PECSAAND 223
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ V PQ E AL +ATVGPI+V+I+A +FQ Y SGIY D C+S ++H +L+VG
Sbjct: 224 TGFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVG 283
Query: 268 Y--------TRNSWILKN 277
Y WI+KN
Sbjct: 284 YGFEGTDSNNNKFWIVKN 301
>gi|31558997|gb|AAP49831.1| cathepsin L [Fasciola hepatica]
Length = 326
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 154/305 (50%), Gaps = 26/305 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y +Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNNEY-NGADDQHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM+R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMSRASDILSHG--------VPYETNNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ + V ++ + +P E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNKQLGVAKVTGYYTVPSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C+ VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 RSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCG 312
Query: 300 IANYA 304
IA+ A
Sbjct: 313 IASLA 317
>gi|332374780|gb|AEE62531.1| unknown [Dendroctonus ponderosae]
Length = 544
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 103/311 (33%), Positives = 173/311 (55%), Gaps = 18/311 (5%)
Query: 12 FPQKKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHP 67
F K+ K +R+ T+ ++L ++ N + IH+HN++ G++L NHL+D
Sbjct: 238 FEFNKFTKKHRRIYTNQNERLLRMEIFRQNVRFIHSHNRKNV----GFSLSVNHLAD-KT 292
Query: 68 RHYIKEMTRLTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFS 126
+K + T+S+ + P ++E +PD DWR G +TP +Q CG+C++F
Sbjct: 293 ETELKALRGKTYSKEYNGGLPFPYTSEDFSNLPDQWDWRLYGAVTPVKDQSVCGSCWSFG 352
Query: 127 IASAIQGQIF-KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
++G F K+ + LS Q ++DCS GN GC GG +++ GG+ EED
Sbjct: 353 TIGTVEGAFFLKNGGHLVRLSQQALIDCSWGFGNNGCDGGEDFRAYQWIKKHGGIPTEED 412
Query: 186 Y-PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
Y PY G+ C + V +SSW+ + DE+AL++ + GPI+V+I+AS TF Y
Sbjct: 413 YGPYLGQDGYCHADKLPKVAKLSSWTNVTTNDENALRLAIFKHGPISVAIDASQRTFSFY 472
Query: 245 ASGIYDDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRC 298
++G+Y +E C + D ++HA+L VGY + W++KN WS++WG++GY+ + NN C
Sbjct: 473 SNGVYFEEKCHNKVDELDHAVLAVGYGSINGNHYWLVKNSWSNYWGNDGYVLMSSKNNNC 532
Query: 299 GIANYAVYALI 309
G+ + Y +
Sbjct: 533 GVMSAPTYVTL 543
>gi|297287735|ref|XP_002803218.1| PREDICTED: putative cathepsin L-like protein 6-like [Macaca
mulatta]
Length = 270
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 128/221 (57%), Gaps = 10/221 (4%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP +DWREKG++TP NQ CG+C+AFS A++GQ+F T ++ LS Q +VDCS
Sbjct: 51 IPTSVDWREKGYVTPVKNQGMCGSCWAFSATGALEGQMFWKTGKLISLSEQNLVDCSWPQ 110
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN G GG + N+ YVQ GGL E YPY+GK C++ P V + V P E
Sbjct: 111 GNEGYNGGFMDNSFRYVQENGGLDSEASYPYEGKVKTCRY-NPKYSVANDTGFVDIPSRE 169
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----- 272
L +ATVGPI+V+++AS +FQ Y GIY + C + ++HAML VGY
Sbjct: 170 KDLAKAVATVGPISVAVDASHFSFQFYKKGIYFEPRCDPEGLDHAMLTVGYGYEGADSDN 229
Query: 273 ---WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W +WG +GY+ + K N CGIA A Y +
Sbjct: 230 NKYWLVKNSWGKNWGMDGYIKMAKDRRNNCGIATAASYPTV 270
>gi|24654434|ref|NP_725686.1| CG4847, isoform D [Drosophila melanogaster]
gi|21645235|gb|AAM70880.1| CG4847, isoform D [Drosophila melanogaster]
gi|255653098|gb|ACU24747.1| RH39096p [Drosophila melanogaster]
Length = 420
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 150/305 (49%), Gaps = 19/305 (6%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N QG+H + N +DL ++ ++T L
Sbjct: 121 KTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLSQLTGLK 180
Query: 79 HSRIRRT-------LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
S + LV P + IPD DWRE G +TP Q CG+C+AF+ AI
Sbjct: 181 RSPEAKARAAASLKLVNLP----AKPIPDAFDWREHGGVTPVKFQGTCGSCWAFATTGAI 236
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
+G F+ T + LS Q +VDC + G GC GG ++ + G+ +E YPY
Sbjct: 237 EGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKGVSQEGAYPY 296
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ CK+ + ++ +PP+DE LK +AT+GP+A S+N T + YA GI
Sbjct: 297 IDNKGTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNYAGGI 355
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
Y+D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA
Sbjct: 356 YNDDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDDTWGEKGYFRLPRGKNYCFIAEEC 415
Query: 305 VYALI 309
Y ++
Sbjct: 416 SYPVV 420
>gi|19922450|ref|NP_611221.1| CG4847, isoform A [Drosophila melanogaster]
gi|24654437|ref|NP_725687.1| CG4847, isoform B [Drosophila melanogaster]
gi|24654439|ref|NP_725688.1| CG4847, isoform C [Drosophila melanogaster]
gi|45552699|ref|NP_995874.1| CG4847, isoform E [Drosophila melanogaster]
gi|7302775|gb|AAF57850.1| CG4847, isoform A [Drosophila melanogaster]
gi|15010382|gb|AAK77239.1| GH01592p [Drosophila melanogaster]
gi|21645236|gb|AAM70881.1| CG4847, isoform B [Drosophila melanogaster]
gi|21645237|gb|AAM70882.1| CG4847, isoform C [Drosophila melanogaster]
gi|45445496|gb|AAS64820.1| CG4847, isoform E [Drosophila melanogaster]
gi|220944958|gb|ACL85022.1| CG4847-PA [synthetic construct]
gi|220954732|gb|ACL89909.1| CG4847-PA [synthetic construct]
Length = 390
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 150/305 (49%), Gaps = 19/305 (6%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N QG+H + N +DL ++ ++T L
Sbjct: 91 KTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLSQLTGLK 150
Query: 79 HSRIRRT-------LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
S + LV P + IPD DWRE G +TP Q CG+C+AF+ AI
Sbjct: 151 RSPEAKARAAASLKLVNLP----AKPIPDAFDWREHGGVTPVKFQGTCGSCWAFATTGAI 206
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
+G F+ T + LS Q +VDC + G GC GG ++ + G+ +E YPY
Sbjct: 207 EGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKGVSQEGAYPY 266
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ CK+ + ++ +PP+DE LK +AT+GP+A S+N T + YA GI
Sbjct: 267 IDNKGTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNYAGGI 325
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
Y+D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA
Sbjct: 326 YNDDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDDTWGEKGYFRLPRGKNYCFIAEEC 385
Query: 305 VYALI 309
Y ++
Sbjct: 386 SYPVV 390
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 153/299 (51%), Gaps = 18/299 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
Q+ + K Y + K+ +++N IH HN + + Y L+ N DL + +
Sbjct: 93 QRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQG----YSYVLKMNKFGDLTLEEFRQR 148
Query: 74 MTRLTHSRIR---RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+R R + + ES E IP H+DWR++G +T +Q DCG+C+AFS A
Sbjct: 149 YLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGA 208
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++G T ++ LS QQ+VDCS GN GC GG + YV GG+ E+YPY
Sbjct: 209 MEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMR 268
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
K +CK + V I+ + +P + E ++K LA P++V+I A+ FQ Y GI+D
Sbjct: 269 KDGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFD 328
Query: 251 DEACTSDYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGYMYL---KRGNNRCGI 300
T+ ++H +LLVGY+ + WI+KN W WG GYM + K +CG+
Sbjct: 329 APCGTN--LDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGV 385
>gi|358334194|dbj|GAA34712.2| cathepsin L [Clonorchis sinensis]
Length = 401
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 103/291 (35%), Positives = 151/291 (51%), Gaps = 18/291 (6%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI--- 71
K YK++Y +K+ + N +I+ HN QG YT+ N SD +
Sbjct: 73 KTYKREYAGPVEQAKRFRIFTENFIRINQHNVRYIQGDTFYTMGINRFSDRVSWTILSQI 132
Query: 72 ----KEMTRLTHSRIRRTLVRSPESNESVLI--PDHLDWREKGFITPDWNQEDCGACYAF 125
+E RL R R R+ ++ P +DWR G +TP +Q CG+C+AF
Sbjct: 133 FQTKEEFGRLLGFRGLRNTSRANSKYITIAAEPPASIDWRSTGAVTPVKDQGQCGSCWAF 192
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S AI+GQ F +T ++ LS QQ+VDCS GN GC+GG + N YV+ G+ E
Sbjct: 193 SATGAIEGQHFMATKQLVSLSEQQLVDCSSHFGNFGCSGGWMDNAFKYVKHTHGITTETK 252
Query: 186 YPYKGKQS-----ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
YPY ++ C+F I ++ LP +E ALK + GPI+V+I+AS +
Sbjct: 253 YPYISGETGTPNPRCEFHGQAIAATVTGIVDLPRSNEFALKQAVGLHGPISVAIHASLES 312
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNG 287
F Y SG+Y DE C+SD ++HA+L+VGY + W++KN W WG+ G
Sbjct: 313 FMGYKSGVYSDEECSSDQLDHAVLVVGYGEENGIPYWLIKNSWGFDWGEMG 363
>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
Length = 333
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 163/307 (53%), Gaps = 9/307 (2%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + ++ + Y + +K + SN + I HN+E G + + N+ +D+
Sbjct: 29 FNLFKTRFGRSYANFEEEIFRKRVFASNLEFIFNHNREFFAGNKNFNVAVNNFTDMSNTE 88
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDW-REKGFITPDWNQEDCGACYAF-SI 127
+ L HS ++ S E +P +DW + K +TP NQE CG+C+AF S
Sbjct: 89 FRARFNGLRHSGVQSAPAIHSASAEG--LPATVDWTKVKNVVTPIKNQEQCGSCWAFFSA 146
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
++++GQ T ++ LS Q +VDCS GN+GC GG + YV G+ E YP
Sbjct: 147 VASMEGQHGLKTGKLVSLSEQNLVDCSAAEGNMGCEGGLMDQAFQYVIANKGIDTEMSYP 206
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
YK +FK+ ++ I S+ + E +L+ +ATVGPI+V I+AS +FQ Y+SG
Sbjct: 207 YKAIDESWEFKKNSVGATIKSYVDVKTGSESSLQSAVATVGPISVGIDASQLSFQFYSSG 266
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+Y++ AC++ ++H + VGY + W +KN W WG +GY+++ R N+CGIA
Sbjct: 267 VYEEPACSTTILDHGVTAVGYGALNGTPYWKVKNSWGTSWGMSGYIFMSRNKQNQCGIAT 326
Query: 303 YAVYALI 309
A + ++
Sbjct: 327 AASWPVV 333
>gi|148235365|ref|NP_001083441.1| uncharacterized protein LOC398927 precursor [Xenopus laevis]
gi|38014481|gb|AAH60424.1| MGC68723 protein [Xenopus laevis]
Length = 333
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 158/301 (52%), Gaps = 8/301 (2%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K +KK Y+ + + ++L W+ K I HN E GLH Y + NHL D+ +
Sbjct: 35 KTHKKLYKNEGEELVRRLIWEDTLKFIMLHNLEYSMGLHTYEVGMNHLGDMTAEEMTDKQ 94
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
S I T E ++S P+ +DWR K +T +Q C + +AFS A++ Q
Sbjct: 95 MNFRPSEIANTTGLPVEISKST-PPESIDWRNKNCVTSVKDQGSCMSSWAFSSIGAMECQ 153
Query: 135 -IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+ K T ++E LS+Q ++DCS GN GC GG ++ Y+ G+ E YPY+GK
Sbjct: 154 NMRKRTGKLESLSVQNLLDCSQNYGNNGCKGGWAVSSFRYI-IDNGIELESIYPYQGKDG 212
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C + +S+ LP +E LK + +GP++V+I S TF++Y SG+Y D
Sbjct: 213 KCSYTPVKKAPRCTSYRQLPYGNEATLKQVVGLMGPVSVAIEGSRKTFRMYKSGVYYDPN 272
Query: 254 CTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C V+H++L+VGY W++KN W +GD GY+ + R +N CGIA++ + +
Sbjct: 273 CGGSTVDHSVLVVGYGAEDGVEYWLVKNSWGTSFGDEGYIKMARNRHNNCGIASFGCFPV 332
Query: 309 I 309
I
Sbjct: 333 I 333
>gi|26245875|gb|AAN77413.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 287
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/281 (34%), Positives = 154/281 (54%), Gaps = 12/281 (4%)
Query: 36 SNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNES 95
SN +I HNQ +GL Y + N +DL P +++ L ++ + L + N
Sbjct: 12 SNLLRIEEHNQNFSRGLSTYEMGVNKFADLTPEEFMERFRPLRKTK-PKFLSEQAKFNFD 70
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
+P +DW ++G +T +Q CG+C+AFS +++ F T ++ LS QQ+VDC
Sbjct: 71 GDLPAEVDWTKQGAVTEVKSQGSCGSCWAFSTTGSVESHNFIKTGKLISLSEQQLVDC-- 128
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
+ N GCAGG + L Y++ A G+M E+DYPY+ + + C+F V I S+ +
Sbjct: 129 VKNNSGCAGGWMDIALEYIE-ADGIMSEDDYPYEERNTTCRFNNSKAAVQIKSYKAIKKN 187
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC--TSDYVNHAMLLVGY----T 269
DE L+ +A GP+ V+I + FQLYA GI +D C T + HA+L+ GY
Sbjct: 188 DEIDLQKAVALEGPVPVAIEVTI-AFQLYARGILNDPQCKNTEGDLTHAVLVTGYGSQDG 246
Query: 270 RNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
++ WI+KN W +G +GY+ + R +N+CGIA A Y ++
Sbjct: 247 KDYWIVKNSWGAEYGMDGYLRMSRNADNQCGIATRASYPVL 287
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 168/319 (52%), Gaps = 38/319 (11%)
Query: 11 IFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHL-------- 62
+F QK+Y+ ++ +W H+K +T+++ + YT+ ++++
Sbjct: 22 VFSQKQYQTAFQ----------NWMVKHQKSYTNDEFGSR----YTIFQDNMDFVTKWNQ 67
Query: 63 --SD-LHPRHYIKEMTRLTHSRI---RRTLVRSPESNESVL----IPDHLDWREKGFITP 112
SD + + + ++T + RI +T V+ P V P +DWR G +T
Sbjct: 68 KGSDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTDVSKAPASVDWRANGAVTA 127
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
NQ CG CY+FS +++G ++ ++ LS QQ++DCS GN GC GG + N+
Sbjct: 128 VKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGNNGCDGGLMTNSFE 187
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
Y+ GGL E YPY+G CKF + NI I+ + + E L+ +A P++V
Sbjct: 188 YIIAVGGLDTEASYPYEGVVGKCKFNKANIGATITGYKNVKSGSESDLQTAVAAQ-PVSV 246
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGY 288
+I+AS ++FQLY+SG+Y + AC+S ++H +L VGY S WI+KN W WG+ G+
Sbjct: 247 AIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGSQSGQDYWIVKNSWGADWGEKGF 306
Query: 289 MYLKRGN-NRCGIANYAVY 306
+ + R N CGIA A Y
Sbjct: 307 ILMARNKHNNCGIATMASY 325
>gi|7271889|gb|AAF44675.1|AF239264_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYGCMGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FTMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
+ GIY C+S VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 SGGIYQSRTCSSLRVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|91085671|ref|XP_971698.1| PREDICTED: similar to cathepsin L-like protein; cysteine proteinase
[Tribolium castaneum]
gi|270011034|gb|EFA07482.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 160/313 (51%), Gaps = 24/313 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y ++ +K +Q ++I HN+ +GL YT+ N +D+ P +E
Sbjct: 31 KKTHEKSYLNAKEEAFRKQIFQKKLERIEAHNERFNKGLETYTMGINMFTDMTP----EE 86
Query: 74 MTRLTHSRIRRTLVRSP----------ESNESVLIPDHLDWREKGFITPDWNQEDCGACY 123
M TH I +V P N SV P DWR+KG +T NQ CG+C+
Sbjct: 87 MRPYTHGLIEPAVVPKPLVEIKSRADLGLNHSVQYPASFDWRDKGMVTGVKNQGGCGSCW 146
Query: 124 AFSIASAIQGQIFKSTSEIEELSI--QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
AFS AI+ Q+ + ++S+ QQ+VDC + GC GG + + Y+ GG+
Sbjct: 147 AFSSTGAIESQVKIAKGANTDISVSEQQLVDCDTAAD--GCGGGWMTDAFTYIAQTGGID 204
Query: 182 KEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E YPYKG C F + + ++ L DE+ L +++ GP++V+ +A F
Sbjct: 205 SESSYPYKGVDESCHFMSDKVAAKLKGYAYLTGPDENMLADMVSSKGPVSVAFDAEGD-F 263
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NN 296
Y+ G+Y + C ++ HA+L+VGY ++ W++KN W WG++GY + R N
Sbjct: 264 GSYSGGVYYNPNCATNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGEHGYFKIARNKGN 323
Query: 297 RCGIANYAVYALI 309
CGIA+ A Y ++
Sbjct: 324 HCGIASKASYPVL 336
>gi|42794048|dbj|BAD11762.1| cahepsin L-like cysteine protease [Brugia malayi]
Length = 371
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 163/315 (51%), Gaps = 25/315 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++KY K + + ++ + + N K+I HN+ ++ Y L NHL+D+ P + K
Sbjct: 58 KRKYNKRDEEINLEHRRFMTYLKNVKEIEKHNERYERNEETYELAINHLADMLPEEFRKL 117
Query: 73 ---EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ ++T + +R + +P +DWR G +T +Q CG+C+ FS
Sbjct: 118 HGFQSRKITSKNNFKNTIRMKINGP---LPKSIDWRTSGAVTKVKDQGYCGSCWTFSAVG 174
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F T ++ ELS+Q ++DCS + GN GC GG + YV G+ E+ YPY
Sbjct: 175 ALEGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKNDGIDTEKSYPY 234
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+G Q+ C++ + +LP DE L+ +AT+GPI+V+++A F Y GI
Sbjct: 235 QGYQNTCRYSNSTRGTTAYAGKLLPEGDELQLQAAIATIGPISVAVDAKLMKF--YRRGI 292
Query: 249 YDDEACTSDYVNHAMLLVGY----------TRNS---WILKNWWSHHWGDNGYMYLKRGN 295
+ CT+ + HA+L VGY T+ S W+LKN WS WG GY+ L R
Sbjct: 293 FSTSKCTTR-MGHALLAVGYGTEEVKLQNGTKKSVDYWLLKNSWSKRWGIGGYLKLARNQ 351
Query: 296 -NRCGIANYAVYALI 309
N CGI YA Y L+
Sbjct: 352 ENMCGIGFYACYPLV 366
>gi|391328503|ref|XP_003738728.1| PREDICTED: digestive cysteine proteinase 3-like [Metaseiulus
occidentalis]
Length = 506
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 131/212 (61%), Gaps = 5/212 (2%)
Query: 103 DWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGC 162
D+ +G++TP +Q CG+C+AFS +++GQ FK+T ++ LS Q +VDCS GN GC
Sbjct: 295 DYWLEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGDEGNNGC 354
Query: 163 AGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKV 222
GG + Y++ GG+ EE YPY + C FK + ++ + + E AL+
Sbjct: 355 EGGLMDQGFTYIKNNGGIDTEESYPYNAEDGDCAFKSNAVGARVTGFVDIDSGSEKALQK 414
Query: 223 TLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNW 278
+ATVGP++V+I+AS +FQLY GIYD+ AC+S ++H +L VGY + W++KN
Sbjct: 415 AVATVGPVSVAIDASNDSFQLYKEGIYDEPACSSTQLDHGVLAVGYGSENGVDYWLVKNS 474
Query: 279 WSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W+ WG +GY+ + R +N+CGIA+ A Y +
Sbjct: 475 WNTVWGQDGYIKMARNKDNQCGIASQASYPTV 506
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/292 (31%), Positives = 156/292 (53%), Gaps = 20/292 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-IK 72
Q + K+Y + ++++ + N +I+ HN GL Y + N +D+ Y +
Sbjct: 35 QATFGKNYDAQQAAERQRI-FNENVDRINAHNLAHDLGLTSYRMGLNEFTDMSRSEYELV 93
Query: 73 EMTRLTHSRIRRT-----LVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
R + + RT L ++P+ +P +D+R+ G +TP NQ CG+C+AFS
Sbjct: 94 RGLRYSGEQTARTGHPFTLTQNPDRE----LPKKVDYRKSGHVTPVKNQGLCGSCWAFSA 149
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
+++GQ+ + LS Q ++DCS N GC GG + Y++ GG+ EE YP
Sbjct: 150 TGSLEGQLSIQNGTLVSLSEQNLLDCS--RENQGCDGGYMDKAFEYIKKNGGIDTEESYP 207
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G++ C FK+ NI ++ +P +DE ALK+ +A +GPI+V I+AS +F+ Y G
Sbjct: 208 YTGRKGKCMFKKKNIGARVTGHVDVPAEDEQALKLAVAKIGPISVGIDASKDSFRFYKEG 267
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCG 299
IYD+ +C++ ++H +L+VGY S K++W GY+ + +CG
Sbjct: 268 IYDESSCSTSQLDHGVLVVGY--GSEKGKDYWLE-----GYVTPVKDQGQCG 312
>gi|156392785|ref|XP_001636228.1| predicted protein [Nematostella vectensis]
gi|156223329|gb|EDO44165.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 156/297 (52%), Gaps = 14/297 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
Y+K Y K+K ++ N + I + N++ GY+L+ NH++D+ + M
Sbjct: 217 YRKRYPSAHEHEKRKDIYRHNMRFIKSRNRQHL----GYSLKPNHMADMTDAE-VNRMKG 271
Query: 77 LTHSR---IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
L H I + P+ + V +P H+DWR+ G + +Q CG+CYAF++A A++G
Sbjct: 272 LLHEEPPLIGDSPFSIPDKDRGVPLPPHVDWRKAGAVNSVKSQGICGSCYAFAVAGALEG 331
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP-YKGKQ 192
F T +LS QQ+VDC+ GN GC GG + ++ GGL EE Y Y ++
Sbjct: 332 AHFIKTGLKLDLSEQQIVDCTWGFGNRGCKGGYPYRAMQWILKHGGLATEESYGRYLAQE 391
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C FK +I + + + + LK+ +A GP+++ +N P TF+ Y SGIY D
Sbjct: 392 GYCHFKNTSIGARLDKYMSIRQGNTSQLKLAVAFYGPVSILVNTQPKTFKFYGSGIYYDT 451
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
CT ++HA L VGY WI+KN WS WG+ GY+ + ++ CG+A AV
Sbjct: 452 QCTHA-LDHAALAVGYGEEKGVSYWIVKNSWSAMWGEEGYIKIAMKDDNCGVAQKAV 507
>gi|47199802|emb|CAF88807.1| unnamed protein product [Tetraodon nigroviridis]
Length = 261
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 93/260 (35%), Positives = 141/260 (54%), Gaps = 15/260 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHS---RIRRTL 86
+++ W+ N KKI HN E G H Y L NH D+ + + M H + R +L
Sbjct: 5 RRMVWEKNLKKIELHNLEHSMGQHSYRLGMNHFGDMTHEEFRQIMNGYKHKPQRKFRGSL 64
Query: 87 VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELS 146
P E+ P +DWR+KG++TP +Q CG+C+AFS A++GQ F+ T ++ LS
Sbjct: 65 FMEPNFLEA---PRAVDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRQTGKLVSLS 121
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG-KQSICKFKRPNIVVD 205
Q +VDCS GN GC GG + Y++ GGL E YPY C + N +
Sbjct: 122 EQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGGLDSEASYPYLATDDQPCHYDPSNNSAN 181
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
+ + +P E AL +A+VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+
Sbjct: 182 ETGFVDVPSGSERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLV 241
Query: 266 VGY--------TRNSWILKN 277
VGY + WI+KN
Sbjct: 242 VGYGFQGEDVDGKKFWIVKN 261
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 158/307 (51%), Gaps = 18/307 (5%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI----- 71
+KK Y+ + + + N +KI HN++ + Y L N D+ I
Sbjct: 70 HKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTLNG 129
Query: 72 --KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K +T I T + +V +P +DWR+KG +T +Q CG+C+AFS
Sbjct: 130 FNKSVTVSEEQLIGATFIEPA----NVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTG 185
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ F+ + + LS Q ++DCS GN GC GG + Y++ GL E+ YPY+
Sbjct: 186 ALEGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYE 245
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C++ N + +P DE LK +AT+GPI+V+I+AS +F Y+ G+Y
Sbjct: 246 AENDQCRYNPKNSGASDVGFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVY 305
Query: 250 DDEACTSDYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+ C+ ++H +L+VGY +S W++KN W WG+ GY+ + R N CGIA+
Sbjct: 306 YEPECSPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARNKENHCGIAS 365
Query: 303 YAVYALI 309
A Y L+
Sbjct: 366 SASYPLV 372
>gi|41152538|gb|AAR99518.1| cathepsin L protein [Fasciola hepatica]
Length = 326
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 156/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM+R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMSRASDILSHG--------VPYETNNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ V ++ + + E LK + + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNEQLGVAKVTGYYTVHSGSEVELKNLVGSEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C+ VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 RSGIYQSQTCSPLSVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|46948144|gb|AAT07054.1| cathepsin L-like cysteine proteinase [Brugia malayi]
Length = 368
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 163/315 (51%), Gaps = 25/315 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK- 72
++KY K + + ++ + + N K+I HN+ ++ Y L NHL+D+ P + K
Sbjct: 55 KRKYNKRDEEINLEHRRFMTYLKNVKEIEKHNERYERNEETYELAINHLADILPEEFRKL 114
Query: 73 ---EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ ++T + +R + +P +DWR G +T +Q CG+C+ FS
Sbjct: 115 HGFQSRKITSKNNFKNTIRMKINGP---LPKSIDWRTSGAVTKVKDQGYCGSCWTFSAVG 171
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ F T ++ ELS+Q ++DCS + GN GC GG + YV G+ E+ YPY
Sbjct: 172 ALKGQHFLQTGKLVELSMQNLLDCSDDTYGNYGCDGGLMMEAFEYVVKNDGIDTEKSYPY 231
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+G Q+ C++ + +LP DE L+ +AT+GPI+V+++A F Y GI
Sbjct: 232 QGYQNTCRYSNSTRGTTAYAGKLLPEGDELQLQAAIATIGPISVAVDAKLMKF--YRRGI 289
Query: 249 YDDEACTSDYVNHAMLLVGY----------TRNS---WILKNWWSHHWGDNGYMYLKRGN 295
+ CT+ + HA+L VGY T+ S W+LKN WS WG GY+ L R
Sbjct: 290 FSTSKCTTR-MGHALLAVGYGTEEVKLQNGTKKSVDYWLLKNSWSKRWGIGGYLKLARNQ 348
Query: 296 -NRCGIANYAVYALI 309
N CGI YA Y L+
Sbjct: 349 ENMCGIGFYACYPLV 363
>gi|41055337|ref|NP_956720.1| cathepsin S, a [Danio rerio]
gi|32451845|gb|AAH54668.1| Cathepsin S, a [Danio rerio]
Length = 239
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 82/225 (36%), Positives = 134/225 (59%), Gaps = 6/225 (2%)
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
SP S ++ +P ++W E G ++P NQ CG+C+AFS +++ Q+ + T+ + LS Q
Sbjct: 17 SPPSLQT--LPQRVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQ 74
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
++DCS+ GN GC GG L YV G+ YPY+ K+ +C++ +
Sbjct: 75 NLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEHKEGVCRYSVSGRAGYCTG 134
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY 268
+ ++P +E AL+ +A +GP++V INA +F Y SGIY+D C+S +NHA+L+VGY
Sbjct: 135 FRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIYNDPKCSSALINHAVLVVGY 194
Query: 269 ----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
++ W++KN W WG+NGY+ + R N CGI+++ +Y I
Sbjct: 195 GSENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSFGIYPTI 239
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 167/322 (51%), Gaps = 30/322 (9%)
Query: 7 IIIFIFPQ------KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ F FP+ +++ K+Y ++ + ++ WQSN K I +HN + + GYTL N
Sbjct: 14 VAAFDFPEEWVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDK--FGYTLEMN 71
Query: 61 HLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI--------PDHLDWREKGFITP 112
DL + + +I + +N++ L +DWR+KG ++
Sbjct: 72 EFGDL---------SGVEFKQIYNGYIMQERANDTKLFTASPYMEPAASVDWRQKGVVSE 122
Query: 113 DWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLN 172
NQ CG+C++FS +++GQ + LS Q ++DCS GN GC GG + +
Sbjct: 123 VKNQGQCGSCWSFSATGSLEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFR 182
Query: 173 YVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
YV G+ E YPY K C+F + N+ +S+ + E +L A +GPI+V
Sbjct: 183 YVISNHGVDTESSYPYTAKDGYCRFNQNNVGATETSYRDIARGSESSLTQASAQIGPISV 242
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGY 288
+I+AS +FQ Y +G+Y + +C+S ++H +L+VGY ++ +I+KN W WG +GY
Sbjct: 243 AIDASHRSFQFYKNGVYYEPSCSSSRLDHGVLVVGYGTEGGQDYFIVKNSWGTRWGMDGY 302
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R N CGIA+ A Y ++
Sbjct: 303 IMMSRNRRNNCGIASQASYPIV 324
>gi|281427380|ref|NP_001163996.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|281427798|ref|NP_001164001.1| cathepsin L-like proteinase precursor [Tribolium castaneum]
gi|270001241|gb|EEZ97688.1| cathepsin L precursor [Tribolium castaneum]
gi|270016928|gb|EFA13374.1| hypothetical protein TcasGA2_TC001950 [Tribolium castaneum]
Length = 328
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 155/307 (50%), Gaps = 25/307 (8%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y + ++K +Q N KI HN + +G YT N +D+ ++
Sbjct: 33 HKKQYSSPIEELRRKAIFQDNLVKIEEHNAKFAKGEVTYTKAVNQFADMTADEFM----- 87
Query: 77 LTHSRIRRTLVRSPESNESVLIP---------DHLDWREKGFITPDWNQEDCGACYAFSI 127
+ + R L P+ NE + IP +DWR K +T +Q CG+C++FS
Sbjct: 88 ---AYVNRGLATKPKMNEKLRIPFVKSGKPAAAEVDWRSKA-VTEVKDQGQCGSCWSFST 143
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ+ S + LS Q +VDCS GN GC GG + + +Y+ G+M E YP
Sbjct: 144 TGAVEGQLAISGKGLTSLSEQNLVDCSSQYGNAGCNGGWMDSAFDYIH-DNGIMSESAYP 202
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y C+F V + + +P DE AL+ +A GP+AV+++A+ QLY+ G
Sbjct: 203 YTAMDGNCRFDASQSVTSLQGYYDIPSGDESALQDAVANNGPVAVALDATEE-LQLYSGG 261
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+ D C++ +NH +L+VGY ++ WI+KN W WG+ GY R NN CGIA
Sbjct: 262 VLYDTTCSAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQGYWRQARNRNNNCGIAT 321
Query: 303 YAVYALI 309
A Y +
Sbjct: 322 AASYPAL 328
>gi|291463491|pdb|3IV2|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
gi|291463492|pdb|3IV2|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant
gi|291463519|pdb|3K24|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
Complex With Gln-Leu-Ala Peptide
gi|291463520|pdb|3K24|B Chain B, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant In
Complex With Gln-Leu-Ala Peptide
Length = 220
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+ +AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 180
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|157835400|pdb|2NQD|B Chain B, Crystal Structure Of Cysteine Protease Inhibitor,
Chagasin, In Complex With Human Cathepsin L
Length = 221
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+ +AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 3 PRSVDWREKGYVTPVKNQGQCGSAWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 62
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 63 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK 121
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 122 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 181
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 182 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 221
>gi|111036376|dbj|BAF02517.1| cathepsin L-like proteinase [Echinococcus multilocularis]
Length = 338
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 93/276 (33%), Positives = 151/276 (54%), Gaps = 8/276 (2%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRLTHSRIRRTLVRSPESNESVL-I 98
I HN+ + GL Y+ N +DL + + + SR+ ++ +++ SV +
Sbjct: 64 IKGHNRRVKAGLESYSTALNQFADLEVSEFTERFLGTRPESRVAGKSGKTWKASTSVSDL 123
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +DWR+K +T NQ +CG+C+AFS A++G + K T ++ LS QQ+VDC++ +G
Sbjct: 124 PDMVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEGALAKKTGKLISLSEQQLVDCTLENG 183
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N Y++ + E YPY+ C++ V ++ +P +E
Sbjct: 184 NDGCNGGYMSNAFKYLE-GHSIEPESAYPYRATDGPCRYNESLGVGSVTDIGDIPEGNET 242
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WI 274
AL +ATVGPI+++I+AS F Y GIY C+S ++NH +L +GY + W+
Sbjct: 243 ALMEAVATVGPISIAIDASTLGFMFYHHGIYKSHWCSSKFLNHGVLAIGYGKLDGKPYWL 302
Query: 275 LKNWWSHHWGDNGY-MYLKRGNNRCGIANYAVYALI 309
+KN W WG GY M K +N CG+A+ A + +
Sbjct: 303 VKNSWGSRWGMKGYIMMAKDYHNMCGVASLADFPYV 338
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 154/287 (53%), Gaps = 18/287 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ N + + HN A +G H + L N +DL Y R SR+RR+ S
Sbjct: 77 FKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYRTRFLR-DFSRLRRSASGKISSR 135
Query: 94 ----ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
E +PD +DWRE G + P NQ CG+C+AFS +A++G T ++ LS QQ
Sbjct: 136 YRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQ 195
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
+VDC+ + N GC GG + ++ GG+ EE YPY+G+ IC VV I S+
Sbjct: 196 LVDCT--TANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSY 253
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P +E +L+ +A P++V+++A+ FQLY SGI+ S NHA+ +VGY
Sbjct: 254 ENVPSHNEQSLQKAVANQ-PVSVTMDAAGRDFQLYRSGIFTGSCNIS--ANHALTVVGYG 310
Query: 269 ---TRNSWILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVYAL 308
++ WI+KN W +WG++GY+ +R N +CGI +A Y +
Sbjct: 311 TENDKDFWIVKNSWGKNWGESGYIRAERNIENPNGKCGITRFASYPV 357
>gi|283046734|ref|NP_001164314.1| cathepsin L precursor [Tribolium castaneum]
gi|270001247|gb|EEZ97694.1| cathepsin L precursor [Tribolium castaneum]
Length = 328
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 100/307 (32%), Positives = 154/307 (50%), Gaps = 25/307 (8%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK Y + K+ +Q N KI HN + +G Y+ N +D+ ++
Sbjct: 33 HKKQYSSPIEELKRMAIFQDNLVKIEEHNAKFAKGEVTYSKAVNQFADMTADEFM----- 87
Query: 77 LTHSRIRRTLVRSPESNESVLIP---------DHLDWREKGFITPDWNQEDCGACYAFSI 127
+ + R L P+ NE + +P +DWR ++ NQ CG+C++FS
Sbjct: 88 ---AYVNRGLATKPKKNEKLRLPFVQSDKPAAAEVDWRNSA-VSEVKNQGQCGSCWSFST 143
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++GQ+ S + LS Q +VDCS GN GC GG + + +Y+ G+M E YP
Sbjct: 144 TGAVEGQLAISGRGLTSLSEQNLVDCSSAYGNAGCNGGWMDSAFDYIH-DNGIMSESAYP 202
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C+F V + + LP DE+ALK +A GPIAV+++A+ Q Y+ G
Sbjct: 203 YTASEGSCRFNPSESVTSLQGYYDLPSGDENALKSAVANNGPIAVALDATDE-LQFYSGG 261
Query: 248 IYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+ D C++ +NH +L+VGY ++ WI+KN W WG+ GY R NN CGIA
Sbjct: 262 VLYDTTCSAQALNHGVLVVGYGSEGGQDYWIVKNSWGSGWGEQGYWRQARNRNNNCGIAT 321
Query: 303 YAVYALI 309
A Y +
Sbjct: 322 AASYPAL 328
>gi|222820543|gb|ACM67633.1| cathepsin 1L [Fasciola hepatica]
Length = 326
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 157/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN +GL Y L N +DL
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEQNVKHIEEHNLRHDRGLVTYKLGLNQFTDLTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ ++ S + + E N+ +P +DWR+ G++T NQ CG+C+AFS
Sbjct: 84 YLMEMSPVSES-LSDGISYEAEGND---VPASIDWRQYGYVTEVKNQGQCGSCWAFSAVG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDC+ GN GC GG + N Y++ + GL YPY+
Sbjct: 140 AIEGQYVKRFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNS-GLETASYYPYQ 198
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G + C++++ V ++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 199 GWEYQCQYRKELGVAKVTGAYTVHSGDEMKLMQMVGREGPAAVAVDAQS-DFYMYESGIF 257
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ CTS V HA+L VGY S WILKN W WG++GYM R N C IA+ A
Sbjct: 258 QSQTCTSRSVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRGNMCAIASVA 317
Query: 305 VYALI 309
++
Sbjct: 318 SVPMV 322
>gi|310751866|gb|ADP09371.1| cathepsin L-like proteinase [Fasciola hepatica]
Length = 326
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 157/306 (51%), Gaps = 18/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+R S I V +N +V PD +DWRE G++T +Q +CG+C+AFS
Sbjct: 84 YLTEMSRA--SDILSHGVPYEANNRAV--PDKIDWRESGYVTEVKDQGNCGSCWAFSTTG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E YPY
Sbjct: 140 TMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYEYLKQF--GLETESSYPY 197
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C++ + V ++ + + E LK + GP AV+++ F +Y+ GI
Sbjct: 198 RAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMYSGGI 256
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + C+ +NHA+L VGY WI+KN W WG+ GY+ + R N CGIA+
Sbjct: 257 YQSQTCSPLGLNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGIASL 316
Query: 304 AVYALI 309
A ++
Sbjct: 317 ASLLMV 322
>gi|169791725|pdb|2VHS|A Chain A, Cathsilicatein, A Chimera
gi|169791726|pdb|2VHS|B Chain B, Cathsilicatein, A Chimera
gi|169791727|pdb|2VHS|C Chain C, Cathsilicatein, A Chimera
gi|169791728|pdb|2VHS|D Chain D, Cathsilicatein, A Chimera
Length = 217
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 125/217 (57%), Gaps = 7/217 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CGA YAFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGASYAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ P V V P+ E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY-NPKYSVANDVGFVDIPKQEK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRNSW 273
AL +ATVGPI+V+I+A +F Y GIY C+S +NHAML+VGY + W
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFSSDCSSSSLNHAMLVVGYGFISNNQKYW 180
Query: 274 ILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 181 LVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 217
>gi|116488416|gb|AAB41670.2| secreted cathepsin L 1 [Fasciola hepatica]
Length = 326
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 155/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM+R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMSRASDILSHG--------VPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ + V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C+ VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 RSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|195488703|ref|XP_002092426.1| GE11675 [Drosophila yakuba]
gi|194178527|gb|EDW92138.1| GE11675 [Drosophila yakuba]
Length = 384
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 153/305 (50%), Gaps = 19/305 (6%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N QG++ + N +DL ++ ++T L
Sbjct: 85 KTYLSAADRALHETAFASTKNLVDAGNAAFAQGVNTFKQTVNAFADLTHSEFLSQLTGLK 144
Query: 79 HS-----RIRRTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
S R +L V+ PE IPD DWRE G +TP Q CG+C+AF+ AI
Sbjct: 145 RSPEAKARAAASLKEVQLPEKP----IPDAFDWREHGGVTPVKFQGTCGSCWAFATTGAI 200
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
+G F+ T + LS Q +VDC ++ G GC GG ++ + G+ + YPY
Sbjct: 201 EGHTFRKTGSLPILSEQNLVDCGPVADFGLNGCDGGFQEAAFCFIDEVQKGVSQAGAYPY 260
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ CK+ + ++ +PP+DE +K +AT+GPIA S+N T + YA GI
Sbjct: 261 IDSKDTCKYDGSKSGASLQGFAAIPPKDEEQMKKVVATLGPIACSVNGL-ETLKNYAGGI 319
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
Y+D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA+
Sbjct: 320 YNDDECNQGEPNHSILVVGYGSENGQDYWIVKNSWDDTWGEQGYFRLPRGQNYCFIADEC 379
Query: 305 VYALI 309
Y ++
Sbjct: 380 SYPVV 384
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 162/309 (52%), Gaps = 20/309 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
Q Y K Y + ++ +++N IHTHNQ+ + Y+L+ NH DL + ++
Sbjct: 121 QAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG----YSYSLKMNHFGDLSRDEFRRK 176
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHL----DWREKGFITPDWNQEDCGACYAFSIAS 129
SR ++ + ++P L DWR +G +TP +Q DCG+C+AFS
Sbjct: 177 YLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTG 236
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++G T ++ LS Q+++DCS GN C+GG + + YV +GG+ E+ YPY
Sbjct: 237 ALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYL 296
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C+ + VV I + +P + E A+K LA P++++I A FQ Y G++
Sbjct: 297 ARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVF 355
Query: 250 DDEACTSDYVNHAMLLVGY------TRNSWILKNWWSHHWGDNGYMYL---KRGNNRCGI 300
D +C +D ++H +LLVGY ++ WI+KN W WG +GYMY+ K +CG+
Sbjct: 356 -DASCGTD-LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGL 413
Query: 301 ANYAVYALI 309
A + ++
Sbjct: 414 LLDASFPVM 422
>gi|45822205|emb|CAE47499.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 317
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 102/279 (36%), Positives = 155/279 (55%), Gaps = 10/279 (3%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESV 96
N +KI HN Q G + L N +D+ + + + +R + ++ +
Sbjct: 43 NLQKIEQHNARYQNGEVSFYLGVNQFADMTSEEFKAMLDSQLIHKPKRDITSRFVADPQL 102
Query: 97 LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
+P+ +DWREKG + P +QE CG+C+AFS A A++GQ F ++E LS QQ+VDCS
Sbjct: 103 TVPESIDWREKGAVNPVRDQEQCGSCWAFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRD 162
Query: 157 SGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQ 215
N GC GG +Y++ GL E Y Y+G CK P I I+ +S + Q
Sbjct: 163 YKNEGCNGGWPHWAYDYIK-DNGLCLESKYKYQGYDGYYCKECIPAI-KKINGYSSI-NQ 219
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT-SDYVNHAMLLVGY----TR 270
E ALK + T GPIAV +NA+ +QLY+ GI + ++C + +NHA+L VGY +
Sbjct: 220 TEEALKEAVGTAGPIAVCVNAN-DDWQLYSGGILESQSCPGGESINHAVLAVGYGSENGK 278
Query: 271 NSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ W++KN W+ +WG+ GY+ + RG N+CGI A Y L+
Sbjct: 279 DFWLIKNSWNTYWGEEGYLRIVRGKNQCGINEVADYPLL 317
>gi|149030666|gb|EDL85703.1| cathepsin S [Rattus norvegicus]
Length = 291
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 148/303 (48%), Gaps = 59/303 (19%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K+Y+ + + ++L W+ N K I HN E G+H Y++ NH+ D+
Sbjct: 41 KKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDM-------- 92
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
CG+C+AFS A++G
Sbjct: 93 -------------------------------------------GSCGSCWAFSAVGALEG 109
Query: 134 QIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK 191
Q+ T ++ LS Q +VDCS GN GC GG + Y+ GG+ E YPYK
Sbjct: 110 QLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPYKAM 169
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + N S + LP DE ALK +AT GP++V I+AS +F LY SG+YDD
Sbjct: 170 DEKCHYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQSGVYDD 229
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVY 306
+CT + VNH +L+VGY ++ W++KN W H+GD GY+ + R N N CGIA+Y Y
Sbjct: 230 PSCTEN-VNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSY 288
Query: 307 ALI 309
I
Sbjct: 289 PEI 291
>gi|170292465|pdb|3BC3|A Chain A, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
L
gi|170292466|pdb|3BC3|B Chain B, Exploring Inhibitor Binding At The S Subsites Of Cathepsin
L
gi|261824911|pdb|3H8C|A Chain A, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
gi|261824912|pdb|3H8C|B Chain B, A Combined Crystallographic And Molecular Dynamics Study
Of Cathepsin-L Retro-Binding Inhibitors (Compound 14)
Length = 220
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+ +AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 180
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|330796919|ref|XP_003286511.1| hypothetical protein DICPUDRAFT_77394 [Dictyostelium purpureum]
gi|325083492|gb|EGC36943.1| hypothetical protein DICPUDRAFT_77394 [Dictyostelium purpureum]
Length = 325
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 103/283 (36%), Positives = 158/283 (55%), Gaps = 14/283 (4%)
Query: 24 KATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRI 82
+ D KK+ ++ N I++ N+ Q + G N +DL Y + S I
Sbjct: 45 ETLDFKKRYEIFKYNMDFIYSWNKGNSQTILGL----NKYADLSNEEY---KSLFLGSNI 97
Query: 83 R-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSE 141
+ + +R S IP DWR KG +TP NQ C + YAFS +++ +
Sbjct: 98 KTQNYIRINSSRYD--IPTTFDWRLKGAVTPVKNQGFCNSGYAFSAIGSLESSNKIENGQ 155
Query: 142 IEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKR 199
+ LS Q ++DCS GN GC GG++ N+ NY+ G + KE YPY+ ++S C+FK
Sbjct: 156 LIRLSEQNLIDCSGSEGNRGCDGGTVVNSFNYLFKHQNGKIPKESSYPYEAQKSKCRFKD 215
Query: 200 PNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYV 259
I +++++ L DE ++ +AT GP++V+I+AS FQLY G+YDD C++ Y
Sbjct: 216 QFIGATLNNFANLI-SDESTIQNAVATKGPVSVAIDASSIFFQLYFGGVYDDLFCSNIYT 274
Query: 260 NHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIAN 302
NH +L+VGYT N WI+KN + +G +GY+Y+K+G N CGIAN
Sbjct: 275 NHFVLIVGYTENYWIVKNSYGEDYGIDGYIYMKKGKNLCGIAN 317
>gi|313103779|pdb|3KSE|A Chain A, Unreduced Cathepsin L In Complex With Stefin A
gi|313103780|pdb|3KSE|B Chain B, Unreduced Cathepsin L In Complex With Stefin A
gi|313103781|pdb|3KSE|C Chain C, Unreduced Cathepsin L In Complex With Stefin A
Length = 220
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+ +AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSXWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + +P Q E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDAGFVDIPKQ-EK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS------ 272
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY S
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDDN 180
Query: 273 --WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
W++KN W WG GY+ + K N CGIA+ A Y +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 162/309 (52%), Gaps = 20/309 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
Q Y K Y + ++ +++N IHTHNQ+ + Y+L+ NH DL + ++
Sbjct: 120 QAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG----YSYSLKMNHFGDLSRDEFRRK 175
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHL----DWREKGFITPDWNQEDCGACYAFSIAS 129
SR ++ + ++P L DWR +G +TP +Q DCG+C+AFS
Sbjct: 176 YLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTG 235
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++G T ++ LS Q+++DCS GN C+GG + + YV +GG+ E+ YPY
Sbjct: 236 ALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYL 295
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C+ + VV I + +P + E A+K LA P++++I A FQ Y G++
Sbjct: 296 ARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVF 354
Query: 250 DDEACTSDYVNHAMLLVGY------TRNSWILKNWWSHHWGDNGYMYL---KRGNNRCGI 300
D +C +D ++H +LLVGY ++ WI+KN W WG +GYMY+ K +CG+
Sbjct: 355 -DASCGTD-LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGL 412
Query: 301 ANYAVYALI 309
A + ++
Sbjct: 413 LLDASFPVM 421
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 165/309 (53%), Gaps = 21/309 (6%)
Query: 15 KKYKKDYRKKATDSKKKLH----WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY 70
+++K + +K D +++ + + N + I N++ ++G Y L N SD+ +
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 71 IKEMTRLTH----SRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFS 126
M + + + +PES E +DWR KG +TP +Q CG+C+AFS
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDAAPESTE-------VDWRTKGAVTPVKDQGQCGSCWAFS 133
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISG-NLGCAGGSLRNTLNYVQFAGGLMKEED 185
I+GQ F T + LS QQ+VDC+ S N GC GG + + YV+ GG+ E
Sbjct: 134 TTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESS 193
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY+ + + C+F I + + + E ALK +GPI+V+I+AS +FQ Y
Sbjct: 194 YPYEARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYY 253
Query: 246 SGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
+G+Y + +C+S ++HA+L VGY ++ W++KN W+ WG++GY+ + R NN CGI
Sbjct: 254 TGVYYEPSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGI 313
Query: 301 ANYAVYALI 309
A A Y +
Sbjct: 314 ATDACYPTV 322
>gi|66812702|ref|XP_640530.1| counting factor associated protein [Dictyostelium discoideum AX4]
gi|74897159|sp|Q54TR1.1|CFAD_DICDI RecName: Full=Counting factor associated protein D; Flags:
Precursor
gi|60468561|gb|EAL66564.1| counting factor associated protein [Dictyostelium discoideum AX4]
Length = 531
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 170/307 (55%), Gaps = 20/307 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +Y K+Y + ++ +++++ K I THN + Y L NH +DL KE
Sbjct: 229 KAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESS----YKLGMNHYADLSN----KE 280
Query: 74 MTRLTHSRIRRTLVRSPES---NESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
L ++ R V +S +ES+ IP +DWR + +TP +Q CG+C+ F
Sbjct: 281 FNTLVKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTG 340
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
+++G + E+ LS QQ+VDC+I++G+ GC GG + YV G L E +YPY
Sbjct: 341 SLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYL 400
Query: 190 GKQSICKFKRPNIV-VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ +C+ + V I+ + + E AL+ +AT GP+A++I+AS F+ Y SG+
Sbjct: 401 MQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGV 460
Query: 249 YDDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
Y++ AC + D ++H +L +GY ++ +++KN WS +WG +GY+Y+ R NN CG++
Sbjct: 461 YNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVS 520
Query: 302 NYAVYAL 308
+ A Y +
Sbjct: 521 SQATYPI 527
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/291 (35%), Positives = 147/291 (50%), Gaps = 16/291 (5%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSP--- 90
+Q N I HN+E QGL Y + N + L + + + + + R
Sbjct: 55 FQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRRAGKH 114
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
E IP +DWREKG + NQ CG+C+AFS +A++G F ++ E+ LS QQ+
Sbjct: 115 ERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQL 174
Query: 151 VDCSIISGNLGCAGGSLRNTLNY-VQFAG-GLMKEEDYPYKGKQSICKFKRPNIVVDISS 208
VDCS GN GCAGG + N Y + G G E+DYPYKG CKF + IS
Sbjct: 175 VDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSADGVRATISG 234
Query: 209 WSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS-DYVNHAMLLVG 267
++ + +E L +A VGP++V+I+A Q Y G+++ A T +NH + VG
Sbjct: 235 YNDVKQGNETDLLDAVANVGPVSVAIHAGA-ALQFYLRGVFNGVAGTCFGPLNHGVTAVG 293
Query: 268 YTRNS---------WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
Y S WI+KN W WG+ G++ RG N CG+AN A Y L+
Sbjct: 294 YGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPLV 344
>gi|195123219|ref|XP_002006105.1| GI20850 [Drosophila mojavensis]
gi|193911173|gb|EDW10040.1| GI20850 [Drosophila mojavensis]
Length = 329
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 101/302 (33%), Positives = 164/302 (54%), Gaps = 13/302 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM- 74
+Y K Y + + +KL + N ++I HNQ + + Y + N +D+ + + M
Sbjct: 33 EYDKVYESEEEELLRKLIFIDNKRQIDRHNQRYRLNMESYEMGVNQFTDMLDKEFESLML 92
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+ + + L+ +P+ E+ +P +DWR +G +TP NQ CGAC++F+ ++G
Sbjct: 93 SSMNTTESDADLLYTPD--EAYALPADIDWRNRGAVTPVKNQGKCGACWSFAATGTLEGM 150
Query: 135 IFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
F T ++ LS Q +VDCS I N GC GG L YV+ GG+ E Y Y+ KQ
Sbjct: 151 HFLKTGKLVSLSEQNLVDCSTIRYFNRGCNGGMPFRALKYVRDNGGIDTEYSYTYEAKQL 210
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ +I ++ + + H L V +A+ GPI+V I+AS + F+ Y G+ +D
Sbjct: 211 SCRYDPLHIGAQVTDVVRVAAGEPH-LAVAVASKGPISVGIHAS-NNFRNYRDGVLNDRQ 268
Query: 254 CTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C NHA+L+VG+ R+ W++KN W WGD GY+ + R +N+CGIA+ AVY
Sbjct: 269 CNK-AANHAVLVVGFGRDPQGGDFWLVKNSWGASWGDGGYIRMSRNRSNQCGIASNAVYP 327
Query: 308 LI 309
L+
Sbjct: 328 LV 329
>gi|195154194|ref|XP_002018007.1| GL17477 [Drosophila persimilis]
gi|198460088|ref|XP_002138780.1| GA24205 [Drosophila pseudoobscura pseudoobscura]
gi|194113803|gb|EDW35846.1| GL17477 [Drosophila persimilis]
gi|198136897|gb|EDY69338.1| GA24205 [Drosophila pseudoobscura pseudoobscura]
Length = 353
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/314 (33%), Positives = 163/314 (51%), Gaps = 36/314 (11%)
Query: 18 KKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL 77
++ YR+ +KK L I N+ A G GY L N L+DL KE+ L
Sbjct: 54 ERAYRESIFAAKKSL--------IDLSNKNADGGFSGYRLYLNPLADLTK----KEIATL 101
Query: 78 THSRIRRT----------LVRSPESNESVLIPDHLDWREKGFITPDWNQE-DCGACYAFS 126
S++ + V +P L P+ DWREKG +TP Q CG+C++F+
Sbjct: 102 LGSKVSESGEKYTNGHINFVTAPNPASGNL-PESFDWREKGGVTPPGFQGMGCGSCWSFA 160
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A++G +F+ T + LS Q +VDC+ GN+GC GG Y++ G+ Y
Sbjct: 161 TTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIR-DHGITLANKY 219
Query: 187 PYKGKQSICK----FKRP--NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
PY + C+ +P +V I ++ + P DE +K +AT+GP+A S+NA+P +
Sbjct: 220 PYTQFEMQCRQNETAGQPPRESIVKIRDYATITPGDEQKMKEVVATLGPLACSMNAAPIS 279
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYM-YLKRGN 295
F+ Y GIY DE C D VNH++++VGY ++ WI+KN +S +WG+ G+M L+
Sbjct: 280 FEQYQGGIYADEECNQDEVNHSVVVVGYGSEGGQDYWIIKNSYSQNWGEGGFMRILRNAG 339
Query: 296 NRCGIANYAVYALI 309
CGIA+ Y ++
Sbjct: 340 GFCGIASECSYPIL 353
>gi|242020372|ref|XP_002430629.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515801|gb|EEB17891.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 346
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 165/329 (50%), Gaps = 34/329 (10%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLS 63
KEW + + +Y K Y+ + + + N +I HN + +G Y + EN S
Sbjct: 29 KEWDLF----KAQYGKSYKTPEEEYYRMRIYMDNKLEIVEHNLKFLEGKVSYEMGENQFS 84
Query: 64 DLHPRHYIKEMTRL--THSRIRRTLVRSPE--------------SNESVLIPDHLDWREK 107
D+ E+ L +++I R + E NE V PD+++W E
Sbjct: 85 DMTS----DEVNDLYNGYNKINRMNAKPGEIPALPEMDGVPFVAKNEDV--PDYVNWVEA 138
Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSL 167
G +TP +Q CG+CYAF+ + ++ ++ +LS+Q V+DCS GN GC GG
Sbjct: 139 GAVTPIRDQGACGSCYAFASLATLESRLMIYNKTELQLSVQNVLDCSGEFGNFGCDGGLA 198
Query: 168 RNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATV 227
RN YV G+ E DYPY+ ++ C+F I + + DE ALK +AT
Sbjct: 199 RNVYEYVMDNEGVNNETDYPYEVREGKCRFSSKKFTAKIKDYVSVSYFDEDALKAAVAT- 257
Query: 228 GPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------TRNSWILKNWWSH 281
GP++VS++AS F+ Y G+Y D+ C+S +NHA++ VGY ++ W+++N W
Sbjct: 258 GPVSVSMDASSPAFKKYKGGVYTDDKCSSMKLNHAVVAVGYGTDPDTKQDYWLVRNSWGT 317
Query: 282 HWGDNGYMYLKR-GNNRCGIANYAVYALI 309
WG+ GY + R +N CG+A V+ +
Sbjct: 318 AWGERGYFKIARNADNMCGLATRPVFPTL 346
>gi|74917819|sp|Q6YD92.1|SILIC_PETFI RecName: Full=Silicatein; Flags: Precursor
gi|37724090|gb|AAO23671.1| silicatein [Petrosia ficiformis]
Length = 339
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 102/307 (33%), Positives = 157/307 (51%), Gaps = 20/307 (6%)
Query: 21 YRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT-- 78
Y + + ++ + WQ N + I HN+ +Q GYTL N D+ + + M +
Sbjct: 35 YESEHEERRRHVVWQQNQEYIDQHNKYKEQ--FGYTLEMNKFGDMSNAEFAELMMCVQDY 92
Query: 79 --HSRIRRTL---------VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
H + +L VR ++ +V +P+ +DWR G +T +Q CG YAF+
Sbjct: 93 NHHGNLTESLLADNKFKGRVREYQAPATVSLPETVDWRTGGAVTHVKDQLRCGCSYAFAA 152
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
A++G + LS Q V+DCS+ GN GC+ + N YV GGL YP
Sbjct: 153 VGALEGAAALARGRTASLSEQNVLDCSVPYGNHGCSCEDVNNAFMYVIDNGGLDTTSSYP 212
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y +Q CKFK + + + DE +L+ LAT GP+AV I+AS +FQ Y G
Sbjct: 213 YVSRQYYCKFKSSGVGATATGIVTISSGDESSLESALATAGPVAVYIDASHSSFQFYKYG 272
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+ + C+ ++HAM+L+GY S W+LKN W +WG +GY+ + RG +N+CGIA
Sbjct: 273 VLNVPNCSRSKLSHAMILIGYGTTSSKKYWLLKNSWGPNWGISGYIKMSRGMSNQCGIAT 332
Query: 303 YAVYALI 309
YA + +
Sbjct: 333 YASFPTL 339
>gi|377823949|gb|AFB77219.1| cathepsin L1 [Fasciola gigantica]
Length = 326
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 153/310 (49%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYGCMGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
GIY + C+ VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 RGGIYQSQTCSPLGVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|410990006|ref|XP_004001241.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 328
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 150/289 (51%), Gaps = 16/289 (5%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
++ WQ N K I HN+E QG H + + N D+ + M L + V
Sbjct: 47 RRAVWQKNMKMIMQHNREYLQGKHSFLMAMNGFGDMTNEEFRHMMIGLKIQKNENGTVFK 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
+ +P ++ R+ TPD + C + +AFS A AI+GQIF+ + LS+Q
Sbjct: 107 VPFPAGISLP--MNRRQ----TPDISLGRCASGWAFSAAGAIEGQIFRKYGKRVSLSVQN 160
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
++DCS GN GC GG + N YV+ GL EE YPY + CK++ +++++
Sbjct: 161 LLDCSQAEGNEGCNGGLMSNAFQYVRNNRGLDTEESYPYVARDGPCKYRPEYSAANVTAF 220
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P ++E AL V + +G I+ +I+AS TF+ Y GIY D C+S+ +NH +L+VGY
Sbjct: 221 QTIPRREE-ALLVAMKNMGSISAAIDASLDTFRFYKGGIYYDPKCSSEDLNHGVLVVGYG 279
Query: 269 -------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+ W +KN W WG GY+ + R NN CGIA A + ++
Sbjct: 280 FQGKESDNQKYWFVKNSWGTDWGMGGYIKMARERNNNCGIATRASFPIV 328
>gi|194755357|ref|XP_001959958.1| GF13132 [Drosophila ananassae]
gi|190621256|gb|EDV36780.1| GF13132 [Drosophila ananassae]
Length = 392
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 147/300 (49%), Gaps = 10/300 (3%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT--- 75
K Y A ++ + ++ + N G + L N SDL ++ ++T
Sbjct: 94 KTYASAAEQQLRETAFSASKSLVDAGNAAFASGASTFKLAVNAFSDLTHSEFLSQLTGRK 153
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
R + + + P S + +P+ DWR+ G +TP NQ CG+C+AF+ I+G I
Sbjct: 154 RSSQGDAQAAASKQPPSVPAGAVPESFDWRQHGAVTPVKNQGTCGSCWAFATTGTIEGHI 213
Query: 136 FKSTSEIEELSIQQVVDCSIISGNL-GCAGGSLRNTLNYV-QFAGGLMKEEDYPYKGKQS 193
++T + LS Q +VDC L GC GG + ++ + G+ E Y Y KQ
Sbjct: 214 ARATGNLPVLSEQNLVDCGPQEFALVGCDGGYQGYAMAFIHENQKGVSNSESYAYLDKQD 273
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
CK+ I W+ +P DE LK + T+GP+A S+ + T Y SGIY DE
Sbjct: 274 TCKYNPSTSAAQIKGWAEIPVGDEELLKKVVGTLGPVACSLYGT-ETLLNYDSGIYSDEQ 332
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
C + NH++L+VGY ++ WI+KN WS WG++GY L RG N C IA Y ++
Sbjct: 333 CNGEDPNHSVLVVGYGSENGQDYWIVKNSWSAAWGEDGYFRLVRGKNFCNIAAECAYPVV 392
>gi|8547325|gb|AAF76330.1|AF271385_1 cathepsin L [Fasciola hepatica]
Length = 326
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 155/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y D ++ + W+ N K I HN GL Y L N +D+
Sbjct: 25 KRIYNKEYNGADDDHRRNI-WEQNVKHIQEHNLRHDLGLVTYKLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P +PD +DWRE G++T +Q CG+C+AF
Sbjct: 84 YLTEMPRASELLSHG--------IPYKANKRAVPDRIDWRESGYVTEVKDQGGCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S A++GQ K+ S QQ+VDCS GN GC GG + N Y++ GL E
Sbjct: 136 STTGAMEGQYMKNQRTSISFSEQQLVDCSRDFGNYGCNGGLMENAYEYLKRF-GLETESS 194
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY+ + C++ V ++ + + DE L+ + GP AV+++ F +Y
Sbjct: 195 YPYRAVEGQCRYNEQLGVAKVTGYYTVHSGDEVELQNLVGAEGPAAVALDVESD-FMMYR 253
Query: 246 SGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGY--MYLKRGNNRCG 299
SGIY + C+ D +NH +L VGY + WI+KN W WG++GY M KRG N CG
Sbjct: 254 SGIYQSQTCSPDRLNHGVLAVGYGIQDGTDYWIVKNSWGTWWGEDGYIRMVRKRG-NMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASVPMV 322
>gi|301628647|ref|XP_002943461.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 334
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 165/302 (54%), Gaps = 9/302 (2%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K ++K Y+ + ++ W+ K I HN E GLH Y + NHL D+ M
Sbjct: 35 KTHQKSYKDWEEERARRTIWEETLKFITAHNLEYSLGLHTYEVGMNHLGDMTGEEVAATM 94
Query: 75 TRLTHSRIR-RTLVRSPESNESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQ 132
T T S +V P+ L P +DWR KG +TP NQ CG+CYAFS A++
Sbjct: 95 TGYTDSGDSLDNVVHVPKQILEALPPASIDWRTKGCVTPVRNQGLFCGSCYAFSAVGALE 154
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
Q K T + S Q++VDCS GN GC GGSL + Y++ G+M++ Y Y K+
Sbjct: 155 CQWKKKTRSLVTFSPQELVDCSDSEGNNGCKGGSLNASFTYMK-KNGVMEDSAYKYTEKK 213
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
CK K+P+ + + L +E ALK + GP++V I++S F++Y SG+Y D
Sbjct: 214 EPCKKKKPSNTGVVKQFYRLHAGNETALKKAVGIDGPVSVVIDSSCPGFRMYNSGVYYDP 273
Query: 253 ACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
CT++ ++H++L+VGY ++ W++KN W WG+ GY+ + R NN CGIA+YA Y
Sbjct: 274 YCTTN-LDHSVLVVGYGTDNGNDYWLIKNSWGIGWGEKGYVKMARNRNNHCGIASYAYYL 332
Query: 308 LI 309
+
Sbjct: 333 TV 334
>gi|195379514|ref|XP_002048523.1| GJ11310 [Drosophila virilis]
gi|194155681|gb|EDW70865.1| GJ11310 [Drosophila virilis]
Length = 328
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/299 (36%), Positives = 157/299 (52%), Gaps = 17/299 (5%)
Query: 22 RKKATDSKKKLH---WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI-KEMTRL 77
R A DS++ L ++ N K I THN + G Y + N +DL P ++ + M L
Sbjct: 36 RSYAGDSEELLRRRIFEDNKKLIDTHNARYEAGKETYKMGVNEFTDLLPSEFVSRMMGSL 95
Query: 78 THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
+ + + P +N + IP+ +DWR KG ++P NQ CG+C+ F+ ++GQ F
Sbjct: 96 NRTAVTADYIYEPSAN--LQIPESIDWRTKGAVSPVKNQGTCGSCWTFAAVGTLEGQSFL 153
Query: 138 STSEIEELSIQQVVDCSIISG--NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
T + ELS Q ++DCS N GC G + L YV+ GL YPY+G Q C
Sbjct: 154 RTKRMVELSEQNLLDCSSHPPYRNHGCQRGYPYDALRYVKDNQGLDTRSSYPYQGVQGRC 213
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+F++ ++ V I + + DE AL+ +A GPIAV I+ Q Y SGIY + C
Sbjct: 214 RFRKEHVGVRIKGVATVRSGDERALQAAVAEKGPIAVGIDV--QHLQHYHSGIY-NRPCF 270
Query: 256 SDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
HA++LVGY R+ W+LKN W +WG+ GY + R + N C IAN AVY L
Sbjct: 271 GPAFLHAVVLVGYGRDRGHDYWLLKNSWG-NWGEAGYFRMARNSRNLCYIANDAVYPLF 328
>gi|545734|gb|AAB30089.1| cysteine protease [Fasciola sp.]
gi|2662308|dbj|BAA23743.1| cathepsin L [Fasciola hepatica]
Length = 325
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/309 (33%), Positives = 152/309 (49%), Gaps = 25/309 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N L+D+
Sbjct: 25 KRMYNKEY-NGAVDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQLTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYGCMGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGI 300
+ GIY C+S VNHA+L VGY WI+KN W WG+ ++ N CGI
Sbjct: 253 SGGIYQSRTCSSLRVNHAVLAVGYGTQGGTDYWIVKNSWGSSWGERYIRMVRNRGNMCGI 312
Query: 301 ANYAVYALI 309
A+ A ++
Sbjct: 313 ASLASLPMV 321
>gi|74765984|sp|Q24940.1|CATLL_FASHE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|497700|gb|AAA29136.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 156/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM+R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMSRASDILSHG--------VPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ + V ++ + + E LK + P AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C+ VNHA+L VGY WI+KN W +WG+ GY+ + R N CG
Sbjct: 253 RSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYIRMARNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|40060518|gb|AAR37420.1| papain-like cysteine proteinase [Trichomonas vaginalis]
Length = 284
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 152/278 (54%), Gaps = 18/278 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+T+ N L+ L P Y L R+ + ++ SN
Sbjct: 14 WLSNKRLVQEHNRAN----LGFTVALNKLAHLTPAEY----RSLLGFRMSKKQFKATASN 65
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ D DWR++G + P NQ CG+C+AFS A + Q + +++ LS Q +VDC
Sbjct: 66 --AVAADSCDWRQQGKVNPVKNQGQCGSCWAFSTVQAQESQYAIAHGQLQSLSEQNLVDC 123
Query: 154 SIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVL 212
++ GC GG + +YV + G M E+DYPY + CKF +++S+ +
Sbjct: 124 --VTECYGCNGGLMTAAYDYVIRPQGKFMLEDDYPYTARDGSCKFDSKKGTSNVASYVTV 181
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY---- 268
DE L ++T+GP A++I+AS +FQLY+SGIYD+ AC+S ++H + VGY
Sbjct: 182 NEGDEKDLAKKVSTLGPAAIAIDASAWSFQLYSSGIYDESACSSVNLDHGVGCVGYGTEG 241
Query: 269 TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAV 305
++N WI++N W WG+ GY+ +K NN+CG A A
Sbjct: 242 SKNYWIVRNSWGESWGEKGYIRMIKDKNNQCGEATMAC 279
>gi|379991182|emb|CCA61803.1| cathepsin protein CatL1-MM3p, partial [Fasciola hepatica]
Length = 326
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 157/306 (51%), Gaps = 18/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y D ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEYNGD-DDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+R S I V +N +V PD +DWRE G++T +Q +CG+C+AFS
Sbjct: 84 YLTEMSRA--SDILSHGVPYEANNRAV--PDKIDWRESGYVTEVKDQGNCGSCWAFSTTG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E YPY
Sbjct: 140 TMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF--GLETESSYPY 197
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C++ + V ++ + + E LK + GP AV+++ F +Y+ GI
Sbjct: 198 TAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMYSGGI 256
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + C+ +NHA+L VGY WI+KN W +WG+ GY+ + R N CGIA+
Sbjct: 257 YQSQTCSPLGLNHAVLAVGYGTQGGTDYWIVKNSWGSYWGERGYIRMARNRGNMCGIASL 316
Query: 304 AVYALI 309
A ++
Sbjct: 317 ASLPMV 322
>gi|123391522|ref|XP_001300085.1| Clan CA, family C1, cathepsin L or K-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121881065|gb|EAX87155.1| Clan CA, family C1, cathepsin L or K-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 285
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/279 (34%), Positives = 151/279 (54%), Gaps = 19/279 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+T+ N L+ L P Y L R+ + ++ SN
Sbjct: 14 WLSNKRLVQEHNRAN----LGFTVALNKLAHLTPAEY----RSLLGFRMSKKQFKATASN 65
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ D DWR++G + P NQ CG+C+AFS A + Q + +++ LS Q +VDC
Sbjct: 66 --AVAADSCDWRQQGKVNPVKNQGQCGSCWAFSTVQAQESQYAIAHGQLQSLSEQNLVDC 123
Query: 154 SIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
++ GC GG + +YV G M E+DYPY + CKF +++S+
Sbjct: 124 --VTECYGCNGGLMTAAYDYVIRNQKGKFMLEDDYPYTARDGSCKFDSKKGTSNVASYVT 181
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++T+GP A++I+AS +FQLY+SGIYD+ AC+S ++H + VGY
Sbjct: 182 VNEGDEKDLAKKVSTLGPAAIAIDASAWSFQLYSSGIYDESACSSVNLDHGVGCVGYGTQ 241
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAV 305
++N WI++N W WG+ GY+ +K NN+CG A A
Sbjct: 242 GSKNYWIVRNSWGESWGEKGYIRMIKDKNNQCGEATMAC 280
>gi|473159|emb|CAA83538.1| cathepsin L [Schistosoma mansoni]
Length = 317
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/308 (36%), Positives = 166/308 (53%), Gaps = 19/308 (6%)
Query: 15 KKYKKDYRKKATDS---KKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
K++K Y K +DS ++K + +KI HN GL GYT+ N D+
Sbjct: 16 KQWKLKYNKTYSDSNEIRRKAIFMRYVEKIQQHNLRHDLGLEGYTMGLNQFCDMD----W 71
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLI-----PDHLDWREKGFITPDWNQEDCGACYAFS 126
+E+ + S++ + E + + P DWR+ G +TP NQ CG+C+AFS
Sbjct: 72 EEIKTIMLSKVFGNSPLWDDKKEELELSNDPLPSKWDWRDHGAVTPVKNQGLCGSCWAFS 131
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
A A++GQ+ K ++ LS QQ+VDCS GN GC GG++ + Y++ + E+DY
Sbjct: 132 AAGAVEGQLVKKHKKLISLSEQQLVDCSYKYGNDGCQGGTMDQSFAYLE-KYPIESEKDY 190
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
Y G S C F++ VV + + LP +DE L+ L GPI+V+I+A LY S
Sbjct: 191 KYIGHDSSCHFRKSKGVVKVKKFVDLPARDEEKLQKALYHYGPISVAIDALDDLI-LYKS 249
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIA 301
GIY+ + C+S +NH +L VGY R + W++KN W WG NGY L+R +N CGIA
Sbjct: 250 GIYESKQCSSFLLNHGVLAVGYGRENRKDYWLIKNSWGTTWGMNGYFKLRRNKHNMCGIA 309
Query: 302 NYAVYALI 309
A + L+
Sbjct: 310 TNASFPLL 317
>gi|339252572|ref|XP_003371509.1| cathepsin L1 [Trichinella spiralis]
gi|316968239|gb|EFV52542.1| cathepsin L1 [Trichinella spiralis]
Length = 448
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 173/338 (51%), Gaps = 48/338 (14%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y ++ ++ ++ + +N K+ HN++ G Y+++ N SDL +++ M
Sbjct: 112 KTYANESEENYRREVFYANRLKVIRHNEQFDGGAKSYSMKLNKYSDLTHGEFVQLMNGFK 171
Query: 79 ----HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS----- 129
R + V P L P ++DWR +G +TP +Q CG+C+AFS +
Sbjct: 172 IASKSGDYRPSSVFKPLLFTGDL-PLNVDWRSEGMVTPVKDQGHCGSCWAFSAVNSNALH 230
Query: 130 ----------AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
A++GQ + T ++ LS Q ++DCS GN GC+GG + N YV+ G
Sbjct: 231 VHSRAFQQTGALEGQNKRKTGKLVSLSEQNLIDCSRKYGNKGCSGGLMDNAFEYVKENHG 290
Query: 180 LMKEEDYPYKGKQSI----CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSIN 235
+ EE YPY+ + C+FK I + + P +E L +AT+GP++V+I+
Sbjct: 291 IDTEESYPYEAAVRMLDKKCRFKNSTIGATDKGFVDIEPGNETYLMHAVATIGPLSVAID 350
Query: 236 ASPHTFQLYAS-------------------GIYDDEACTSDYVNHAMLLVGY----TRNS 272
AS +FQ Y+S G+Y + C+S +++H +L+VGY ++
Sbjct: 351 ASHESFQFYSSGMLLMVDIFNTVEVMWTNLGVYFEPMCSSQFLDHGVLVVGYGSLKGKDY 410
Query: 273 WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
WI+KN W WG++GY+++ R NN CGIA++A Y +I
Sbjct: 411 WIVKNSWGTSWGNDGYIFMARNKNNSCGIASFASYPII 448
>gi|91092016|ref|XP_970773.1| PREDICTED: similar to cathepsin-L-like midgut cysteine proteinase
[Tribolium castaneum]
gi|270001248|gb|EEZ97695.1| cathepsin L precursor [Tribolium castaneum]
Length = 314
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 155/301 (51%), Gaps = 14/301 (4%)
Query: 17 YKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTR 76
+KK+Y K + K+ + N KI HN + + G Y N DL ++ + R
Sbjct: 20 HKKEYSTKTEEMKRLAIFTENLSKIDAHNTKYRNGEVTYFKAMNKFGDLTTDEFLAFVNR 79
Query: 77 LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
++ + + + + +DWR G ++ N++DC + ++FS A++GQ+
Sbjct: 80 NKLTKREKNEKHTKLNTTKIEYETQVDWRANGLVSDVKNEQDCSSSWSFSALGAVEGQLA 139
Query: 137 KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPYKGKQSIC 195
T+++ LS Q ++DCS + GC GG N +Y+ QF G+M E+DYPY+GK +C
Sbjct: 140 LKTNQLTSLSAQNLIDCS---ADFGCNGGHATNAYSYISQF--GIMPEKDYPYEGKAGVC 194
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+F + ++ + + P DE AL+ LA +GPIA +I A+ Q Y GI DE C
Sbjct: 195 RFDASKSITTVTGFYDIDPNDETALQGALAMMGPIAATIEATEE-LQFYKGGILLDEKCN 253
Query: 256 SDY--VNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
S +NH +L+VGY + WI+KN W WG+ GY R N CGIA+ A +
Sbjct: 254 SKVPDLNHGVLVVGYGSENGGDFWIVKNSWGSDWGEGGYYRPVRNHGNNCGIASSATLPI 313
Query: 309 I 309
+
Sbjct: 314 L 314
>gi|320543907|ref|NP_001188921.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
gi|318068589|gb|ADV37168.1| cysteine proteinase-1, isoform D [Drosophila melanogaster]
Length = 249
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 130/220 (59%), Gaps = 6/220 (2%)
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
V +P +DWR KG +T +Q CG+C+AFS A++GQ F+ + + LS Q +VDCS
Sbjct: 30 VTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCST 89
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
GN GC GG + N Y++ GG+ E+ YPY+ C F + + ++ +P
Sbjct: 90 KYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQG 149
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS--- 272
DE + +ATVGP++V+I+AS +FQ Y+ G+Y++ C + ++H +L+VG+ +
Sbjct: 150 DEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGE 209
Query: 273 --WILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
W++KN W WGD G++ L+ N+CGIA+ + Y L+
Sbjct: 210 DYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 249
>gi|158263969|gb|ABW24657.1| cathepsin L [Fasciola hepatica]
Length = 326
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 158/306 (51%), Gaps = 18/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPR 68
++ Y K+Y D + +L+ W+ N K I HN +GL Y L N +DL
Sbjct: 25 KRMYNKEY--NGADDEHRLNIWEQNVKHIEEHNLRHDRGLVTYKLGLNQFTDLTFEEFKA 82
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
Y+ EM+ ++ S + + E N+ +P +DWR+ G++T +Q CG+C+AFS
Sbjct: 83 KYLMEMSPVSES-LSDGISYEAEGND---VPASIDWRQYGYVTEVKDQGQCGSCWAFSAV 138
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
AI+GQ K S QQ+VDC+ GN GC+GG + N Y++ GL YPY
Sbjct: 139 GAIEGQYVKKFRNRMLFSEQQLVDCTKRFGNHGCSGGWMENAYRYLK-DSGLETASYYPY 197
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ + C+++R V ++ + DE L + GP AV+++A F +Y SGI
Sbjct: 198 QAWEYQCQYRRELGVAKVTGAYTVHSGDEMRLMQMVGREGPAAVAVDAQS-DFYMYQSGI 256
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
+ + CTS V HA+L VGY S WILKN W WG++GYM R NN C IA+
Sbjct: 257 FQSQTCTSQRVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRNNMCAIASV 316
Query: 304 AVYALI 309
A ++
Sbjct: 317 ASVPMV 322
>gi|114796866|gb|ABI79445.1| cysteine proteinase 5 [Entamoeba histolytica]
Length = 289
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 141/242 (58%), Gaps = 17/242 (7%)
Query: 74 MTRLTHSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
MT ++ + + V + E V +P+ +DWR KG + +Q CG+CY+F+
Sbjct: 50 MTEAEYNSMLKPFVIDKQHEEIVYDSRGDVPESVDWRAKGKVPAIRDQASCGSCYSFASV 109
Query: 129 SAIQGQIF-----KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE 183
+AI+G++ K T + +LS QQ+VDCS+ GN GC GGSL + YV+ G+M+E
Sbjct: 110 AAIEGRLLVAGSKKFTVDDLDLSEQQLVDCSVSVGNKGCNGGSLLLSFRYVKL-NGIMQE 168
Query: 184 EDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
+DYPY + C + + + V I+ ++ P E AL + A GP+ +I+AS FQL
Sbjct: 169 KDYPYVAAEETCTYDKKKVAVKITGQKLVRPGSEKAL-MRAAAEGPVGAAIDASGVKFQL 227
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRC 298
Y SGIY+ + C+S +NH + +VGY T+N WI++N W WGD GY+ + R NN+C
Sbjct: 228 YKSGIYNSKECSSTQLNHGVAVVGYGTQNGTEYWIVRNSWGTIWGDQGYVLMSRNKNNQC 287
Query: 299 GI 300
GI
Sbjct: 288 GI 289
>gi|167427529|gb|ABZ80401.1| cathepsin L4, partial [Fasciola hepatica]
Length = 303
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 105/309 (33%), Positives = 154/309 (49%), Gaps = 24/309 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL-------- 65
++ Y K+Y D ++ W+ N K I HN GL YTL N +D+
Sbjct: 2 KRMYNKEY-NGVDDVHRRNIWEENVKHIQEHNIRHDLGLVTYTLGLNQFTDMTFEEFKAK 60
Query: 66 HPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
H R + L+H P +P+ +DWRE G++T +Q DCG+C+AF
Sbjct: 61 HLREIPRASDMLSHG--------IPYEANDRAVPESIDWREFGYVTEVKDQGDCGSCWAF 112
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S A++GQ K+ S QQ+VDCS GN GC GG + N Y++ GL E
Sbjct: 113 STTGAVEGQYMKNPKANISFSEQQLVDCSGDYGNHGCNGGFMENAYEYLERR-GLETESS 171
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPYK ++ CK+ VV++ + + E L + GP AV+++ F +Y
Sbjct: 172 YPYKAEEGPCKYDSRLGVVEVFGYFIEHSGIESKLAHLVGDKGPAAVAVDVES-DFLMYR 230
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
GIY C+S+ +NHAML+VGY WI+KN W WGD+GY+ + R +N CGI
Sbjct: 231 GGIYASRNCSSEKLNHAMLVVGYGTQDGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGI 290
Query: 301 ANYAVYALI 309
A+ A ++
Sbjct: 291 ASAASVPVV 299
>gi|195995651|ref|XP_002107694.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
gi|190588470|gb|EDV28492.1| hypothetical protein TRIADDRAFT_36902 [Trichoplax adhaerens]
Length = 544
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 167/310 (53%), Gaps = 14/310 (4%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
IF K++K+Y+ + ++ ++ N + IH+ N++ G+T++ NHL+DL
Sbjct: 239 IFHHFASKHQKNYKDERERRFRENTFRQNLRFIHSTNRQRL----GFTVKVNHLADLTDN 294
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESV--LIPDHLDWREKGFITPDWNQEDCGACYAFS 126
IK M S + + P + + + +DWR+ G +TP +Q CG+C++F
Sbjct: 295 E-IKVMNGRKTSLKKSKTYQMPFNLTGLERYVAPTIDWRKLGAVTPVKDQGVCGSCWSFG 353
Query: 127 IASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
I+G ++ + ++ LS Q ++DC+ GN GC GG ++ GG+ E+ Y
Sbjct: 354 TTGTIEGSLYLKSGKLVSLSQQNMIDCTWGFGNNGCDGGEEFRAFEWIAKHGGIATEKSY 413
Query: 187 -PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
Y + CK + I I W +P ++ ALK+ ++ VGP+AV ++A+ +F Y+
Sbjct: 414 GQYLAQDGKCKLNKTKIGAKIRGWVQVPHGNQSALKLAVSAVGPVAVGMDAALKSFSFYS 473
Query: 246 SGIYDDEACTSDY--VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCG 299
SGIY D+ C + ++HA+L VGY ++ WI+KN WS HWGD+GY+ L NN CG
Sbjct: 474 SGIYYDKQCGNKEQDLDHAVLAVGYGNENGQDYWIIKNSWSTHWGDDGYVKLSMKNNNCG 533
Query: 300 IANYAVYALI 309
IA A + I
Sbjct: 534 IATDASFVNI 543
>gi|194883260|ref|XP_001975721.1| GG20405 [Drosophila erecta]
gi|190658908|gb|EDV56121.1| GG20405 [Drosophila erecta]
Length = 352
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/290 (34%), Positives = 154/290 (53%), Gaps = 26/290 (8%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM---------TRLTHSRIRRTLVRSPE 91
I N+ A G+ G+ L N L+D+ R I + R T+ I R+P
Sbjct: 68 ISLSNKNADSGISGFRLGVNMLADM-TRKEISSLLGSKVSEFGERYTNGHINFVTARNPA 126
Query: 92 SNESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
S +P+ DWREKG +TP Q CGAC++F+ +++G I++ T+ + LS Q +
Sbjct: 127 SAN---LPESFDWREKGGVTPPGFQGVGCGACWSFATTGSLEGHIYRRTAVLASLSQQNL 183
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK----FKRP--NIVV 204
VDC+ GN+GC GG Y++ G + + YPY + C+ RP +V
Sbjct: 184 VDCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPYTQTEMKCRQNDTVGRPPRESLV 242
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
I ++ + P DE +K +AT+GP+A S+NA +F+ Y GIY+DE C VNH++
Sbjct: 243 KIRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYGGGIYEDEECNQGEVNHSVT 302
Query: 265 LVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
+VGY R+ WI+KN +S +WG+ G+M L R CGIA+ Y ++
Sbjct: 303 VVGYGSENGRDYWIIKNSYSQNWGEGGFMRLIRNAGGFCGIASECSYPIL 352
>gi|222820541|gb|ACM67632.1| cathepsin 2L [Fasciola hepatica]
Length = 326
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 157/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++KY K+Y + ++ + W+ N K I HN GL YTL N +DL
Sbjct: 25 KRKYNKEYNGADNEHRRNV-WEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDLTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ + S + + E N+ +P +DWR+ G++T +Q CG+C+AFS
Sbjct: 84 YLIEMSPESES-LSDGISYEAEGND---VPASIDWRQYGYVTEVKDQGQCGSCWAFSAVG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDC+ GN GC+GG + N Y++ GL YPY+
Sbjct: 140 AIEGQYVKKFQNRMLFSEQQLVDCTKRFGNHGCSGGWMENAYRYLK-DSGLETASYYPYQ 198
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C+++R V ++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 199 AWEYQCQYRRELGVAKVTGAYTVHSGDEMRLMQMVGREGPAAVAVDAQS-DFYMYKSGIF 257
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ CT+ V HA+L VGY S WILKN W WG++GYM R NN C IA+ A
Sbjct: 258 MSQVCTTQRVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRNNMCAIASVA 317
Query: 305 VYALI 309
++
Sbjct: 318 SVPMV 322
>gi|1093503|prf||2104214A Cys protease
Length = 255
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 81/220 (36%), Positives = 130/220 (59%), Gaps = 6/220 (2%)
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
V +P +DWR KG +T +Q CG+C+AFS A++GQ F+ + + LS Q +VDCS
Sbjct: 36 VTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCST 95
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
GN GC GG + N Y++ GG+ E+ YPY+ C F R + ++ +P
Sbjct: 96 KYGNNGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQG 155
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS--- 272
DE + +ATVGP++V+I+AS +FQ Y+ G+Y++ C + ++H +L+VG+ +
Sbjct: 156 DEKKMPEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGE 215
Query: 273 --WILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
W++KN W WGD G++ L+ N+CGIA+ + Y L+
Sbjct: 216 DYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSYPLV 255
>gi|16506723|gb|AAL23917.1|AF419329_1 cathepsin L [Fasciola gigantica]
Length = 326
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 157/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++KY K+Y + ++ + W+ N K I HN GL YTL N +DL
Sbjct: 25 KRKYNKEYNGADNEHRRNV-WEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDLTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ + S + + E N+ +P +DWR+ G++T NQ CG+C+AFS
Sbjct: 84 YLIEMSPESES-LSDGISYEAEGND---VPASIDWRQYGYVTEVKNQGQCGSCWAFSAVG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDC+ GN GC+GG + N Y++ GL YPY+
Sbjct: 140 AIEGQYVKKFRNRMLFSEQQLVDCTKRFGNHGCSGGWMENAYRYLK-DSGLETASYYPYQ 198
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C+++R V +++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 199 AWEYQCQYRRELGVAEVTGAYTVHSGDEMRLMQMVGREGPAAVAVDAQS-DFYMYKSGIF 257
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ CT+ V HA+L VGY S WI KN W WG++GYM R NN C IA+ A
Sbjct: 258 MSQVCTTQRVTHAVLAVGYGTESGTDYWISKNSWGKWWGEDGYMRFARNRNNMCAIASVA 317
Query: 305 VYALI 309
++
Sbjct: 318 SVPMV 322
>gi|194882211|ref|XP_001975206.1| GG20691 [Drosophila erecta]
gi|190658393|gb|EDV55606.1| GG20691 [Drosophila erecta]
Length = 378
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 150/303 (49%), Gaps = 15/303 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N +G+ + N +DL ++ ++T L
Sbjct: 79 KTYLSAADRALHERAFASTKNVVDAGNAAFAKGVSTFKQSVNAFADLTHPEFLSQLTGLK 138
Query: 79 HSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
S + R+ S + V+ IPD DWRE G +TP Q CG+C+AF+ AI+G
Sbjct: 139 RSPEAK--ARAAASLKEVILPKKPIPDAFDWREHGGVTPVKFQGTCGSCWAFATTGAIEG 196
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNL--GCAGGSLRNTLNYV-QFAGGLMKEEDYPYKG 190
F+ T + LS Q +VDC + GC GG ++ + G+ + YPYK
Sbjct: 197 HTFRKTGSLPNLSEQNLVDCGPLEDFSLNGCDGGFQEAAFCFIDEVQKGVSQAGAYPYKD 256
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ CK+ + ++ +PP+DE LK +AT+GP+A S+N T + YA GIY+
Sbjct: 257 NKETCKYDGKKSGASLKGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNYAGGIYN 315
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA Y
Sbjct: 316 DDECNKGEPNHSILVVGYGSENGQDYWIIKNSWDDTWGEQGYFRLPRGQNYCFIAEECSY 375
Query: 307 ALI 309
++
Sbjct: 376 PVV 378
>gi|156384930|ref|XP_001633385.1| predicted protein [Nematostella vectensis]
gi|156220454|gb|EDO41322.1| predicted protein [Nematostella vectensis]
Length = 548
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 154/302 (50%), Gaps = 11/302 (3%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
KK+KK+Y+ ++ H++ N + IH+ N+ GY L NHL D +
Sbjct: 250 KKHKKNYKDNKEHHTRREHFKHNLRFIHSKNRRHA----GYYLAMNHLGDRSDKELRVLR 305
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
R L P+ +PD ++W +G +TP +Q CG+C++F I+G
Sbjct: 306 GRRYTKGYNGGLPYKPDMASINDVPDEMNWVIRGAVTPVKDQAVCGSCWSFGTTGTIEGT 365
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKGKQS 193
+F T + LS Q ++DCS GN C GG + Y+ +GG+ EE Y PY G
Sbjct: 366 LFLKTKYLTRLSQQNLMDCSWGEGNNACDGGEDFRSYQYIMKSGGIATEESYGPYLGADG 425
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C K I I+ + + D ALK +A GPI+VSI+AS + Y+ G+Y +
Sbjct: 426 YCHKKDAEIGATITGYVNITEGDLSALKTAIAQKGPISVSIDASHKSLSFYSYGVYYEPK 485
Query: 254 C--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYA 307
C ++ ++H++L VGY + W++KN WS HWG NGY+ + + +N CG+A A Y
Sbjct: 486 CGNKNEDLDHSVLAVGYGTMDGKPYWMIKNSWSTHWGMNGYVLMSQKDNNCGVATAATYV 545
Query: 308 LI 309
L+
Sbjct: 546 LM 547
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 155/305 (50%), Gaps = 27/305 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y K + ++SN ++ H Q +HG T SDL P + ++
Sbjct: 55 KAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDPSAVHGVT----KFSDLTPAEFHRK 110
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
L R+ ++P + L P DWR+KG +T +Q CG+C++FS A++G
Sbjct: 111 FLGLKPLRLPAHAQKAPILPTNNL-PKDFDWRDKGAVTNVKDQGSCGSCWSFSTTGALEG 169
Query: 134 QIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
F +T E+ LS QQ+VDC + S + GC GG + N Y+ +GG+ +E+DY
Sbjct: 170 AHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGSGGVQREKDY 229
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ CKF + I +S++SV+ DE + L GP+AV+INA Q Y
Sbjct: 230 PYTGRDGTCKFDKSKIAASVSNYSVI-SLDEEQIAANLVKNGPLAVAINAV--YMQTYVG 286
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYLKRGN 295
G+ C +++H +LLVGY + WI+KN W +WG+NGY + RG
Sbjct: 287 GVSCPYIC-GKHLDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWGENGYYKICRGR 345
Query: 296 NRCGI 300
N CG+
Sbjct: 346 NVCGV 350
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 157/309 (50%), Gaps = 27/309 (8%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + K+ K Y + + +++N ++ H + HG T SDL P
Sbjct: 44 FTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLHAKLDPSAEHGVT----KFSDLTPEE 99
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ ++ L R+ T ++P S L P++ DWR+KG +TP NQ CG+C+AFS
Sbjct: 100 FKRQYLGLKPLRLPSTANKAPILPTSDL-PENFDWRDKGAVTPVKNQGSCGSCWAFSTTG 158
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G + ST E+ LS QQ+VDC + + + GC GG + N +Y+ AGG+
Sbjct: 159 ALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFDYILQAGGVQT 218
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E+DYPY G+ CKF + + ++++SV+ DE + L GP+AV INA Q
Sbjct: 219 EKDYPYSGRDETCKFDKSKVAATVANFSVV-SLDEDQIAANLVKHGPLAVGINAI--FMQ 275
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWGDNGYMYL 291
Y G+ C + ++H +LLVGY + WI+KN W WG++GY +
Sbjct: 276 TYIGGVSCPYICGKN-LDHGVLLVGYGAAGYAPIRFKDKPFWIIKNSWGESWGEDGYYKI 334
Query: 292 KRGNNRCGI 300
RG N CG+
Sbjct: 335 CRGKNVCGV 343
>gi|169807671|emb|CAM57981.1| silicatein alpha [Geodia cydonium]
Length = 334
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 156/304 (51%), Gaps = 13/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ ++ K Y + ++ L W +N + I+ HNQ + + G+TL NHL D+ Y E
Sbjct: 36 KTQHGKSYGSVREELERHLVWLANREYINAHNQNSH--IFGFTLAMNHLGDITDAEY--E 91
Query: 74 MTRLTHSRI---RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
T+ I T V P S L PD +DWR +T +Q CGA YAFS A
Sbjct: 92 EIYSTYCDIDHGNYTKVYDPSRYSSTL-PDAIDWRTNNAVTSIKDQGYCGASYAFSAVGA 150
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
++G +T + LS Q ++DCSI GN GC GG++ +T Y+ G+ YPY
Sbjct: 151 LEGANALATGTLTSLSEQNIIDCSIPYGNHGCKGGNMYDTYLYIVANEGIDTSASYPYSA 210
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
KQ C + IS+ + E L+ +A +GPIAV+++A F+ Y+SG+Y
Sbjct: 211 KQLGCSYSVSGRGARISNSISIESGSEEGLQSAVANIGPIAVAVDAQSSAFRYYSSGVYS 270
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAV 305
C+S + HAM++ GY + +++KN + +WG NGY+ + R N+CGIA A
Sbjct: 271 STQCSSSTLTHAMVVTGYGTYNGKEYYLVKNSFGTNWGMNGYILMARNKYNQCGIATDAS 330
Query: 306 YALI 309
Y +
Sbjct: 331 YPTL 334
>gi|195335257|ref|XP_002034291.1| GM21790 [Drosophila sechellia]
gi|194126261|gb|EDW48304.1| GM21790 [Drosophila sechellia]
Length = 382
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 151/303 (49%), Gaps = 15/303 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N QG++ + N +DL ++ ++T L
Sbjct: 83 KTYLSAADRALHEGAFASTKNLVDAGNAAFAQGVNTFKQAVNAFADLTHSEFLSQLTGLK 142
Query: 79 HSRIRRTLVRSPESNESV-----LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
S + R+ S + V IP+ DWRE G +TP Q CG+C+AF+ AI+G
Sbjct: 143 RSPEAK--ARAAASLKEVDLPAKPIPEAFDWREHGGVTPVKFQGVCGSCWAFATTGAIEG 200
Query: 134 QIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPYKG 190
F+ T + LS Q +VDC + G GC GG ++ + G+ + E YPY
Sbjct: 201 HTFRKTGSLPNLSEQNLVDCGPVQDFGLNGCDGGFQEAAFCFIDEVQKGVSQAEAYPYID 260
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ CK+ + ++ +PP+DE LK +AT+GP+A S+N T + YA GIY+
Sbjct: 261 NKDTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNYAGGIYN 319
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA Y
Sbjct: 320 DDECNKGEPNHSILVVGYGSENGQDYWIVKNSWDDTWGEQGYFRLPRGQNFCFIAEECSY 379
Query: 307 ALI 309
++
Sbjct: 380 PVV 382
>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
distachyon]
Length = 373
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 151/310 (48%), Gaps = 30/310 (9%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGL-HGYTLRENHLSDLHPRHYIKE 73
+++ K+Y A + ++L + + +Q G HG T SDL P +
Sbjct: 59 RRHGKEYSGGAEEYARRLRVFAANLARAAAHQALDPGARHGVT----PFSDLTPEEFQAR 114
Query: 74 MTRLTHSRIRRTLVRSPESN--ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+T L + + + E +P DWR KG +T Q CG+C+AFS A+
Sbjct: 115 LTGLQQQGTNNNMPAAARATAEELATLPASFDWRAKGAVTEVKMQGMCGSCWAFSTTGAV 174
Query: 132 QGQIFKSTSEIEELSIQQVVDCS----IISGN---LGCAGGSLRNTLNYVQFAGGLMKEE 184
+G F +T ++ LS QQ+VDC ++ N GC+GG + N Y+ AGGLM++
Sbjct: 175 EGAHFVATGKLLNLSEQQLVDCDHTCDAVAKNECDSGCSGGLMTNAYTYLIRAGGLMEQA 234
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY G Q C+F + V ++S++ +PP DE ++ +L GP+AV +NA+ Q Y
Sbjct: 235 AYPYTGAQGTCRFDANKVAVRVTSFTAVPPDDEDQIRASLVRAGPLAVGLNAA--FMQTY 292
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYT-----------RNSWILKNWWSHHWGDNGYMYLKR 293
G+ C +NH +LLVGY R WI+KN W WG+ GY L R
Sbjct: 293 LGGVSCPLLCPRKLINHGVLLVGYGARGLAPLRLGYRPYWIIKNSWGKEWGEGGYYRLCR 352
Query: 294 G---NNRCGI 300
G N CG+
Sbjct: 353 GARNRNVCGV 362
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 98/284 (34%), Positives = 155/284 (54%), Gaps = 16/284 (5%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
+QSN + I+ N + +TL +N +D+ Y L S R S +
Sbjct: 69 YQSNVRFINYINAQN----FSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRE 124
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
S ++P +DWR+ G +TP NQ +CG+C+AFS +A++G T ++ LS Q+++DC
Sbjct: 125 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 184
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC-KFKRPNIVVDISSWSVL 212
I SGN GC GG + N +++ GG+ +YPY G+Q IC K K N VV IS + +
Sbjct: 185 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETV 244
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PP +E L+ +A P++V+I+A + FQLY+ GI++ +NHA+ ++GY ++
Sbjct: 245 PPNNEKILQAAVAKQ-PVSVAIDAGGYEFQLYSKGIFN--GFCGKQLNHAVTVIGYGEDN 301
Query: 273 ----WILKNWWSHHWGDNGYMYLKRGNNR----CGIANYAVYAL 308
W++KN W WG+ GY + R + CGIA A Y +
Sbjct: 302 GKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345
>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
Length = 335
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 153/328 (46%), Gaps = 37/328 (11%)
Query: 7 IIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGL-------------- 52
I + P +DY K+ WQ H K+++ +E+Q L
Sbjct: 20 FIPLVLPIPGLYEDYFKE---------WQEKHGKVYSTEEESQSRLKVFMKNVIYIDNHN 70
Query: 53 ---HGYTLRENHLSDLHPRHYIKE-MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKG 108
H Y L N +D+ + + + H +L P P +DWR KG
Sbjct: 71 KQGHSYELEVNEYADMTLDEFKDQYLMEPQHCSATHSLKSDPPKYRDP--PKAIDWRSKG 128
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLR 168
+TP NQ CG+C+ FS ++ F T ++ LS QQ+VDC+ N GC GG
Sbjct: 129 AVTPVKNQGQCGSCWTFSTTGCLESHHFLKTGQLVSLSEQQLVDCAQAFNNNGCNGGLPS 188
Query: 169 NTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVG 228
Y+ + GGL EE YPY+ C F + +S+ + +DE L + TVG
Sbjct: 189 QAFEYIHYNGGLDSEESYPYRAHDEKCHFVPSEVSATVSNVVNITSKDEMQLYNAVGTVG 248
Query: 229 PIAVSINASPHTFQLYASGIYDDEACTSD--YVNHAMLLVGYT-----RNSWILKNWWSH 281
P++++ + S F+ Y G+Y + C +D +VNHA+L VGY + WI+KN W
Sbjct: 249 PVSIAYDVSA-DFRFYKKGVYKSKECKTDPEHVNHAVLAVGYNTTESGEDYWIVKNSWGT 307
Query: 282 HWGDNGYMYLKRGNNRCGIANYAVYALI 309
+G NGY ++ RG N CG+A+ A Y ++
Sbjct: 308 KFGINGYFWIARGENMCGLADCASYPIV 335
>gi|535600|gb|AAA29137.1| cathepsin [Fasciola hepatica]
Length = 326
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 155/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y K A D ++ W+ N K I HN GL Y L N +D+
Sbjct: 25 KRIYNKEY-KGADDDHRRNIWEQNVKHIQEHNLRHDLGLVTYKLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P +PD +DWRE G++T +Q CG+C+AF
Sbjct: 84 YLTEMPRASELLSHG--------IPYKANKRAVPDRIDWRESGYVTEVKDQGGCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S A++GQ K+ S QQ+VDCS GN GC GG + N Y++ GL E
Sbjct: 136 STTGAMEGQYMKNEKTSISFSEQQLVDCSGPFGNYGCNGGLMENAYEYLKRF-GLETESS 194
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY+ + C++ V ++ + + DE L+ + P AV+++ F +Y
Sbjct: 195 YPYRAVEGQCRYNEQLGVAKVTGYYTVHSGDEVELQNLVGCRRPAAVALDVESD-FMMYR 253
Query: 246 SGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGY--MYLKRGNNRCG 299
SGIY + C+ D +NH +L VGY + WI+KN W WG++GY M KRG N CG
Sbjct: 254 SGIYQSQTCSPDRLNHGVLAVGYGIQDGTDYWIVKNSWGTWWGEDGYIRMVRKRG-NMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASVPMV 322
>gi|449670651|ref|XP_002168787.2| PREDICTED: uncharacterized protein LOC100201231 [Hydra
magnipapillata]
Length = 1244
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 160/315 (50%), Gaps = 42/315 (13%)
Query: 27 DSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR- 84
+ +K+ H ++ N + +++HN++ Q + LR NH D+ E + H + +
Sbjct: 528 EERKRTHIFKHNARFVNSHNRKNTQ----FKLRPNHFIDMSA-----EELNMFHGNLAKD 578
Query: 85 --------TLVRSPESNESVL-----------------IPDHLDWREKGFITPDWNQEDC 119
+ P+S ++ IP+ +DWRE G +T Q C
Sbjct: 579 TKKHKGIINMADIPDSKSNIFNKADIDKKPFVQVNEDEIPEEIDWREFGAVTSVKGQGIC 638
Query: 120 GACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG 179
CYAF+ A A +G F+ T ++ E+S QQ+VDCS GN GC GG + L+++ F G
Sbjct: 639 SGCYAFTAAGAAEGGWFRKTGKLIEVSSQQLVDCSWGYGNHGCKGGHYASALSWI-FTHG 697
Query: 180 LMKEEDY-PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASP 238
+ ++ Y Y G++ C F I +S + P + ALK +A GP+AVSIN P
Sbjct: 698 VSTDKSYGKYLGQEGYCHFNDTLFGAQIDGFSYIQPYNISALKRVVAKYGPVAVSINTKP 757
Query: 239 HTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG 294
+F+ Y+ GIYDD+ C S+ +H+ L+VGY R WI+KN WS WG+ GYM L
Sbjct: 758 LSFKFYSKGIYDDKECLSNQTHHSALIVGYGKLNGREYWIIKNSWSSSWGEGGYMKLAME 817
Query: 295 NNRCGIANYAVYALI 309
N+ CG+ V +I
Sbjct: 818 NHLCGVTENPVAVVI 832
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 98/284 (34%), Positives = 155/284 (54%), Gaps = 16/284 (5%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
+QSN + I+ N + +TL +N +D+ Y L S R S +
Sbjct: 65 YQSNVRFINYINAQN----FSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRE 120
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
S ++P +DWR+ G +TP NQ +CG+C+AFS +A++G T ++ LS Q+++DC
Sbjct: 121 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 180
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC-KFKRPNIVVDISSWSVL 212
I SGN GC GG + N +++ GG+ +YPY G+Q IC K K N VV IS + +
Sbjct: 181 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETV 240
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS 272
PP +E L+ +A P++V+I+A + FQLY+ GI++ +NHA+ ++GY ++
Sbjct: 241 PPNNEKILQAAVAKQ-PVSVAIDAGGYEFQLYSKGIFN--GFCGKQLNHAVTVIGYGEDN 297
Query: 273 ----WILKNWWSHHWGDNGYMYLKRGNNR----CGIANYAVYAL 308
W++KN W WG+ GY + R + CGIA A Y +
Sbjct: 298 GKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 341
>gi|10798511|emb|CAC12806.1| cathepsin L1 [Fasciola hepatica]
Length = 311
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 154/309 (49%), Gaps = 24/309 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 10 KRMYNKEY-NGADDEHRRNIWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 68
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 69 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAF 120
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S ++GQ K+ S QQ+VDCS GN GC+GG + N Y++ GL E
Sbjct: 121 STTGTMEGQYMKNEKTSISFSEQQLVDCSGPWGNNGCSGGLMENAYEYLKRF-GLETESS 179
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY+ + C++ V ++ + + E LK + + GP A+++ A F +Y
Sbjct: 180 YPYRAVEGQCRYNEQLGVAKVTGYYTVHSGSEVELKNLVGSEGPAAIAVEAESD-FMMYR 238
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
SGIY + C +NHA+L VGY WI+KN W WG+ GY+ + R N CGI
Sbjct: 239 SGIYQSQTCLPFALNHAVLAVGYGTQDGTDYWIVKNSWGLSWGERGYIRMARNRGNMCGI 298
Query: 301 ANYAVYALI 309
A+ A ++
Sbjct: 299 ASLASLPMV 307
>gi|19922198|ref|NP_610906.1| CG6347 [Drosophila melanogaster]
gi|17862554|gb|AAL39754.1| LD36817p [Drosophila melanogaster]
gi|21645444|gb|AAF58331.2| CG6347 [Drosophila melanogaster]
gi|220954310|gb|ACL89698.1| CG6347-PA [synthetic construct]
gi|220960044|gb|ACL92558.1| CG6347-PA [synthetic construct]
Length = 352
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 154/289 (53%), Gaps = 24/289 (8%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHY-------IKEM-TRLTHSRIRRTLVRSPES 92
I N+ A G+ G+ L N L+D+ + I E R T+ I R+P S
Sbjct: 68 ITLSNKNADNGVSGFRLGVNTLADMTRKEIATLLGSKISEFGERYTNGHINFVTARNPAS 127
Query: 93 NESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+P+ DWREKG +TP Q CGAC++F+ A++G +F+ T + LS Q +V
Sbjct: 128 AN---LPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLV 184
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK----FKRP--NIVVD 205
DC+ GN+GC GG Y++ G + + YPY + C+ RP +V
Sbjct: 185 DCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPYTQTEMQCRQNETAGRPPRESLVK 243
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
I ++ + P DE +K +AT+GP+A S+NA +F+ Y+ GIY+DE C +NH++ +
Sbjct: 244 IRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTV 303
Query: 266 VGY----TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
VGY R+ WI+KN +S +WG+ G+M L+ CGIA+ Y ++
Sbjct: 304 VGYGTENGRDYWIIKNSYSQNWGEGGFMRILRNAGGFCGIASECSYPIL 352
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 154/305 (50%), Gaps = 27/305 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y K + ++SN ++ H Q +HG T SDL P + ++
Sbjct: 55 KSKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDPSAVHGVT----KFSDLTPAEFHRK 110
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
L R+ ++P + L P DWR+KG +T +Q CG+C++FS A++G
Sbjct: 111 FLGLKPLRLPAHAQKAPILPTNNL-PKDFDWRDKGAVTNVKDQGSCGSCWSFSTTGALEG 169
Query: 134 QIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
F +T E+ LS QQ+VDC + S + GC GG + N Y+ +GG+ +E+DY
Sbjct: 170 AHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGSGGVQREKDY 229
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ CKF + I +S++SV+ DE + L GP+AV+INA Q Y
Sbjct: 230 PYTGRDGTCKFDKSKIAASVSNYSVI-SLDEEQIAANLVKNGPLAVAINAV--YMQTYVG 286
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYLKRGN 295
G+ C +++H +LLVGY + WI+KN W +WG NGY + RG
Sbjct: 287 GVSCPYIC-GKHLDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWGGNGYYKICRGR 345
Query: 296 NRCGI 300
N CG+
Sbjct: 346 NVCGV 350
>gi|38045864|gb|AAR08900.1| cathepsin L [Fasciola gigantica]
Length = 326
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 158/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y D+ ++ W+ N K I HN GL YTL N +D+
Sbjct: 25 KRMYNKEY-NGVDDAHRRNIWEENVKHIQEHNIRHDLGLVTYTLGLNQFTDMTFEEFKAK 83
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y++E+ R S I + P +P+ +DWRE G++T +Q DCG+C+AFS
Sbjct: 84 YLREIPRA--SDIHSHGI--PYEANDRAVPESIDWREFGYVTEVKDQGDCGSCWAFSATG 139
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++GQ K+ S QQ+VDCS GN GC+GG + + Y+ + GL E YPYK
Sbjct: 140 AMEGQYMKNQKANISFSEQQLVDCSGDYGNRGCSGGFMEHAYEYL-YEVGLETESSYPYK 198
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
++ CK+ V ++ + E L + GP AV+++ F +Y GIY
Sbjct: 199 AEEGPCKYDSRLGVAKVNGFYFDHFGVESKLAHLVGDKGPAAVAVDVES-DFLMYRGGIY 257
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
C+S+ +NHAML+VGY WI+KN W WGD+GY+ + R +N CGIA++A
Sbjct: 258 ASRNCSSEKLNHAMLVVGYGTQDGTDYWIVKNSWGSLWGDHGYIRMARNRDNMCGIASFA 317
Query: 305 VYALI 309
++
Sbjct: 318 SLPVV 322
>gi|157862759|gb|ABV90502.1| cathepsin L, partial [Fasciola gigantica]
Length = 280
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 146/287 (50%), Gaps = 25/287 (8%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRHYIKEMTR----LTHSRIRRTLVR 88
N K I HN GL YTL N +D+ Y+ EM R L+H
Sbjct: 1 NAKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMPRASDILSHG-------- 52
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQ 148
P + +PD +DWRE G++T +Q +CG+C+AFS ++GQ K+ S Q
Sbjct: 53 IPYEANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAFSTTGTMEGQYMKNQRTSISFSEQ 112
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
Q+VDCS GN+GC+GG + N Y+ QF GL E YPY+ + C++ R VV ++
Sbjct: 113 QLVDCSGPWGNMGCSGGLMENAYEYLKQF--GLETESSYPYRAVEGQCRYNRQLGVVKVT 170
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVG 267
+ + E LK + GP AV+++ F +Y SGIY + C+ +NHA+L VG
Sbjct: 171 GYYTVHSGSEVGLKNLVGAEGPAAVAVDVESD-FMMYRSGIYQSQTCSPFGLNHAVLAVG 229
Query: 268 YTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
Y WI+KN W WG+ GY+ + R N CGIA+ A ++
Sbjct: 230 YGTQGGTDYWIVKNSWGSSWGERGYIRMVRNRGNMCGIASMASLPMV 276
>gi|121543823|gb|ABM55576.1| putative cathepsin L-like cysteine protease precursor
[Maconellicoccus hirsutus]
Length = 339
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 168/322 (52%), Gaps = 19/322 (5%)
Query: 1 MTNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLREN 60
+ ++EW + + +Y K Y D + + N +I HN+ +GL + N
Sbjct: 24 LFHEEWQLF----KTQYSKKYTTDIEDRLRMKIFIDNKYRIAQHNKLFHKGLVTFEQGIN 79
Query: 61 HLSDLHPRHYIKEM------TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDW 114
SD+ + ++M R T + ++ +P N V PD +DWR KG + P
Sbjct: 80 EYSDMLQSEFNEKMGQKSSNQRNTEANGLPSIRFTPLHN--VNPPDSVDWRTKGLVGPVG 137
Query: 115 NQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV 174
Q +C + YA+S A++GQ+ + + +S+Q V+DCS +GN GC+GG+ ++ Y+
Sbjct: 138 KQVNCSSGYAWSAIGALEGQLASDKKKFQGISVQNVIDCSESTGNKGCSGGNQHHSYFYI 197
Query: 175 QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
GG+ + YPYK + C FK+ N+V +S LP E L ++A GP+A +I
Sbjct: 198 YKQGGVDDDVSYPYKDAEEPCAFKKENVVTRVSGEITLPDGYETNLHESVAVYGPVAATI 257
Query: 235 NASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGY 288
+A+ +F Y GIY + C + D VNH +L+VGY ++ WI+KN + WG++GY
Sbjct: 258 DATHQSFHSYKGGIYFEPDCGNKKDEVNHGVLVVGYGSENGQDYWIVKNSYGTDWGEDGY 317
Query: 289 MYLKRG-NNRCGIANYAVYALI 309
+ + R NN CGIA A ++
Sbjct: 318 IRMARNKNNHCGIATSASVPML 339
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 170 bits (431), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 99/287 (34%), Positives = 154/287 (53%), Gaps = 18/287 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
++ N + + HN A +G H + L N +DL Y R SR+RR+ S
Sbjct: 75 FKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEEYRTRFLR-DFSRLRRSASGKISSR 133
Query: 94 ----ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
E +PD +DWREKG + P NQ CG+C+AFS +A++G T ++ LS QQ
Sbjct: 134 YRLREGDDLPDSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQ 193
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSW 209
+VDC+ + N GC GG + ++ GG+ EE YPY+G+ IC VV I S+
Sbjct: 194 LVDCT--TANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSY 251
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P +E +L+ +A P++V+++A+ FQLY SGI+ S NHA+ +VGY
Sbjct: 252 ENVPSHNEQSLQKAVANQ-PVSVTMDAAGRDFQLYRSGIFTGSCNIS--ANHALTVVGYG 308
Query: 269 ---TRNSWILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVYAL 308
++ +KN W +WG++GY+ ++R N +CGI +A Y +
Sbjct: 309 TENDKDYRTVKNSWGKNWGESGYIRVERNIGNPNGKCGITRFASYPV 355
>gi|195484886|ref|XP_002090862.1| GE12565 [Drosophila yakuba]
gi|194176963|gb|EDW90574.1| GE12565 [Drosophila yakuba]
Length = 353
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 101/294 (34%), Positives = 155/294 (52%), Gaps = 33/294 (11%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRR-------------TLV 87
I N+ A G+ G+ L N L+D+ KE+ L S+I T
Sbjct: 68 ITLSNKNADNGVAGFRLGVNPLADMTK----KEIATLLGSKISEFGERYTNGHVNFVTPA 123
Query: 88 RSPESNESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQGQIFKSTSEIEELS 146
R+P S +P+ DWREKG +TP Q CGAC++F+ A++G +F+ T + LS
Sbjct: 124 RNPASAN---LPEMFDWREKGGVTPPGYQGVGCGACWSFATTGALEGHLFRRTGVLASLS 180
Query: 147 IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK----FKRP-- 200
Q +VDC+ GN+GC GG Y++ G + + YPY + C+ RP
Sbjct: 181 QQNLVDCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPYTQAEMQCRQNETAGRPPR 239
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVN 260
+V I ++ + P DE +K +AT+GP+A S+NA +F+ Y+ GIY+DE C +N
Sbjct: 240 ESLVKIRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELN 299
Query: 261 HAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
H++ +VGY R+ WI+KN +S +WG+ G+M L R CGIA+ Y ++
Sbjct: 300 HSVTVVGYGTENGRDYWIIKNSYSQNWGEGGFMRLPRNAGGFCGIASECSYPIL 353
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 152/284 (53%), Gaps = 23/284 (8%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL------THSRIRRTLVRSP 90
N ++I HN++A +T+ N S L + K T L SR + L+ +P
Sbjct: 54 NDQRIEAHNKDASSS---FTMGHNEYSHLTFDEFKKLRTGLRVSPSYIQSRAKYALM-AP 109
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
N + +P+ +DW E+G +TP NQ CG+C+AFS AI+G F S+ ++ +S Q++
Sbjct: 110 AVNMTD-VPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGAIEGAAFVSSKQLVSVSEQEL 168
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWS 210
VDC +G++GC GG + N +V+ GL KEEDYPY K+ C K+ V ++++
Sbjct: 169 VDCDH-NGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAKEGTCALKKCKPVTKVTAFH 227
Query: 211 VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTR 270
+P DE ALK +A P++V+I A FQ Y SG++D T ++H +L+VGY
Sbjct: 228 DVPANDEQALKAAVAKQ-PVSVAIEADQPEFQFYKSGVFDKSCGTK--LDHGVLVVGYGE 284
Query: 271 NS----WILKNWWSHHWGDNGYMYLKR----GNNRCGIANYAVY 306
W +KN W WGD GY+ L R +CG+A Y
Sbjct: 285 EGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMVPSY 328
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 128/215 (59%), Gaps = 6/215 (2%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +DWREKG ++ +Q CG+C++FS A++G + + LS Q +VDCS G
Sbjct: 118 PDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYG 177
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N Y+ GG+ E YPY Q CKF + +I + +P +E
Sbjct: 178 NQGCEGGLMVNAFEYIIDNGGIATESSYPYTAAQGRCKFTKSMNGANIIGYKEIPQGEED 237
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
+L LA P++V+I+AS +FQLY+SG+YD+ AC+S+ ++H +L VGY ++ +I
Sbjct: 238 SLTAALAKQ-PVSVAIDASHMSFQLYSSGVYDEPACSSEALDHGVLAVGYGTLEGKDYYI 296
Query: 275 LKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYAL 308
+KN W WG +GY+++ R N+CG+A A Y +
Sbjct: 297 IKNSWGPTWGQDGYIFMSRNAQNQCGVATMASYPI 331
>gi|9843862|emb|CAC03737.1| silicatein protein [Suberites domuncula]
Length = 330
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 154/306 (50%), Gaps = 16/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + K Y + + ++ L W SN K I HN + + G+TL N DL Y
Sbjct: 31 KSTHSKMYESQLMELERHLTWLSNKKYIEQHNVNSH--IFGFTLAMNQFGDLSELEYANY 88
Query: 74 MTRL-----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ + +T R P + P+ +DWR KG +T +Q DCGA YAFS
Sbjct: 89 LGQYRIEDKKSGNYSKTFQRDPLQD----YPEAVDWRTKGAVTAVKDQGDCGASYAFSAM 144
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G + LS Q ++DCSI GN GC GG++ + YV G+ ++ YP+
Sbjct: 145 GALEGANALAKGNAVSLSEQNIIDCSIPYGNHGCHGGNMYDAFLYVIANEGVDQDSAYPF 204
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
GKQS C + +S + E L+ ++ VGP++V+I+ + F+ Y SG+
Sbjct: 205 VGKQSSCNYNSKYKGTSMSGMVSIKSGSESDLQAAVSNVGPVSVAIDGANSAFRFYYSGV 264
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
YD C+S +NHAM++ GY + W+ KN W +WG++GY+ + R N+CGIA
Sbjct: 265 YDSSRCSSSSLNHAMVVTGYGSYNGKKYWLAKNSWGTNWGNSGYVMMARNKYNQCGIATD 324
Query: 304 AVYALI 309
A Y +
Sbjct: 325 ASYPTL 330
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 173/323 (53%), Gaps = 25/323 (7%)
Query: 2 TNKEWIIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENH 61
T++E + I+ K+ K Y ++ ++ N K + HN E + Y + N
Sbjct: 39 TDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRS----YKVGLNR 94
Query: 62 LSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN------ESVLIPDHLDWREKGFITPDWN 115
+DL Y + M T + +R ++S ++ +S ++P+ +DWRE G + P +
Sbjct: 95 FADLTNEEY-RSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKD 153
Query: 116 QEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQ 175
Q CG+C+AFS +A++G +T E+ +LS Q++VDC + GC GG + ++
Sbjct: 154 QGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTY-DAGCNGGLMDYAFEFII 212
Query: 176 FAGGLMKEEDYPYKGKQSICKFKRPNI-VVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
GG+ EEDYPY+G C +R N VV I+ + +PP DE ALK +A P++V+I
Sbjct: 213 NNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQ-PVSVAI 271
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMY 290
AS FQLY SG++ E + ++H +++VGY ++ WI++N W WG+NGY+
Sbjct: 272 EASGRAFQLYLSGVFTGECGRA--LDHGVVVVGYGTDNGADHWIVRNSWGTSWGENGYIR 329
Query: 291 LKRG-----NNRCGIANYAVYAL 308
++R +CGIA A Y +
Sbjct: 330 MERNVVDNFGGKCGIAMQASYPI 352
>gi|161347489|gb|ABW75768.2| procathepsin L [Fasciola hepatica]
Length = 311
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 157/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN +GL Y L N +DL
Sbjct: 10 KRMYNKEY-NGADDEHRRNIWEQNVKHIEEHNLRHDRGLVTYKLGLNQFTDLTFEEFKAK 68
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ ++ S + + E N+ +P +DWR+ G++T +Q CG+C+AFS
Sbjct: 69 YLMEMSPVSES-LSDGISYEAEGND---VPASIDWRQYGYVTEVKDQGQCGSCWAFSAVG 124
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDC+ GN GC GG + N Y++ + GL YPY+
Sbjct: 125 AIEGQYVKKFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNS-GLETASYYPYQ 183
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G + C++++ V ++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 184 GWEYQCQYRKELGVAKVTGAYTVHSGDEMKLMQMVGREGPAAVAVDAQS-DFYMYESGIF 242
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ C+S V HA+L VGY S WILKN W WG++GYM R N C IA+ A
Sbjct: 243 QSQYCSSRRVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRGNMCAIASVA 302
Query: 305 VYALI 309
++
Sbjct: 303 SVPMV 307
>gi|195584238|ref|XP_002081921.1| GD11280 [Drosophila simulans]
gi|194193930|gb|EDX07506.1| GD11280 [Drosophila simulans]
Length = 382
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 150/303 (49%), Gaps = 15/303 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N QG++ + N +DL ++ ++T L
Sbjct: 83 KTYLSAADRALHEGAFASTKNLVDAGNAAFAQGVNTFKQAVNAFADLTHSEFLSQLTGLK 142
Query: 79 HSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
S + R+ S + V IP+ DWRE G +TP Q CG+C+AF+ AI+G
Sbjct: 143 RSPEAK--ARAAASLKEVALPAKPIPEAFDWREHGGVTPVKFQGTCGSCWAFATTGAIEG 200
Query: 134 QIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPYKG 190
F+ T + LS Q +VDC + G GC GG ++ + G+ + YPY
Sbjct: 201 HTFRKTGSLPNLSEQNLVDCGPVQDFGLNGCDGGFQEAAFCFIDEVQKGVSQAGAYPYID 260
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ CK+ + ++ +PP+DE LK +AT+GP+A S+N T + YA GIY+
Sbjct: 261 NKDTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNYAGGIYN 319
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA Y
Sbjct: 320 DDECNKGEPNHSILVVGYGSENGQDYWIVKNSWDDTWGEQGYFRLPRGQNYCFIAEECSY 379
Query: 307 ALI 309
++
Sbjct: 380 PVV 382
>gi|55740404|gb|AAV63978.1| cathepsin L2 precursor [Artemia franciscana]
Length = 226
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 131/222 (59%), Gaps = 7/222 (3%)
Query: 95 SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCS 154
+V +P+ +DWREKG +TP Q C +C+AFS A++ Q F+ T ++ LS Q ++DCS
Sbjct: 5 NVTVPESVDWREKGAVTPVKYQGQCASCWAFSSTGALESQTFRKTGKLISLSEQNLIDCS 64
Query: 155 IISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPP 214
GNLGC GG + Y++ G+ E Y Y+ K++ C+ N + +P
Sbjct: 65 GEYGNLGCKGGWISQAFEYIKDNKGIDTENKYHYEAKENFCRDNPRNRGAVALGFVNIPS 124
Query: 215 QDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY--VNHAMLLVGYTRNS 272
+E LK +ATVGP++ I+ S FQ Y+ G+Y + +C + + +NHA+L++GY ++
Sbjct: 125 GEEDKLKAAVATVGPVSAVIDVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGYGSDN 184
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN WS HWGD GY+ + R N CG+A A+Y ++
Sbjct: 185 GEDYWLVKNSWSKHWGDEGYLKIARNRKNHCGVATAALYPIV 226
>gi|348542138|ref|XP_003458543.1| PREDICTED: counting factor associated protein D-like [Oreochromis
niloticus]
Length = 551
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 163/315 (51%), Gaps = 25/315 (7%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
+F + K+++ Y + K++ + N + IH+ N+ ++L N LSD
Sbjct: 247 MFSHFKDKFQRQYNDEREHEKREHAFVLNLRYIHSKNRAGMS----FSLALNSLSD---- 298
Query: 69 HYIKEMTRLTHSRIR-------RTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGA 121
+ M+ L R R R L ++ E V +P+ LDWR G +TP +Q CG+
Sbjct: 299 ---RTMSELATMRGRKRGKTPNRGLPFPFKAYERVNLPESLDWRLYGAVTPVKDQAICGS 355
Query: 122 CYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
C++F+ A++G +F T ++ LS Q ++DCS GN GC GG ++ GG+
Sbjct: 356 CWSFATTGAVEGALFVKTGSLQVLSQQMLIDCSWGFGNNGCDGGEEWRAYEWIMKHGGIA 415
Query: 182 KEEDY-PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT 240
E Y Y G C + I S++ + D+ ALK+ L GP+AVSI+AS +
Sbjct: 416 TTETYGAYMGMNGFCHVDSSELTARIQSYTNVTSGDQLALKMALFKNGPVAVSIDASHRS 475
Query: 241 FQLYASGIYDDEAC--TSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG 294
F Y+ G+Y + AC T D ++HA+L VGY S W++KN WS +WG++GY+ +
Sbjct: 476 FVFYSHGVYYEPACGNTVDDLDHAVLAVGYGTLSGEPYWLIKNSWSTYWGNDGYILMSMK 535
Query: 295 NNRCGIANYAVYALI 309
+N CG+A A Y +
Sbjct: 536 DNNCGVATDATYVTL 550
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/272 (35%), Positives = 141/272 (51%), Gaps = 30/272 (11%)
Query: 53 HGYTLRENHLSDLHPRHYIKEMTRL-THSRIRRTL----VRSPESNESVL-IPDHLDWRE 106
HG T SDL + +T L ++R + P S E V +P DWR+
Sbjct: 103 HGVT----PFSDLTREEFEARLTGLRAGGDVQRLMSGVPAAPPASKEEVARLPASFDWRD 158
Query: 107 KGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC----SIISGNL-- 160
KG +T Q CG+C+AFS A++G F +T E+ +LS QQ+VDC S ++ N
Sbjct: 159 KGAVTGVKTQGACGSCWAFSTTGAVEGANFLATGELVDLSEQQLVDCDHTCSAVAQNECN 218
Query: 161 -GCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHA 219
GCAGG + N +Y+ +GGLM++ YPY G C+F + V +++++ +P DE
Sbjct: 219 NGCAGGLMTNAYSYLMESGGLMEQSAYPYTGAAGPCRFDPTQVAVRVANFTAVPAGDEAQ 278
Query: 220 LKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYT---------- 269
++ L GP+AV +NA+ Q Y G+ C +VNH +LLVGY
Sbjct: 279 IRAALVRRGPLAVGLNAA--FMQTYVGGVSCPLICPRAWVNHGVLLVGYGARGFAALRLG 336
Query: 270 -RNSWILKNWWSHHWGDNGYMYLKRGNNRCGI 300
R WI+KN W WG+ GY L RG+N CG+
Sbjct: 337 YRPYWIIKNSWGKQWGEQGYYRLCRGSNVCGV 368
>gi|85677146|emb|CAI46305.1| silicatein alpha [Suberites domuncula]
Length = 330
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 154/306 (50%), Gaps = 16/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + K Y + + ++ L W SN K I HN + + G+TL N DL Y
Sbjct: 31 KSTHSKMYESQLMELERHLTWLSNKKYIEQHNVNSH--IFGFTLAMNQFGDLSELEYADY 88
Query: 74 MTRL-----THSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ + +T R P + P+ +DWR KG +T +Q DCGA YAFS
Sbjct: 89 LGQYRIEDKKSGNYSKTFQRDPLQD----YPEAVDWRTKGAVTAVKDQGDCGASYAFSAM 144
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G + LS Q ++DCSI GN GC GG++ + YV G+ ++ YP+
Sbjct: 145 GALEGANALAKGNAVSLSEQNIIDCSIPYGNHGCHGGNMYDAFLYVIANEGVDQDSAYPF 204
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
GKQS C + +S + E L+ ++ VGP++V+I+ + F+ Y SG+
Sbjct: 205 VGKQSSCNYNSKYKGTSMSGMVSIKSGSESDLQAAVSNVGPVSVAIDGANSAFRFYYSGV 264
Query: 249 YDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGN-NRCGIANY 303
YD C+S +NHAM++ GY + W+ KN W +WG++GY+ + R N+CGIA
Sbjct: 265 YDSSRCSSSSLNHAMVVTGYGSYNGKKYWLAKNSWGTNWGNSGYVMMARNKYNQCGIATD 324
Query: 304 AVYALI 309
A Y +
Sbjct: 325 ASYPTL 330
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 156/310 (50%), Gaps = 28/310 (9%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F+ ++++ K Y + + +++N ++ H HG T SDL P
Sbjct: 50 FLEFKRRFGKAYDSEDEHDYRYKVFKANMRRARRHQSLDPSAAHGVT----RFSDLTPSE 105
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ ++ L R+ ++P L P DWR+ G +TP NQ CG+C++FS
Sbjct: 106 FRNKVLGLRGVRLPLDANKAPILPTDNL-PSDFDWRDHGAVTPVKNQGSCGSCWSFSTTG 164
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCS-------IISGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G F ST E+ LS QQ+VDC S + GC GG + + Y+ +GG+M+
Sbjct: 165 ALEGAHFLSTGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYILKSGGVMR 224
Query: 183 EEDYPYKGKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
EEDYPY G S CKF + I ++++SV+ DE + L GP+AV+INA+
Sbjct: 225 EEDYPYSGADSGTCKFDKTKIAASVANFSVV-SLDEDQIAANLVKNGPLAVAINAA--YM 281
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMY 290
Q Y G+ C S +NH +LLVGY + WI+KN W +WG+NGY
Sbjct: 282 QTYIGGVSCPYVC-SRRLNHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYK 340
Query: 291 LKRGNNRCGI 300
+ RG N CG+
Sbjct: 341 ICRGRNICGV 350
>gi|1498185|dbj|BAA06738.1| cysteine proteinase-1 precursor [Drosophila melanogaster]
Length = 254
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 129/220 (58%), Gaps = 6/220 (2%)
Query: 96 VLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSI 155
V +P +DWR KG +T +Q CG+C+AFS A++GQ F+ + + LS Q +VDCS
Sbjct: 35 VTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCST 94
Query: 156 ISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
GN GC GG + N Y++ GG+ E+ YPY+ C F R + ++ +P
Sbjct: 95 KYGNNGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQG 154
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS--- 272
DE + + TVGP++V+I+AS +FQ Y+ G+Y++ C + ++H +L+VG+ +
Sbjct: 155 DEKKMPEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGE 214
Query: 273 --WILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
W++KN W WGD G++ L+ N+CGIA+ + Y L+
Sbjct: 215 DYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSYPLV 254
>gi|123502829|ref|XP_001328382.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121911324|gb|EAY16159.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/278 (35%), Positives = 155/278 (55%), Gaps = 19/278 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W +N + + QE + G+T+ N L+ L P Y + + V++ +SN
Sbjct: 34 WIANKRLV----QERNRANLGFTVALNRLAHLTPAEYQALLGYCNNG----VSVKAVKSN 85
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ D +DWR KG + P +Q CG+C+AFS A + Q + S +++LS Q +VDC
Sbjct: 86 AVCI--DEIDWRSKGVVNPVQDQGQCGSCWAFSAIQAQESQYAITYSNLQKLSEQNLVDC 143
Query: 154 SIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
+S GC GG + + +YV G M E+DY Y + CKF+ V I+S+
Sbjct: 144 --VSTCYGCNGGLMTSAYDYVINHQNGKFMLEKDYSYTAAEGSCKFEATKAVSKITSYIP 201
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L V +AT GP AV+I+AS +FQ+Y+SGIYD+ +C+S ++H + VG+
Sbjct: 202 VAEGDEKDLAVKIATYGPAAVAIDASAWSFQVYSSGIYDEPSCSSYNLDHGVGCVGFGKE 261
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYA 304
++N WI++N W +WG+ GY+ +K NN+CGIA A
Sbjct: 262 GSKNYWIVRNSWGEYWGEKGYIRMIKDKNNQCGIATMA 299
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 176/344 (51%), Gaps = 52/344 (15%)
Query: 10 FIFPQKKYKKDY---------RKKATDSKKKLH-WQSNHKKIHTHNQEAQQGLHGYTLRE 59
+F +++YK ++ + ++ KK+ ++SN +H+ N + Q + G
Sbjct: 170 LLFSEEQYKNEFENWIDRFEKKYDVSEFKKRFSIFKSNMDFVHSWNSKNSQTVLGL---- 225
Query: 60 NHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPE----SNESVLIPDH--LDWREKGFITPD 113
NHL+DL Y ++ TH ++ ++ +P SN + D +DWR+KG ++P
Sbjct: 226 NHLADLTNLEY-RQFYLGTH---KKAVLGTPGNHEVSNLQSVFGDSATVDWRQKGAVSPI 281
Query: 114 WNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNY 173
+Q CG+C++FS +++G + + ELS Q +VDCS GN+GC GG + Y
Sbjct: 282 KDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMDYAFEY 341
Query: 174 VQFAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAV 232
+ G+ E YPY + CK+ + N ISS+ + E L + GP++V
Sbjct: 342 IITNNGIDTESSYPYTASSGTTCKYNKANSGATISSYKNITAGSESDLADAVKNAGPVSV 401
Query: 233 SINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------------------------ 268
+I+AS ++FQLY+ GIY D +C+S ++H +L+VGY
Sbjct: 402 AIDASHNSFQLYSHGIYYDASCSSVNLDHGVLVVGYGSGTPDSDSRVHKGSQVRVKVPKT 461
Query: 269 --TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
T+N WI+KN W WGD G++Y+ + +N CGIA+ A Y ++
Sbjct: 462 DDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNNCGIASCASYPIV 505
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 156/309 (50%), Gaps = 23/309 (7%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
+KYK+ Y K + ++ + N +I HN ++G Y++ N SD K
Sbjct: 72 EKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSD-------KTN 124
Query: 75 TRLTHSRIRRTLVRSPESNESVLI-----PDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ L R R ++ S + P +DWR KG +TP NQ DCG+C+AFS
Sbjct: 125 SELDVLRGFRHSSKASRSGSQYIPFDAAPPAEVDWRTKGAVTPVKNQGDCGSCWAFSATG 184
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY- 188
I+GQ + +T ++ LS QQ+VDCS S N GC GG + YV+ G+ E YPY
Sbjct: 185 GIEGQHYLATGKLVSLSEQQLVDCS--SSNDGCDGGLMDLAFEYVKEHKGIDTEVHYPYV 242
Query: 189 ---KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
G C F V+++ + +P E L+ + GPI+V INA +F Y
Sbjct: 243 SGNTGYARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLPSFMAYE 302
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
SGIY D C ++H +L+VGY ++ W++KN W WG+NGY+ + R NN CG+
Sbjct: 303 SGIYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRNHNNLCGV 362
Query: 301 ANYAVYALI 309
A A Y L+
Sbjct: 363 ATMASYPLM 371
>gi|10798513|emb|CAC12807.1| procathepsin L3 [Fasciola hepatica]
Length = 306
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 157/305 (51%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN +GL Y L N +DL
Sbjct: 5 KRMYNKEY-NGADDEHRRNIWEQNAKHIEEHNLRHDRGLVTYKLGLNQFTDLTFEEFKAK 63
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ ++ S + + E + +P +DWR+ G++T +Q CG+C+AFS
Sbjct: 64 YLMEMSPVSES-LSDGISYEAEGKD---VPASIDWRQYGYVTEVKDQGQCGSCWAFSPVG 119
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDC+ GN GC GG + N Y++ + GL DYPY+
Sbjct: 120 AIEGQYVKKFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNS-GLETASDYPYQ 178
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G + C++++ V ++ + DE L + GP A +++A P F +Y SGI+
Sbjct: 179 GWEYQCQYRKELGVAKVTGAYTVHSGDEMKLMPMVRKKGPAAAAVDAQP-DFYMYESGIF 237
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ C+S V HA+L VG+ S WILKN W WG++GYM R N C IA+ A
Sbjct: 238 QSQYCSSRRVTHAVLAVGHGTESGTDYWILKNSWGKWWGEDGYMRFARNRGNMCAIASVA 297
Query: 305 VYALI 309
++
Sbjct: 298 SVPMV 302
>gi|391333246|ref|XP_003741030.1| PREDICTED: digestive cysteine proteinase 2-like [Metaseiulus
occidentalis]
Length = 327
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 163/306 (53%), Gaps = 17/306 (5%)
Query: 15 KKYKKDYRKK---ATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
K YK + K+ DS ++ ++ N I+ HN G Y + + +D P I
Sbjct: 26 KLYKSVHDKRYAAGEDSVRRRIFERNVAMINAHNLLHDLGQVSYRMGLSRFTDATPEE-I 84
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLI---PDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ +T L S T + S +++ I + +DWR+ G++TP +Q CG+C+AF+
Sbjct: 85 RSLTCLNISDSTSTGKSNGNSFDTIDITELSEAVDWRQNGYVTPVKDQGKCGSCWAFAAT 144
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++GQ FK T ++ LS Q +VDC S GC GG + Y++ GG+ E Y Y
Sbjct: 145 GAVEGQYFKKTGQLVSLSEQNLVDCDRSSD--GCEGGYFYESFEYIRSNGGIATESSYGY 202
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C+F +I +S + DE AL +A++GPI+V+I+ TF+ Y+SG+
Sbjct: 203 EATAGSCRFTADSIGATVSGRDSVASGDEEALLKAVASIGPISVTIDV-IDTFRHYSSGV 261
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKR--GNNRCGIAN 302
Y D C+S NHA+L+VGY + W++KN W +G+ GY+ + R GNN CGIA+
Sbjct: 262 YYDAECSSSSRNHAVLVVGYGTEAGGDYWLVKNSWGTSFGEQGYIKMARNKGNN-CGIAS 320
Query: 303 YAVYAL 308
A Y +
Sbjct: 321 EAGYPI 326
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 158/300 (52%), Gaps = 15/300 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ +Q N I HN++ ++G + + +D+ ++ +
Sbjct: 32 KTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLDLLKLQG 91
Query: 79 HSRIRRTLVRSPESNE-SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
+ V S + + D +DWRE+G +TP +Q +CG+C+AFS AI+GQ FK
Sbjct: 92 VPALPSNAVHFDNSEDIDMEEKDAVDWREEGAVTPAKDQANCGSCWAFSAVGAIEGQFFK 151
Query: 138 STSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ LS Q++VDC+ GN GC GG + ++VQ G+ EE YPY+G++S CK
Sbjct: 152 KNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQ-DEGIQTEESYPYEGRRSSCK 210
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA-CT 255
K V + ++ + P DE + T+A GP+AV+I AS +F Y GI D+ C+
Sbjct: 211 -KSGEYVTKVKTY--VFPLDEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDERCRCS 265
Query: 256 S--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ + +NH +L+VGY + WI+KN W WG+ GY LK+ CGI Y Y ++
Sbjct: 266 NKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIGYYNTYPIL 325
>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
Length = 373
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 155/307 (50%), Gaps = 30/307 (9%)
Query: 14 QKKYKKDY-RKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIK 72
+K++KK Y +K D + K+ +Q N ++ H HG T SDL P + K
Sbjct: 62 KKRFKKSYGSQKEHDYRFKI-FQVNLRRAARHQNLDPSATHGVT----QFSDLTPGEFRK 116
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L R+ + +P L P DWREKG +TP NQ CG+C++FS A++
Sbjct: 117 AYLGLRRLRLPKDATEAPILPTDNL-PQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALE 175
Query: 133 GQIFKSTSEIEELSIQQVVDC-------SIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
G F +T ++ LS QQ+VDC S + GC GG + + Y AGGLM+EED
Sbjct: 176 GANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREED 235
Query: 186 YPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY G + CKF + ++++SV+ DE + L GP+AV+INA Q Y
Sbjct: 236 YPYTGTDRGTCKFDNTKVAAKVANFSVV-SLDEDQIAANLFKNGPLAVAINAV--FMQTY 292
Query: 245 ASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWGDNGYMYLKR 293
G+ C S ++H +LLVGY + WI+KN W +WG+NG+ + R
Sbjct: 293 IGGVSCPYIC-SKRLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWGENGFYRICR 351
Query: 294 GNNRCGI 300
G N CG+
Sbjct: 352 GRNICGV 358
>gi|300120790|emb|CBK21032.2| unnamed protein product [Blastocystis hominis]
Length = 516
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 162/305 (53%), Gaps = 17/305 (5%)
Query: 15 KKYKKDYRKKATD---SKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
K++ KD +K D +++L++ N ++ N E + Y L+ NHL+D +
Sbjct: 218 KQFVKDNKKCYNDVEYKERQLNFLRNKARVEKVNSENRS----YKLKLNHLADRSESE-L 272
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ M L S+ ++ + + + PD +DWREKG +TP +Q CG+C+ + +
Sbjct: 273 RAMMGLKRSQ-KKDFAAHRYTPSNGVKPDFVDWREKGAVTPVKDQCMCGSCWTYGTVGVL 331
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP-YKG 190
+GQ F ++ + S Q ++DCS GN GC GG ++ GGLM +EDY Y G
Sbjct: 332 EGQYFLKYGKLVKFSEQNLLDCSWNFGNDGCNGGEDFRAYGWMLHNGGLMTDEDYGHYLG 391
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
C F + V I+ + ++ P L+ +A VGPI+V I A F YA G++D
Sbjct: 392 IDGWCHFNKSAAAVKITDYVLITPGSVEELEDAVANVGPISVGI-AVTTDFLFYAEGVFD 450
Query: 251 DEACTSDYVN--HAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
+ C+S + HA+L VGY ++ W++KN WS +WGDNGY+ + R NN CG+A A
Sbjct: 451 NPECSSAVEDQAHAVLAVGYGTENGKDYWLIKNSWSTYWGDNGYVKIARKNNICGVATAA 510
Query: 305 VYALI 309
Y ++
Sbjct: 511 SYPIL 515
>gi|395514296|ref|XP_003761355.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 262
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/251 (37%), Positives = 134/251 (53%), Gaps = 21/251 (8%)
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLD--------WREKGFI--TPDWNQEDCGACYAF 125
RL H RI R PE V P H W + G + T ++ CG+C+AF
Sbjct: 16 RLRHRRITRHGDSDPE----VAPPLHRGRANGCDGRWDQAGSVRDTSIKDKGQCGSCWAF 71
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S +++GQ F T ++ LS Q +VDCS GN GC GG + N YV+ GG+ EE
Sbjct: 72 SATGSLEGQWFHKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKKNGGIDTEES 131
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY GK C + +++ + +P E AL +ATVGPI+V+I+A +FQ Y
Sbjct: 132 YPYVGKDGTCHYNSQCSGANVTGYVDIPAGVERALAKAVATVGPISVAIDAGHSSFQFYR 191
Query: 246 SGIYDDEACTSDYVNHAMLLVGY------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRC 298
SG+Y + C+S+ ++H +L+VG+ + WI+KN W WGD GY+ + R NN C
Sbjct: 192 SGVYYEPECSSEELDHGVLVVGFGVEGKNGKKYWIVKNSWGEEWGDRGYVLMTRDHNNHC 251
Query: 299 GIANYAVYALI 309
GIA A Y +
Sbjct: 252 GIATAASYPEV 262
>gi|13774082|gb|AAK38169.1| cathepsin L-like [Fasciola hepatica]
Length = 310
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/310 (33%), Positives = 155/310 (50%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W++N K I HN GL YTL N +D+
Sbjct: 9 KRMYNKEY-NGADDEHRRNIWEANVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 67
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 68 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAF 119
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E
Sbjct: 120 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF--GLETES 177
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + + P A++++ F +Y
Sbjct: 178 SYPYTAVEGQCRYNRQLGVAKVTGYYTVHSGSEVELKNLVGSRRPAAIAVDVESD-FMMY 236
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C +NHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 237 RSGIYQSQTCLPFALNHAVLAVGYGTQDGTDYWIVKNSWGLSWGERGYIRMARNRGNMCG 296
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 297 IASLASLPMV 306
>gi|195583145|ref|XP_002081384.1| GD10987 [Drosophila simulans]
gi|194193393|gb|EDX06969.1| GD10987 [Drosophila simulans]
Length = 352
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 153/289 (52%), Gaps = 24/289 (8%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM--------TRLTHSRIRRTLVRSPES 92
I N+ A G+ G+ L N L+D+ + + R T+ I R+P S
Sbjct: 68 ITLSNKNADNGVSGFRLGVNTLADMTRKEIATLLGSKVSEFGERYTNGHINFVTARNPAS 127
Query: 93 NESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+P+ DWRE+G +TP Q CGAC++F+ A++G +F+ T + LS Q +V
Sbjct: 128 AN---LPEMFDWRERGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLV 184
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK----FKRP--NIVVD 205
DC+ GN+GC GG Y++ G + + YPY + C+ RP +V
Sbjct: 185 DCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPYTQTEMQCRQNETAGRPPRESLVK 243
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
I ++ + P DE +K +AT+GP+A S+NA +F+ Y+ GIY+DE C +NH++ +
Sbjct: 244 IRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTV 303
Query: 266 VGY----TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
VGY R+ WI+KN +S +WG+ G+M L+ CGIA+ Y ++
Sbjct: 304 VGYGTENGRDYWIIKNSYSQNWGEGGFMRILRNAGGFCGIASECSYPIL 352
>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
Length = 347
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 150/303 (49%), Gaps = 28/303 (9%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH------YIKEMTRLTHSRIRRTLV 87
W HKK + +E L YT ++ L+ H + + LT + +R +
Sbjct: 46 WTIKHKKTYATAEEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLTFAEFKRIYL 105
Query: 88 RS--------------PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
S P +V P +DWR++ ITP +Q CG+C+AFS S +
Sbjct: 106 SSSSQHCRATTGNFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWAFSATSCLSA 165
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+ T ++ LS QQ++DCS N GC GG Y+++ GG+ E DYPYK ++
Sbjct: 166 HLALKTGQLISLSKQQLLDCSRSFNNRGCKGGLPSQAFEYIRYNGGIESERDYPYKDREE 225
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C FK + ++ E + V LA +GP+++ I+ S +F Y GIY +
Sbjct: 226 KCHFKPSLVAATVTGVVNFTQGAEDDIAVALANIGPVSIGIH-STKSFATYKKGIYQGKL 284
Query: 254 CTSD--YVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
C+ + +NHA+L+VGY + + WI KN W +WG NGY +++RG+N CG+A A Y
Sbjct: 285 CSKNPRKINHAVLIVGYDQTASGEKYWIGKNSWGTNWGMNGYFWIRRGHNACGLATCASY 344
Query: 307 ALI 309
++
Sbjct: 345 PVV 347
>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
Length = 322
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 153/292 (52%), Gaps = 12/292 (4%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + + ++ +++N +I HN +QGL Y N +DL + +
Sbjct: 35 KSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTDLTQEEFKAYLGLHV 94
Query: 79 HSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS 138
+ T+ + E +P +DWR G +T NQ CG+C++F++ + +G ++
Sbjct: 95 KPVLNNTIQYELKGLE---VPTSVDWRSAGQVTGVKNQGSCGSCWSFALTGSTEGAYYRK 151
Query: 139 TSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFK 198
++ LS QQ+VDCS S N GC GG L T Y++ GL E YPY G CK+
Sbjct: 152 HKQLVSLSEQQLVDCS-TSINYGCNGGFLDATFPYIE-QYGLQTESSYPYTGVDGSCKYD 209
Query: 199 RPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY 258
+V IS++ L + L+ + ++GP+A++++AS Y+SGIY CT+
Sbjct: 210 SSKVVTKISNYVSLHGSESKVLE-PVGSIGPVAITMDAS--YLSSYSSGIYAANKCTTTN 266
Query: 259 VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVY 306
+NHA+L+VGY +N WI+KN W WG+ GY L RG+N CG A VY
Sbjct: 267 LNHAVLVVGYGSQNGQNYWIVKNSWGSGWGEQGYFRLLRGSNECGCAQDPVY 318
>gi|157862755|gb|ABV90500.1| cathepsin L, partial [Fasciola gigantica]
Length = 251
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/249 (38%), Positives = 137/249 (55%), Gaps = 19/249 (7%)
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H R R+ +P +DWRE G++T +Q CG+C+AF
Sbjct: 9 YLSEMPRASAFLSHGMPYRAKNRA--------VPTSIDWRESGYVTEVKDQGGCGSCWAF 60
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
S A++GQ KS S QQ+VDCS GN GC+GG + Y++ GL E
Sbjct: 61 STTGAMEGQYMKSQRINISFSEQQLVDCSGDFGNHGCSGGLMEKAYEYLRHF-GLETESS 119
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY+ + C++ + V +S + ++ QDE ALK + GP AV+++ + F +Y
Sbjct: 120 YPYRADEGPCQYDKQLGVAQLSDYYIVHSQDEVALKNLIGVEGPAAVALDVNI-DFMMYK 178
Query: 246 SGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
SGIY DE C+S Y+NHA+L VGY WI+KN W WG++GY+ L R +N CGI
Sbjct: 179 SGIYQDEICSSRYLNHALLAVGYGTEDGTEYWIVKNSWGSRWGEHGYIRLARNRDNMCGI 238
Query: 301 ANYAVYALI 309
A A ++
Sbjct: 239 ATLASLPIV 247
>gi|163310848|pdb|2O6X|A Chain A, Crystal Structure Of Procathepsin L1 From Fasciola
Hepatica
Length = 310
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 155/306 (50%), Gaps = 18/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+
Sbjct: 9 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 67
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+R S I V +N +V PD +DWRE G++T +Q +CG+ +AFS
Sbjct: 68 YLTEMSRA--SDILSHGVPYEANNRAV--PDKIDWRESGYVTEVKDQGNCGSGWAFSTTG 123
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPY 188
++GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E YPY
Sbjct: 124 TMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQF--GLETESSYPY 181
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
+ C++ + V ++ + + E LK + GP AV+++ F +Y SGI
Sbjct: 182 TAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FMMYRSGI 240
Query: 249 YDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANY 303
Y + C+ VNHA+L VGY WI+KN W WG+ GY+ + R N CGIA+
Sbjct: 241 YQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIASL 300
Query: 304 AVYALI 309
A ++
Sbjct: 301 ASLPMV 306
>gi|156367164|ref|XP_001627289.1| predicted protein [Nematostella vectensis]
gi|156214194|gb|EDO35189.1| predicted protein [Nematostella vectensis]
Length = 514
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 154/297 (51%), Gaps = 20/297 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
Q ++ K Y + SK+K ++ N + I + N++ + Y L NH DL Y +
Sbjct: 224 QGQHNKQYDSEHEVSKRKHIFRHNMRYIRSINRKNLK----YKLAPNHFVDLTDGEYDQ- 278
Query: 74 MTRLTHSRIRRTLVRSPESNES-----VLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
H + P SN S V +PD LDWR+ G ++P Q CG+CYA +
Sbjct: 279 -----HKGDSIITLYGPYSNMSHVLQRVDVPDELDWRDYGAVSPVRGQGICGSCYALAAV 333
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
A++G F T +++ELS QQV+DCS SGN GC GG ++++ G E PY
Sbjct: 334 GAVEGAYFMKTGKLKELSAQQVIDCSWGSGNRGCKGGYYNKAMSWIYLHGIASAESYGPY 393
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
G++ C+ + I +++ +P + ALK+++A GP VSIN +P + + Y+ G+
Sbjct: 394 LGQEGTCRIEGLRRAAAIDAFAFVPKYNNTALKISVARFGPAVVSINENPLSLKFYSWGL 453
Query: 249 YDDEACTSDYVN-HAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGI 300
YDD C D H++L+VGY W++KN WS WG +GY+ + N CG+
Sbjct: 454 YDDPECGRDTAAVHSVLVVGYGVEDGEPYWLVKNSWSTTWGMDGYIKIAWKRNTCGV 510
>gi|195455847|ref|XP_002074892.1| GK22908 [Drosophila willistoni]
gi|194170977|gb|EDW85878.1| GK22908 [Drosophila willistoni]
Length = 381
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 151/304 (49%), Gaps = 10/304 (3%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
++ K Y A + ++ ++ + + + N G +T N SDL ++K++
Sbjct: 79 QQTGKTYASAAEQALRQGVFEGSQNLVDSANAAFAAGTSTFTSAVNAFSDLTHLEFLKQL 138
Query: 75 TRLTHS---RIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
T S R R + IPD DWREKG +TP +Q CG+C+ F+ AI
Sbjct: 139 TGFKKSAEGESRVAAARQAVEVPAEPIPDSFDWREKGGVTPVKHQGTCGSCWTFAATGAI 198
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNL-GCAGGSLRNTLNYVQFAG-GLMKEEDYPYK 189
+G +F+ T+++ LS Q +VDC ++ L GC GG +++ A G+ E Y Y
Sbjct: 199 EGHLFRKTNQLPNLSEQNLVDCGPLNFGLNGCDGGCQEYAFAFLKEAQRGIASEAKYTYV 258
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
K+ +C + + + + P DE LK +AT+GP+ S+ A Y GI+
Sbjct: 259 DKRDVCSYTEKQAEAYVHGLATVTPNDEDLLKKVVATLGPVGCSLFADEALLH-YEKGIF 317
Query: 250 DDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
+E C +NHA+L+VGY ++ W +KN W +WG++GY L RG N CGI+
Sbjct: 318 SNETCNGQELNHAVLVVGYGSENGQDYWTIKNSWGENWGESGYFRLIRGQNFCGISLECS 377
Query: 306 YALI 309
Y ++
Sbjct: 378 YPIV 381
>gi|381283083|gb|AFG19440.1| cathepsin L, partial [Larimichthys crocea]
Length = 257
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 92/258 (35%), Positives = 137/258 (53%), Gaps = 15/258 (5%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT---RLTHSRIRRTLVRSP 90
W+ N +KI HN E G H Y L NH D+ + + M R + + +L P
Sbjct: 3 WEMNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYKRKAEGKFKGSLFMEP 62
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
E+ P +DWR+ G++TP +Q CG+C+AFS A++GQ F+ T ++ LS Q +
Sbjct: 63 NFLEA---PRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNL 119
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI-VVDISSW 209
VDCS GN GC GG + YV+ GL E+ YPY G PN + + +
Sbjct: 120 VDCSRPEGNEGCNGGLMDQAFQYVKDNHGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGF 179
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P EHAL +A VGP++V+I+A +FQ Y SGIY ++ C+S+ ++H +L+VGY
Sbjct: 180 VDVPSGKEHALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYG 239
Query: 269 -------TRNSWILKNWW 279
+ WI+ N W
Sbjct: 240 FEGEDVDGKKYWIVNNSW 257
>gi|195334168|ref|XP_002033756.1| GM21493 [Drosophila sechellia]
gi|194125726|gb|EDW47769.1| GM21493 [Drosophila sechellia]
Length = 352
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 154/289 (53%), Gaps = 24/289 (8%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHY-------IKEM-TRLTHSRIRRTLVRSPES 92
I N+ A G+ G+ L N L+D+ + + E R T+ I R+P S
Sbjct: 68 ITLSNKNADNGVTGFRLGVNTLADMTRKEISTLLGSKVSEFGERYTNGHINFVTARNPAS 127
Query: 93 NESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+P+ DWRE+G +TP Q CGAC++F+ A++G +F+ T + LS Q +V
Sbjct: 128 AN---LPEMFDWRERGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLV 184
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK----FKRP--NIVVD 205
DC+ GN+GC GG Y++ G + + YPY + C+ RP +V
Sbjct: 185 DCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPYTQTEMQCRQNETAGRPPRESLVK 243
Query: 206 ISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLL 265
I ++ + P DE +K +AT+GP+A S+NA +F+ Y+ GIY+DE C +NH++ +
Sbjct: 244 IRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTV 303
Query: 266 VGY----TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
VGY R+ WI+KN +S +WG+ G+M L+ CGIA+ Y ++
Sbjct: 304 VGYGTENGRDYWIIKNSYSQNWGEGGFMRILRNAGGFCGIASECSYPIL 352
>gi|334314327|ref|XP_001368532.2| PREDICTED: cathepsin H-like [Monodelphis domestica]
Length = 344
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/287 (32%), Positives = 148/287 (51%), Gaps = 15/287 (5%)
Query: 30 KKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
++LH + +N ++I HN G H YTL N SD+ + K+ L + +
Sbjct: 62 RRLHNFLNNKRRIDEHNA----GKHSYTLGLNQFSDMSFDEFKKQY--LMSEPQNCSATK 115
Query: 89 SPESNESVLIPDHLDWREKG-FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSI 147
PD +DWR+KG +++P NQ CG+C+ FS ++ + +T ++ L+
Sbjct: 116 GSHVRRVGPYPDFMDWRKKGNYVSPVKNQGGCGSCWTFSTTGGLESAVAIATGKLLSLAE 175
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDIS 207
QQ+VDC+ N GC GG Y+ + G+M E+ YPY+GK C+FK + +
Sbjct: 176 QQLVDCAQAFNNHGCNGGLPSQAFEYIMYNNGIMGEDTYPYEGKDGTCRFKPDKAIAFVK 235
Query: 208 SWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC--TSDYVNHAMLL 265
+ DE A+ +A P++ + + F Y GIY + C + D VNHA+L
Sbjct: 236 DVVNITIYDEEAMTEAVAHHNPVSFAFEVT-EDFMSYRDGIYSNPRCDKSPDKVNHAVLA 294
Query: 266 VGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
VGY +N+ WI+KN W WG+NGY ++RG N CG+A+ A Y +
Sbjct: 295 VGYGKNNGILYWIVKNSWGTSWGNNGYFLIERGKNMCGLADCASYPV 341
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/310 (33%), Positives = 156/310 (50%), Gaps = 28/310 (9%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + K+KK Y + + +++N ++ H + HG T SDL P
Sbjct: 53 FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDPTASHGVT----QFSDLTPAE 108
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
+ K++ L R+ + +P S L P+ DWR+KG + P NQ CG+C++FS
Sbjct: 109 FRKQVLGLRRLRLPKDANEAPILPTSDL-PEDFDWRDKGAVGPIKNQGSCGSCWSFSATG 167
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCS-------IISGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G F +T E+ LS QQ+VDC S + GC GG + + Y AGGLM+
Sbjct: 168 ALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMR 227
Query: 183 EEDYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
EEDYPY G + CKF + + ++++SV+ DE + L GP+AV+INA
Sbjct: 228 EEDYPYTGTDRDACKFDKNKVAARVANFSVV-SLDEDQIAANLVKNGPLAVAINAV--FM 284
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWGDNGYMY 290
Q Y G+ C S ++H +LLVGY + WI+KN W WG+NG+
Sbjct: 285 QTYIGGVSCPYIC-SRRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNSWGEKWGENGFYK 343
Query: 291 LKRGNNRCGI 300
+ RG N CG+
Sbjct: 344 ICRGRNVCGV 353
>gi|123492185|ref|XP_001326005.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121908913|gb|EAY13782.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 149/279 (53%), Gaps = 19/279 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+T+ N L+ L P Y L R+ + ++ +SN
Sbjct: 34 WLSNKRLVQEHNRANL----GFTVALNKLAHLTPAEY----NSLLGFRMNKAERKAVKSN 85
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ DWR+KG + P +Q CG+C+AFS A + Q + S ++ LS Q +VDC
Sbjct: 86 --AIANADCDWRKKGAVNPIKDQGQCGSCWAFSAIQAQESQYYISFKTLQSLSEQNLVDC 143
Query: 154 SIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
++ GC GG + +YV +G M E DYPY + CKF I S+
Sbjct: 144 --VTTCYGCNGGLMDAAYDYVVKHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSYVN 201
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++T+GP A++I+AS +FQLY+SGIYD+ AC+S ++H + VGY
Sbjct: 202 VAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYDESACSSYNLDHGVGCVGYGTE 261
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAV 305
++N WI++N W WG+ GY+ +K NN+CG A A
Sbjct: 262 GSKNYWIVRNSWGTAWGEKGYIRMIKDKNNQCGEATMAC 300
>gi|321477694|gb|EFX88652.1| hypothetical protein DAPPUDRAFT_304724 [Daphnia pulex]
Length = 336
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 164/303 (54%), Gaps = 20/303 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+KK ++ K + +K + N++ I HNQ G Y L+ N +D E
Sbjct: 39 KAKHKKRFKNKDREKMRKNTFVENNRGIKKHNQ---AGKSTYKLQPNQFADWSE----AE 91
Query: 74 MTRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+ +L + +P + +L +PD +D+R+ + Q CG+CY F+ +
Sbjct: 92 LEQLKGEIDTDNEIPAPVVSGDILRQAVPDAVDYRKSHCLAKVKYQGGCGSCYTFASTTP 151
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKG 190
I+ Q T + LS + ++DCS GN GC GG + NYV+ GL EE YPY+G
Sbjct: 152 IEYQRCMKTGTLVTLSEENLIDCSQKYGNAGCNGGLALRSWNYVKDV-GLNTEEAYPYQG 210
Query: 191 KQSICKFKRPNIVVDISSWS-VLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
++++C++ N ++++W+ DE A+KV +A GP+AVS++AS + Y+SGI+
Sbjct: 211 EETMCEYSASNYGGNVTTWAYATRTNDEEAIKVVVAKYGPVAVSVDAS--NWDFYSSGIF 268
Query: 250 DDEACTSDYVNHAMLLVGYTRNS------WILKNWWSHHWGDNGYMYLKRGNNRCGIANY 303
C++ NHA+++VGY +++ WI++N W WG+ GY+ L+RG N C I+
Sbjct: 269 SSPTCSNTTTNHAVVIVGYGKDTKTRKDFWIVRNSWGPEWGEGGYINLERGVNMCAISKR 328
Query: 304 AVY 306
AV+
Sbjct: 329 AVF 331
>gi|56567186|gb|AAV98582.1| cathepsin L-like cysteine proteinase precursor [Trichomonas
vaginalis]
Length = 305
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 149/279 (53%), Gaps = 19/279 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+T+ N L+ L P Y L R+ + ++ +SN
Sbjct: 34 WLSNKRLVQEHNRANL----GFTVALNKLAHLTPAEY----NSLLGFRMNKAERKAVKSN 85
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ DWR+KG + P +Q CG+C+AFS A + Q + S ++ LS Q +VDC
Sbjct: 86 --AIANADCDWRKKGAVNPIKDQGQCGSCWAFSAIQAQESQYYISFKTLQSLSEQNLVDC 143
Query: 154 SIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
++ GC GG + +YV +G M E DYPY + CKF I S+
Sbjct: 144 --VTTCYGCNGGLMDAAYDYVVKHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSYVN 201
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++T+GP A++I+AS +FQLY+SGIYD+ AC+S ++H + VGY
Sbjct: 202 VAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYDESACSSYNLDHGVGCVGYGTE 261
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAV 305
++N WI++N W WG+ GY+ +K NN+CG A A
Sbjct: 262 GSKNYWIVRNSWGTTWGEKGYIRMIKDKNNQCGEATMAC 300
>gi|123455797|ref|XP_001315639.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121898322|gb|EAY03416.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 102/278 (36%), Positives = 149/278 (53%), Gaps = 19/278 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+TL N L+ L P Y K M + + ++S
Sbjct: 34 WLSNKRFVQNHNRANL----GFTLALNKLAYLSPTEY-KAMLGFHNKGVHNIAIKS---- 84
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ + D +DWR KG + P +Q+ CG+C+AFS A + Q + ++++LS Q +VDC
Sbjct: 85 -NTIANDEIDWRSKGVVNPIQHQQHCGSCWAFSAIQAQESQYAITYGKLQKLSEQNLVDC 143
Query: 154 SIISGNLGCAGGSLRNTLNYV-QFAGG-LMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
+ S N GC G + +YV Q+ GG M E DYPY + CKF + I S+
Sbjct: 144 -VTSCN-GCHNGLMSAAYDYVIQYQGGKFMLETDYPYTAVEGTCKFNQAKATSKIVSYIN 201
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++ GP V I+AS +TFQLY+ GIYD+ C+S +NH + VGY
Sbjct: 202 VVEGDEKDLAAKVSAYGPSTVGIDASHYTFQLYSHGIYDEPHCSSFSLNHGVGCVGYGTE 261
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYA 304
T+N WI++N W WG+ GY+ +K NN CGIA A
Sbjct: 262 GTKNYWIVRNSWGLEWGEQGYVRMIKDKNNNCGIATVA 299
>gi|13242027|gb|AAK16514.1|AF331036_1 cathepsin L-like cysteine proteinase [Onchocerca volvulus]
Length = 401
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 103/286 (36%), Positives = 147/286 (51%), Gaps = 19/286 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRL-----THSRIRR---- 84
++SN N++ +QGL YT NHL+DL + + K M L TH R RR
Sbjct: 116 FESNELMTEATNRKYEQGLISYTNGLNHLADLTDKKF-KMMNGLRFPNETHLRTRRQTRH 174
Query: 85 TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
T+ + + + +P +DWR+KG +TP NQ CG+CYAF+ A++ K T ++ +
Sbjct: 175 TVGQKYTYDPNEKLPVSVDWRKKGMVTPVKNQGVCGSCYAFAAIGALEAYNKKKTGKLVD 234
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
LSIQ VDC+ GN GC GG + N + Y GL E + G + CK++ N
Sbjct: 235 LSIQNAVDCTWTLGNYGCRGGYM-NPIFYYATKFGLAMESEISVLGTEQKCKWQEENAYA 293
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
++ + DE L +A GP+ V IN S F+ Y SG+Y + C +NHA+L
Sbjct: 294 TDKGYAAIQRGDELGLMHAVAKHGPVVVGINGSKRPFKFYKSGVYSNRDCGD--LNHAVL 351
Query: 265 LVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
LVGY ++ WI+KN W WG GY Y+ R N C IA A
Sbjct: 352 LVGYGKHKTYGEYWIIKNSWGTDWGKKGYAYMARNKGNMCHIATLA 397
>gi|454890|emb|CAA54438.1| cysteine proteinase, putative [Trichomonas vaginalis]
Length = 292
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 149/279 (53%), Gaps = 19/279 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+T+ N L+ L P Y L R+ + ++ +SN
Sbjct: 21 WLSNKRLVQEHNRAN----LGFTVALNKLAHLTPAEY----NSLLGFRMNKAERKAVKSN 72
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ DWR+KG + P +Q CG+C+AFS A + Q + S ++ LS Q +VDC
Sbjct: 73 --AIANADCDWRKKGAVNPIKDQGQCGSCWAFSAIQAQESQYYISFKTLQSLSEQNLVDC 130
Query: 154 SIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
++ GC GG + +YV +G M E DYPY + CKF I S+
Sbjct: 131 --VTTCYGCNGGLMDAAYDYVVKHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSYVN 188
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++T+GP A++I+AS +FQLY+SGIYD+ AC+S ++H + VGY
Sbjct: 189 VAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYDESACSSYNLDHGVGCVGYGTE 248
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAV 305
++N WI++N W WG+ GY+ +K NN+CG A A
Sbjct: 249 GSKNYWIVRNSWGTAWGEKGYIRMIKDKNNQCGEATMAC 287
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 162/301 (53%), Gaps = 17/301 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ +Q N I HN++ ++G + + +D+ ++ ++ +L
Sbjct: 32 KTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFL-DLLKLQ 90
Query: 79 HSRIRRTLVRSPESNESVLIP--DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
+ ++ E + + D +DWRE+G +TP +Q +CG+C+AFS AI+GQ F
Sbjct: 91 GVPALPSNAVHFDNFEDIDMEEKDAIDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFF 150
Query: 137 KSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
K + LS Q++VDC+ GN GC GG + ++VQ G+ EE YPY+G++S C
Sbjct: 151 KKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQ-DEGIQTEESYPYEGRRSSC 209
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA-C 254
K K V + ++ + P DE + T+A GP+AV+I AS +F Y GI D+ C
Sbjct: 210 K-KSGEYVTKVKTY--VFPLDEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDERCRC 264
Query: 255 TS--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
++ + +NH +L+VGY + WI+KN W WG+ GY LK+ CGI Y Y +
Sbjct: 265 SNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIGTYNTYPV 324
Query: 309 I 309
+
Sbjct: 325 L 325
>gi|123438675|ref|XP_001310117.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121891873|gb|EAX97187.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/279 (34%), Positives = 149/279 (53%), Gaps = 19/279 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+T+ N L+ L P Y K + ++ R +V+S
Sbjct: 34 WLSNKRLVQEHNRANL----GFTVALNKLAHLTPAEY-KSLLGFRMNKAERKVVKS---- 84
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ + DWR++G + P Q CG+C+AFS A + Q + + ++ LS Q +VDC
Sbjct: 85 -NAIANADCDWRKEGAVNPIKGQGQCGSCWAFSAIQAQESQYYIAFKNLQSLSEQNLVDC 143
Query: 154 SIISGNLGCAGGSLRNTLNYV--QFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
++ GC GG + +YV +G M E DYPY + CKF I S+
Sbjct: 144 --VTTCYGCNGGLMDAAYDYVINHQSGKFMTEADYPYTARDGSCKFNAAKGTSQIKSYVN 201
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++T+GP A++I+AS +FQLY+SGIYD+ AC+S ++H + VGY
Sbjct: 202 VAEGDEKDLATKVSTLGPAAIAIDASAWSFQLYSSGIYDESACSSYNLDHGVGCVGYGTE 261
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAV 305
++N WI++N W WG+ GY+ +K NN+CG A A
Sbjct: 262 GSKNYWIVRNSWGTSWGEKGYIRMIKDKNNQCGEATMAC 300
>gi|224069140|ref|XP_002326284.1| predicted protein [Populus trichocarpa]
gi|118482340|gb|ABK93094.1| unknown [Populus trichocarpa]
gi|222833477|gb|EEE71954.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 99/294 (33%), Positives = 148/294 (50%), Gaps = 10/294 (3%)
Query: 22 RKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSR 81
++ T+ + KL + + + ++GL YTL N +D + + K RL ++
Sbjct: 67 KRYETEGEMKLRFAIFSESLDLIRSTNKKGLP-YTLGLNQFADWTWQEFQK--YRLGAAQ 123
Query: 82 IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSE 141
R + L+P+ DWRE+G ++P NQ CG+C+ FS A++ ++ +
Sbjct: 124 NCSATTRGNHKLTNALLPETKDWREEGIVSPVKNQGHCGSCWTFSTTGALEAAYHQAFGK 183
Query: 142 IEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN 201
LS QQ+VDC+ N GC GG Y++F GGL EE YPY GK CKF N
Sbjct: 184 GISLSEQQLVDCARAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKDDACKFSSEN 243
Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY--V 259
+ V + + E LK +A V P++V+ +F+LY G+Y C S V
Sbjct: 244 VGVRVVESVNITLGAEDELKHAVAFVRPVSVAFEVV-GSFRLYKEGVYTTSTCGSTPMDV 302
Query: 260 NHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
NHA+L VGY + W++KN W WGDNGY ++ G N CGIA A Y ++
Sbjct: 303 NHAVLAVGYGVENGIPYWLIKNSWGEDWGDNGYFKMEMGKNMCGIATCASYPVV 356
>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
Length = 371
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 165/319 (51%), Gaps = 38/319 (11%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F+ +++ K Y+ + + +++N ++ H HG T SDL P
Sbjct: 48 FLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARRHQLLDPSAEHGVT----KFSDLTPAE 103
Query: 70 YIKEMTRLTHSRIRRTLVR--SPESNESVLIP-----DHLDWREKGFITPDWNQEDCGAC 122
+ + T L + RR L+R +NE+ ++P D DWR+ G +TP NQ CG+C
Sbjct: 104 FRR--TYLGLRKSRRALLRELGKSANEAPVLPTDGLPDDFDWRDHGAVTPVKNQGSCGSC 161
Query: 123 YAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQ 175
++FS + A++G + +T ++E LS QQ+VDC + S + GC GG + N +Y+Q
Sbjct: 162 WSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPDSCDSGCNGGLMTNAFSYLQ 221
Query: 176 FAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSIN 235
AGGL E+DYPY G CKF + IV + ++SV+ DE + L GP+A+ IN
Sbjct: 222 KAGGLESEKDYPYTGSDDKCKFDKSKIVASVQNFSVV-SVDEGQIAANLIKHGPLAIGIN 280
Query: 236 ASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWG 284
A+ Q Y G+ C ++H +LLVGY + WI+KN W +WG
Sbjct: 281 AA--YMQTYIGGVSCPYIC-GRTLDHGVLLVGYGAAGFAPIRLKDKPYWIIKNSWGENWG 337
Query: 285 DNGYMYLKRGN---NRCGI 300
+NGY + RG+ N+CG+
Sbjct: 338 ENGYYKICRGSNVRNKCGV 356
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 158/312 (50%), Gaps = 32/312 (10%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F ++K+KK Y + + ++SN ++ H + HG T SDL
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDPTASHGVT----QFSDLTSAE 108
Query: 70 YIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
+ K++ L R+ + +P +N+ +P+ DWREKG + P NQ CG+C++FS
Sbjct: 109 FRKQVLGLRKLRLPKDANTAPILPTND---LPEDFDWREKGAVGPVKNQGSCGSCWSFST 165
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDC-------SIISGNLGCAGGSLRNTLNYVQFAGGL 180
A++G F +T E+ LS QQ+VDC S + GC GG + + Y AGGL
Sbjct: 166 TGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 225
Query: 181 MKEEDYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
M+EEDYPY G + CKF + + ++++SV+ DE + L GP+AV+INA
Sbjct: 226 MREEDYPYTGMDRGACKFDKNKVAAGVANFSVV-SLDEDQIAANLVKNGPLAVAINAV-- 282
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGY 288
Q Y G+ C S ++H +LLVGY + WI+KN W WG+NG+
Sbjct: 283 FMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGF 341
Query: 289 MYLKRGNNRCGI 300
+ RG N CG+
Sbjct: 342 YKICRGRNICGV 353
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 102/300 (34%), Positives = 150/300 (50%), Gaps = 13/300 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+Y K Y ++ + + K I +HN+ +GL YTL N +DL + K
Sbjct: 66 RYGKSYETAEEMKRRFSIFVDSLKMIRSHNK---KGLS-YTLGVNEFADLTWEEFRKH-- 119
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
RL ++ ++ + L+P DWRE G +TP NQ CG+C+ FS A++
Sbjct: 120 RLGAAQNCSATLKGNHKLTNGLLPLKKDWREVGIVTPVKNQGHCGSCWTFSTTGALEAAY 179
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
++ + LS QQ+VDC+ N GC GG Y++ GGL EE YPY G +C
Sbjct: 180 VQAFGKAIFLSEQQLVDCARAYNNFGCNGGLPSQAFEYIKANGGLDTEEAYPYTGVDGVC 239
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC- 254
KF NI V + + E LK +A V P++V+ F+LY SG+Y + C
Sbjct: 240 KFSSENIGVQVLDSVNITLGAEDELKDAVAFVRPVSVAFEVVSG-FRLYKSGVYTSDTCG 298
Query: 255 -TSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
T VNHA++ VGY + W++KN W WGDNGY ++ G N CG+A A Y ++
Sbjct: 299 NTPMDVNHAVVAVGYGVENDVPYWLIKNSWGADWGDNGYFKMEMGKNMCGVATCASYPVV 358
>gi|198285481|gb|ACH85279.1| cathepsin l-like [Salmo salar]
Length = 444
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 155/308 (50%), Gaps = 11/308 (3%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
+F ++++ + Y + K++ + N + +H+ N+ ++L N LSDL
Sbjct: 140 LFGHFKEQFGRHYGDEREHEKREHAFVHNLRYVHSMNRAGLS----FSLAVNSLSDLSMS 195
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R R L V +PD LDWR G +TP +Q CG+C++F+
Sbjct: 196 ELSAMRGRNRGKRPNNGLPFPMHLYTGVQVPDQLDWRLYGAVTPVKDQAICGSCWSFATT 255
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-P 187
A++G +F ++ ++ LS Q +VDCS GN GC GG ++ GG+ E Y
Sbjct: 256 GAVEGALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATTETYGS 315
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G +C F + I S++ + D ALKV L GP+AVSI+A +F Y+ G
Sbjct: 316 YMGMNGLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVSIDAGHRSFVFYSHG 375
Query: 248 IYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
+Y + C T+D ++HA+L VGY W++KN WS +WG++GY+ + +N CG+
Sbjct: 376 VYYEPKCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDGYILMSMKDNNCGVT 435
Query: 302 NYAVYALI 309
A Y +
Sbjct: 436 TDATYVTL 443
>gi|2146900|pir||S67481 cathepsin L-like cysteine proteinase (EC 3.4.22.-) CP1 [similarity]
- fruit fly (Drosophila melanogaster) (fragment)
Length = 218
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 79/218 (36%), Positives = 128/218 (58%), Gaps = 6/218 (2%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR KG +T +Q CG+C+AFS A++GQ F+ + + LS Q +VDCS
Sbjct: 1 LPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKY 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN GC GG + N Y++ GG+ E+ YPY+ C F R + ++ +P DE
Sbjct: 61 GNNGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQGDE 120
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----- 272
+ + TVGP++V+I+AS +FQ Y+ G+Y++ C + ++H +L+VG+ +
Sbjct: 121 KKMPEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDY 180
Query: 273 WILKNWWSHHWGDNGYM-YLKRGNNRCGIANYAVYALI 309
W++KN W WGD G++ L+ N+CGIA+ + Y L+
Sbjct: 181 WLVKNSWGTTWGDKGFIKMLRNKENQCGIASPSSYPLV 218
>gi|330800456|ref|XP_003288252.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
gi|325081708|gb|EGC35214.1| hypothetical protein DICPUDRAFT_55299 [Dictyostelium purpureum]
Length = 531
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 102/307 (33%), Positives = 161/307 (52%), Gaps = 13/307 (4%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F+ + +Y+K Y K + +++ H KI +HN + Y L NH +DL H
Sbjct: 225 FVAFKSEYEKSYENKEEHDMRFKNYKVAHNKIVSHNAKNLS----YKLGFNHYADLS-DH 279
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESV-LIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ + +R S +E + IP +DWR + +TP +Q CG+C+ F
Sbjct: 280 EFNTLIKPKVARPSNNGAHSVHDDEDIYTIPQSVDWRNQKCVTPVKDQGVCGSCWTFGST 339
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+++G + + LS QQ+VDC+ + G+ GC GG + Y+ AGG+ E DY Y
Sbjct: 340 GSLEGTNCVTNGYLVSLSEQQLVDCAYLMGSQGCNGGFAASAFQYIMDAGGIATESDYQY 399
Query: 189 KGKQSICKFKRPNIV-VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
+ ++CK K V +SS+ + +AL +AT GP+A++I+AS F+ Y SG
Sbjct: 400 LMQNALCKDKSTTFSGVGVSSYVNVTAGSINALLNAVATQGPVAIAIDASVDDFRYYQSG 459
Query: 248 IYDDEACTS--DYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
IY + +C + D ++H +L +GY T N W++KN WS +WG GY L+R NN CG A
Sbjct: 460 IYSNPSCKNGPDDLDHEVLAIGYGTLNGVDYWLVKNSWSTNWGMEGYFMLERANNLCGPA 519
Query: 302 NYAVYAL 308
+ A Y L
Sbjct: 520 SQATYPL 526
>gi|443687700|gb|ELT90593.1| hypothetical protein CAPTEDRAFT_217275 [Capitella teleta]
Length = 248
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 135/239 (56%), Gaps = 9/239 (3%)
Query: 77 LTHSRIRRTLVRSPESNESV-LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
L RI T S E V +P +DWR KGF++ +Q C + YAF+ A++GQ+
Sbjct: 5 LKKRRIVSTAQELGLSEEQVRTLPHSVDWRTKGFVSEVQDQGACSSSYAFAALGAVEGQV 64
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F T ++ LS Q +VDC+ I GC GG L NTL+Y+ G+ DYPY G+ C
Sbjct: 65 FNKTGKLTTLSAQNIVDCAGIMLGDGCNGGILDNTLDYIMAFKGVNSLVDYPYIGQLQQC 124
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
F + + I + +L +E LK+T+ATVGP+AV+++AS F+ Y G+ D C+
Sbjct: 125 AFNQSKAISGIGGYVILSGLNETLLKITIATVGPVAVTMDASSPYFRYYRHGVLVDPECS 184
Query: 256 SDY--VNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
S++ NH++L+VGY + WI KN W WG GY++L R +N CG++ + +Y
Sbjct: 185 SNWKDQNHSVLIVGYGTDEYGVEYWIAKNSWGKQWGMKGYVHLARNHDNMCGLSTHPMY 243
>gi|194757748|ref|XP_001961124.1| GF13714 [Drosophila ananassae]
gi|190622422|gb|EDV37946.1| GF13714 [Drosophila ananassae]
Length = 352
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 97/291 (33%), Positives = 155/291 (53%), Gaps = 28/291 (9%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRT----------LVRSP 90
I N+ + G+ + L N L+D+ KE+ L S+I + P
Sbjct: 68 IALSNKNSDSGISSFQLSVNPLADMTK----KEIATLLGSKISDNGEKYTNGHINFLTGP 123
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQE-DCGACYAFSIASAIQGQIFKSTSEIEELSIQQ 149
++ +V +P+ DWREKG +TP Q CG+C++F+ A++G IF+ T + LS Q
Sbjct: 124 RTS-NVNLPEKFDWREKGGVTPPGFQGVGCGSCWSFATTGALEGHIFRRTGILPSLSQQN 182
Query: 150 VVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK----FKRP--NIV 203
+VDC+ GN+GC GG Y++ G+ YPY + C+ RP +
Sbjct: 183 LVDCADDYGNMGCDGGFQEYAFEYIR-DHGITLANKYPYTQTEMQCRQNETAGRPPRESI 241
Query: 204 VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAM 263
V + ++ + P DE +K +AT+GP+A S+NA +F+ Y GIY+D+AC + VNH++
Sbjct: 242 VKVRDYASITPGDEEKMKEVIATLGPLACSMNADTISFEQYGGGIYEDDACNAGEVNHSV 301
Query: 264 LLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIANYAVYALI 309
+VGY R+ WI+KN +S +WG+ G+M L R CGIA+ Y ++
Sbjct: 302 TVVGYGSENGRDYWIIKNSYSQNWGEGGFMRLPRNAGGFCGIASECSYPIL 352
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 156/301 (51%), Gaps = 14/301 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+Y K Y A ++ + + +++ + N+ +GL Y L N SD+ + + T
Sbjct: 70 RYGKSYESAAEVRRRFRIFSESLEEVRSTNR---KGLS-YRLGINRFSDMSWEEF--QAT 123
Query: 76 RLTHSRI-RRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
RL ++ TL + ++ +P+ DWRE G ++P +Q CG+C+ FS A++
Sbjct: 124 RLGAAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKDQSHCGSCWTFSTTGALEAA 183
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
++T + LS QQ+VDC+ N GC+GG Y+++ GG+ EE YPYKG +
Sbjct: 184 YTQATGKNISLSEQQLVDCAGGFNNFGCSGGLPSQAFEYIKYNGGIDTEESYPYKGVNGV 243
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
C +K N VV + + E LK + V P++V+ + F+ Y SG+Y + C
Sbjct: 244 CHYKAENAVVQVLDSVNITLNAEDELKNAVGLVRPVSVAFEV-INGFRQYKSGVYSSDHC 302
Query: 255 --TSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
T D VNHA+L VGY + W++KN W WGDNGY ++ G N C +A A Y +
Sbjct: 303 GTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAVATCASYPI 362
Query: 309 I 309
+
Sbjct: 363 V 363
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 99/291 (34%), Positives = 158/291 (54%), Gaps = 17/291 (5%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM--TRLTHSRIRR 84
DS+ ++ W N + + HN+ A G HG+ L N +DL + R+ +R
Sbjct: 75 DSRFRVFWD-NLRFVDAHNERA--GEHGFRLGMNQFADLTNDEFRAAYLGARIPAARSGN 131
Query: 85 TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
+ + + +P+ +DWREKG + P NQ CG+C+AFS S+++ T E+
Sbjct: 132 AVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESINQIVTGEMVT 191
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI-V 203
LS Q++V+CS GN GC GG + N++ GG+ E+DYPYK C R N V
Sbjct: 192 LSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGGIDTEDDYPYKAVDGKCDINRRNAKV 251
Query: 204 VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAM 263
V I ++ +P DE +L+ +A P++V+I A FQLY SG++ +CT++ ++H +
Sbjct: 252 VSIDAFEDVPENDEKSLQKAVAHQ-PVSVAIEAGGRQFQLYKSGVFSG-SCTTN-LDHGV 308
Query: 264 LLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
+ VGY ++ WI++N W WG+ GY+ ++R N +CGIA A Y
Sbjct: 309 VAVGYGTENGKDYWIVRNSWGPKWGEAGYIRMERNINATTGKCGIAMMASY 359
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 154/309 (49%), Gaps = 25/309 (8%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + K+ K Y K + ++SN K H HG T + + R
Sbjct: 48 FTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQ 107
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
++ RL + P +N +P+ DWREKG +TP +Q CG+C+AFS
Sbjct: 108 FLGLKKRLRLPAHAQKAPILPTTN----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTG 163
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G + +T ++ LS QQ+VDC + S + GC GG + N Y+ +GG+++
Sbjct: 164 ALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQ 223
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E+DY Y G+ CKF + +V +S++SV+ DE + L GP+AV+INA+ Q
Sbjct: 224 EKDYAYTGRDGSCKFDKSKVVASVSNFSVV-TLDEDQIAANLVKNGPLAVAINAA--WMQ 280
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYL 291
Y SG+ C ++H +LLVG+ + + WI+KN W +WG+ GY +
Sbjct: 281 TYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKI 340
Query: 292 KRGNNRCGI 300
RG N CG+
Sbjct: 341 CRGRNVCGV 349
>gi|118373972|ref|XP_001020178.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89301945|gb|EAR99933.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 339
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/290 (34%), Positives = 155/290 (53%), Gaps = 11/290 (3%)
Query: 26 TDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTH-SRIRR 84
T+S++ ++S KI HN + + YT NHL+ R ++++ + + + +
Sbjct: 54 TNSERYQLFKSRLAKIIEHNSDPNKT---YTQGINHLT-FQTREELQQLRGFQNCTALAK 109
Query: 85 TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI-FKSTSEIE 143
RS + + IPD++DWREKG +T NQ +CG+C+AF+ AI+ K+
Sbjct: 110 ENTRSFRTYDMNDIPDYVDWREKGVVTAVKNQGECGSCWAFAAVGAIESHFSLKTGKSPI 169
Query: 144 ELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIV 203
+LS QQ++DC+ N GC GG Y+ + GG+ +DYPY GK + C+F NIV
Sbjct: 170 QLSEQQLIDCARQFDNHGCDGGLPSKAFEYIAYEGGIENSKDYPYTGKNNKCQFDGENIV 229
Query: 204 VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD--YVNH 261
+ + DE L L GP+ ++ A+ F Y SGIY+ + C D VNH
Sbjct: 230 TKVKQSFNITYLDEKELIYHLVHKGPVTLAYEAADE-FDNYQSGIYEGKNCEQDPQKVNH 288
Query: 262 AMLLVGY--TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
A+L VGY T + +I+KN W WG NGY Y++ N CG+A+ A Y +I
Sbjct: 289 AVLAVGYNKTGDYYIVKNSWGDKWGMNGYFYIRANKNACGLASCASYPII 338
>gi|209155876|gb|ACI34170.1| Digestive cysteine proteinase 2 precursor [Salmo salar]
Length = 551
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 155/308 (50%), Gaps = 11/308 (3%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
+F ++++ + Y + K++ + N + +H+ N+ ++L N LSDL
Sbjct: 247 LFGHFKEQFGRHYGDEREHEKREHAFVHNLRYVHSMNRAGLS----FSLAVNSLSDLSMS 302
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R R L V +PD LDWR G +TP +Q CG+C++F+
Sbjct: 303 ELSAMRGRNRGKRPNNGLPFPMHLYTGVQVPDQLDWRLYGAVTPVKDQAICGSCWSFATT 362
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-P 187
A++G +F ++ ++ LS Q +VDCS GN GC GG ++ GG+ E Y
Sbjct: 363 GAVEGALFLTSGSLQVLSQQMLVDCSWGFGNNGCDGGEEWRAYEWIMKHGGIATTETYGS 422
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G +C F + I S++ + D ALKV L GP+AVSI+A +F Y+ G
Sbjct: 423 YMGMNGLCHFNTSQLTARIQSYTNVTSGDAEALKVALFKHGPVAVSIDAGHRSFVFYSHG 482
Query: 248 IYDDEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
+Y + C T+D ++HA+L VGY W++KN WS +WG++GY+ + +N CG+
Sbjct: 483 VYYEPKCGNTTDSLDHAVLAVGYGVMEAEPYWLVKNSWSTYWGNDGYILMSMKDNNCGVT 542
Query: 302 NYAVYALI 309
A Y +
Sbjct: 543 TDATYVTL 550
>gi|19909509|dbj|BAB86959.1| cathepsin L [Fasciola gigantica]
Length = 324
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 154/310 (49%), Gaps = 28/310 (9%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N L+D+
Sbjct: 25 KRMYNKEY-NGAVDEHRRNIWEDNVKHIQEHNLRHDVGLVTYTLGLNQLTDMTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWR G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRGSGYVTTVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S ++GQ K+ S QQ+VDCS GN GC+GG + N Y+ QF GL E
Sbjct: 136 STTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNYGCSGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C++ R V ++ + + E LK + GP A++++ F +Y
Sbjct: 194 SYPYTAVEGQCRYNRQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAIAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
+ GIY + C +NHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 SGGIYQSQTCLR--LNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMARNRGNMCG 310
Query: 300 IANYAVYALI 309
I++ A ++
Sbjct: 311 ISSLASLPMV 320
>gi|281211531|gb|EFA85693.1| cysteine protease [Polysphondylium pallidum PN500]
Length = 366
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 157/316 (49%), Gaps = 50/316 (15%)
Query: 30 KKLH-WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHP----RHYIKEMTRLTHSRIRR 84
KK H ++SN +H+ N + + L NHL+D P + Y+ + H ++
Sbjct: 45 KKYHTFKSNMDYVHSWNAKNSDTV----LELNHLADHSPEEYKKFYLGTRVKHIHFNVQG 100
Query: 85 TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEE 144
T + + S +DWR+KG ++P +Q CG+C++FS +++G T + E
Sbjct: 101 THINTQLSTVFEDSGATVDWRKKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVE 160
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-SICKFKRPNIV 203
LS Q +VDCS GN+GC GG + N +Y+ G+ E+ YPY S+CKF + N+
Sbjct: 161 LSEQNLVDCSSAEGNMGCNGGLMNNAFDYIISNHGIDTEQSYPYTANTGSVCKFNKTNVG 220
Query: 204 VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAM 263
ISS+ + P E L + T GP++V+I+AS +FQLY+ GIY + C+S ++H +
Sbjct: 221 ATISSYKSITPGSETDLANAVKTAGPVSVAIDASHRSFQLYSHGIYYEWLCSSTRLDHGV 280
Query: 264 LLVGY---------------------------------------TRNSWILKNWWSHHWG 284
L+VGY ++N WI+KN WS WG
Sbjct: 281 LVVGYGSGNPPNSDMDHMILKKTAKTDHYHGKKSLKVEKVDTTSSKNYWIVKNSWSDTWG 340
Query: 285 DNGYMYLKRG-NNRCG 299
D GY+Y+ + N CG
Sbjct: 341 DKGYIYMSKDRKNNCG 356
>gi|68137209|gb|AAY85545.1| male accessory gland protein [Drosophila simulans]
Length = 362
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 147/295 (49%), Gaps = 15/295 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K Y A + + + S + N QG++ + N +DL ++ ++T L
Sbjct: 63 KTYLSAADRALHEGAFASTKNLVDAGNAAFAQGVNTFKQAVNAFADLTHSEFLSQLTGLK 122
Query: 79 HSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
S + R+ S + V IP+ DWRE G +TP Q CG+C+AF+ AI+G
Sbjct: 123 RSPEAK--ARAAASLKEVALPAKPIPEAFDWREHGGVTPVKFQGTCGSCWAFATTGAIEG 180
Query: 134 QIFKSTSEIEELSIQQVVDCSIIS--GNLGCAGGSLRNTLNYV-QFAGGLMKEEDYPYKG 190
F+ T + LS Q +VDC + G GC GG ++ + G+ + YPY
Sbjct: 181 HTFRKTGSLPNLSEQNLVDCGPVQDFGLNGCDGGFQEAAFCFIDEVQKGVSQAGAYPYID 240
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ CK+ + ++ +PP+DE LK +AT+GP+A S+N T + YA GIY+
Sbjct: 241 NKDTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNYAGGIYN 299
Query: 251 DEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
D+ C NH++L+VGY ++ WI+KN W WG+ GY L RG N C IA
Sbjct: 300 DDECNKGEPNHSILVVGYGSENGQDYWIVKNSWDDTWGEKGYFRLPRGKNYCFIA 354
>gi|410911058|ref|XP_003969007.1| PREDICTED: counting factor associated protein D-like [Takifugu
rubripes]
Length = 549
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 165/312 (52%), Gaps = 19/312 (6%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
IF ++K+++ Y ++ + N + IH+ N+ GL YTL N LSD
Sbjct: 245 IFGHFKEKFQRRYEDDKEHDIRQQAFIHNLRYIHSKNR---AGL-SYTLALNSLSD---- 296
Query: 69 HYIKEMTRLTHSRIRRTLVRSP----ESNESVLIPDHLDWREKGFITPDWNQEDCGACYA 124
+ E+ + + R+T R + E+V +PD LDWR G +TP +Q CG+C++
Sbjct: 297 RTMSELGTMRGKKQRKTPNRGLPFPLKLYENVQVPDSLDWRLYGAVTPVKDQAICGSCWS 356
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
F+ I+G +F T ++ LS Q ++DCS GN C GG + ++ GG+ E
Sbjct: 357 FATTGTIEGALFLKTGFLQVLSQQILMDCSWGFGNNACDGGEEWRSYEWIMKHGGIALAE 416
Query: 185 DY-PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
Y PY G C +V I S++ + D ALK+ L GP+AVSI+AS +F
Sbjct: 417 TYGPYMGMNGFCHVNSSELVAQIQSYTNVTSGDAMALKLALFKHGPVAVSIDASHRSFVF 476
Query: 244 YASGIYDDEACTS--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNR 297
Y+ G+Y + AC S D ++HA+L VGY + W++KN WS +WG++GY+ + +N
Sbjct: 477 YSHGVYYEPACGSTIDDLDHAVLAVGYGNLNGEPYWLIKNSWSTYWGNDGYILMSMKDNN 536
Query: 298 CGIANYAVYALI 309
CG+A A + +
Sbjct: 537 CGVATDATFVTL 548
>gi|211910923|gb|ACJ13090.1| cathepsin L [Globodera rostochiensis]
Length = 295
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 90/260 (34%), Positives = 146/260 (56%), Gaps = 3/260 (1%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
+ QK +K Y + ++++ L + S + I HNQ +G + + ENH++DL Y
Sbjct: 34 YKQKHGRKSYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 93
Query: 72 K--EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K RL +RR +P+ +DWR+KG++T NQ CG+C+AFS
Sbjct: 94 KLNGYRRLLGDNLRRNASTFLAPMNIGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSSTG 153
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ Q + T ++ LS Q ++DCS GN+GC GG + N Y++ G+ KE DYPYK
Sbjct: 154 ALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYK 213
Query: 190 GKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C FKR ++ + + + DE LK+ +AT GP++V+I+A +FQLY G+
Sbjct: 214 AKTGKKCLFKRNDVGATDTGFFDIAEGDEERLKIAVATQGPVSVAIDAGHRSFQLYTHGV 273
Query: 249 YDDEACTSDYVNHAMLLVGY 268
Y ++ C+ + ++H +L+VGY
Sbjct: 274 YFEKECSPENLDHGVLVVGY 293
>gi|313246319|emb|CBY35240.1| unnamed protein product [Oikopleura dioica]
Length = 326
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 157/304 (51%), Gaps = 13/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH---- 69
+++++ +Y + + + W N + H + G +T+ N +DL
Sbjct: 22 KEEHEVEYASQVEEVSRYGVWMKNKAFVDEHMASYEAGEKTFTVGMNKFADLTSEEFAEL 81
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKG--FITPDWNQEDCGACYAFSI 127
Y+ ++ L+ + +N + +P DWR +TP +Q CG+C+AFS
Sbjct: 82 YLAKVQDLSGPHPPMCTDSTVGANST--MPASADWRTANPPVVTPVKDQGQCGSCWAFST 139
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
++++ Q + + + LS QQ+VDCS+ GN GC+GG + Y+ G+ E YP
Sbjct: 140 IASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYIHDNNGVDTEASYP 199
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C F N+ ++S + DE AL + VGP++V+I+AS +FQLY SG
Sbjct: 200 YTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVAIDASHMSFQLYTSG 259
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+Y + C+S +++H + VGY +S +I+KN W+ WGDNGY+ + R NN CGIA
Sbjct: 260 VYYEPNCSSQFLDHGVTAVGYGSSSGNDFFIVKNSWAATWGDNGYIMMSRNKNNNCGIAT 319
Query: 303 YAVY 306
A Y
Sbjct: 320 SASY 323
>gi|168047065|ref|XP_001775992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672650|gb|EDQ59184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 158/322 (49%), Gaps = 38/322 (11%)
Query: 7 IIIFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLH 66
++ F KYKK+Y+ + + + + K + THN+ G H Y+L N +D+
Sbjct: 26 VLHFAGFAAKYKKEYKTVEELKHRFVTFLESVKLVETHNK----GQHSYSLAVNEFADM- 80
Query: 67 PRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHL----------DWREKGFITPDWNQ 116
T R + + E N S + +H+ DWRE+G ++ NQ
Sbjct: 81 -----------TFEEFRDSRLMKGEQNCSATVGNHVLTGESLPKTKDWREEGIVSQVKNQ 129
Query: 117 EDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQF 176
CG+C+ FS A++ ++T ++ LS QQ+VDC+ N GC GG Y+++
Sbjct: 130 ASCGSCWTFSTTGALEAAHAQATGKMVLLSEQQLVDCAGEFNNFGCGGGLPSQAFEYIRY 189
Query: 177 AGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD--EHALKVTLATVGPIAVSI 234
GG+ E+ YPY K S C+F + I + W V+ + E LK +AT+ P++V+
Sbjct: 190 NGGIDTEDSYPYNAKDSQCRFHKNTIGAQV--WDVVNITEGAETQLKHAIATMRPVSVAF 247
Query: 235 NASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNG 287
H F+LY G+Y C + VNHA+L VGY + WI+KN W WG NG
Sbjct: 248 EV-VHDFRLYNGGVYTSLNCHTGPQTVNHAVLAVGYGEDENGVPYWIIKNSWGADWGMNG 306
Query: 288 YMYLKRGNNRCGIANYAVYALI 309
Y ++ G N CG+A A Y ++
Sbjct: 307 YFNMEMGKNMCGVATCASYPVV 328
>gi|313213752|emb|CBY40632.1| unnamed protein product [Oikopleura dioica]
Length = 440
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 157/304 (51%), Gaps = 13/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH---- 69
+++++ +Y + + + W N + H + G +T+ N +DL
Sbjct: 136 KEEHEVEYASQVEEVSRYGVWMKNKAFVDEHMASYEAGEKTFTVGMNKFADLTSEEFAEL 195
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKG--FITPDWNQEDCGACYAFSI 127
Y+ ++ L+ + +N + +P DWR +TP +Q CG+C+AFS
Sbjct: 196 YLAKVQDLSGPHPPMCTDSTVGANST--MPASADWRTANPPVVTPVKDQGQCGSCWAFST 253
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYP 187
++++ Q + + + LS QQ+VDCS+ GN GC+GG + Y+ G+ E YP
Sbjct: 254 IASLESQWALAGNALTSLSEQQLVDCSMNWGNYGCSGGLMTQGFTYIHDNNGVDTEASYP 313
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y + C F N+ ++S + DE AL + VGP++V+I+AS +FQLY SG
Sbjct: 314 YTAQDGKCVFNPANVGTSLTSCYNIASGDEAALANAVQMVGPMSVAIDASHMSFQLYTSG 373
Query: 248 IYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIAN 302
+Y + C+S +++H + VGY +S +I+KN W+ WGDNGY+ + R NN CGIA
Sbjct: 374 VYYEPNCSSQFLDHGVTAVGYGSSSGNDFFIVKNSWAATWGDNGYIMMSRNKNNNCGIAT 433
Query: 303 YAVY 306
A Y
Sbjct: 434 SASY 437
>gi|384251069|gb|EIE24547.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
Length = 548
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 105/302 (34%), Positives = 156/302 (51%), Gaps = 32/302 (10%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLV---RSP 90
+ SN + THN + Y+L N +D + E L LV R
Sbjct: 236 FHSNRLYVDTHNSAQRN----YSLALNRYADWSQEEF--EAVMLPRHGAAGGLVARRRGA 289
Query: 91 ESNE--------SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
+S E + +P +DWR G +Q CG+C+AF +A+QG + +T +
Sbjct: 290 DSGELPYERLAPAWRVPRGVDWRGTGADGVVKDQATCGSCWAFGATAALQGAYWAATGKA 349
Query: 143 EELSIQQVVDCS-IISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC------ 195
S QQ++DCS N C GG N ++Y+ GG+ KE+DY Y G+ C
Sbjct: 350 TSFSEQQLMDCSWKFGANHACDGGDYDNAIDYLVNVGGIAKEKDYEYLGQDDFCGNPFMA 409
Query: 196 KFKRPNI-VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
KRP+ +V + +S +PP D+ AL + + G IAVS++AS TF+ YASG YD+ C
Sbjct: 410 DTKRPSSSLVKVKGYSYVPPHDDEALMEAVYSRGTIAVSLDASQPTFRFYASGTYDEPNC 469
Query: 255 T--SDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYA 307
SD ++HA+ LVGY + WI+KN WS+HWGD G++ + RG++ CGI + VYA
Sbjct: 470 MWKSDDLDHAVALVGYGTDEAGVDYWIIKNSWSNHWGDGGFIKIARGHHGCGITSDPVYA 529
Query: 308 LI 309
+
Sbjct: 530 VF 531
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 162/301 (53%), Gaps = 17/301 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ +Q N I HN++ ++G + + +D+ ++ ++ +L
Sbjct: 32 KTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFL-DLLKLQ 90
Query: 79 HSRIRRTLVRSPESNESVLIP--DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
+ ++ E + + D +DWRE+G +TP +Q +CG+C+AFS AI+GQ F
Sbjct: 91 GVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFF 150
Query: 137 KSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
K + LS Q++VDC+ GN GC GG + ++VQ G+ EE YPY+G++S C
Sbjct: 151 KKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQ-DEGIQTEESYPYEGRRSSC 209
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA-C 254
K K V + ++ + P DE + T+A GP+AV+I AS +F Y GI D+ C
Sbjct: 210 K-KSGEYVTKVKTY--VFPLDEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDERCRC 264
Query: 255 TS--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
++ + +NH +L+VGY + WI+KN W WG+ GY LK+ CGI Y Y +
Sbjct: 265 SNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIGYYNTYPI 324
Query: 309 I 309
+
Sbjct: 325 L 325
>gi|440301075|gb|ELP93522.1| cysteine proteinase 2 precursor, putative [Entamoeba invadens IP1]
Length = 323
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 131/216 (60%), Gaps = 11/216 (5%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+ + +DWRE+G +T +Q++CG+CY+F S I+ I ++ E E+LS QQ+VDC+ +
Sbjct: 105 VRNSVDWREEGKVTAIKDQKNCGSCYSFGAVSVIESLILITSGEYEDLSEQQIVDCTSNT 164
Query: 158 --GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
N GC G++ NT NY++ G+ E Y Y +S C ++ I S++ + P
Sbjct: 165 KYSNFGCTCGNVGNTFNYIKDV-GITTENKYRYIASESNCTVNEIDVKYKIQSYTSIFPP 223
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS--- 272
E ALK+ + P+AVSI+AS +FQLY SGIYD+ C YV+H + +VGY +
Sbjct: 224 TEDALKIAVNK-QPVAVSIDASNPSFQLYKSGIYDEPKC--KYVDHVVSVVGYGSQNGND 280
Query: 273 -WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
WI+KN W WGD GY+Y+ R NN+CGIA A+Y
Sbjct: 281 YWIVKNSWGTEWGDKGYIYMSRNKNNQCGIATIALY 316
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 154/309 (49%), Gaps = 25/309 (8%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + K+ K Y K + ++SN K H + HG T + + R
Sbjct: 48 FTSFKSKFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQ 107
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
++ RL + P +N +P+ DWREKG +TP +Q CG+C+AFS
Sbjct: 108 FLGLKKRLRLPAHAQKAPILPTTN----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTG 163
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G + +T ++ LS QQ+VDC + S + GC GG + N Y+ +GG+++
Sbjct: 164 ALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQ 223
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E+DY Y G+ CKF + +V +S++SV+ DE + L GP+AV INA+ Q
Sbjct: 224 EKDYAYTGRDGSCKFDKSKVVASVSNFSVV-SLDEEQIAANLVKNGPLAVGINAA--WMQ 280
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYL 291
Y SG+ C ++H +LLVG+ + + WI+KN W +WG+ GY +
Sbjct: 281 TYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKI 340
Query: 292 KRGNNRCGI 300
RG N CG+
Sbjct: 341 CRGRNVCGV 349
>gi|410990014|ref|XP_004001245.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 334
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 153/294 (52%), Gaps = 20/294 (6%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
+K W+ N K I HN+E QG H +T+ N D+ + M L + + +
Sbjct: 47 RKAVWEKNMKMIEQHNREQSQGKHSFTMAMNGFGDMTNEEFRHMMNGLKIEKDEKGKMFK 106
Query: 90 PESNESVLIPDHLDWREKGFITPDWN-----QEDCGACYAFSIASAIQGQIFKSTSEIEE 144
+ +P +D R+ ++P Q C + +AFS A++GQ+F E
Sbjct: 107 IPFFADIYVP--VDRRQTPAVSPTETFIICFQGRCASGWAFSATGALEGQMF---PENWP 161
Query: 145 LSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVV 204
+Q ++DCS GN GC GG + N YV+ GGL EE YPY + CK++ +
Sbjct: 162 NVVQNLLDCSWPQGNEGCNGGLMSNAFQYVKNNGGLDTEESYPYVARDGPCKYRPEHSAA 221
Query: 205 DISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAML 264
+++++ +P Q+E AL + +A +GPI+ +I+AS TF+ Y GIY D C+S+ +NH +L
Sbjct: 222 NVTAFENIPQQEE-ALMMAVANMGPISAAIDASLDTFRFYKEGIYYDPKCSSEDLNHGVL 280
Query: 265 LVGY--------TRNSWILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
+VGY + W +KN W WG +GY+ + K +N CGIA A + ++
Sbjct: 281 VVGYGFQGKESDNQKYWFVKNSWGADWGMDGYIKMAKDRDNHCGIATMASFPIV 334
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 98/309 (31%), Positives = 154/309 (49%), Gaps = 25/309 (8%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + K+ K Y K + ++SN K H + HG T + + R
Sbjct: 48 FTSFKSKFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQ 107
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
++ RL + P +N +P+ DWREKG +TP +Q CG+C+AFS
Sbjct: 108 FLGLKKRLRLPAHAQKAPILPTTN----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTG 163
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G + +T ++ LS QQ+VDC + S + GC GG + N Y+ +GG+++
Sbjct: 164 ALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQ 223
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E+DY Y G+ CKF + +V +S++SV+ DE + L GP+AV INA+ Q
Sbjct: 224 EKDYAYTGRDGSCKFDKSKVVASVSNFSVV-SLDEEQIAANLVKNGPLAVGINAA--WMQ 280
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYL 291
Y SG+ C ++H +LLVG+ + + WI+KN W +WG+ GY +
Sbjct: 281 TYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKI 340
Query: 292 KRGNNRCGI 300
RG N CG+
Sbjct: 341 CRGRNVCGV 349
>gi|282158089|ref|NP_001164088.1| cathepsin L precursor [Tribolium castaneum]
Length = 552
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 100/304 (32%), Positives = 162/304 (53%), Gaps = 14/304 (4%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
+K+ K+Y+ K +K ++ N + IH+ N++ + G+TL NHL+D P +K +
Sbjct: 254 RKHGKNYQNKTETLMRKDIFRQNVRFIHSMNRQNR----GFTLTVNHLADKTPTE-LKAL 308
Query: 75 TRLTHSRIRRTLVRSPESN-ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
T+S P N +PD DWR G +TP +Q CG+C++F ++G
Sbjct: 309 RGRTYSGGYNGGAPFPYENINKEDLPDQWDWRLLGAVTPVKDQSVCGSCWSFGTVGTVEG 368
Query: 134 QIF-KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKGK 191
+F + + LS Q +VDCS GN GC GG ++ GG+ EE Y PY G+
Sbjct: 369 ALFLHNGGRLFRLSQQALVDCSWGYGNNGCDGGEDFRAYQWMLKHGGIPTEEAYGPYLGQ 428
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C + V I+ + + DE+AL++ L GPI+V+I+AS TF Y++G+Y +
Sbjct: 429 DGYCHADKVQKVAKITGYVNVTTNDENALRLALFKHGPISVAIDASQRTFSFYSNGVYYE 488
Query: 252 EACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
C + D ++HA+L VGY N W++KN WS++WG++GY+ + +N CG+
Sbjct: 489 PKCGNKIDELDHAVLAVGYGTINGENYWLVKNSWSNYWGNDGYILMSSKDNNCGVMTTPT 548
Query: 306 YALI 309
Y +
Sbjct: 549 YVTM 552
>gi|6448469|dbj|BAA86911.1| homologue of Sarcophaga 26,29kDa proteinase [Periplaneta americana]
Length = 552
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 161/306 (52%), Gaps = 15/306 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K++ KDY +K+K ++ N + IH+ N+ G+TL NHL+D +K
Sbjct: 252 RKRHSKDYASNLEHTKRKEIFRQNLRFIHSKNRARL----GFTLDVNHLAD-RTELELKA 306
Query: 74 MTRLTHSRIRRTLVRSPESNESVL---IPDHLDWREKGFITPDWNQEDCGACYAFSIASA 130
+ ++ P +N + IPD LDWR G +TP +Q CG+C++F
Sbjct: 307 LRGKQYTDGYNGGSPFPYTNLDAIMDQIPDDLDWRIYGAVTPVKDQSVCGSCWSFGTTGT 366
Query: 131 IQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYK 189
I+G F + LS Q ++DCS GN GC GG + ++ GG+ E++Y Y
Sbjct: 367 IEGAYFLKYGHLVRLSQQALIDCSWGYGNNGCDGGEDFRSYEWMMKHGGIPLEDEYGGYL 426
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
G+ C + + I+ + + D ALKV LA GPI+V+I+AS TF Y++GIY
Sbjct: 427 GQDGYCHVENVTLTAKITGYVNVTSGDIDALKVALAKHGPISVAIDASHKTFSFYSNGIY 486
Query: 250 DDEACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANY 303
D C + D ++HA+LLVGY W++KN WS++WG++GY+ + +N CG+A
Sbjct: 487 YDPECGNKLDQLDHAVLLVGYGIINGNPYWLVKNSWSNYWGNDGYILMSPKDNNCGVATD 546
Query: 304 AVYALI 309
Y +
Sbjct: 547 PTYVTM 552
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 150/300 (50%), Gaps = 13/300 (4%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMT 75
+Y K Y ++ + N K I +HN+ +GL Y L N +DL + ++
Sbjct: 67 RYGKRYESVEEIKQRFEVFLDNLKMIRSHNK---KGLS-YKLGVNEFTDLTWDEFRRD-- 120
Query: 76 RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
RL ++ + +V++P+ DWRE G ++P NQ CG+C+ FS A++
Sbjct: 121 RLGAAQNCSATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAY 180
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
++ + LS QQ+VDC+ N GC GG Y++ GGL EE YPY GK +C
Sbjct: 181 SQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLC 240
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC- 254
KF N+ V + + E LK +A V P++++ F+ Y SG+Y C
Sbjct: 241 KFSSENVGVKVIDSVNITLGAEDELKYAVALVRPVSIAFEV-IKGFKQYKSGVYTSTECG 299
Query: 255 -TSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
T VNHA+L VGY + W++KN W WGDNGY ++ G N CGIA A Y ++
Sbjct: 300 NTPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASYPVV 359
>gi|255211|gb|AAB23202.1| cathepsin S [cattle, spleen, Peptide Partial, 217 aa]
gi|227966|prf||1714236A cathepsin S
Length = 217
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 125/218 (57%), Gaps = 7/218 (3%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+PD +DWREKG +T Q CG+C+AFS A++ Q+ T ++ LS Q +VDCS
Sbjct: 1 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAK 60
Query: 158 -GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD 216
GN GC GG + Y+ G+ E YPYK C++ N S + LP
Sbjct: 61 YGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPFGS 120
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNS 272
E ALK +A GP++V I+AS +F LY +G+Y D +CT + VNH +L+VGY ++
Sbjct: 121 EEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQN-VNHGVLVVGYGNLDGKDY 179
Query: 273 WILKNWWSHHWGDNGYMYLKRGN-NRCGIANYAVYALI 309
W++KN W H+GD GY+ + R + N CGIANY Y I
Sbjct: 180 WLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYPEI 217
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 160/300 (53%), Gaps = 15/300 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ +Q N I HN++ ++G + + +D+ ++ +
Sbjct: 32 KTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLDLLKLQG 91
Query: 79 HSRIRRTLVRSPESNESVLIP-DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
+ V ++ + D +DWRE+G +TP +Q +CG+C+AFS AI+GQ FK
Sbjct: 92 VPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK 151
Query: 138 STSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ LS Q++VDC+ GN GC GG + ++VQ G+ EE YPY+G++S CK
Sbjct: 152 KNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQ-DEGIQTEESYPYEGRRSSCK 210
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA-CT 255
K + V + ++ + P DE + T+A GP+AV+I AS +F Y GI D++ C+
Sbjct: 211 -KSGDYVTKVKTY--VFPLDEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDEKCRCS 265
Query: 256 S--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ + +NH +L+VGY + WI+KN W WG+ GY LK+ CGI Y Y ++
Sbjct: 266 NKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIGYYNTYPIL 325
>gi|2706547|emb|CAA75862.1| putative cathepsin L [Xenopus laevis]
Length = 231
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 88/221 (39%), Positives = 130/221 (58%), Gaps = 10/221 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWR+KG++TP +Q CG+C+A S A++GQ ++ TS++ LS Q +VDCS G
Sbjct: 11 PKSVDWRKKGYVTPVKDQGQCGSCWAPSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQG 70
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI-CKFKRPNIVVDISSWSVLPPQDE 217
N GC GG + YV+ GG+ E+ YPY K C + N + + + + E
Sbjct: 71 NEGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGCE 130
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----- 272
L +A+VGP++V+I+A +FQ Y SGIY + C+S+ ++H +L+VGY S
Sbjct: 131 KDLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPECSSEDLDHGVLVVGYGFESEDVDG 190
Query: 273 ---WILKNWWSHHWGDNGYMYL-KRGNNRCGIANYAVYALI 309
WI+KN WS WGDNGY+ + K +N CGIA A Y L+
Sbjct: 191 KKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASYPLV 231
>gi|37911662|gb|AAR05023.1| cathepsin L-like protein [Tenebrio molitor]
Length = 336
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 154/308 (50%), Gaps = 24/308 (7%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ Y + Y ++ +K +Q + HN++ +QGL YTL N +D+ P +E
Sbjct: 31 KTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP----EE 86
Query: 74 MTRLTHSRIRRT-------LVRSPES---NESVLIPDHLDWREKGFITPDWNQEDCGACY 123
M TH I +++ E N SV P DWR++G ++P NQ CG+C+
Sbjct: 87 MKAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKNQGSCGSCW 146
Query: 124 AFSIASAIQGQ--IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLM 181
AFS AI+ Q I +S QQ+VDC + LGC+GG + + YV GG+
Sbjct: 147 AFSSTGAIESQMKIANGAGYDSSVSEQQLVDC--VPNALGCSGGWMNDAFTYVAQNGGID 204
Query: 182 KEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTF 241
E YPY+ C + + +S + L DE+ L +AT GP+AV+ +A F
Sbjct: 205 SEGAYPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDAD-DPF 263
Query: 242 QLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNN 296
Y+ G+Y + C ++ HA+L+VGY ++ W++KN W WG +GY + R NN
Sbjct: 264 GSYSGGVYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGLDGYFKIARNANN 323
Query: 297 RCGIANYA 304
CGIA A
Sbjct: 324 HCGIAGVA 331
>gi|395502422|ref|XP_003755580.1| PREDICTED: pro-cathepsin H [Sarcophilus harrisii]
Length = 334
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 151/294 (51%), Gaps = 20/294 (6%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQ------GLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIR 83
KK H H ++HT + ++ G H +T+R N SD+ E + R+
Sbjct: 43 KKYHLSEYHHRLHTFLENKRRIDKHNAGNHSFTMRLNQFSDMS----FDEFKKTYLMRLP 98
Query: 84 RTLVRSPESNESVL--IPDHLDWREKG-FITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
+ + S+ L P+ +DWR+KG F++P NQ CG+C+ FS ++ + +T
Sbjct: 99 QNCSATKGSHVRRLGPYPESVDWRKKGNFVSPVKNQGGCGSCWTFSTTGGLESAVAIATG 158
Query: 141 EIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRP 200
++ L+ QQ+VDC+ N GC GG Y+ + G+M E+ YPY+GK CKF+
Sbjct: 159 KLLSLAEQQLVDCAQDFNNHGCNGGLPSQAFEYIMYNKGIMGEDTYPYEGKDGTCKFQPN 218
Query: 201 NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS--DY 258
+ + + + DE A+ +A P++ + + F Y GIY + C+ D
Sbjct: 219 KAIAFVKDVANITAYDEEAMTEAVAHHNPVSFAFEVT-DDFLSYHKGIYSNPKCSKSPDK 277
Query: 259 VNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
VNHA+L VGY + + WI+KN W WG+NGY ++RG N CG+A+ A Y +
Sbjct: 278 VNHAVLAVGYGKENGIPYWIVKNSWGTSWGNNGYFLIERGKNMCGLADCASYPI 331
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 100/293 (34%), Positives = 157/293 (53%), Gaps = 19/293 (6%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM--TRLTHSRIRR 84
D + ++ W N + + HN+ A + HG+ L N +DL + R+ SR R
Sbjct: 70 DRRFRVFWD-NLRFVDAHNERAAE--HGFRLGMNQFADLTNDEFRAAYLGARIPASRRRG 126
Query: 85 TLV--RSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
T V R + +P+ +DWREKG + P NQ CG+C+AFS S+++ T E+
Sbjct: 127 TAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEM 186
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS Q++V+CS GN GC GG + +++ GG+ E DYPYK C R N
Sbjct: 187 VTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENA 246
Query: 203 -VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNH 261
VV I + +P DE +L+ +A P++V+I A FQLY +G++ CT++ ++H
Sbjct: 247 KVVSIDGFEDVPENDEKSLQKAVAHQ-PVSVAIEAGGREFQLYKAGVFTG-TCTTN-LDH 303
Query: 262 AMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
++ VGY ++ WI++N W WG++GY+ ++R N +CGIA A Y
Sbjct: 304 GVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCGIAMMASY 356
>gi|62955235|ref|NP_001017633.1| uncharacterized protein LOC550326 precursor [Danio rerio]
gi|62202194|gb|AAH92817.1| Zgc:110239 [Danio rerio]
Length = 546
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 165/304 (54%), Gaps = 13/304 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-IK 72
++K+ + Y + +++ ++ N + +H+ N+ ++L NHL+D + +
Sbjct: 247 KEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLS----FSLSVNHLADRSQKELSMM 302
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ TH ++ R P S+ P+ +DWR G +TP +Q CG+C++F+ ++
Sbjct: 303 RGCQRTH-KVHRKAQPFPSEIRSIATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLE 361
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKGK 191
G +F T ++ LS Q +VDC+ GN GC GG ++ GG+ E Y Y G
Sbjct: 362 GALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGM 421
Query: 192 QSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
+C + + ++V ++ ++ + D ALK + GP+AVSI+A+ +F Y++G+Y +
Sbjct: 422 NGLCHYDKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYE 481
Query: 252 EACTS--DYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAV 305
C + + ++HA+L VGY + W++KN WS +WG++GY+ + +N CG+A A+
Sbjct: 482 PECKNGINDLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYILMSMKDNNCGVATDAI 541
Query: 306 YALI 309
YA +
Sbjct: 542 YATL 545
>gi|195123633|ref|XP_002006308.1| GI18640 [Drosophila mojavensis]
gi|193911376|gb|EDW10243.1| GI18640 [Drosophila mojavensis]
Length = 243
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 143/248 (57%), Gaps = 17/248 (6%)
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
M L + I R P + +P+ DWREKG +TP +Q CG+C+AF+ SA++G
Sbjct: 1 MLELISTHISFLPARKPLES----LPESFDWREKGGVTPPGSQGPCGSCWAFATISALEG 56
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+F+ T + LS Q +VDC+ G GC GG Y++ G + + YPY+ ++
Sbjct: 57 HLFRRTGVLVPLSQQNLVDCADAYGTEGCDGGFQEYGFEYIRDHGVTLANK-YPYEQSEN 115
Query: 194 -ICKFKRP------NIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
+C+ + + +V + ++ +PP D+ +K +AT+GP+A S+N SP +FQ Y S
Sbjct: 116 MVCRQNQTAGAPPRDSIVKVRDYASIPPGDQDLMKQIIATLGPLACSMNGSPVSFQQYKS 175
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKR-GNNRCGIA 301
GIYDD C ++ VNH++ +VGY R+ WI+KN +S +WG+ G+ L R +N CGIA
Sbjct: 176 GIYDDPECNNEEVNHSVTVVGYGSENGRDYWIVKNSYSQNWGEAGFFRLARTPSNFCGIA 235
Query: 302 NYAVYALI 309
+ Y ++
Sbjct: 236 SECSYPIL 243
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 157/293 (53%), Gaps = 19/293 (6%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM--TRLTHSRIRR 84
D + ++ W N + + HN+ A + HG+ L N +DL + R+ SR R
Sbjct: 127 DRRFRVFWD-NLRFVDAHNERAAE--HGFRLGMNQFADLTNDEFRAAYLGARIPASRRRG 183
Query: 85 TLV--RSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
T V R + +P+ +DWREKG + P NQ CG+C+AFS S+++ T E+
Sbjct: 184 TAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEM 243
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS Q++V+CS GN GC GG + +++ GG+ E DYPYK C R N
Sbjct: 244 VTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENA 303
Query: 203 -VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNH 261
VV I + +P DE +L+ +A P++V+I A FQLY +G++ CT++ ++H
Sbjct: 304 KVVSIDGFEDVPENDEKSLQKAVAHQ-PVSVAIEAGGREFQLYKAGVFTG-TCTTN-LDH 360
Query: 262 AMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
++ VGY T N WI++N W WG++GY+ ++R N +CGIA A Y
Sbjct: 361 GVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCGIAMMASY 413
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 159/301 (52%), Gaps = 17/301 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ +Q N I HN++ ++G + + +D+ ++ ++ +L
Sbjct: 32 KTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFL-DLLKLQ 90
Query: 79 HSRIRRTLVRSPESNESVLIP--DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIF 136
+ ++ E + + D +DWRE+G +TP +Q +CG+C+AFS AI+GQ F
Sbjct: 91 GVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFF 150
Query: 137 KSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
K + LS Q++VDC+ GN GC GG + ++VQ G+ EE YPY+G++S C
Sbjct: 151 KKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQ-DEGIQTEESYPYEGRRSSC 209
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
K K V + ++ + P DE + T+A GP+AV+I AS +F Y GI D+
Sbjct: 210 K-KSGEYVTKVKTY--VFPLDEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDERCRC 264
Query: 256 SDY---VNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
S+ +NH +L+VGY + WI+KN W WG+ GY LK+ CGI Y Y +
Sbjct: 265 SNKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIGYYNTYPI 324
Query: 309 I 309
Sbjct: 325 F 325
>gi|350425511|ref|XP_003494144.1| PREDICTED: counting factor associated protein D-like [Bombus
impatiens]
Length = 549
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 158/305 (51%), Gaps = 14/305 (4%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY--I 71
+K + K+Y +K ++ N + IH+ N+ + GY L NHL D +
Sbjct: 250 KKTHNKEYVNHVDQLMRKEVFRQNLRFIHSTNRANK----GYQLSVNHLVDRTELELKAL 305
Query: 72 KEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ H + + E E +PD LDWR G +TP +Q CG+C++F A+
Sbjct: 306 RGKQYTAHYNGGQPFPHNAEK-EVTEVPDSLDWRLYGAVTPVKDQSVCGSCWSFGTTGAV 364
Query: 132 QGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKG 190
+G + ++ LS Q ++DCS GN GC GG + ++ GGL E+DY Y G
Sbjct: 365 EGAYYMKYGKLVRLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGGLPTEDDYGGYLG 424
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ C + I+ + + D +ALKV +A GPI+V+I+AS TF Y+ G+Y
Sbjct: 425 QDGYCHINNATVTAKITGYVNVTSGDANALKVAIAKHGPISVAIDASHKTFSFYSHGVYY 484
Query: 251 DEAC--TSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYA 304
DE+C T + ++HA+L VGY ++ W++KN WS++WG++GY+ + + N CG+
Sbjct: 485 DESCGNTEESLDHAVLAVGYGSLNGKDYWLVKNSWSNYWGNDGYILMSQEKNNCGVLTAP 544
Query: 305 VYALI 309
Y +
Sbjct: 545 TYVTM 549
>gi|195995653|ref|XP_002107695.1| hypothetical protein TRIADDRAFT_51454 [Trichoplax adhaerens]
gi|190588471|gb|EDV28493.1| hypothetical protein TRIADDRAFT_51454 [Trichoplax adhaerens]
Length = 549
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 102/308 (33%), Positives = 164/308 (53%), Gaps = 25/308 (8%)
Query: 21 YRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSD--------LHPRHYIK 72
YR +K++ +++ + + I + N++ + Y L+ NH +D RH+
Sbjct: 244 YRDHHQHAKRQNYFRQHLRFIQSTNRKGLK----YQLKLNHFADRSDKELWNFTQRHFKL 299
Query: 73 EMTRLTHSR----IRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+L H R L + + N+S I DHLDWR++G I+ +Q C +C+A S
Sbjct: 300 NYGKLEHDVKNFITRYKLPFALDLNQS--IADHLDWRDRGAISKIRDQGMCSSCWALSTT 357
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-P 187
AI+ ++ T E ELS Q +VDCS GN GC+GG + +TL ++ G+ + Y
Sbjct: 358 QAIESALYIQTGEKVELSSQSLVDCSWPFGNQGCSGGFISHTLAWIIKQRGIPTKSSYGQ 417
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
Y G+ C + +S +P + ALKV LA VGP+ VSIN SP +F Y+SG
Sbjct: 418 YLGRNGYCHHHDLTTGAKLKGYSQIPQGNMKALKVALAKVGPVTVSINTSPKSFLFYSSG 477
Query: 248 IYDDEACTSDY--VNHAMLLVGYTR----NSWILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
++ ++ C DY ++H +L +GY R + ++KN WS +WGD+GY+ + NN CG+A
Sbjct: 478 VFYNDQCLGDYAHLDHNVLAIGYGRQHDQDYILIKNSWSAYWGDHGYIKISTRNNNCGLA 537
Query: 302 NYAVYALI 309
A Y ++
Sbjct: 538 TNAYYPIL 545
>gi|340369133|ref|XP_003383103.1| PREDICTED: silicatein-like [Amphimedon queenslandica]
Length = 340
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 110/312 (35%), Positives = 162/312 (51%), Gaps = 18/312 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ + K Y + + + W SN K I HN+ ++ GYTL+ N DL + +
Sbjct: 31 KATHGKSYESDHEELGRHIVWLSNKKYIDEHNKYSET--FGYTLKMNKFGDLTNAEFSEL 88
Query: 74 MTRLT----HSRIRRTL-----VRSPESNESVL--IPDHLDWREKGFITPDWNQEDCGAC 122
MT + H+ + L ++ E S + +PD +DWR G +T +Q CG
Sbjct: 89 MTCVQDYKRHNATDKLLSMKKGIKISEYKASGVSSLPDTVDWRTGGAVTWVKDQLRCGCS 148
Query: 123 YAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMK 182
YAF+ A+A++G + + LS Q +VDCSI GN GC+ G + N YV GGL
Sbjct: 149 YAFAAAAALEGAAALARGSLVSLSAQNIVDCSIPFGNHGCSCGDVNNAFMYVIDNGGLDT 208
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E YPY KQ CKFK I + + DE +L LAT GP+AV I+AS +FQ
Sbjct: 209 ENVYPYVSKQDGCKFKSNGIGATATGIIRIASGDETSLASALATAGPVAVYIDASHSSFQ 268
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNR 297
Y G+ + C+ ++HAM+L+GY R W+LKN W +WG GY+ + +G +N+
Sbjct: 269 FYHYGVLNVPNCSRTNLSHAMILIGYGKYGGREYWLLKNSWGPNWGTAGYIMMSKGKSNQ 328
Query: 298 CGIANYAVYALI 309
CGIA YA + +
Sbjct: 329 CGIATYASFPTL 340
>gi|326932936|ref|XP_003212567.1| PREDICTED: counting factor associated protein D-like [Meleagris
gallopavo]
Length = 573
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 159/308 (51%), Gaps = 11/308 (3%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
+F ++++ + Y ++ + N + +H+ N+ A Y+L NHL+D P+
Sbjct: 269 VFHHYRRRFGRHYGSARELEHRQRIFVHNMRFVHSKNRAALS----YSLALNHLADRTPQ 324
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
R L E +++P+ LDWR G +TP +Q CG+C++F+
Sbjct: 325 EMAAMRGRRRSGDPNHGLPFPAEHYAGIILPESLDWRLYGAVTPVKDQAVCGSCWSFATT 384
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-P 187
A++G +F T + LS Q ++DCS GN C GG +++ GG+ E Y
Sbjct: 385 GAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGT 444
Query: 188 YKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASG 247
YKG+ +C + + ++ I+ + + + A+K + GP+AVSI+AS TF Y++G
Sbjct: 445 YKGQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNG 504
Query: 248 IYDDEACT--SDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
IY + C S ++HA+L VGY W++KN WS +WG++GY+ + +N CG+A
Sbjct: 505 IYYEPKCANKSGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMAMKDNNCGVA 564
Query: 302 NYAVYALI 309
A Y ++
Sbjct: 565 TEATYPIL 572
>gi|332384364|gb|AEE69034.1| cysteine protease [Taenia pisiformis]
Length = 338
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/276 (33%), Positives = 147/276 (53%), Gaps = 8/276 (2%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRLTHSRIRRTLVRSPESNESVL-I 98
I N+ + GL Y+ N +DL + + + +R+ R ++ +S +
Sbjct: 64 IKGQNRRFEAGLESYSTGLNQFADLELSEFTERFLGTRPENRVAGKCGRVWKALKSFADL 123
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +DWR+K +T NQ +CG+C+AFS A++ + K T ++ LS QQ+VDCS+ +G
Sbjct: 124 PDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEAALAKKTGKLISLSEQQLVDCSLKNG 183
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N Y++ + E YPY+ C++ V ++ +P +E
Sbjct: 184 NDGCNGGYMSNAFKYLE-DHSIEPESAYPYRATDGPCRYNESLGVGTVTDIGEIPEGNET 242
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WI 274
AL +ATVGPI+++I+AS F Y GIY C+S ++NH +L VGY + W+
Sbjct: 243 ALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKFLNHGVLAVGYGKLDGKPYWL 302
Query: 275 LKNWWSHHWGDNGY-MYLKRGNNRCGIANYAVYALI 309
+KN W WG GY M K +N CGIA+ A + +
Sbjct: 303 VKNSWGSGWGMKGYIMMAKDYHNMCGIASLADFPYV 338
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 157/312 (50%), Gaps = 32/312 (10%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F ++K+KK Y + + ++SN ++ H + HG T SDL
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDPTASHGVT----QFSDLTSAE 108
Query: 70 YIKEMTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
+ K++ L R+ + +P +N+ +P+ DWREKG + P NQ CG+C++FS
Sbjct: 109 FRKQVLGLRKLRLPKDANTAPILPTND---LPEDFDWREKGAVGPVKNQGSCGSCWSFST 165
Query: 128 ASAIQGQIFKSTSEIEELSIQQVVDC-------SIISGNLGCAGGSLRNTLNYVQFAGGL 180
A++G F +T E+ LS QQ+VDC S + GC GG + + Y AGGL
Sbjct: 166 TGALEGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGL 225
Query: 181 MKEEDYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPH 239
M+EEDYPY G + CKF + + ++++S + DE + L GP+AV+INA
Sbjct: 226 MREEDYPYTGMDRGACKFDKNKVAAGVANFSAV-SLDEDQIAANLVKNGPLAVAINAV-- 282
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGY 288
Q Y G+ C S ++H +LLVGY + WI+KN W WG+NG+
Sbjct: 283 FMQTYIGGVSCPYIC-SRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGF 341
Query: 289 MYLKRGNNRCGI 300
+ RG N CG+
Sbjct: 342 YKICRGRNICGV 353
>gi|167427525|gb|ABZ80399.1| cathepsin L3, partial [Fasciola hepatica]
Length = 306
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 154/305 (50%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y + ++K+ W+ N K I HN GL YTL N +DL
Sbjct: 5 KRIYNKEYNGADNEHRRKI-WEQNVKHIQEHNLRHDLGLVTYTLGLNQFTDLTFEEFKAK 63
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ L + + E N+ +P +DWR+ G++T +Q CG+C+AFS
Sbjct: 64 YLIEMS-LESESLSDGISYEAEGND---VPASIDWRQYGYVTEVKDQGQCGSCWAFSAVG 119
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VD + GN GC GG + N Y++ + GL YPY+
Sbjct: 120 AIEGQYVKKFQNQTLFSEQQLVDRTRRFGNHGCGGGWMENAYKYLKNS-GLETASYYPYQ 178
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C+++R V ++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 179 AWEYPCQYRRELGVAKVTGAYTVHSGDEMRLMQMVGREGPAAVAVDAQS-DFYMYKSGIF 237
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ CTS V H +L VGY S WILKN W WG++GYM R NN C IA+ A
Sbjct: 238 QSQTCTSQRVTHPVLAVGYGTESGTDYWILKNRWGKWWGEDGYMRFARNRNNMCAIASVA 297
Query: 305 VYALI 309
++
Sbjct: 298 SVPMV 302
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 101/314 (32%), Positives = 154/314 (49%), Gaps = 35/314 (11%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F ++++ K Y + +++N ++ H Q +HG T SDL P
Sbjct: 49 FAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQQLDPAAVHGVT----QFSDLTPTE 104
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVL----IPDHLDWREKGFITPDWNQEDCGACYAF 125
+ ++ L RR + +L +P DWR++G +TP NQ CG+C++F
Sbjct: 105 FRRKFLGLN----RRLKFPADAKTAPILPTDELPSDFDWRDRGAVTPVKNQGTCGSCWSF 160
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCS-------IISGNLGCAGGSLRNTLNYVQFAG 178
S A++G F +T ++ LS QQ+VDC S + GC GG + + Y AG
Sbjct: 161 STTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKAG 220
Query: 179 GLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
GLM+EEDYPY G +C+F + I ++++SV+ DE + L GP+AV+INA
Sbjct: 221 GLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVV-SLDEDQIAANLVKNGPLAVAINAV 279
Query: 238 PHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWGDN 286
Q Y G+ C S ++H +LLVGY + WI+KN W WG+N
Sbjct: 280 --FMQTYIGGVSCPYIC-SKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGESWGEN 336
Query: 287 GYMYLKRGNNRCGI 300
GY + RG N CG+
Sbjct: 337 GYYKICRGRNVCGV 350
>gi|154415085|ref|XP_001580568.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121914787|gb|EAY19582.1| Clan CA, family C1, cathepsin L-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 305
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 147/278 (52%), Gaps = 19/278 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
W SN + + HN+ G+TL N L+ L P Y K M ++ +S
Sbjct: 34 WLSNKRFVQNHNRANL----GFTLALNKLAHLSPAEY-KAMLGFRNNGTHNIATKS---- 84
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
+ + D +DWR KG + P +Q CG+C+AFS + Q + ++++LS Q +VDC
Sbjct: 85 -NTIANDGIDWRTKGVVNPIQDQAQCGSCWAFSAIQVQESQYAITYGQLQKLSEQNLVDC 143
Query: 154 SIISGNLGCAGGSLRNTLNY-VQFAGG-LMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
++ GC GG + +Y +Q+ GG M E+DYPY CKF + I S+
Sbjct: 144 --VTSCDGCGGGLMSAAYDYAIQYQGGKFMLEKDYPYTALDGTCKFNKAKATSKIVSYIN 201
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--- 268
+ DE L ++ GP +V+I+AS +FQ Y+ GIYD+ C+S ++H + VGY
Sbjct: 202 VVEGDEKDLAAKVSAYGPSSVAIDASQISFQFYSQGIYDEPYCSSYSLDHGVGCVGYGTE 261
Query: 269 -TRNSWILKNWWSHHWGDNGYM-YLKRGNNRCGIANYA 304
T+N WI++N W WGD GY+ +K NN+CGIA A
Sbjct: 262 GTKNYWIVRNSWGLGWGDQGYIRMIKDKNNQCGIATMA 299
>gi|211910909|gb|ACJ13083.1| cathepsin L [Globodera pallida]
Length = 293
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/260 (34%), Positives = 145/260 (55%), Gaps = 3/260 (1%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
+ QK +K Y + ++++ L + S + I HNQ +G + + ENH++DL Y
Sbjct: 34 YKQKHGRKSYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 93
Query: 72 K--EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K RL +RR +P+ +DWR+KG++T NQ CG+C+AFS
Sbjct: 94 KLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSSTG 153
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ Q + T ++ LS Q ++DCS GN+GC GG + N Y++ G+ KE DYPYK
Sbjct: 154 ALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYK 213
Query: 190 GKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C FKR ++ + + + DE LK+ +AT GP +V+I+A +FQLY G+
Sbjct: 214 AKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGV 273
Query: 249 YDDEACTSDYVNHAMLLVGY 268
Y ++ C+ + ++H +L+VGY
Sbjct: 274 YFEKECSPENLDHGVLVVGY 293
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 102/308 (33%), Positives = 157/308 (50%), Gaps = 32/308 (10%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++K+KK Y + + ++SN ++ H + HG T SDL + K+
Sbjct: 63 KRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDPTASHGVT----QFSDLTSAEFRKQ 118
Query: 74 MTRLTHSRIRRTLVRSP--ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAI 131
+ L R+ + ++P +N+ +P+ DWREKG + P NQ CG+C++FS A+
Sbjct: 119 VLGLRKLRLPKDANKAPILPTND---LPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGAL 175
Query: 132 QGQIFKSTSEIEELSIQQVVDCS-------IISGNLGCAGGSLRNTLNYVQFAGGLMKEE 184
+G F +T E+ LS QQ+VDC S + GC GG + + Y AGGLM+EE
Sbjct: 176 EGAHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 235
Query: 185 DYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQL 243
DYPY G + CKF + + ++++SV+ DE + L GP+AV+ NA Q
Sbjct: 236 DYPYTGMDRGACKFDKDKVAAGVANFSVV-SLDEDQIAANLVKNGPLAVATNAV--FMQT 292
Query: 244 YASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWGDNGYMYLK 292
Y G+ C S ++H +LLVGY + WI+KN W WG+NG+ +
Sbjct: 293 YIGGVSCPYIC-SRRLDHGVLLVGYGSAGYAPVRMKEKPYWIIKNSWGESWGENGFYKIC 351
Query: 293 RGNNRCGI 300
RG N CG+
Sbjct: 352 RGRNICGV 359
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 152/305 (49%), Gaps = 27/305 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K+ K Y K + ++SN ++ H + +HG T SDL P + ++
Sbjct: 60 KAKFAKTYATKEEHDHRFGVFKSNLRRARLHAKLDPSAVHGVT----KFSDLTPAEFRRQ 115
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
L R ++P L P DWR+KG +T +Q CG+C++FS A++G
Sbjct: 116 FLGLKPLRFPAHAQKAPILPTKDL-PKDFDWRDKGAVTNVKDQGACGSCWSFSTTGALEG 174
Query: 134 QIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
+ +T E+ LS QQ+VDC + + + GC GG + N Y+ +GG+ KE+DY
Sbjct: 175 AHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKEKDY 234
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PY G+ CKF + + +S++SV+ DE + L GP+AV+INA Q Y
Sbjct: 235 PYTGRDGTCKFDKTKVAATVSNYSVV-SLDEEQIAANLVKNGPLAVAINAV--FMQTYVG 291
Query: 247 GIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYLKRGN 295
G+ C +++H +LLVGY + WI+KN W WG+NGY + RG
Sbjct: 292 GVSCPYIC-GKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYYKICRGR 350
Query: 296 NRCGI 300
N CG+
Sbjct: 351 NVCGV 355
>gi|333827692|gb|AEG19548.1| cathepsin L-like cysteine protease [Taenia pisiformis]
Length = 338
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/276 (33%), Positives = 147/276 (53%), Gaps = 8/276 (2%)
Query: 41 IHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE-MTRLTHSRIRRTLVRSPESNESVL-I 98
I N+ + GL Y+ N +DL + + + +R+ R ++ +S +
Sbjct: 64 IKGQNRRFEAGLESYSTGLNQFADLELSEFTERFLGTRPENRVAGKCGRVWKALKSFADL 123
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +DWR+K +T NQ +CG+C+AFS A++ + K T ++ LS QQ+VDCS+ +G
Sbjct: 124 PDTVDWRDKNLVTEVKNQGNCGSCWAFSSTGALEAALAKKTGKLISLSEQQLVDCSLKNG 183
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N Y++ + E YPY+ C++ V ++ +P +E
Sbjct: 184 NDGCNGGYMSNAFKYLE-DHSIEPESAYPYRATDGPCRYNESLGVGTVTDIGEIPEGNET 242
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS----WI 274
AL +ATVGPI+++I+AS F Y GIY C+S ++NH +L VGY + W+
Sbjct: 243 ALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSSKFLNHGVLAVGYGKLDGKPYWL 302
Query: 275 LKNWWSHHWGDNGY-MYLKRGNNRCGIANYAVYALI 309
+KN W WG GY M K +N CGIA+ A + +
Sbjct: 303 VKNSWGSGWGMKGYIMMAKDYHNMCGIASLADFPYV 338
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 107/316 (33%), Positives = 153/316 (48%), Gaps = 40/316 (12%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F+ ++++ K Y + + ++SN + H +HG T SDL P
Sbjct: 45 FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVT----RFSDLTPME 100
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLI------PDHLDWREKGFITPDWNQEDCGACY 123
+ HS + V P +S I P DWRE G +TP NQ CG+C+
Sbjct: 101 F-------RHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTPVKNQGSCGSCW 153
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCS-------IISGNLGCAGGSLRNTLNYVQF 176
+FS A++G F ST E+ LS QQ+VDC S + GC GG + + Y+
Sbjct: 154 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSGCNGGLMNSAFEYILN 213
Query: 177 AGGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSIN 235
GG+M+EEDYPY G CKF + I ++++SV+ +DE + L GP+AV+IN
Sbjct: 214 NGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVV-SRDEDQIAANLVKNGPLAVAIN 272
Query: 236 ASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWG 284
A Q Y G+ C S +NH +LLVGY S WI+KN W +WG
Sbjct: 273 AV--YMQTYVGGVSCPYVC-SKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENWG 329
Query: 285 DNGYMYLKRGNNRCGI 300
+NGY + RG N CG+
Sbjct: 330 ENGYYKICRGRNICGV 345
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 153/317 (48%), Gaps = 41/317 (12%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F+ ++++ K Y + + ++SN + H +HG T SDL P
Sbjct: 45 FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDPSAVHGVT----QFSDLTPME 100
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLI------PDHLDWREKGFITPDWNQEDCGACY 123
+ HS + V P +S I P DWRE G +TP NQ CG+C+
Sbjct: 101 F-------QHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTPVKNQGSCGSCW 153
Query: 124 AFSIASAIQGQIFKSTSEIEELSIQQVVDCS--------IISGNLGCAGGSLRNTLNYVQ 175
+FS A++G F ST E+ LS QQ+VDC S + GC GG + + Y+
Sbjct: 154 SFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDSGCNGGLMNSAFEYIL 213
Query: 176 FAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSI 234
GG+M+EEDYPY G CKF + I ++++SV+ +DE + L GP+AV+I
Sbjct: 214 NNGGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVV-SRDEDQIAANLVKNGPLAVAI 272
Query: 235 NASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHW 283
NA Q Y G+ C S +NH +LLVGY S WI+KN W +W
Sbjct: 273 NAV--YMQTYVGGVSCPYVC-SKKLNHGVLLVGYGSESYAPIRMKQKPYWIIKNSWGENW 329
Query: 284 GDNGYMYLKRGNNRCGI 300
G+NGY + RG N CG+
Sbjct: 330 GENGYYKICRGRNICGV 346
>gi|195379496|ref|XP_002048514.1| GJ14012 [Drosophila virilis]
gi|194155672|gb|EDW70856.1| GJ14012 [Drosophila virilis]
Length = 327
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 105/302 (34%), Positives = 160/302 (52%), Gaps = 18/302 (5%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM- 74
+Y+K Y + + ++ N + I HN+ G Y + N +D+ + K M
Sbjct: 35 EYEKSYEDDGEEQLRMQIFKDNKQLIDRHNERYAAGEETYEMGVNQFTDMLATEFRKIML 94
Query: 75 TRLTHSRIRRTL--VRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L S ++ + SP + E IP +DWREKG +TP NQ CG+C+AFS A A++
Sbjct: 95 VNLNISDFTSSIEYIYSPANAE---IPSQVDWREKGAVTPVKNQGRCGSCWAFSAAGALE 151
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ F T ++ LS Q ++DCS N GC GG L YV+ G+ + YPY+G
Sbjct: 152 GQHFIQTKQLIPLSEQNLLDCSSRYNNHGCGGGWPAAALMYVRDNRGMDNDRAYPYEGHV 211
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+F+R ++ ++ + +DE AL +AT GP++V+++A+ FQ Y G+Y
Sbjct: 212 GRCRFRRYSVSATVTQ-VMQVRRDEVALANAVATKGPVSVAVDAT--YFQHYRGGVYSHR 268
Query: 253 ACTSDYVNHAMLLVGYTRNS-----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
C NHAML+VGY + W++KN W WG+ GYM L R N C +A+YAV+
Sbjct: 269 -CRQQ-ANHAMLVVGYGSDQRGGDFWLIKNSWG-GWGEQGYMRLARNQGNLCHVASYAVF 325
Query: 307 AL 308
+
Sbjct: 326 PI 327
>gi|260821900|ref|XP_002606341.1| hypothetical protein BRAFLDRAFT_118507 [Branchiostoma floridae]
gi|229291682|gb|EEN62351.1| hypothetical protein BRAFLDRAFT_118507 [Branchiostoma floridae]
Length = 546
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 155/289 (53%), Gaps = 18/289 (6%)
Query: 34 WQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESN 93
+ +N + IH+ N+ GY+L NHL+D +K M T+S + E
Sbjct: 263 FTTNSRLIHSKNRANL----GYSLAINHLAD-RTDDELKMMRGRTYSPEPNNGLPFDEEE 317
Query: 94 ESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVD 152
+V IPD+++W +G +TP +Q CG+C++F A AI+G FK T + LS Q ++D
Sbjct: 318 LAVTDIPDNINWAIRGAVTPVKDQAVCGSCWSFGAAEAIEGAWFKKTGLLVPLSQQNLMD 377
Query: 153 CSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY-PYKGKQSICKFKRPNIVVDISSWSV 211
CS GN C GG +V GG+ Y PY G C F ++ ++ +
Sbjct: 378 CSWGYGNNACDGGEEWRAYEWVMKHGGIATAAHYGPYTGADGKCHFDPKHVGAVVTGYVN 437
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY- 268
+ ++ ALK+ LA GP+AV I+A+ TF YA+G+Y D+ C + + ++HA+L VGY
Sbjct: 438 VTSENGTALKMALANFGPVAVGIDAAVKTFSFYANGVYYDKDCGNKPEDLDHAVLAVGYG 497
Query: 269 --------TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
++ W++KN WS HWGDNGY+ + + +N CG+A A Y +
Sbjct: 498 SMPDSKGVMQDYWLIKNSWSTHWGDNGYVLISQKDNNCGVATDATYVKV 546
>gi|391346471|ref|XP_003747496.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 333
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 159/306 (51%), Gaps = 16/306 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHY-IK 72
+ + K+Y A ++ + + S ++I HN ++G Y L N LSDL
Sbjct: 28 KASHGKEYASPAAEAARLEAFNSKLQQIEAHNARFERGEVSYFLGLNALSDLTDAEIRAT 87
Query: 73 EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
++ + IR L S P+ +DWR +T NQ CG+CY FS A AI+
Sbjct: 88 KLGAVVDEEIRAGLNASISEVTMPNAPESVDWRGTA-VTSVKNQASCGSCYTFSAAGAIE 146
Query: 133 GQIFKSTSEIEELSIQQVVDCS----IISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+ K++ + +LS QQ+VDC+ I + N GC GG T+ + G+ +E +YPY
Sbjct: 147 STLLKNSRKNYDLSEQQLVDCTLNRYIHNMNFGCGGGDPATTIQHA-LRHGISQEHEYPY 205
Query: 189 KGKQSI----CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
+ + C ++ ++ + DE+AL +AT GPIAV++N F Y
Sbjct: 206 RSGNTQTHGRCSSTSGSVSLNNLRLMQVKAGDENALANAVATHGPIAVTLNGENSDFYSY 265
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGI 300
+ GIY++ +C + +NHA+LLVGY ++ WI+KN W WG+NG+M L RG+NRCGI
Sbjct: 266 SGGIYNNRSCPTQ-INHAVLLVGYGSSNGQPYWIIKNSWGSTWGENGFMKLARGSNRCGI 324
Query: 301 ANYAVY 306
+ A Y
Sbjct: 325 VSAASY 330
>gi|50403821|gb|AAT76664.1| cathepsin L1 proteinase [Fasciola hepatica]
Length = 326
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 151/310 (48%), Gaps = 26/310 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL----HPRH 69
++ Y K+Y A D ++ W+ N K I HN GL YTL N +DL
Sbjct: 25 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQYTDLTFEEFKAK 83
Query: 70 YIKEMTR----LTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF 125
Y+ EM R L+H P + +PD +DWRE G++T +Q +CG+C+AF
Sbjct: 84 YLTEMPRASDILSHG--------IPYEANNRAVPDKIDWRESGYVTGVKDQGNCGSCWAF 135
Query: 126 SIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYV-QFAGGLMKEE 184
S +GQ K+ S QQ+VDCS GN GC GG + N Y+ QF GL E
Sbjct: 136 STTGTTEGQYMKNERTSISFSEQQLVDCSGPWGNNGCGGGLMENAYEYLKQF--GLETES 193
Query: 185 DYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
YPY + C+ + V ++ + + E LK + P AV+++ F +Y
Sbjct: 194 SYPYTAVEGQCRHSKQLGVAKVTGYYTVHSGSEVELKNLVGAERPAAVAVDVESD-FMMY 252
Query: 245 ASGIYDDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCG 299
SGIY + C+ VNHA+L VGY WI+KN W WG+ GY+ + R N CG
Sbjct: 253 RSGIYQSQTCSPLSVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCG 312
Query: 300 IANYAVYALI 309
IA+ A ++
Sbjct: 313 IASLASLPMV 322
>gi|211910935|gb|ACJ13096.1| cathepsin L [Globodera mexicana]
Length = 293
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/260 (34%), Positives = 145/260 (55%), Gaps = 3/260 (1%)
Query: 12 FPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYI 71
+ QK +K Y + ++++ L + S + I HNQ +G + + ENH++DL Y
Sbjct: 34 YKQKHGRKAYADQDVENERMLTYLSAKQFIDKHNQAYIEGKVTFRVGENHIADLPFSEYK 93
Query: 72 K--EMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
K RL +RR +P+ +DWR+KG++T NQ CG+C+AFS
Sbjct: 94 KLNGYRRLLGDNLRRNASTFLAPMNVGDLPESVDWRDKGWVTEVKNQGMCGSCWAFSSTG 153
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
A++ Q + T ++ LS Q ++DCS GN+GC GG + N Y++ G+ KE DYPYK
Sbjct: 154 ALEAQHARQTGQLISLSEQNLIDCSKKYGNMGCNGGIMDNAFQYIKDNNGVDKELDYPYK 213
Query: 190 GKQS-ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGI 248
K C FKR ++ + + + DE LK+ +AT GP +V+I+A +FQLY G+
Sbjct: 214 AKTGKKCLFKRNDVGATDTGFFDIAEGDEEKLKIAVATQGPASVAIDAGHRSFQLYTHGV 273
Query: 249 YDDEACTSDYVNHAMLLVGY 268
Y ++ C+ + ++H +L+VGY
Sbjct: 274 YFEKECSPENLDHGVLVVGY 293
>gi|146168075|ref|XP_001016705.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145247|gb|EAR96460.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 343
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/217 (38%), Positives = 127/217 (58%), Gaps = 6/217 (2%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ-IFKSTSEIEELSIQQVVDCSII 156
+P ++DWR KG +TP +Q +CG+C+ FS A++ + + LS QQ++DC+
Sbjct: 122 LPQYVDWRTKGVVTPVKDQGECGSCWTFSTTGALESHWALHTGNAPLLLSEQQLIDCAGA 181
Query: 157 SGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD 216
N GC GG Y+ +AGGL E DYPY+G + C+F R + + S + QD
Sbjct: 182 FNNFGCDGGLPSQAYEYISYAGGLETEGDYPYEGTDNSCEFNRAQVAAKVVSSYNITFQD 241
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTS--DYVNHAMLLVGY--TRNS 272
E+ L LATVGP++++ + F Y GIY + +C+ + VNHA+L VGY T N
Sbjct: 242 ENELIYHLATVGPVSIAYECT-DDFMDYEGGIYSNPSCSKSPEDVNHAVLAVGYNLTGNY 300
Query: 273 WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+I+KN W WG NGY Y++ G+N CG+A+ A Y ++
Sbjct: 301 YIVKNSWGEDWGINGYFYIELGSNMCGLADCASYPIV 337
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 99/293 (33%), Positives = 157/293 (53%), Gaps = 19/293 (6%)
Query: 27 DSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM--TRLTHSRIRR 84
D + ++ W N + + HN+ A + HG+ L N +DL + R+ +R R
Sbjct: 67 DRRFRVFWD-NLRFVDAHNERAAE--HGFRLGMNQFADLTNDEFRAAYLGARIPAARRRG 123
Query: 85 TLV--RSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
T V R + +P+ +DWREKG + P NQ CG+C+AFS S+++ T E+
Sbjct: 124 TAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEM 183
Query: 143 EELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI 202
LS Q++V+CS GN GC GG + +++ GG+ E DYPYK C R N
Sbjct: 184 VTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENA 243
Query: 203 -VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNH 261
VV I + +P DE +L+ +A P++V+I A FQLY +G++ CT++ ++H
Sbjct: 244 KVVSIDGFEDVPENDEKSLQKAVAHQ-PVSVAIEAGGREFQLYKAGVFSG-TCTTN-LDH 300
Query: 262 AMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
++ VGY ++ WI++N W WG++GY+ ++R N +CGIA A Y
Sbjct: 301 GVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCGIAMMASY 353
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 160/300 (53%), Gaps = 15/300 (5%)
Query: 19 KDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLT 78
K YR + ++ +Q N I HN++ ++G + + +D+ ++ +
Sbjct: 32 KTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLDLLKLQG 91
Query: 79 HSRIRRTLVRSPESNESVLIP-DHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFK 137
+ V ++ + D +DWRE+G +TP +Q +CG+C+AFS AI+GQ FK
Sbjct: 92 VPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAIEGQFFK 151
Query: 138 STSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK 196
+ LS Q++VDC+ GN GC GG + ++VQ G+ EE YPY+G++S CK
Sbjct: 152 KNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQ-DEGIQTEESYPYEGRRSSCK 210
Query: 197 FKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA-CT 255
K + V + ++ + P DE + T+A GP+AV+I AS +F Y GI D++ C+
Sbjct: 211 -KSGDYVTKVKTY--VFPLDEQEMARTVAAKGPVAVAIEASQLSF--YDKGIVDEKCRCS 265
Query: 256 S--DYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ + +NH +L+VGY + WI+KN W WG+ GY LK+ CGI Y Y ++
Sbjct: 266 NKREDLNHGVLVVGYGSENGVDYWIVKNSWGADWGEKGYFRLKKDVKACGIDYYNTYPIL 325
>gi|10798509|emb|CAC12805.1| procathepsin L3 [Fasciola hepatica]
Length = 311
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 102/305 (33%), Positives = 154/305 (50%), Gaps = 16/305 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSD----LHPRH 69
++ Y K+Y + ++ + W N K I HN +GL Y L N +D
Sbjct: 10 KRMYNKEYNGADEEHRRNI-WGKNVKHIEEHNLRHDRGLVTYKLGLNQFTDPTFEEFQAK 68
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
Y+ EM+ ++ S + + E N+ +P +DWRE G++T +Q CG+C+AFS
Sbjct: 69 YLMEMSPVSES-LSDGVSYEAEGND---VPASIDWREYGYVTEVKDQGQCGSCWAFSAVG 124
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYK 189
AI+GQ K S QQ+VDC+ GN GC GG + N Y++ + GL YPY+
Sbjct: 125 AIEGQYVKKFQNQTLFSEQQLVDCTRRFGNHGCGGGWMENAYKYLKNS-GLETASYYPYQ 183
Query: 190 GKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIY 249
+ C++++ V ++ + DE L + GP AV+++A F +Y SGI+
Sbjct: 184 AVEYQCQYRKELGVAKVTGAYTVHSGDEMKLMPMVGREGPAAVAVDAQS-DFYMYESGIF 242
Query: 250 DDEACTSDYVNHAMLLVGYTRNS----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
+ CTS V HA+L VGY S WILKN W WG++GYM R N C IA+ A
Sbjct: 243 QSQTCTSRSVTHAVLAVGYGTESGTDYWILKNSWGKWWGEDGYMRFARNRGNMCAIASVA 302
Query: 305 VYALI 309
++
Sbjct: 303 SVPMV 307
>gi|2499879|sp|Q40143.1|CYSP3_SOLLC RecName: Full=Cysteine proteinase 3; Flags: Precursor
gi|1235545|emb|CAA88629.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
Length = 356
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 145/279 (51%), Gaps = 13/279 (4%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESV 96
N K I +HN+ +GL Y L N +DL + K +L S+ + +V
Sbjct: 84 NLKMIRSHNR---KGLS-YKLGINEFTDLTWDEFRKH--KLGASQNCSATTKGNLKLTNV 137
Query: 97 LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
++P+ DWR+ G ++P Q CG+C+ FS A++ ++ + LS QQ+VDC+
Sbjct: 138 VLPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGA 197
Query: 157 SGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQD 216
N GC GG Y++F GGL EE YPY GK ICKF + NI V + S +
Sbjct: 198 FNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKFSQANIGVKVISSVNITLGA 257
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC--TSDYVNHAMLLVGYTRNS-- 272
E+ LK +A V P++V+ F+ Y SG+Y C T VNHA+L VGY +
Sbjct: 258 EYELKYAVALVRPVSVAFEVV-KGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVENGT 316
Query: 273 --WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
W++KN W WG++GY ++ G N CG+A A Y ++
Sbjct: 317 PYWLIKNSWGADWGEDGYFKMEMGKNMCGVATCASYPIV 355
>gi|159464745|ref|XP_001690602.1| cystein endopsptidase [Chlamydomonas reinhardtii]
gi|158280102|gb|EDP05861.1| cystein endopsptidase [Chlamydomonas reinhardtii]
Length = 616
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 147/300 (49%), Gaps = 35/300 (11%)
Query: 40 KIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVL-- 97
K+ HN Y L+ N +D HP + M L + R++ + + P+ + ++
Sbjct: 301 KVQAHNTRPDAS---YKLKLNQWADWHPEEWRSAM--LPNLRLKPQVQQRPDGTDVLISE 355
Query: 98 ----------------------IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQI 135
+P +DWR G +Q CG+CY F+ + G
Sbjct: 356 MPGAPYRPKSTFRGTPGLSRADLPPSVDWRGTGADPGVKDQGMCGSCYTFAATGTMDGTW 415
Query: 136 FKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSIC 195
F +T + S QQ++DC+ G GC GG + LNYV GG+ E+DY Y+G+ C
Sbjct: 416 FVATGQRRSFSEQQIIDCAWDYGPNGCFGGYYQPVLNYVAEQGGMALEQDYTYRGEPGYC 475
Query: 196 KFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACT 255
+ V S + + ++E AL +A GPIAVS+NA P F Y+ G++D+ ACT
Sbjct: 476 RASNHTRVGLFSGYMNVESRNELALMEAVAKYGPIAVSVNADPEAFSFYSEGVFDEPACT 535
Query: 256 SDY--VNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
+ ++H + L GY ++ W+++N WSH WGD+GY+ + RG + CGIA AL+
Sbjct: 536 TRMRDLDHTVTLFGYGSQDGKDYWLVRNSWSHFWGDDGYIKIVRGKHDCGIATDPAVALV 595
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/309 (31%), Positives = 155/309 (50%), Gaps = 25/309 (8%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F + K+ K Y K + +++N K H + HG T + + R
Sbjct: 43 FTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQ 102
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIAS 129
++ RL + P +N +P+ DWREKG +TP +Q CG+C+AFS
Sbjct: 103 FLGLNKRLRLPAHAQKAPILPTTN----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTG 158
Query: 130 AIQGQIFKSTSEIEELSIQQVVDCSII-------SGNLGCAGGSLRNTLNYVQFAGGLMK 182
A++G + +T ++ LS QQ+VDC + S + GC GG + N Y+ +GG+++
Sbjct: 159 ALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGLMNNAFEYLLQSGGVVQ 218
Query: 183 EEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQ 242
E+DY Y G+ CKF + +V +S++SV+ DE + L GP+AV+INA+ Q
Sbjct: 219 EKDYAYTGRDGSCKFDKSKVVASVSNFSVV-SLDEEQIAANLVKNGPLAVAINAA--WMQ 275
Query: 243 LYASGIYDDEACTSDYVNHAMLLVGYTRNS-----------WILKNWWSHHWGDNGYMYL 291
Y SG+ C ++H +LLVG+ + + WI+KN W +WG+ GY +
Sbjct: 276 AYMSGVSCPYVCAKARLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGEQGYYKI 335
Query: 292 KRGNNRCGI 300
RG N CG+
Sbjct: 336 CRGRNVCGV 344
>gi|45550334|gb|AAS67923.1| cathepsin L [Artemia franciscana]
Length = 226
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 129/222 (58%), Gaps = 7/222 (3%)
Query: 95 SVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCS 154
+V +P+ +DWREKG +TP Q C +C AFS A++ Q F+ T ++ LS Q ++DCS
Sbjct: 5 NVTVPESVDWREKGAVTPVKYQGQCASCLAFSPTGALESQTFRKTGKLISLSEQNLIDCS 64
Query: 155 IISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPP 214
GNLGC GG + Y++ G+ E Y Y+ K++ C+ N + +P
Sbjct: 65 GEYGNLGCKGGWISQAFEYIKDNKGIDTENKYHYEAKENFCRDNPRNRGAVALGFVNIPS 124
Query: 215 QDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDY--VNHAMLLVGYTRNS 272
+E LK +ATVGP++ I+ S FQ Y+ G+Y + +C + + +NHA+L++G ++
Sbjct: 125 GEEDKLKAAVATVGPVSAVIDVSHEGFQFYSKGVYYEPSCKTSFEHLNHAVLVIGCGSDN 184
Query: 273 ----WILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN WS HWGD GY+ + R N CG+A A+Y ++
Sbjct: 185 GEDYWLVKNSWSKHWGDEGYLKIARNRKNHCGVATAALYPIV 226
>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 387
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 157/315 (49%), Gaps = 35/315 (11%)
Query: 10 FIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRH 69
F ++++ K Y + ++ +++N ++ H +HG T SDL P
Sbjct: 59 FSLFKRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFDPSAIHGVT----QFSDLTPFE 114
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDH-----LDWREKGFITPDWNQEDCGACYA 124
+ K L R+R + ++N + ++P DWR+ G +T NQ CG+C++
Sbjct: 115 FRKAFLGLRGHRLRLPV----DTNAAPILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWS 170
Query: 125 FSIASAIQGQIFKSTSEIEELSIQQVVDCS-------IISGNLGCAGGSLRNTLNYVQFA 177
FS A++G F +T E+ LS QQ+VDC + + GC GG + + Y A
Sbjct: 171 FSTTGALEGANFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLKA 230
Query: 178 GGLMKEEDYPYKG-KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINA 236
GGLMKE+DYPY G ++ C F + I I+++SV+ DE + L GP+A++INA
Sbjct: 231 GGLMKEQDYPYAGIDRNTCNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINA 290
Query: 237 SPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----------TRNSWILKNWWSHHWGD 285
Q Y G+ C S ++H +LLVGY ++ WI+KN W WG+
Sbjct: 291 V--FMQTYIGGVSCPFIC-SKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGE 347
Query: 286 NGYMYLKRGNNRCGI 300
NGY + RG N CG+
Sbjct: 348 NGYYKICRGRNICGV 362
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/299 (33%), Positives = 154/299 (51%), Gaps = 15/299 (5%)
Query: 9 IFIFPQKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPR 68
+F K+Y K Y A S + +++N + I HN A YT+ N +DL
Sbjct: 41 MFTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANAS---YTMGLNEFADLSFE 96
Query: 69 HYIKEMTRLTHSRIRRTLVRSPESNESV-LIPDHLDWREKGFITPDWNQEDCGACYAFSI 127
+ + H + R RS ++ V P +DWR +TP +Q CG+C+AFS
Sbjct: 97 EFKGKYFGYKH--VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSA 154
Query: 128 ASAIQGQ-IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDY 186
+I+G + + + LS QQ+VDCS GN GC GG + Y+ G+ E Y
Sbjct: 155 TGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAY 214
Query: 187 PYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYAS 246
PYKG +C+ K VV IS + + DE +L + TVGP++V+I A FQ Y+S
Sbjct: 215 PYKGVGGLCQ-KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSS 273
Query: 247 GIYDDEACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNNRCGIA 301
G++ C + ++H +L VGY +++ WI+KN W WG++GY+ + R N+CGIA
Sbjct: 274 GVFSG-TCGHN-LDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRMIRNKNQCGIA 330
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,185,496,068
Number of Sequences: 23463169
Number of extensions: 211331059
Number of successful extensions: 505656
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5230
Number of HSP's successfully gapped in prelim test: 1671
Number of HSP's that attempted gapping in prelim test: 484643
Number of HSP's gapped (non-prelim): 7947
length of query: 309
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 167
effective length of database: 9,027,425,369
effective search space: 1507580036623
effective search space used: 1507580036623
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)