BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022267
(300 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/302 (77%), Positives = 258/302 (85%), Gaps = 3/302 (0%)
Query: 1 MASSHLFLTTCLLILGVI---SSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAA 57
M +S F T LL++G I SQ A VS LKL+S ILQDSI+K+VN NPKAGWKA
Sbjct: 1 METSLCFSTLLLLLIGAIFTFQSQVIAVEPVSDLKLNSRILQDSILKKVNGNPKAGWKAT 60
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 117
N FSNYTV QFK+LLGVKPTPK L G+PV +H KSL+LP+ FDAR+AWPQCSTI +I
Sbjct: 61 MNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKI 120
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
LDQGHCGSCWAFGAVE+LSDRFCIH+GMN+SLSVNDLLACCGFLCG GC+GGYPISAWRY
Sbjct: 121 LDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWRY 180
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
FVHHGVVTEECDPYFD GCSHPGCEP YPTPKC RKCV KNQLW+ SKHY + YRI+S
Sbjct: 181 FVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDS 240
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
DPE IMAEIYKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTS+DGE Y
Sbjct: 241 DPESIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAY 300
Query: 298 WV 299
W+
Sbjct: 301 WL 302
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 227/297 (76%), Positives = 256/297 (86%), Gaps = 17/297 (5%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
S+ + + SKLKL+S ILQ+SIIK+VNENP AGW+AA NPQ SN+TVGQFK+LLG KPT
Sbjct: 23 SRVISTELDSKLKLNSRILQESIIKKVNENPDAGWEAAMNPQLSNFTVGQFKYLLGAKPT 82
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ-----------------GH 122
PK L+GVP+ +H K+LKLPK FDAR+AWP CSTI +IL Q GH
Sbjct: 83 PKKELMGVPMISHPKTLKLPKEFDARTAWPHCSTIGKILGQLLSFYNIFSIFFFLFLEGH 142
Query: 123 CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
CGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCGDGCDGGYP+ AWRYFVHHG
Sbjct: 143 CGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHG 202
Query: 183 VVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
VVTEECDPYFD+ GCSHPGCEP +PTPKCVRKC+ KNQLWR SKHYS++AYRI+SDP D+
Sbjct: 203 VVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPHDV 262
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
MAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG+VMGGHAVKLIGWGTSD+GEDYW+
Sbjct: 263 MAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWL 319
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 230/302 (76%), Positives = 261/302 (86%), Gaps = 3/302 (0%)
Query: 1 MASSHLFLTTCLLILGVISS---QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAA 57
MASSH +L+ LL L + + Q +AE V K KLD+ ILQ+SI++ VNE+P+AGWKA
Sbjct: 1 MASSHFYLSLSLLFLAAVCTFHHQVYAEEQVLKFKLDADILQESIVRHVNEHPQAGWKAT 60
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 117
NP+FSNY+V QFK+LLGVK TP+ L PV +H KSLKLPKSFDAR AWPQC +I I
Sbjct: 61 MNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGTI 120
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
LDQGHCGSCWAFGAVE+LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPISAWRY
Sbjct: 121 LDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWRY 180
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
FV HGVVTE+CDPYFD+TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAYR+
Sbjct: 181 FVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKR 240
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
DP DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DDGEDY
Sbjct: 241 DPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY 300
Query: 298 WV 299
W+
Sbjct: 301 WL 302
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 230/303 (75%), Positives = 261/303 (86%), Gaps = 4/303 (1%)
Query: 1 MASSHLFLTTCLLILGVISS----QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKA 56
MASSH +L+ LL L + + Q +AE V K KLD+ ILQ+SI++ VNE+P+AGWKA
Sbjct: 1 MASSHFYLSLSLLFLAAVCTFHHQQVYAEEQVLKFKLDADILQESIVRHVNEHPQAGWKA 60
Query: 57 ARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISR 116
NP+FSNY+V QFK+LLGVK TP+ L PV +H KSLKLPKSFDAR AWPQC +I
Sbjct: 61 TMNPRFSNYSVSQFKYLLGVKQTPEKDLKSTPVLSHPKSLKLPKSFDAREAWPQCISIGT 120
Query: 117 ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 176
ILDQGHCGSCWAFGAVE+LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPISAWR
Sbjct: 121 ILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPISAWR 180
Query: 177 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
YFV HGVVTE+CDPYFD+TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAYR+
Sbjct: 181 YFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAYRVK 240
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
DP DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DDGED
Sbjct: 241 RDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED 300
Query: 297 YWV 299
YW+
Sbjct: 301 YWL 303
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 225/301 (74%), Positives = 258/301 (85%), Gaps = 2/301 (0%)
Query: 1 MASSHLFLTTCLLILG--VISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR 58
MA +H+ L T LL++G V+ Q AE +S+ K +S ILQDSI+K+VNEN KAGWKAA
Sbjct: 1 MAMNHMSLVTFLLLIGASVLVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAAL 60
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
NP+FSN+TV QFK LLGVKPT KG L G+P+ TH K L+LP+ FDAR AWP CSTI RIL
Sbjct: 61 NPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIGRIL 120
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDLLACCGFLCGDGCDGGYP+ AW+YF
Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGGYPLQAWKYF 180
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
V GVVT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW SKH+ ++AY I+SD
Sbjct: 181 VRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSD 240
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
P IM E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGDVMGGHAVKLIGWGTS+DGEDYW
Sbjct: 241 PHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYW 300
Query: 299 V 299
+
Sbjct: 301 L 301
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 482 bits (1241), Expect = e-134, Method: Compositional matrix adjust.
Identities = 222/302 (73%), Positives = 258/302 (85%), Gaps = 4/302 (1%)
Query: 2 ASSHLFLTTCLLILGVISSQTFAEGV----VSKLKLDSHILQDSIIKEVNENPKAGWKAA 57
++ L L + L+LG++SS +GV +SK KL+S ILQ+ I+K+VN+NP AGWKAA
Sbjct: 6 TTTKLCLVSVFLLLGLVSSSFDLQGVKAENLSKQKLNSKILQEEIVKKVNQNPDAGWKAA 65
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 117
N +FSN TV +FK LLGVKPTPK LGVP+ +HD+SLKLPK FDAR+AWPQC++I I
Sbjct: 66 INDRFSNATVAEFKRLLGVKPTPKKHFLGVPIVSHDRSLKLPKEFDARTAWPQCTSIGNI 125
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
LDQGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+Y
Sbjct: 126 LDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQY 185
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
F + GVVTEECDPYFD TGCSHPGCEPAYPTPKC+RKCV NQLW SKHYS+S Y + S
Sbjct: 186 FSYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVKS 245
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
+P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT+D+GEDY
Sbjct: 246 NPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDY 305
Query: 298 WV 299
W+
Sbjct: 306 WL 307
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 222/301 (73%), Positives = 256/301 (85%), Gaps = 2/301 (0%)
Query: 1 MASSHLFLTTCLLILG--VISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR 58
MA +H+ LTT L++G +I Q AE +S+ K +S ILQDSI+K+VNEN KAGWKAA
Sbjct: 1 MALNHMSLTTLFLLIGASIIVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAAL 60
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
NP+FSN+TV QFK LLGVKPT KG L G+P+ TH K L+LP+ FDAR AW CSTI RIL
Sbjct: 61 NPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWSNCSTIGRIL 120
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDL ACCGFLCGDGCDGGYP+ AW+YF
Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGYPLQAWKYF 180
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
V GVVT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW SKH+ ++AY I+SD
Sbjct: 181 VRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSD 240
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
P IM E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGD+MGGHAVKLIGWGTS+DGEDYW
Sbjct: 241 PHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYW 300
Query: 299 V 299
+
Sbjct: 301 L 301
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 229/302 (75%), Positives = 262/302 (86%), Gaps = 3/302 (0%)
Query: 1 MASSHLFLTTCLLILGVISS---QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAA 57
MA + L L T LL+LG IS+ + A VS+LK ++ ILQ+S+++ +N NPKAGWKAA
Sbjct: 1 MAMNQLCLATILLLLGAISTFHPEVVALKSVSQLKFNTKILQESMVELINANPKAGWKAA 60
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 117
NP+FSNY+VGQF HLLGVKPT + L GVPV TH K+LKLPK FDAR+AWPQCSTI +I
Sbjct: 61 MNPRFSNYSVGQFMHLLGVKPTLQKDLEGVPVITHPKTLKLPKHFDARTAWPQCSTIGKI 120
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
LDQGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCG GCDGGYP+ AWRY
Sbjct: 121 LDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWRY 180
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
F+HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC +NQLWR +K Y SAYRI+S
Sbjct: 181 FIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQLWRKAKRYGQSAYRISS 240
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
DP IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDY
Sbjct: 241 DPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDY 300
Query: 298 WV 299
W+
Sbjct: 301 WI 302
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 224/302 (74%), Positives = 253/302 (83%), Gaps = 3/302 (0%)
Query: 1 MASSHLF-LTTCLLILGVISSQTFAEGV--VSKLKLDSHILQDSIIKEVNENPKAGWKAA 57
MAS+HL L T L+L Q ++ LKL+SHILQ+S KE+NENP+AGW+AA
Sbjct: 1 MASTHLLPLATFFLLLSASYLQIAGAEAQPLTSLKLNSHILQESTAKEINENPEAGWEAA 60
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 117
NP+FSNYTV QFK LLGVKP PK L P +H K+LKLPK+FDAR+AW QCSTI RI
Sbjct: 61 INPRFSNYTVEQFKRLLGVKPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRI 120
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
LDQGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY
Sbjct: 121 LDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWRY 180
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
HHGVVTEECDPYFD GCSHPGCEPAY TPKCV+KCV NQ+W+ SKHYS+SAYR+NS
Sbjct: 181 LAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNS 240
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
DP DIMAE+YKNGPVEV+FTVYEDFA+YKSGVYKHITG +GGHAVKLIGWGT+DDGEDY
Sbjct: 241 DPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDY 300
Query: 298 WV 299
W+
Sbjct: 301 WL 302
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 221/294 (75%), Positives = 251/294 (85%), Gaps = 2/294 (0%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
LFL + L + ++T + ++ LKL+SHILQ+SI K++NENP+AGW+AA NP+FSN+
Sbjct: 11 LFLAFSVSYLSIGDAET--DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNF 68
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
TVGQFK LLGVK PK LL PV TH KSLKLPK FDAR+AW QCSTI +ILDQGHCGS
Sbjct: 69 TVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILDQGHCGS 128
Query: 126 CWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
CWAFGAVE+L DRFCIHF MN+SLSVNDLLACCGFLCG GCDGG PI AWRY HHGVVT
Sbjct: 129 CWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVT 188
Query: 186 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 245
EECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE
Sbjct: 189 EECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAE 248
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YKNGPVEV+FTV+EDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GEDYW+
Sbjct: 249 VYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWL 302
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 221/294 (75%), Positives = 251/294 (85%), Gaps = 2/294 (0%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
LFL + L + ++T + ++ LKL+SHILQ+SI K++NENP+AGW+AA NP+FSN+
Sbjct: 13 LFLAFSVSYLSIGDAET--DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNF 70
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
TVGQFK LLGVK PK LL PV TH KSLKLPK FDAR+AW QCSTI +ILDQGHCGS
Sbjct: 71 TVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILDQGHCGS 130
Query: 126 CWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
CWAFGAVE+L DRFCIHF MN+SLSVNDLLACCGFLCG GCDGG PI AWRY HHGVVT
Sbjct: 131 CWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVT 190
Query: 186 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 245
EECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE
Sbjct: 191 EECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAE 250
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YKNGPVEV+FTV+EDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GEDYW+
Sbjct: 251 VYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWL 304
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 221/298 (74%), Positives = 252/298 (84%), Gaps = 4/298 (1%)
Query: 6 LFLTTCLLILGVISSQTFAEGV----VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ 61
L L + LG++ S +G+ +SK KL S ILQ+ I+KEVNENP AGWKAA N +
Sbjct: 8 LHLASVFFFLGLLISSFNLQGIAAENLSKQKLTSWILQNEIVKEVNENPNAGWKAAFNDR 67
Query: 62 FSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 121
F+N TV +FK LLGVKPTPK LGVP+ +HD SLKLPK FDAR+AW QC+++ RILDQG
Sbjct: 68 FANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSVGRILDQG 127
Query: 122 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
HCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HH
Sbjct: 128 HCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHH 187
Query: 182 GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED 241
GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV NQLWR SKHY +SAY++ S P+D
Sbjct: 188 GVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDD 247
Query: 242 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
IMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+
Sbjct: 248 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWL 305
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 477 bits (1227), Expect = e-132, Method: Compositional matrix adjust.
Identities = 224/302 (74%), Positives = 253/302 (83%), Gaps = 3/302 (0%)
Query: 1 MASSHLF-LTTCLLILGVISSQTFAEGV--VSKLKLDSHILQDSIIKEVNENPKAGWKAA 57
MAS+HL L T L+L Q ++ LKL+SHILQ+S KE+NENP+AGW+AA
Sbjct: 1 MASTHLLPLATFFLLLSASYLQIAGAEAQPLTSLKLNSHILQESTAKEINENPEAGWEAA 60
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRI 117
NP+FSNYTV QFK LLGVKP PK L P +H K+LKLPK+FDAR+AW QCSTI RI
Sbjct: 61 INPRFSNYTVEQFKRLLGVKPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRI 120
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
LDQGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY
Sbjct: 121 LDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWRY 180
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
HHGVVTEECDPYFD GCSHPGCEPAY TPKCV+KCV NQ+W+ SKHYS+SAYR+NS
Sbjct: 181 LAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNS 240
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
DP DIMAE+YKNGPVEV+FTVYEDFA+YKSGVYKHITG +GGHAVKLIGWGT+DDGEDY
Sbjct: 241 DPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGEDY 300
Query: 298 WV 299
W+
Sbjct: 301 WL 302
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 476 bits (1226), Expect = e-132, Method: Compositional matrix adjust.
Identities = 222/299 (74%), Positives = 251/299 (83%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
+ S+ +F LLI Q A +SK KL S ILQ+ I+KEVNENP AGWKA+ N
Sbjct: 9 LHSASVFFCLGLLISSFNLLQGIAAENLSKQKLTSWILQNEIVKEVNENPNAGWKASFND 68
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+F+N TV +FK LLGVKPTPK LGVP+ +HD SLKLPK FDAR+AW QC++I RILDQ
Sbjct: 69 RFANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQ 128
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
GHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF H
Sbjct: 129 GHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKH 188
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 240
HGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV NQLWR SKHY +SAY++ S P+
Sbjct: 189 HGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPD 248
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+
Sbjct: 249 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWL 307
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 219/294 (74%), Positives = 249/294 (84%), Gaps = 2/294 (0%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
LFL + L + ++T + ++ LKL+SHILQ+SI K++NENP+AGW+AA NP+FSN+
Sbjct: 13 LFLAFSVSYLSIGDAET--DEKLNGLKLNSHILQESIAKQINENPEAGWEAAINPRFSNF 70
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
TVGQFK LLGVK PK LL PV TH KSLKLPK FDAR+AW QCSTI +ILDQGHCGS
Sbjct: 71 TVGQFKRLLGVKQAPKKELLSTPVVTHPKSLKLPKEFDARAAWSQCSTIGKILDQGHCGS 130
Query: 126 CWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
CWAFGAVE+L DRFC HF MN+SLSVNDLLACCGFLCG GCDGG PI AWRY HHGVVT
Sbjct: 131 CWAFGAVESLQDRFCSHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVT 190
Query: 186 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 245
EECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIM E
Sbjct: 191 EECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMTE 250
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YKNGPVEV+FTV+EDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GEDYW+
Sbjct: 251 VYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWL 304
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 473 bits (1216), Expect = e-131, Method: Compositional matrix adjust.
Identities = 221/301 (73%), Positives = 251/301 (83%), Gaps = 2/301 (0%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGV--VSKLKLDSHILQDSIIKEVNENPKAGWKAAR 58
MAS+ L L T L+L Q ++ LKL+S ILQ+SI KE+NENP+AGW+AA
Sbjct: 1 MASTLLPLATFFLVLSASYLQIAGAKAQPLTSLKLNSPILQESIAKEINENPEAGWEAAI 60
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
NP FSNYTV QFK LLGVKPTPK L P +H KSLKLPK+FDAR+AW QCSTI RIL
Sbjct: 61 NPHFSNYTVEQFKRLLGVKPTPKKELRSTPAISHPKSLKLPKNFDARTAWSQCSTIGRIL 120
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AW+Y
Sbjct: 121 DQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQYL 180
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
HHGVVTEECDPYFD GCSHPGCEPAY TPKCV+KCV NQ+W+ SKHYS++AYR++SD
Sbjct: 181 AHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSD 240
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
P DIM E+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT++DGEDYW
Sbjct: 241 PHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGEDYW 300
Query: 299 V 299
+
Sbjct: 301 L 301
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 213/272 (78%), Positives = 244/272 (89%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
++K KL+S ILQD I+K+VN+NP AGWKAA N +FSN TV +FK LLGVKPTPK LGV
Sbjct: 33 LTKQKLNSKILQDEIVKKVNQNPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
PV +HD SLKLPK+FDAR+AWPQC++I +ILDQGHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 93 PVVSHDPSLKLPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNI 152
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
TP+C+RKCV N+LW SKHYS+S Y +NS P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 213 TPRCLRKCVSDNKLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 272
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVYKHITG +GGHAVKLIGWGTS++GEDYW+
Sbjct: 273 GVYKHITGSNIGGHAVKLIGWGTSNEGEDYWL 304
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 217/279 (77%), Positives = 247/279 (88%)
Query: 21 QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP 80
Q A ++K KL+S ILQ+ I+K+VNE+P AGWKAA N +FSN TV +FK LLGVKPTP
Sbjct: 27 QGVAAENLTKQKLNSKILQEEIVKKVNEHPNAGWKAAINDRFSNATVAEFKRLLGVKPTP 86
Query: 81 KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
K LLLGVPV +HD+SLKLPKSFDAR+ WPQC++I +ILDQGHCGSCWAFGAVE+LSDRFC
Sbjct: 87 KKLLLGVPVVSHDQSLKLPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFC 146
Query: 141 IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP 200
I FGMN++LSVNDLLACCGF CGDGCDGGYPISAW+YF + GVVTEECDPYFD TGCSHP
Sbjct: 147 IQFGMNITLSVNDLLACCGFRCGDGCDGGYPISAWQYFSYSGVVTEECDPYFDQTGCSHP 206
Query: 201 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
GCEPAY TP+C+RKCV +NQLW SKHYSI+ Y + S+P+DIMAEIYKNGPVEVSFTVYE
Sbjct: 207 GCEPAYNTPQCLRKCVGRNQLWSESKHYSINTYVVESNPQDIMAEIYKNGPVEVSFTVYE 266
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DFAHYKSGVYKHITG +GGHAVKLIGWGT+DDGEDYW+
Sbjct: 267 DFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDGEDYWL 305
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 217/280 (77%), Positives = 243/280 (86%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
Q AE VSKLKL+S ILQDSI+++VNENPKAGW+A NPQFSNY+VG+FK+LLGVK T
Sbjct: 5 QQATAEEPVSKLKLNSRILQDSIVQKVNENPKAGWEATMNPQFSNYSVGEFKYLLGVKQT 64
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
P+ L GVP+ H KS+KLP FDAR+AWP CSTI RILDQGHCGSCWAFGAVE+LSDRF
Sbjct: 65 PRKELRGVPLLRHPKSMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRF 124
Query: 140 CIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
CIH+GMNLSLSVNDLLACCG++CG GCDGG PI AWRYFV GVVTEECDPYFD GCSH
Sbjct: 125 CIHYGMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSH 184
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
PGCEP +PTPKC RKC KN+LW SKH+S++AYRI+SDP IMAE+ NGPVEV+FTVY
Sbjct: 185 PGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTVY 244
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDFAHYKSGVYKHITGD MGGHAVKLIGWGTS+DGEDYW+
Sbjct: 245 EDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWL 284
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 470 bits (1209), Expect = e-130, Method: Compositional matrix adjust.
Identities = 214/272 (78%), Positives = 241/272 (88%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
++K KLDS ILQD I+K+VNENP AGWKAA N +FSN TV +FK LLGVKPTPK LGV
Sbjct: 33 LTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
P+ +HD SLKLPK+FDAR+AWPQC++I ILDQGHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 93 PIVSHDPSLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNI 152
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
TPKC RKCV N+LW SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 213 TPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 272
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVYKHITG +GGHAVKLIGWGTS +GEDYW+
Sbjct: 273 GVYKHITGSNIGGHAVKLIGWGTSSEGEDYWL 304
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 212/272 (77%), Positives = 240/272 (88%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
+S LKL+S ILQ+SI KE+NENP AGW+AA +P+FSNYTV QFK LLGVKP+PK L
Sbjct: 31 LSTLKLNSRILQESIAKEINENPGAGWEAAISPRFSNYTVAQFKRLLGVKPSPKKELRST 90
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
PV +H +SLKLPKSFDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIH +N+
Sbjct: 91 PVVSHPRSLKLPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHLDVNV 150
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
SLSVNDLLACCGFLCG GCDGGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPAY
Sbjct: 151 SLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQ 210
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
TPKCVRKCVK NQ+W+ SK++S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKS
Sbjct: 211 TPKCVRKCVKGNQIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKS 270
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVYKHITG +GGHAVKLIGWGT+D+GEDYW+
Sbjct: 271 GVYKHITGSQLGGHAVKLIGWGTTDEGEDYWL 302
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 212/272 (77%), Positives = 239/272 (87%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
++K KLDS ILQD I+K+VNENP AGWKAA N +FSN TV +FK LLGVKPTPK LGV
Sbjct: 33 LTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
P+ +HD SLKLPK+FDAR+AWPQC++I IL GHCGSCWAFGAVE+LSDRFCI FGMN+
Sbjct: 93 PIVSHDPSLKLPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFGMNI 152
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEPAYP
Sbjct: 153 SLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYP 212
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
TPKC RKCV N+LW SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKS
Sbjct: 213 TPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKS 272
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVYKHITG +GGHAVKLIGWGTS +GEDYW+
Sbjct: 273 GVYKHITGSNIGGHAVKLIGWGTSSEGEDYWL 304
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 214/299 (71%), Positives = 246/299 (82%), Gaps = 3/299 (1%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M + L L T L+ ++T+ +S++KL+SHILQ+SI +++NENP+AGW+A NP
Sbjct: 1 MTPTILSLATLFLVFFFGEAKTYE---LSEVKLNSHILQESIARQINENPEAGWEATINP 57
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+FSN+TVGQFK LLGVK TP+ L PV TH KSLKLPK FDAR+AW QCSTI RILDQ
Sbjct: 58 RFSNFTVGQFKRLLGVKQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRILDQ 117
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
GHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW Y H
Sbjct: 118 GHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAH 177
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 240
HGVVTEECDPYFD GCSHPGCEP Y TPKCV+KCV NQLW SKHYS+ AY +NSDP+
Sbjct: 178 HGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQ 237
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKL+GWGTS +GEDYW+
Sbjct: 238 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWL 296
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 462 bits (1190), Expect = e-128, Method: Compositional matrix adjust.
Identities = 215/290 (74%), Positives = 245/290 (84%), Gaps = 3/290 (1%)
Query: 13 LILG---VISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 69
L+LG ++ Q AE +S+ KL+S ILQDSI+K VNEN +AGWKAA NPQ SN+TV Q
Sbjct: 12 LLLGAFFILILQVAAEKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQ 71
Query: 70 FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
FK LLGVKP +G L G+PV TH + +LPK FDAR AWPQCSTI +ILDQGHCGSCWAF
Sbjct: 72 FKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAF 131
Query: 130 GAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 189
GAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWRYF GVVTEECD
Sbjct: 132 GAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECD 191
Query: 190 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
PYFD+TGCSHPGCEP YPTPKC RKCVK N LWR SKHY ++AYR++ DP+ IMAE+YKN
Sbjct: 192 PYFDTTGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKN 251
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GPVEVSFTVYEDFAHYKSGVYKH+TG MGGHAVKLIGWGTS+ GEDYW+
Sbjct: 252 GPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWL 301
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 221/323 (68%), Positives = 252/323 (78%), Gaps = 39/323 (12%)
Query: 16 GVISS---QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
G IS+ + A VS+LK ++ ILQ+S+++ +N NPKAGWKAA NP+FSNY+VGQF H
Sbjct: 14 GAISTFHPEVVALKSVSQLKFNTKILQESMVELINANPKAGWKAAMNPRFSNYSVGQFMH 73
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL-------------- 118
LLGVKPT + L GVPV TH K+LKLPK FDAR+AWPQCSTI +IL
Sbjct: 74 LLGVKPTLQKDLEGVPVITHPKTLKLPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDF 133
Query: 119 ----------------------DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
DQGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLA
Sbjct: 134 FCFGCTDALYFSYHLLVPFYIKDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLA 193
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGFLCG GCDGGYP+ AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC
Sbjct: 194 CCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCT 253
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
+NQLWR +K Y SAYRI+SDP IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGD
Sbjct: 254 DENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGD 313
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
VMGGHAVKLIGWGT+DDGEDYW+
Sbjct: 314 VMGGHAVKLIGWGTTDDGEDYWI 336
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 213/299 (71%), Positives = 246/299 (82%), Gaps = 3/299 (1%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
++ + LFL L ++T+ +S++KL+SHILQ+SI +++NENP+AGW+A NP
Sbjct: 6 LSLATLFLVFFAPYLRFGEAKTYE---LSEVKLNSHILQESIARQINENPEAGWEATINP 62
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+FSN+TVGQFK LLGVK TP+ L PV TH KSLKLPK FDAR+AW QCSTI RILDQ
Sbjct: 63 RFSNFTVGQFKRLLGVKQTPRSELSSAPVVTHPKSLKLPKDFDARTAWSQCSTIGRILDQ 122
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
GHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW Y H
Sbjct: 123 GHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAH 182
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 240
HGVVTEECDPYFD GCSHPGCEP Y TPKCV+KCV NQLW SKHYS+ AY +NSDP+
Sbjct: 183 HGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQ 242
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKL+GWGTS +GEDYW+
Sbjct: 243 DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWL 301
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 208/291 (71%), Positives = 241/291 (82%), Gaps = 3/291 (1%)
Query: 12 LLILGVISS---QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 68
LL++G IS Q A V+ ++D ILQD I+K VNENP+AGWKA NP+FS++TV
Sbjct: 7 LLLIGAISLLILQVVAVKPVTLTEVDPKILQDEIVKTVNENPEAGWKADMNPRFSDFTVS 66
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
QFK LLGVK PK LL PV TH K ++LPK+FDAR+AWPQC +I+ ILDQGHCGSCWA
Sbjct: 67 QFKRLLGVKKAPKSLLKRTPVVTHSKEIELPKTFDARTAWPQCLSIADILDQGHCGSCWA 126
Query: 129 FGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 188
FGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF GVVT EC
Sbjct: 127 FGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSEC 186
Query: 189 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
DPYFD TGCSHPGCEPAYPTP C +KCVKKN LW SKH+S++AYR+NSD IM E+Y
Sbjct: 187 DPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYT 246
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGP EVSFTVYEDFAHYKSGVYKH+TG MGGHAVKLIGWGTS+DGEDYW+
Sbjct: 247 NGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWL 297
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 208/291 (71%), Positives = 241/291 (82%), Gaps = 3/291 (1%)
Query: 12 LLILGVISS---QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 68
LL++G IS Q A V+ ++D ILQD I+K VNENP+AGWKA NP+FS++TV
Sbjct: 7 LLLIGAISLLILQVVAVKPVTLTEVDPKILQDEIVKTVNENPEAGWKADMNPRFSDFTVS 66
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
QFK LLGVK PK LL PV TH K ++LPK+FDAR+AWPQC +I+ ILDQGHCGSCWA
Sbjct: 67 QFKRLLGVKKAPKSLLKRTPVVTHSKEIELPKTFDARTAWPQCLSIADILDQGHCGSCWA 126
Query: 129 FGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 188
FGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF GVVT EC
Sbjct: 127 FGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSEC 186
Query: 189 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
DPYFD TGCSHPGCEPAYPTP C +KCVKKN LW SKH+S++AYR+NSD IM E+Y
Sbjct: 187 DPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYT 246
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGP EVSFTVYEDFAHYKSGVYKH+TG MGGHAVKLIGWGTS+DGEDYW+
Sbjct: 247 NGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWL 297
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 202/273 (73%), Positives = 233/273 (85%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 86
+++K S I+QD IIK +N++P AGW AARNP F+NYT QFKH+LGVKPTP +L
Sbjct: 31 LMTKEGGSSRIIQDDIIKAINKHPNAGWTAARNPYFANYTTAQFKHILGVKPTPHSVLND 90
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
VPVKT+ +SL LPK FDARSAW QC+TI ILDQGHCGSCWAFGAVE L DRFCIHF MN
Sbjct: 91 VPVKTYPRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMN 150
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 206
+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEPAY
Sbjct: 151 ISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAY 210
Query: 207 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
PTP C +KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYK
Sbjct: 211 PTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYK 270
Query: 267 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SGVYKHITG +MGGHAVKLIGWGT+D GEDYW+
Sbjct: 271 SGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWL 303
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 203/268 (75%), Positives = 231/268 (86%), Gaps = 2/268 (0%)
Query: 34 DSH--ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
D+H I+Q+ II+ VN +P AGW A+RNP FSNYT+ QFKH+LGVKP P+ L VPVKT
Sbjct: 27 DNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKT 86
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 151
+ +SL+LPK FDARSAW +CSTI ILDQGHCGSCWAFGAVE L DRFCIH M++ LSV
Sbjct: 87 YSRSLELPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSV 146
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
NDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEPAYPTPKC
Sbjct: 147 NDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKC 206
Query: 212 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
+KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK
Sbjct: 207 EKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK 266
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
HITG +MGGHAVKLIGWGTSD GEDYW+
Sbjct: 267 HITGGIMGGHAVKLIGWGTSDAGEDYWL 294
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 446 bits (1147), Expect = e-123, Method: Compositional matrix adjust.
Identities = 202/268 (75%), Positives = 231/268 (86%), Gaps = 2/268 (0%)
Query: 34 DSH--ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
D+H I+Q+ II+ VN +P AGW A+RNP FSNYT+ QFKH+LGVKP P+ L VPVKT
Sbjct: 27 DNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKT 86
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 151
+ +SL+LPK FDARSAW +CSTI IL+QGHCGSCWAFGAVE L DRFCIH M++ LSV
Sbjct: 87 YSRSLELPKEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQDRFCIHLNMSILLSV 146
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
NDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEPAYPTPKC
Sbjct: 147 NDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKC 206
Query: 212 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
+KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK
Sbjct: 207 EKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK 266
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
HITG +MGGHAVKLIGWGTSD GEDYW+
Sbjct: 267 HITGGIMGGHAVKLIGWGTSDAGEDYWL 294
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 206/295 (69%), Positives = 239/295 (81%), Gaps = 2/295 (0%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
HL + LL+ + Q A +SK KL S ILQ+ I+KEVNENP AGWKAA N +F+N
Sbjct: 10 HLLASVFLLLFSSFNLQGIAAENLSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFAN 69
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
TV +FK LLGV TPK LGVP+ HD SLKLPK FDAR+AW C++I RIL GHCG
Sbjct: 70 ATVAEFKRLLGVIQTPKTAYLGVPIVRHDLSLKLPKEFDARTAWSHCTSIRRIL--GHCG 127
Query: 125 SCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
SCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVV
Sbjct: 128 SCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVV 187
Query: 185 TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 244
T+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW SKHY + AYRIN DP+DIMA
Sbjct: 188 TQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMA 247
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVKLIGWGTSDDGEDYW+
Sbjct: 248 EVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWL 302
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 199/263 (75%), Positives = 227/263 (86%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q+ II+ VN++P AGW A NP F+NYT+ QFKH+LGVKPTP GLL GVP+KTH KS
Sbjct: 40 IIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKTHPKSA 99
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
LPK FDAR+ W CSTI ILDQGHCG+CWAF AVE+L DRFCIH M++SLSVNDLLA
Sbjct: 100 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLA 159
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC
Sbjct: 160 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCK 219
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
+NQ+W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG
Sbjct: 220 VENQVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 279
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 280 VMGGHAVKLIGWGTSDAGEDYWL 302
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 199/263 (75%), Positives = 226/263 (85%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q+ II+ VN++P AGW A NP F+NYT+ QFKH+LGVKPTP GLL GVP+KTH KS
Sbjct: 40 IIQEDIIQTVNDHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKTHPKSA 99
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
LPK FDAR+ W CSTI ILDQGHCG+CWAF AVE+L DRFCIH M++SLSVNDLLA
Sbjct: 100 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLA 159
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC
Sbjct: 160 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCK 219
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
+NQ+W+ +KH S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG
Sbjct: 220 VENQVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 279
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 280 VMGGHAVKLIGWGTSDAGEDYWL 302
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 194/263 (73%), Positives = 225/263 (85%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q+ II+ +N +P AGW A +N F+NYT+ QFKH+LGVKPTP GLL GVP KT+ +S
Sbjct: 37 IIQNDIIETINNHPNAGWTAGQNSYFANYTIAQFKHILGVKPTPPGLLRGVPTKTYSRST 96
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
LPK FDARS W CSTI ILDQGHCGSCWAFGAVE L DRFCIH MN+SLSVNDL+A
Sbjct: 97 DLPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLVA 156
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGF+CGDGCDGGYPISAW+Y V +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC
Sbjct: 157 CCGFMCGDGCDGGYPISAWQYLVENGVVTDECDPYFDQVGCKHPGCEPAYPTPACEKKCK 216
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
+NQ+W+ KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVY+HITG+
Sbjct: 217 VQNQVWQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGE 276
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
+MGGHAVKLIGWGTS DG+DYW+
Sbjct: 277 MMGGHAVKLIGWGTSADGKDYWL 299
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 198/263 (75%), Positives = 222/263 (84%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q II+ VN +P AGW A NP +NYT+ QFKH+LGVKPTP GLL GV KTH +S
Sbjct: 35 IIQKGIIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLLAGVRTKTHPRSE 94
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
+LPK FDARS W CSTI +ILDQGHCGSCWAFGAVE L DRFCIH MN+SLS NDL+A
Sbjct: 95 QLPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNISLSANDLVA 154
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGF+CGDGCDGGYPISAW+YFV +GVVTEECDPYFD GC HPGCEPAYPTP C +KC
Sbjct: 155 CCGFMCGDGCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCKHPGCEPAYPTPVCEKKCK 214
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
+NQ+W+ KH+SI AY++NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG
Sbjct: 215 VQNQVWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGG 274
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 275 VMGGHAVKLIGWGTSDAGEDYWL 297
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 206/315 (65%), Positives = 239/315 (75%), Gaps = 20/315 (6%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
HL + LL+ + Q A +SK KL S ILQ+ I+KEVNENP AGWKAA N +F+N
Sbjct: 10 HLLASVFLLLFSSFNLQGIAAENLSKQKLTSLILQNEIVKEVNENPNAGWKAAFNDRFAN 69
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD----- 119
TV +FK LLGV TPK LGVP+ HD SLKLPK FDAR+AW C++I RIL
Sbjct: 70 ATVAEFKRLLGVIQTPKTAYLGVPIVRHDLSLKLPKEFDARTAWSHCTSIRRILVGYILN 129
Query: 120 ---------------QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGD 164
GHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG
Sbjct: 130 NVLLWSTITLWFWFLLGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGF 189
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
GC+GG+P+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW
Sbjct: 190 GCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGE 249
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
SKHY + AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVK
Sbjct: 250 SKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVK 309
Query: 285 LIGWGTSDDGEDYWV 299
LIGWGTSDDGEDYW+
Sbjct: 310 LIGWGTSDDGEDYWL 324
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 217/295 (73%), Positives = 246/295 (83%), Gaps = 1/295 (0%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
HL LLI I T A G +SK KL S ILQ+ I+KEVNENP AGWKA+ N +F+N
Sbjct: 11 HLAFVFLLLISSFILQGT-AAGNLSKQKLTSLILQNEIVKEVNENPNAGWKASLNDRFAN 69
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
TV +FK LLGVKPTPK LGVP+ HD SLKLPK FDAR+AW QC++I RILDQGHCG
Sbjct: 70 ATVAEFKRLLGVKPTPKTAYLGVPIVRHDLSLKLPKEFDARTAWSQCTSIPRILDQGHCG 129
Query: 125 SCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
SCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVV
Sbjct: 130 SCWAFGAVESLSDRFCIKYNLNVSLSANDVVACCGLLCGLGCNGGFPMGAWLYFKYHGVV 189
Query: 185 TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 244
TEECDPYFD+TGCSHPGCEP YPTPKCVRKCV +NQLW SKHY +SAYRIN DP+DIMA
Sbjct: 190 TEECDPYFDNTGCSHPGCEPGYPTPKCVRKCVSENQLWGESKHYGVSAYRINHDPQDIMA 249
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+
Sbjct: 250 EVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWL 304
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 195/263 (74%), Positives = 220/263 (83%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q II+ +N++P AGW A N +NYT+ QFKH+LGVKPTP GLL GVP KT+ KS
Sbjct: 33 IIQKDIIETINKHPNAGWTAGHNAYLANYTIEQFKHILGVKPTPPGLLAGVPTKTYSKSE 92
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
+LPK FDARS W CSTI ILDQGHCGSCWAFGAVE L DRFCIH +N+SLS NDL+A
Sbjct: 93 ELPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHQNINISLSANDLVA 152
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGF+CGDGCDGGYPI AW+YFV GVVTEECDPYFD GC HPGCEPAY TPKC +KC
Sbjct: 153 CCGFMCGDGCDGGYPIKAWQYFVQSGVVTEECDPYFDQVGCKHPGCEPAYDTPKCEKKCK 212
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
+NQ+W KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG
Sbjct: 213 VQNQVWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGG 272
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 273 VMGGHAVKLIGWGTSDAGEDYWL 295
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 192/264 (72%), Positives = 221/264 (83%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
I+Q+ II+ +N++P AGW A NP F+NYT+ QFKH+LGVKPTP LL GVP K++ +S
Sbjct: 35 RIIQNDIIETINKHPNAGWTAGHNPYFANYTITQFKHILGVKPTPPALLAGVPTKSYSRS 94
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
+KLP FDARS W CSTI ILDQGHCGSCWAFGAVE L DRFCIH MN+SLSVNDLL
Sbjct: 95 MKLPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLL 154
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 215
ACCGFLCG GC+GGYPISAWRYF GVVT+ECDPYFD GC HPGCEPAY TPKC +KC
Sbjct: 155 ACCGFLCGSGCNGGYPISAWRYFRRKGVVTDECDPYFDQVGCKHPGCEPAYRTPKCEKKC 214
Query: 216 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
+N++W+ KH+S+ AYR++S+P DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG
Sbjct: 215 KVQNEVWKEQKHFSVDAYRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITG 274
Query: 276 DVMGGHAVKLIGWGTSDDGEDYWV 299
VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 275 GVMGGHAVKLIGWGTSDAGEDYWL 298
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 203/318 (63%), Positives = 234/318 (73%), Gaps = 45/318 (14%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV------------------- 67
+++K S I+QD IIK +N++P AGW AARNP F+NYTV
Sbjct: 31 LMTKEGGSSRIIQDDIIKAINKHPNAGWTAARNPYFANYTVNNNTLLLLFSFFFLRGHLP 90
Query: 68 --------------------------GQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 101
QFKH+LGVKPTP +L VPVKT+ +SL LPK
Sbjct: 91 VVVSIAYIKTFISCLFGGLNNPPVQTAQFKHILGVKPTPHSVLNDVPVKTYPRSLMLPKE 150
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFL 161
FDARSAW QC+TI ILDQGHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+
Sbjct: 151 FDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFM 210
Query: 162 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 221
CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+
Sbjct: 211 CGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQV 270
Query: 222 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 281
W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGH
Sbjct: 271 WLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGH 330
Query: 282 AVKLIGWGTSDDGEDYWV 299
AVKLIGWGT+D GEDYW+
Sbjct: 331 AVKLIGWGTTDAGEDYWL 348
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 426 bits (1094), Expect = e-117, Method: Compositional matrix adjust.
Identities = 193/238 (81%), Positives = 214/238 (89%)
Query: 62 FSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 121
F+N TV +FK LLGVKPTPK LGVP+ +HD SLKLPK FDAR+AW QC++I RILDQG
Sbjct: 1 FANATVAEFKRLLGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQG 60
Query: 122 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
HCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HH
Sbjct: 61 HCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHH 120
Query: 182 GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED 241
GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV NQLWR SKHY +SAY++ S P+D
Sbjct: 121 GVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDD 180
Query: 242 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
IMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+
Sbjct: 181 IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWL 238
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 191/258 (74%), Positives = 216/258 (83%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 101
II+ VN +P AGW A NP +NYT+ QFKH+LGVKPTP GL V KTH +S +LPK
Sbjct: 1 IIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRSEQLPKV 60
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFL 161
FDARS W CSTI +ILDQGHCGSCWAFGAVE L DRFCIH MN++LS NDL+ACCGF+
Sbjct: 61 FDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCIHHNMNITLSANDLVACCGFM 120
Query: 162 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 221
CGDGCDGGYPISAW+YFV +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+
Sbjct: 121 CGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQV 180
Query: 222 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 281
W KH+SI+AY++NSDP DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG VMGGH
Sbjct: 181 WEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGH 240
Query: 282 AVKLIGWGTSDDGEDYWV 299
AVKLIGWGTSD GEDYW+
Sbjct: 241 AVKLIGWGTSDAGEDYWL 258
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 194/265 (73%), Positives = 222/265 (83%), Gaps = 3/265 (1%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q II+ VN++P AGW A NP F+NYT+ QFKH+LGVKPTP GLL GVP+K H + +
Sbjct: 37 IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
LPK FDAR+ W CSTI ILDQGHCG+CWAF AVEAL DRFCIH M++SLSVNDLLA
Sbjct: 96 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLA 155
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC
Sbjct: 156 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCK 215
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHIT 274
+NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT + DFAHYKSGVYKHIT
Sbjct: 216 VENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHIT 275
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 276 GGVMGGHAVKLIGWGTSDAGEDYWL 300
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 197/300 (65%), Positives = 237/300 (79%), Gaps = 5/300 (1%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVS-KLKLDSHILQDSIIKEVNENPKAGWKAARN 59
MAS LF CL++L +++ A V S + IL++ I++E+N +PKAGWKA N
Sbjct: 1 MASRLLF---CLMVLVAMAATPQASLVESFPAQSQDRILKEPIVEEINRHPKAGWKAGMN 57
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+FSN+TVGQFK LLGV PTP+ LL VPV+T+ K L LPK FDAR AWPQC+++ ILD
Sbjct: 58 SRFSNHTVGQFKRLLGVLPTPRNLLENVPVRTYPKGLNLPKQFDARKAWPQCTSVRTILD 117
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
QGHCGSCWAFGAVEALSDRFCIH+ +N++LS NDL+ACCGF CGDGCDGGYP+SAW+YF+
Sbjct: 118 QGHCGSCWAFGAVEALSDRFCIHYKVNVTLSENDLVACCGFRCGDGCDGGYPLSAWQYFI 177
Query: 180 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
GVVT ECDPYFD GC HPGCEP YPTP+CV++C +NQ W NSK +S +AYRI S P
Sbjct: 178 STGVVTAECDPYFDEAGCQHPGCEPLYPTPQCVKQCKDENQNWGNSKRFSATAYRITSKP 237
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+Y GPVEV F VYEDFAHYKSGVYK+ITGD +GGHAVKLIGWGT ++G DYW+
Sbjct: 238 YDIMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGT-ENGTDYWL 296
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 422 bits (1085), Expect = e-116, Method: Compositional matrix adjust.
Identities = 204/300 (68%), Positives = 231/300 (77%), Gaps = 34/300 (11%)
Query: 3 SSHLFLTTCLLILGVI---SSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN 59
+S L+L T L++ + SQ A VSKLKL+S ILQDSI+++VNENP AGW+A N
Sbjct: 2 ASPLYLGTLFLLVAALFTFRSQVIAVEPVSKLKLNSRILQDSIVQKVNENPNAGWEATMN 61
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
PQFSNY+VG+FK+LLGVKPTP L GVP+
Sbjct: 62 PQFSNYSVGEFKYLLGVKPTPGKELRGVPL------------------------------ 91
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
GHCGSCWAFGAVE+LSDRFCIH+GMNLSLSVNDLLACCG++CGDGCDGGYPI AWRYFV
Sbjct: 92 -GHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRYFV 150
Query: 180 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
GVVTEECDPYFD GCSHPGCEP +PTPKC RKC KN+LW SKH+S++AYRI+SDP
Sbjct: 151 QSGVVTEECDPYFDDIGCSHPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDP 210
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
IMAE+ NGPVEV+FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW+
Sbjct: 211 HSIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWL 270
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 422 bits (1085), Expect = e-116, Method: Compositional matrix adjust.
Identities = 194/299 (64%), Positives = 233/299 (77%), Gaps = 3/299 (1%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
MAS LF T L+ + + E +K + IL++ I++E+N +P AGWKA N
Sbjct: 1 MASRLLFCLTVLVAMAATLQASLLESFPAKNQ--DRILKEPIVEEINRHPNAGWKAGMNS 58
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+FSN+TVGQFK LLGV PTP+ L VPV T+ K + LPK FDAR AWPQC+++ ILDQ
Sbjct: 59 RFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGINLPKQFDAREAWPQCTSVQTILDQ 118
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
GHCGSCWAFGAVEALSDRFCIH +N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+
Sbjct: 119 GHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPISAWQYFIS 178
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 240
GVVT ECDPYFD GC HPGCEP YPTP+CV++C +NQ W NSK +S +AYRI+S P
Sbjct: 179 TGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPY 238
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+Y NGPVEVSF+VYEDFAHYKSGVYK+ GD MGGHAVKL+GWGT +DG DYW+
Sbjct: 239 DIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWL 296
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 194/265 (73%), Positives = 222/265 (83%), Gaps = 3/265 (1%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q II+ VN++P AGW A NP F+NYT+ QFKH+LGVKPTP GLL GVP+K H + +
Sbjct: 37 IIQKDIIQTVNKHPNAGWTAGHNPYFANYTIEQFKHILGVKPTPPGLLAGVPIKIHPE-M 95
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
LPK FDAR+ W CSTI ILDQGHCG+CWAF AVEAL DRFCIH M++SLSVNDLLA
Sbjct: 96 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLA 155
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC
Sbjct: 156 CCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCK 215
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHIT 274
+NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT + DFAHYKSGVYKHIT
Sbjct: 216 VENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHIT 275
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G VMGGHAVKLIGWGTSD GEDYW+
Sbjct: 276 GGVMGGHAVKLIGWGTSDAGEDYWL 300
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 193/299 (64%), Positives = 232/299 (77%), Gaps = 3/299 (1%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M S LF T L+ + + E +K + IL++ I++E+N +P AGWKA N
Sbjct: 1 MTSRLLFCLTVLVAMAATLQASLLESFPAKNQ--DRILKEPIVEEINRHPNAGWKAGMNS 58
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+FSN+TVGQFK LLGV PTP+ L VPV T+ K + LPK FDAR AWPQC+++ ILDQ
Sbjct: 59 RFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGMNLPKQFDAREAWPQCTSVQTILDQ 118
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
GHCGSCWAFGAVEALSDRFCIH +N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+
Sbjct: 119 GHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPISAWQYFIS 178
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 240
GVVT ECDPYFD GC HPGCEP YPTP+CV++C +NQ W NSK +S +AYRI+S P
Sbjct: 179 TGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPY 238
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+Y NGPVEVSF+VYEDFAHYKSGVYK+ GD MGGHAVKL+GWGT +DG DYW+
Sbjct: 239 DIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWL 296
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 191/265 (72%), Positives = 220/265 (83%), Gaps = 2/265 (0%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q+ II+ VN +P AGW A NP +NYT+ QFKH+LGVKPTP GLL GVP KT+ +S
Sbjct: 34 IIQEDIIRTVNSHPNAGWTAGHNPYLANYTIEQFKHILGVKPTPPGLLAGVPTKTYSRSE 93
Query: 97 K--LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 154
K LPK FDARS W CSTI +ILDQGHCG+CWAFGAVE L DRFCIH +N+SLSVNDL
Sbjct: 94 KAELPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECLQDRFCIHHSVNVSLSVNDL 153
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 214
+ACCGFLCGDGCDGGYPI AW+YFV +GVVT+ECDP+FD GC HPGCEPAYPTP C +K
Sbjct: 154 VACCGFLCGDGCDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQHPGCEPAYPTPVCEKK 213
Query: 215 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
C +NQ+W KH+SI AY++NSDP DIMAE+YKNGPVEVSF +YEDFAHYKSGVYK IT
Sbjct: 214 CKVQNQVWEEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQIT 273
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G ++GGHA KLIGWGTSD GEDYW+
Sbjct: 274 GRMVGGHAAKLIGWGTSDAGEDYWL 298
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/301 (63%), Positives = 236/301 (78%), Gaps = 8/301 (2%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSH--ILQDSIIKEVNENPKAGWKAAR 58
MA++ L + T +L+ + S G+ S L+S ILQ S ++ +N++P AGWKAA
Sbjct: 1 MATTILTVFTTVLLACIKVS-----GLESFHSLESQRPILQKSFVEHINKHPNAGWKAAM 55
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
+ +FSNYTV +F HLLGV PTP+ LL VPV+ + K LKLP FDAR AWP C++ IL
Sbjct: 56 STRFSNYTVREFAHLLGVLPTPQKLLETVPVRVYPKGLKLPSKFDARKAWPHCTSTRSIL 115
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQGHCGSCWAF AVEALSDRFCIHF +N +LS NDL+ACCGF CG GC+GG+P+SAWRYF
Sbjct: 116 DQGHCGSCWAFAAVEALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGFPLSAWRYF 175
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
GVVT+ECDPYFD+ GC+HPGCEP+YPTP+CV+ C K NQ W +SKHYS +AYRI SD
Sbjct: 176 SRRGVVTDECDPYFDNDGCNHPGCEPSYPTPRCVKNC-KDNQRWSHSKHYSANAYRIKSD 234
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
P +IMAE++ NGPVEVSF+VYEDFAHY++GVYKH+ G +GGHAVKLIGWGT+DDG DYW
Sbjct: 235 PYNIMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYW 294
Query: 299 V 299
+
Sbjct: 295 L 295
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 174/271 (64%), Positives = 207/271 (76%), Gaps = 3/271 (1%)
Query: 30 KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV-P 88
KL L +LQ SI+ VN +P AGWKA N +F N+TV FK L GV P + + P
Sbjct: 30 KLDLGRPLLQKSIVDIVNNDPNAGWKAGFNERFINHTVRDFKRLCGVLPKSSEEVQPLRP 89
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 148
+++H ++L LPK FDAR AWPQCS+I ILDQGHCGSCWAFGAVEAL+DRFCI N+S
Sbjct: 90 LRSHPRTLDLPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENVS 149
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
LS NDL+ACC CG GCDGGYP +AW YF GVVT +CDPYFD GC HPGCEP Y T
Sbjct: 150 LSENDLVACCS-SCGFGCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEYDT 208
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
P CV++CV N+ WR+SKH+++ Y +NSD DI AEIYKNGPVEVS+TVYEDFAHYKSG
Sbjct: 209 PVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSG 267
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH+ G+V+GGHAVK IGWGT+DDG+DYW+
Sbjct: 268 VYKHVFGEVLGGHAVKFIGWGTTDDGKDYWI 298
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 365 bits (937), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 177/289 (61%), Positives = 211/289 (73%), Gaps = 5/289 (1%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL V AE KL L +LQ SI+ VN +P AGWKA N +F N+TV FK
Sbjct: 3 LLFSAVAQGVRVAES--GKLDLGRPLLQKSIVDIVNNDPNAGWKAGFNERFINHTVRDFK 60
Query: 72 HLLGVKPTPKGLLLGV-PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
L GV P + + P+++H ++L LPK FDAR AWPQC++I ILDQGHCGSCWAFG
Sbjct: 61 RLCGVLPKSSEEVQPLRPLRSHPRTLDLPKHFDAREAWPQCASIKTILDQGHCGSCWAFG 120
Query: 131 AVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
AVEAL+DRFCI N+SLS NDL+ACC CG GC+GGYP +AW YF GVVT +CDP
Sbjct: 121 AVEALTDRFCILNNENVSLSENDLVACCS-SCGFGCEGGYPYAAWEYFAQTGVVTSQCDP 179
Query: 191 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
YFD GC HPGCEP Y TP CV++CV N+ WR+SKH+++ Y +NSD DI AEIYKNG
Sbjct: 180 YFDGKGCKHPGCEPEYDTPVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNG 238
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PVEVS+TVYEDFAHYKSGVYKH+ G V+GGHAVK IGWGT+DDG+DYW+
Sbjct: 239 PVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWI 287
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 170/290 (58%), Positives = 204/290 (70%), Gaps = 3/290 (1%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL+ VI + A L+ I Q ++ +VN +P+A WKA N +F +T+ K
Sbjct: 7 LLLCSVILAAQAARVEPDLLESKRLIHQQLLVDKVNAHPRATWKAGFNDRFEGHTIEHLK 66
Query: 72 HLLGVKPTPKGLLL-GVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
+ G K TP L + TH K L LPK FDAR W CSTI ILDQGHCGSCWAF
Sbjct: 67 KICGAKMTPANELEPSIERVTHKHKKLVLPKEFDARKHWGHCSTIGAILDQGHCGSCWAF 126
Query: 130 GAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 189
GA E+L+DRFCIH ++SLS NDLLACCGF CGDGCDGGYPI AWRYF GVVT +CD
Sbjct: 127 GAAESLTDRFCIHMNESVSLSENDLLACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCD 186
Query: 190 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
PYFD GC HPGC P Y TPKCV+ CV ++LW SKH S++AY ++ +PED+MAE+Y N
Sbjct: 187 PYFDQIGCGHPGCYPTYRTPKCVKHCV-DDELWVKSKHLSVNAYEVSKEPEDLMAELYTN 245
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GP+EVSF V+EDFAHYK+GVYKH+ G +GGHAVKLIGWGT+DDG DYW
Sbjct: 246 GPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWT 295
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 347 bits (890), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 211/295 (71%), Gaps = 3/295 (1%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L L + L++ G+I + A L+ + I Q S++ ++N +P A WKA N +F+ +
Sbjct: 7 LKLGSVLVLCGLILASQAARPEPDLLENNRLIHQQSLVDKINAHPGATWKAGLNDRFAKH 66
Query: 66 TVGQFKHLLGVKPTPKGLLL-GVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHC 123
TV K + G K TP + + TH K+L LP FDAR W CSTI ILDQGHC
Sbjct: 67 TVEHLKKMCGAKMTPANEVEPSIERVTHKHKNLDLPTEFDARKHWSHCSTIGDILDQGHC 126
Query: 124 GSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
GSCWAFGAVE+L+DRFCIH ++SLS NDLLACCGF CGDGC+GGYPI AW+YF GV
Sbjct: 127 GSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGDGCEGGYPIRAWQYFKRTGV 186
Query: 184 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 243
VT +CDPYFD GC HPGC P Y TPKC ++CV ++LW +SKH +SAY ++ +PE++M
Sbjct: 187 VTSKCDPYFDQKGCGHPGCYPTYDTPKCFKRCV-DDELWVSSKHLGVSAYEVSMEPEELM 245
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
AE++ NGP+EV+F V+EDFAHYK+GVYKH+ G +GGHAVKL+GWGT+DDG DYW
Sbjct: 246 AELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYW 300
>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
unguiculata]
Length = 195
Score = 343 bits (879), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 155/189 (82%), Positives = 173/189 (91%)
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
VPV +H KSLKLP +FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +N
Sbjct: 7 VPVISHPKSLKLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVN 66
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 206
+SLSVNDLLACCGFLCG GC+GGYP+SAWRY +HGVVTEECDPYFD TGCSHPGCEPAY
Sbjct: 67 ISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPAY 126
Query: 207 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
TPKCV+KCV NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHYK
Sbjct: 127 RTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHYK 186
Query: 267 SGVYKHITG 275
SGVYKH+TG
Sbjct: 187 SGVYKHVTG 195
>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
Length = 201
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 154/192 (80%), Positives = 173/192 (90%)
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM 145
+ V +H KSLKLP +FDAR+AW QCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHF +
Sbjct: 6 ALTVISHPKSLKLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDV 65
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 205
N+SLSVNDLLACCGFLCG GC+GGYP+SAWRY +HGVVTEECDPYFD TGCSHPGCEPA
Sbjct: 66 NISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPA 125
Query: 206 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
Y TPKCV+KCV NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 126 YRTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 185
Query: 266 KSGVYKHITGDV 277
KSGVYKH+TG V
Sbjct: 186 KSGVYKHVTGYV 197
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 337 bits (864), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 157/265 (59%), Positives = 192/265 (72%), Gaps = 3/265 (1%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL-GVPVKTHD-K 94
I Q +++ +VN +P A W A N +F+ +T+ K + G TP L + +H K
Sbjct: 40 IHQQALVDKVNAHPGATWTAGFNERFAKHTIEHLKKMCGAILTPANKLEPSIETISHKHK 99
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 154
L LPK FDAR W C TI IL QGHCGSCWAFGAVE+L+DRFCIH ++SLS NDL
Sbjct: 100 KLYLPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTDRFCIHLNESVSLSENDL 159
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 214
LACCGF CG GC+GGYPI AW+YF H GVVT +CDPYFD GC+HPGC P Y TPKC ++
Sbjct: 160 LACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGCAHPGCYPTYETPKCEKQ 219
Query: 215 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
CV ++ W SKH ++AY ++ +PED+MAE+Y NGPVEV+F VYEDFAHYK+GVYKH+
Sbjct: 220 CV-DDEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLF 278
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G MGGHAVKLIGWGT+DDG DYW
Sbjct: 279 GGFMGGHAVKLIGWGTTDDGVDYWT 303
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 143/179 (79%), Positives = 161/179 (89%)
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
GHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV
Sbjct: 1 GHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVR 60
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 240
+GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+W KH+S++AYR+NSDP
Sbjct: 61 NGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPH 120
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+
Sbjct: 121 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWL 179
>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 151/208 (72%), Positives = 172/208 (82%), Gaps = 3/208 (1%)
Query: 13 LILG---VISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 69
L+LG ++ Q AE +S+ KL+S ILQDSI+K VNEN +AGWKAA NPQ SN+TV Q
Sbjct: 10 LLLGAFFILILQVAAEKPISEAKLESAILQDSIVKRVNENAEAGWKAAFNPQLSNFTVSQ 69
Query: 70 FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
FK LLGVKP +G L G+PV TH + +LPK FDAR AWPQCSTI +ILDQGHCGSCWAF
Sbjct: 70 FKRLLGVKPAREGDLEGIPVLTHPRLKELPKEFDARKAWPQCSTIGKILDQGHCGSCWAF 129
Query: 130 GAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 189
GAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWRYF GVVTEECD
Sbjct: 130 GAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECD 189
Query: 190 PYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
PYFD+TGCSHPGCEP YPTPKC RKCVK
Sbjct: 190 PYFDTTGCSHPGCEPLYPTPKCHRKCVK 217
>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
Length = 206
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 134/164 (81%), Positives = 144/164 (87%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 98
Q+SI KEVNENP AGWKAA NP+FSN TVGQFK LLGVK TP+ L +PV TH KSL L
Sbjct: 43 QESIAKEVNENPGAGWKAAINPRFSNSTVGQFKRLLGVKQTPRNELSSIPVVTHPKSLNL 102
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACC 158
PK FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LSDRFCIHFG+++ LSVNDLLACC
Sbjct: 103 PKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFGVDVPLSVNDLLACC 162
Query: 159 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 202
GFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD GCSHPGC
Sbjct: 163 GFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 206
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 283 bits (723), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 126/155 (81%), Positives = 141/155 (90%)
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
M++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEP
Sbjct: 1 MSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEP 60
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
AYPTPKC +KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 61 AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAH 120
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+
Sbjct: 121 YKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWL 155
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 109/146 (74%), Positives = 122/146 (83%)
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
L F G GGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPAY TPKCVR
Sbjct: 9 FLHAVAFSVGLAVMGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVR 68
Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
KCVK NQ+W+ SKH+S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHI
Sbjct: 69 KCVKGNQIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHI 128
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
TG +GGHAVKLIGWGT+D+GEDYW+
Sbjct: 129 TGSQLGGHAVKLIGWGTTDEGEDYWL 154
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 107/134 (79%), Positives = 121/134 (90%)
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGGYPISAW+YF HHGVVTEECDPYFD GCSHPGCEP Y TPKCVRKCVK NQ+W+ S
Sbjct: 1 CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
KHYS+ Y++NSDP++IM E+YKNGPVEV+F+VYEDFAHYKSGVYKHITG +GGHAVKL
Sbjct: 61 KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120
Query: 286 IGWGTSDDGEDYWV 299
GWGTSD+GEDYW+
Sbjct: 121 NGWGTSDEGEDYWL 134
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 177/308 (57%), Gaps = 41/308 (13%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 EE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
C PY C H P C TPKC + C + ++ KHY +
Sbjct: 170 GGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYN 228
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG
Sbjct: 229 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV- 287
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 288 ENGTPYWL 295
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 175/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW + G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFLTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 233 bits (593), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H + D ++ VN+ W+A N F N +G
Sbjct: 10 CLLVLANARSRP-----------SFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMGYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H + D ++ VN+ W+A N F N +G
Sbjct: 10 CLLVLANARSRP-----------SFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMGYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 168/282 (59%), Gaps = 28/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
H L D ++ VN+ W+A N F N V K L G P P ++
Sbjct: 24 HPLSDELVNYVNKR-NTTWQAGHN--FYNVDVSYLKKLCGTFLGGPKPPQRVM------F 74
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LKLP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+
Sbjct: 75 TEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 134
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 198
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 135 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 194
Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y +++ DIMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFS 254
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 255 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 295
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGAFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 103/128 (80%), Positives = 115/128 (89%)
Query: 172 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 231
+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW SKHY +
Sbjct: 1 MGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVG 60
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVKLIGWGTS
Sbjct: 61 AYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTS 120
Query: 292 DDGEDYWV 299
DDGEDYW+
Sbjct: 121 DDGEDYWL 128
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 168/282 (59%), Gaps = 28/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTH 92
H L D ++ VN+ W+A N F N + K L G P P ++
Sbjct: 8 HPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYLKRLCGTFLGGPKPPQRVM------F 58
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LKLP SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+
Sbjct: 59 TEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVS 118
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCS 198
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 119 AEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGS 178
Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+
Sbjct: 179 RPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFS 238
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 239 VYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 279
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 175/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 176/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC T+ I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTVKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 127/284 (44%), Positives = 172/284 (60%), Gaps = 33/284 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-----KPTPKGLLLGVPVKTH 92
L D ++ VN+ WKA N F N + K L G K P+ ++L
Sbjct: 26 LSDEMVNYVNK-LNTTWKAGHN--FRNVDMSYVKKLCGTVMGGAKQLPQRVMLA------ 76
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D +KLP++FDAR WP+C TI I DQG CGSCWAFGAVEA+SDR C+H + + +S
Sbjct: 77 DDDMKLPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHTNGYITIEVS 136
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PG 201
DLL+CCG CG+GC+GG+P AW+Y++ G+V+ C PY C H G
Sbjct: 137 AEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNG 195
Query: 202 CEPAYP-----TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
PA TPKC +KC + +++ KHY +AY + S ++IMAEIYKNGPVE +
Sbjct: 196 SRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGA 255
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VY DF YKSGVY+H+TGD++GGHA++++GWG +DG YW+
Sbjct: 256 FIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGV-EDGVPYWL 298
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 175/307 (57%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H + D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 166/283 (58%), Gaps = 30/283 (10%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 93
H L D ++ VN+ W+A N F N + K L G LG P
Sbjct: 15 HPLSDELVNYVNKQ-NTTWQAGHN--FYNVDLSYLKRLCGT-------FLGGPKPPQRVK 64
Query: 94 --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 151
+ L LP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V
Sbjct: 65 FAEDLNLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 124
Query: 152 N--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
+ DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 125 SAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNG 184
Query: 198 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TPKC + C + ++ KHY +Y ++++ DIMAEIYKNGPVE +F
Sbjct: 185 SRPPCTGEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAF 244
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 245 SVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 286
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 174/307 (56%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H L D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPLSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGP E +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 125/276 (45%), Positives = 164/276 (59%), Gaps = 19/276 (6%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLP 99
++I+K VN+ WKA+ N + Y K L GVK G + + +K+P
Sbjct: 55 NAIVKTVNK-ANTTWKASLNFDPTYYVPEDLKLLCGVKEDKHGYSKLETSYHNLEGIKIP 113
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 157
FD+R WP C +IS I DQG CGSCWAFGAVEA+SDR+CI + + +S DLL+C
Sbjct: 114 NQFDSRKQWPHCPSISYIRDQGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDLLSC 173
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEP 204
CGF CGDGC+GG+P SAW+Y+ G+VT C PY C H P C
Sbjct: 174 CGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPY-QIKPCEHHVPGDRPKCSE 232
Query: 205 AYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
TP CV KC + + KHY +S+Y + SDP I EI +GPVE +FTVY DF
Sbjct: 233 GGGTPSCVSKCKGNTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFP 292
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVYKH+TG V+GGHA++++GWG S++G YW+
Sbjct: 293 TYKSGVYKHVTGGVLGGHAIRILGWG-SENGVAYWL 327
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 174/307 (56%), Gaps = 39/307 (12%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H + D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSRCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 186 EE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY S P C TPKC + C + ++ KHY ++
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG +
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-E 288
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 289 NGTPYWL 295
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/283 (44%), Positives = 165/283 (58%), Gaps = 30/283 (10%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 93
H L + ++ VN+ W+A N F N + K L G LG P
Sbjct: 36 HPLSEELVNYVNKQ-NTTWQAGHN--FYNVDLSYLKRLCGT-------FLGGPKPPQRVK 85
Query: 94 --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV 151
+ L LP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V
Sbjct: 86 FAEDLNLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 145
Query: 152 N--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
+ DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 146 SAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEHHVNG 205
Query: 198 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TPKC + C ++ KHY ++Y +++ DIMAEIYKNGPVE +F
Sbjct: 206 SRPPCTGEGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF 265
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 266 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 307
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 127/287 (44%), Positives = 166/287 (57%), Gaps = 30/287 (10%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
K SH L D +I +N+ W+A RN F N + K L G +LG P
Sbjct: 20 KPSSHPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPNLP 69
Query: 92 H----DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +
Sbjct: 70 ERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRV 129
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 193
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 194 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 295
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 126/281 (44%), Positives = 166/281 (59%), Gaps = 26/281 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKT-HD 93
H L D +I +N+ W+A RN F N + K L G V PK +P +
Sbjct: 24 HPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPK-----LPERVGFS 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +N+ +S
Sbjct: 76 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 135
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSH 199
DLL CCG CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 136 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSR 195
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV
Sbjct: 196 PPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTV 255
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 256 FSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 295
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 130/305 (42%), Positives = 171/305 (56%), Gaps = 32/305 (10%)
Query: 14 ILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 73
+L +S G S+L + L D ++ VN+ WKA N F N + L
Sbjct: 4 LLTTLSCLVMLTGAQSRLPFRA--LSDELVDYVNKR-NTTWKAGHN--FHNVDPSYLRRL 58
Query: 74 LGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
G LG P K+L LP+SFDAR WP C TI I DQG CGSCWAF
Sbjct: 59 CGT-------FLGGPKLPQRVQFAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAF 111
Query: 130 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
GAVEA+SDR CI +N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+
Sbjct: 112 GAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGG 171
Query: 188 -------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYR 234
C PY S P C TPKC + C + ++ KHY S+Y
Sbjct: 172 LYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYS 231
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
++ + ++IMAEIYKNGPVE +FTVY DF YKSGVY+H+TG++MGGHAV+++GWG +DG
Sbjct: 232 VSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDG 290
Query: 295 EDYWV 299
YW+
Sbjct: 291 TPYWL 295
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 133/295 (45%), Positives = 173/295 (58%), Gaps = 31/295 (10%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL- 83
E ++ L+ D+ D II +VN + WKA N SNY KH+ G+ T G
Sbjct: 29 EKLIENLEHDNF---DDIIAKVN-SADLSWKAGANFN-SNYAP---KHVAGLCGTIMGDD 80
Query: 84 LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH- 142
L V +D L+LP +FD+R AWP C +IS + DQG CGSCWAFGA EA+SDR CIH
Sbjct: 81 RLPVNHLLNDADLELPANFDSREAWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHS 140
Query: 143 -FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG 201
LS DLL+CCG++CG+GC+GG+P +AW Y+V +G+V+ + TGC
Sbjct: 141 NAAFTFDLSSEDLLSCCGYVCGNGCNGGFPQAAWEYWVQNGLVS---GGLYHGTGCQPYA 197
Query: 202 CEPAY---------------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAE 245
EP TPKC KCV + KHY AYRI ++ + IM E
Sbjct: 198 IEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQDKHYGSVAYRIPANEKAIMNE 257
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
IYKNGPVE +F VYEDF YKSGVY H TG +GGHA++++GWG ++GE YW+C
Sbjct: 258 IYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWG-EENGEKYWLC 311
>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
Length = 166
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 103/126 (81%), Positives = 111/126 (88%)
Query: 77 KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
K TP+ L +PV TH KSL LPK FDAR+AWPQCSTI RILDQGHCGSCWAFGAVE+LS
Sbjct: 41 KQTPRNELSSIPVVTHPKSLNLPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLS 100
Query: 137 DRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG 196
DRFCIHFG+++ LSVNDLLACCGFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD G
Sbjct: 101 DRFCIHFGVDVPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIG 160
Query: 197 CSHPGC 202
CSHPGC
Sbjct: 161 CSHPGC 166
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/281 (44%), Positives = 166/281 (59%), Gaps = 26/281 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKT-HD 93
H L D +I +N+ W+A RN F N + K L G V PK +P +
Sbjct: 24 HPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPK-----LPERVGFS 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +N+ +S
Sbjct: 76 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 135
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSH 199
DLL CCG CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 136 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSR 195
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV
Sbjct: 196 PPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTV 255
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 256 FSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 295
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 182/308 (59%), Gaps = 25/308 (8%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ LL++G++++ F + K H L D +I +N+ WKA RN S ++
Sbjct: 1 MLKSLLVVGLLAAVCFGREIHPKR---WHPLSDQMINFINK-INTTWKAGRNFDKS-ISM 55
Query: 68 GQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
+ L+GV P K L P H++ LP+SFDAR W C++I+ I DQ CGSC
Sbjct: 56 SYIRGLMGVNPKSKEYRL--PEFVHEEIPDDLPESFDAREKWSHCASINLIRDQSTCGSC 113
Query: 127 WAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGA EA+SDR CIH G+ +++S DLL CC CG GCDGGYP +AW Y+ G+V
Sbjct: 114 WAFGAAEAMSDRVCIHSEGGIQVNISAEDLLDCCDS-CGAGCDGGYPAAAWEYWKESGLV 172
Query: 185 TEE-------CDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
++ C PY + T S P C PTPKCV C K + +++ KH+
Sbjct: 173 SDGLYGTPDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQHDKHFGKK 232
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
Y I+S+ + I EI+KNGPVE FTVY DF YKSGVY+H +GDV+GGHA++++GWGT
Sbjct: 233 VYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRILGWGT- 291
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 292 ENGTPYWL 299
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 127/285 (44%), Positives = 167/285 (58%), Gaps = 33/285 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 93
H L D ++ VN+ W+A RN F N + K L G LG P
Sbjct: 24 HPLSDELVNYVNK-LNTTWQAGRN--FHNVDISYVKRLCGT-------YLGGPRLPQRVQ 73
Query: 94 --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ L LP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 74 FAEDLDLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--- 199
S DLL+CCG LCG+GC+GGYP AW+Y+ G+V+ C PY C H
Sbjct: 134 SAEDLLSCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPY-SIPPCEHHVN 192
Query: 200 ---PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
P C TPKC + C + ++ K+Y S+Y + S ++IMAEIYKNGPVE
Sbjct: 193 GTRPKCTGEGGDTPKCSKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEA 252
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+F+V+ DF YKSGVYKH+ G+V+GGHA++++GWG ++G YW+
Sbjct: 253 AFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWG-KENGVPYWL 296
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 179/308 (58%), Gaps = 25/308 (8%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ LL++G++++ F + K H L D +I +N+ WKA RN S ++
Sbjct: 1 MLKSLLVVGLLAAVCFGREIHPK---KWHPLSDQMINFINK-INTTWKAGRNFDKS-ISM 55
Query: 68 GQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
+ L+GV P K L V HD+ LP+SFDAR W C++I I DQ CGSC
Sbjct: 56 SYIRGLMGVHPKSKEYRLAEFV--HDEIPDDLPESFDAREKWSHCASIHLIRDQSTCGSC 113
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGA EA+SDR CIH + + +S DLL CC CG GC+GGYP +AW Y+ G+V
Sbjct: 114 WAFGAAEAMSDRVCIHSKGKIQVDISAEDLLDCCDS-CGAGCNGGYPAAAWEYWKESGLV 172
Query: 185 T-------EECDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
T + C PY + T S P C PTPKCV C K + +++ KH+
Sbjct: 173 TGGLYGTSDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQDDKHFGRK 232
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
Y I+SD + I EI+KNGPVE FTVY DF YKSGVY+H +GDV+GGHA++++GWGT
Sbjct: 233 VYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGT- 291
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 292 ENGTPYWL 299
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 183/317 (57%), Gaps = 47/317 (14%)
Query: 7 FLTT--CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
FL T CL++L +K +L L D ++ +N+ W+A N F N
Sbjct: 4 FLATLCCLVVL-----------TSAKSRLSIPPLSDEMVNHINK-LNTTWQAGHN--FLN 49
Query: 65 YTVGQFKHLLGV-----KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+ K L G K P+ ++L ++KLP++FDAR WP C TI I D
Sbjct: 50 ADMSYVKKLCGTFMGGAKLLPQRMILA-------DNMKLPENFDAREQWPNCPTIKEIRD 102
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
QG CGSCWAFGAVEA+SDR C+H N+ +S DLL+CCG CGDGC+GG+P AW +
Sbjct: 103 QGSCGSCWAFGAVEAISDRICVHSNGNANVEVSAEDLLSCCGSECGDGCNGGFPAGAWNF 162
Query: 178 FVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKCVRKCVKK-NQLW 222
+ G+V+ C PY C H G PA TP C +KC + + +
Sbjct: 163 WTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACTGEEGDTPTCRKKCEEGYSTQY 221
Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
++ K+Y ++Y + S ++IMAEIYKNGPVE +F+VYEDF HYKSGVY+H+ G+++GGHA
Sbjct: 222 KDDKNYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHA 281
Query: 283 VKLIGWGTSDDGEDYWV 299
++++GWG ++G YW+
Sbjct: 282 IRILGWGV-ENGIRYWL 297
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 161/282 (57%), Gaps = 31/282 (10%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L D ++ VN+ WKA N F N + K L G +LG P
Sbjct: 49 LSDELVNYVNKR-NTTWKAGHN--FHNVDLSYVKRLCGT-------ILGGPKLPQRVWLA 98
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
+ L LP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI + +N+ +S
Sbjct: 99 EDLVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSA 158
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCS 198
DLL CCGF CG+GC+GG+P AW ++ G+V+ C PY G
Sbjct: 159 EDLLTCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSR 218
Query: 199 HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P TPKC R C ++ KH+ S+Y + S +IMAEIYKNGPVE +F+
Sbjct: 219 PPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFS 278
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YKSGVY+H+TG++MGGHAV+++GWG +DG YW+
Sbjct: 279 VYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPYWL 319
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 177/317 (55%), Gaps = 37/317 (11%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I TF E +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H D ++++P SFD+R WP+C +I+ I DQ CGS
Sbjct: 58 DARIQ-MGARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG AW Y+V G+
Sbjct: 117 CWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDYWVKEGI 175
Query: 184 VTEECDPYFDSTGCSHPGCEP--------------------AYPTPKCVRKCVKKNQL-W 222
VT S+ +H GCEP Y TP+C + C KK + +
Sbjct: 176 VT-------GSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPY 228
Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
KH S+Y + +D + I EI K GPVE FTVYEDF +YKSG+YKHITG+ +GGHA
Sbjct: 229 TQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHA 288
Query: 283 VKLIGWGTSDDGEDYWV 299
+++IGWG ++ YW+
Sbjct: 289 IRIIGWGV-ENKTPYWL 304
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 125/293 (42%), Positives = 165/293 (56%), Gaps = 31/293 (10%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 86
++ +L+ L D ++ VN+ WKA N F N + K L G K LG
Sbjct: 15 TTARSRLEFQPLSDELVNYVNKQ-NTTWKAGHN--FYNVDLSYVKKLCGTK-------LG 64
Query: 87 VPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
P SL LP+SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 65 GPKLPQRLSLAGDIALPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIR 124
Query: 143 FG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-- 191
N+ +S DLL CCGF CG+GC+GG+P AW ++ G+V+ C PY
Sbjct: 125 SNGLQNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSI 184
Query: 192 ----FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 246
G P TPKC + C + ++ KH+ Y + SD ++IM EI
Sbjct: 185 PPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPSDEKEIMVEI 244
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKNGPVE +F+VY DF YKSGVY+H+TG+++GGHAV+++GWG ++G YW+
Sbjct: 245 YKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGV-ENGTPYWL 296
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/219 (50%), Positives = 145/219 (66%), Gaps = 16/219 (7%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--D 153
LKLP SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+ D
Sbjct: 1 LKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 60
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPG 201
LL CCG +CGDGC+GGYP AW ++ G+V+ C PY S P
Sbjct: 61 LLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPP 120
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY
Sbjct: 121 CTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 180
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 181 DFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 218
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 166/282 (58%), Gaps = 28/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKT-HD 93
H L D +I +N+ W+A RN F N + K L G V PK +P +
Sbjct: 7 HPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPK-----LPERVGFS 58
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ + LP+SFDAR W C TI++I DQG CGS WAFGAVEA+SDR CIH +N+ +S
Sbjct: 59 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVNVEVSA 118
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 199
DLL CCG CGDGC+GGYP AW ++ G+V+ C PY C H
Sbjct: 119 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGA 177
Query: 200 -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FT
Sbjct: 178 RPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFT 237
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 238 VFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 278
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 121/281 (43%), Positives = 167/281 (59%), Gaps = 26/281 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVK-THD 93
H L D +I +N+ W+A RN F N + K L G + PK +P +
Sbjct: 24 HPLSDDLINYINKR-NTTWQAGRN--FHNVDISYLKRLCGTIMGGPK-----LPERVAFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ ++LP++FDAR W C TI +I DQG CGSCWAFGAV A+SDR CIH +N+ +S
Sbjct: 76 EDMELPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVNVEVSA 135
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSH 199
DLL CCG CGDGC+GGYP AW +++ G+V+ C PY S
Sbjct: 136 EDLLTCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEHHVNGSR 195
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C TPKC + C + ++ KHY ++Y ++++ ++IMAEIYKNGPVE +FTV
Sbjct: 196 PQCTGEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAFTV 255
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ DF YKSGVYKH GD+MGGHA++++GWG ++ YW+
Sbjct: 256 FSDFLTYKSGVYKHEAGDIMGGHAIRILGWGV-ENSVPYWL 295
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 159/282 (56%), Gaps = 25/282 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHDKSL 96
D +I+ VNE A WKAAR+ +FSN V FK HL + TP+ P HD S
Sbjct: 26 FSDELIRFVNEESGASWKAARSTRFSN--VDHFKLHLGALSETPEERNALRPTIKHDISK 83
Query: 97 K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFDARS WPQC TIS I DQ CGSCWA A A+SDR CIH M L+ D
Sbjct: 84 NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAAD 143
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-- 204
L+CC + CG GC GGYP AW Y++ G+VT C P+ T C H G
Sbjct: 144 PLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKY 201
Query: 205 ------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
YPTP C R C N+ + K Y S+Y + IM EI KNGPVEV+F
Sbjct: 202 SRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFA 261
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+++DF Y+SG+Y H+ G +G HAV++IGWG ++G +YW+
Sbjct: 262 IFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWL 302
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 125/283 (44%), Positives = 165/283 (58%), Gaps = 33/283 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--- 94
L ++ +N+ WKA N F N + K L G LG P K ++
Sbjct: 26 LSSDLVNHINK-LNTTWKAGHN--FYNTDMSYVKQLCGT-------FLGGP-KLPERVDF 74
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
++LP SFD+R+ WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 75 AGDMELPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVS 134
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGC 197
DLL+CCGF CG GC+GGYP AWRY+ G+V+ C PY G
Sbjct: 135 AEDLLSCCGFECGMGCNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHVNGS 194
Query: 198 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
P TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +F
Sbjct: 195 RPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAF 254
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYEDF YKSGVY+H+TG+ +GGHA++L+GWG D+G YW+
Sbjct: 255 IVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGV-DNGTPYWL 296
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 219 bits (559), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 121/286 (42%), Positives = 170/286 (59%), Gaps = 21/286 (7%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 90
+ ++H L D IK + ++ + W+A RN + ++ F+ L+GV P K + G
Sbjct: 15 VSANNHFLSDKFIKML-QSEDSTWEAGRNFN-RHLSIRYFRRLMGVHPDSKYHMPGYEAH 72
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLS 148
++ +PK FD+R+AWP C TI I DQG CGSCWAFGAVE +SDR CIH N
Sbjct: 73 KIPENFDMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFH 132
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH-- 199
S +L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C H
Sbjct: 133 YSSENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCEHHV 190
Query: 200 ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
P C TPKCV++C + + + H+ AY I D + I EI KNGPVE
Sbjct: 191 PGPRPKCSEGGGTPKCVKRCENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEG 250
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+FTVY DF HYKSGVY+H G +GGHA++++GWG ++G YW+C
Sbjct: 251 AFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWG-EENGTPYWLC 295
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/218 (50%), Positives = 144/218 (66%), Gaps = 16/218 (7%)
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DL 154
KLP SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+ DL
Sbjct: 1 KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 60
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGC 202
L CCG +CGDGC+GGYP AW ++ G+V+ C PY S P C
Sbjct: 61 LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC 120
Query: 203 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY D
Sbjct: 121 TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD 180
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 181 FLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 217
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 176/310 (56%), Gaps = 23/310 (7%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I TF E +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLTSILCIASLI---TFLEAHISVKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H D ++++P +FD+R WP C +I+ I DQ CGS
Sbjct: 58 DARIQ-MGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CW+FGAVEA+SDR CI G N+ LS DLL CC CG GC+GG AW Y+V G+
Sbjct: 117 CWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGI 175
Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
VT C+PY T +P C Y TP+C + C +K + + KH
Sbjct: 176 VTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRG 235
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWG 295
Query: 290 TSDDGEDYWV 299
++ YW+
Sbjct: 296 V-ENKTPYWL 304
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 174/312 (55%), Gaps = 39/312 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L ++ L L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVL-----------TSARSSLHFPPLSDEMVNYVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGA------ILGGPKLPQRDAFAADMVLPDSFDAREQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY S P C TPKC + C + +++ KH
Sbjct: 165 KGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKDDKH 224
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ S+Y ++S+ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++G
Sbjct: 225 FGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILG 284
Query: 288 WGTSDDGEDYWV 299
WG +D YW+
Sbjct: 285 WGVEND-TPYWL 295
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 174/312 (55%), Gaps = 39/312 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L ++ L L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVL-----------TSARSSLHFPPLSDEMVNYVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGA------ILGGPKLPQRDAFAADMVLPDSFDAREQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY S P C TPKC + C + +++ KH
Sbjct: 165 KGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKDDKH 224
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ S+Y ++S+ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++G
Sbjct: 225 FGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILG 284
Query: 288 WGTSDDGEDYWV 299
WG +D YW+
Sbjct: 285 WGVEND-TPYWL 295
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 175/313 (55%), Gaps = 35/313 (11%)
Query: 12 LLILGV--ISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 69
LL+LGV +S+ + + +++ + ++ I P A WKA N F +
Sbjct: 7 LLLLGVWTVSAIPPKDELFKFIRVFRPMSEEMINFLNMPGPGATWKAGNNFPFIRNLDDK 66
Query: 70 F---KHLLGVK---PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 123
K L G K P P +PVK + LP +FDAR+ WP C T+ + DQG C
Sbjct: 67 LLYAKRLCGTKLNNPNP------LPVKNIEPLRDLPTNFDARTQWPNCPTVKEVRDQGDC 120
Query: 124 GSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWAFGAVEA+SDR CI + +N +S DLLACC CG+GC GG+P AWRY+
Sbjct: 121 GSCWAFGAVEAMSDRICIASNGKVNAEISAEDLLACCSS-CGEGCQGGFPAEAWRYYERE 179
Query: 182 GVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSK 226
G+VT + C PY C H P + TPKC +KC N +++ K
Sbjct: 180 GLVTGGLYNSSQGCQPYM-IPACDHHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDK 238
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
HY ++Y ++S E IM EI NGPVE +FTVYEDF YKSGVY+H TG +GGHAVK++
Sbjct: 239 HYGKNSYSVDS-VEKIMTEIMTNGPVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKIL 297
Query: 287 GWGTSDDGEDYWV 299
GWG D+G YW+
Sbjct: 298 GWG-EDNGTPYWI 309
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/291 (42%), Positives = 163/291 (56%), Gaps = 32/291 (10%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++ L L D ++ +N+ W A N F N + K L G LG P
Sbjct: 17 ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66
Query: 89 VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ LPKSFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 67 KLPQRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIP 185
Query: 196 GCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
C H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYK 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW+
Sbjct: 246 NGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWL 295
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/221 (49%), Positives = 146/221 (66%), Gaps = 16/221 (7%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ LKLP SFDAR WPQC TI I DQG CGS WAFGAVEA+SDR CIH ++S+ V+
Sbjct: 3 EDLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSA 62
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 199
DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY +H
Sbjct: 63 EDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGAR 122
Query: 200 PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+V
Sbjct: 123 PPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV 182
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 183 YSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 222
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/285 (42%), Positives = 164/285 (57%), Gaps = 34/285 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ Y GC
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191
Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE
Sbjct: 192 NGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEG 251
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+
Sbjct: 252 AFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWL 295
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 123/305 (40%), Positives = 172/305 (56%), Gaps = 26/305 (8%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
L++ ++ +++F + + L D I+ +N WKAA+ +F T+ +
Sbjct: 7 LIMYALLCAESFRAEYIPSFE----SLSDEIVHYINHKANTTWKAAKYQRFK--TISDVR 60
Query: 72 HLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
+LG P P G L + + + +LP+SFDAR WP CS+I+ I DQ +CGSCWAFG
Sbjct: 61 RVLGAVPDPNGFGLEKRCLLSTIREQELPESFDAREKWPYCSSIAEIRDQSNCGSCWAFG 120
Query: 131 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 185
A A+SDR CI G +S DL+ CC CG GC GGYP AW Y+V +G+VT
Sbjct: 121 AAGAISDRICIASGGKHQPRISPEDLVDCCA-DCGMGCQGGYPAQAWEYWVRNGLVTGDL 179
Query: 186 ----EECDPYFDSTGCSHPGCEPAYP------TPKCVRKCVKK-NQLWRNSKHYSISAYR 234
+ C PY C H P P TP+CV+KC + + + N K Y + AY
Sbjct: 180 YNTTDTCRPY-SFPPCEHHVVGPRKPCTGDPTTPQCVKKCQPEYPKTYENDKWYGLKAYS 238
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
I+SD E IM ++ GP+EV F VY DF Y SGVY+H+ G ++GGHAV+L+GWG +DG
Sbjct: 239 IHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGV-EDG 297
Query: 295 EDYWV 299
DYW+
Sbjct: 298 ADYWL 302
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 120/279 (43%), Positives = 160/279 (57%), Gaps = 26/279 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT----HDKSLK 97
II +N W+A +N +F++ K +G P G +L P K+ +
Sbjct: 28 IIDYINNKANTTWRAGKNKRFTDALSA--KSQMGSLFNPGGSML--PTKSFYLSSTQKAA 83
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLL 155
LP FDAR AWP C TI I DQG CGSCWAFGA EA+SDR CIH + +S +DLL
Sbjct: 84 LPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLL 143
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGC 202
+CCG CG GC+GG P +AWRY+ G+V+ C PY + C H P C
Sbjct: 144 SCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPY-EIPPCEHHTSGNRPDC 202
Query: 203 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+ TPKC R+CV+ + ++ KH++ + Y + + EDIM EI GPVE F VY D
Sbjct: 203 KGNSKTPKCQRQCVESFDGKYQADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYAD 262
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
F YKSGVY+H+ G +GGHAVK++GWG ++G YW+C
Sbjct: 263 FLTYKSGVYQHVKGGFLGGHAVKILGWG-EENGVPYWLC 300
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 174/308 (56%), Gaps = 28/308 (9%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
++L +I + KLK + + L D I +N + K+ WKA RN N+ +G
Sbjct: 3 IVLSIIFAVVLVTSQAKKLKSNKYFNPLSDEFINHIN-SMKSTWKAGRNFG-KNFPMGAL 60
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
++GV P L P+K + + +P++FDAR WP C TI I DQG CGSCW
Sbjct: 61 TQMMGVHPDSN--LYMPPLKNVSQMYSNQAIPEAFDAREQWPDCPTIQEIRDQGSCGSCW 118
Query: 128 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH +N LS +L++CC + CG GC+GG+P +AW ++V G+VT
Sbjct: 119 AFGAVEAMSDRICIHSKGEVNAHLSAENLVSCC-YTCGFGCNGGFPGAAWSHWVKKGIVT 177
Query: 186 -------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 231
+ C PY C H P C TPKC++ C + + HY S
Sbjct: 178 GGNFNSSQGCQPYI-IPACEHHTTGDRPPCSEGGGTPKCLKTCEDGYTVDYTQDLHYGAS 236
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y ++ EDI EI NGPVE + TVYEDF YKSGVY+H+ G +GGHA++++GWG
Sbjct: 237 SYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGV- 295
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 296 EEGVPYWL 303
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 109/217 (50%), Positives = 143/217 (65%), Gaps = 16/217 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLL 155
LP SFDAR WPQC TI I DQG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL
Sbjct: 1 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 203
CCG +CGDGC+GGYP AW ++ G+V+ C PY S P C
Sbjct: 61 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 204 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF
Sbjct: 121 GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 180
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 181 LLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 216
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/291 (42%), Positives = 162/291 (55%), Gaps = 32/291 (10%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++ L L D ++ +N+ W A N F N + K L G LG P
Sbjct: 17 ARESLHFQPLSDELVNFINKQ-NTTWTAGHN--FYNVDLSYVKKLCGT-------FLGGP 66
Query: 89 VKTHDKSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ LPK FDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 67 KLPQRAAFAADMILPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSN 126
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 127 GRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIP 185
Query: 196 GCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
C H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYK 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW+
Sbjct: 246 NGPVEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWL 295
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 119/280 (42%), Positives = 167/280 (59%), Gaps = 22/280 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVND 153
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-----FDSTGCSHPG 201
L++CC CGDGC GG+P AW Y+V G+VT EE C PY T +P
Sbjct: 148 LISCCED-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPA 206
Query: 202 C-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C Y TP+C + C K + + KHY Y + S+ + I EI GPVE +F VY
Sbjct: 207 CGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVY 266
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF +YKSG+Y+H+TG ++GGHA+++IGWG + G+ YW+
Sbjct: 267 EDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 122/281 (43%), Positives = 167/281 (59%), Gaps = 21/281 (7%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
H L D IK + ++ + W+A RN + ++ F+ L+GV P K + V ++
Sbjct: 19 HFLSDKFIKLL-QSEDSTWEAGRNFN-KHLSIRYFRRLMGVHPDSKYHMPKYEVHQIPEN 76
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 153
+LPK FD+R+AWP C TI I DQG CGSCWAFGAVE +SDR CIH N S +
Sbjct: 77 FELPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAEN 136
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH------P 200
L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C H P
Sbjct: 137 LVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVPGPRP 194
Query: 201 GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKC + C K + + + H+ AY I D + I EI KNGPVE +FTVY
Sbjct: 195 KCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVY 254
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
DF HYKSGVY+H G +GGHA++++GWG ++G YW+C
Sbjct: 255 VDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLC 294
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 118/280 (42%), Positives = 157/280 (56%), Gaps = 19/280 (6%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHD 93
+ +L + E WKA N +F + + +GV + P L + +P K
Sbjct: 16 AELLNQQDMSEYINKLGTTWKAGVNKRFEGLSEVDIRRQMGVLQGGP--LDIKLPEKDIT 73
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 153
+P FDAR WP C TI I DQG CGSCWAFGAVE++SDRFCIHF + +S D
Sbjct: 74 PLKDVPDMFDARMQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCIHFNQSAHISAED 133
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST------GCSHP 200
L+ACC CG GC+GGY +AWRYF H G+VT E C PY ++ G P
Sbjct: 134 LMACCE-TCGMGCNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP 192
Query: 201 GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
TP+C + C + + KH+ SAY + S E I EI NGPVE +FTVY
Sbjct: 193 CASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVY 252
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+H +G ++GGHA++++GWGT ++G YW+
Sbjct: 253 ADFPTYKSGVYQHTSGAMLGGHAIRILGWGT-ENGTPYWL 291
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 119/286 (41%), Positives = 168/286 (58%), Gaps = 21/286 (7%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 90
+ SH L D I+++ ++ + W+A RN + ++ F+ L+GV P K +
Sbjct: 14 VNASSHFLSDKFIRQL-QSEDSTWEAGRNFN-KHLSIKYFRRLMGVHPDSKFHMPKYEAH 71
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLS 148
++ ++PK FD+R+AWP C TI I DQG CGSCWAFGAVE +SDR CIH N
Sbjct: 72 QIPENFEMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFH 131
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH-- 199
S +L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C H
Sbjct: 132 YSAENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCEHHV 189
Query: 200 ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
P C TPKC + C K + + + H+ AY I D + I EI NGPVE
Sbjct: 190 SGPRPKCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGPVEG 249
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+FTVY DF HYKSGVY+H G +GGHA++++GWG ++G YW+C
Sbjct: 250 AFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLC 294
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 216 bits (550), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 177/318 (55%), Gaps = 31/318 (9%)
Query: 1 MASSHLFLTT--CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR 58
M S + F + CL+ L E ++ K L +I +N WKAA
Sbjct: 1 MTSYNYFCSVLFCLIFLNY-------EIEANRHKFMHQPLSSELIHFINHEANTTWKAAP 53
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRI 117
+P+F +V + +LG P P G L + SL +LPK FDAR WP C +IS I
Sbjct: 54 SPRFK--SVSDIRRMLGALPDPNGGHLPTLCTGYTPSLDELPKEFDARKYWPHCPSISEI 111
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAW 175
DQ CGSCWAFGAVEA+SDR CI G++ LS +L+ACC CG GC+GG+P SAW
Sbjct: 112 RDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCS-SCGMGCNGGFPHSAW 170
Query: 176 RYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQL 221
Y+ G+VT + C PY + C H P CE TPKC C N
Sbjct: 171 SYWKRSGIVTGDLYNPTDGCQPY-EFPPCEHHVVGPRPSCEGDVETPKCKTTCQPGYNIP 229
Query: 222 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 281
+ K Y + YR++S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGH
Sbjct: 230 YNKDKWYGKTVYRVHSNQEAIMKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGH 289
Query: 282 AVKLIGWGTSDDGEDYWV 299
AV+L+GWG ++G YW+
Sbjct: 290 AVRLLGWG-EENGVPYWL 306
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 178/319 (55%), Gaps = 30/319 (9%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M L +T +L+ + F + K + Q II++VN + + WKA N
Sbjct: 1 MKHQALIITAGILLATLTGFVAFEAFRYKQEKYHDKLKQ--IIQKVNSS-NSTWKAGENT 57
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAW-PQCSTISRIL 118
++ N + K +GVK G G+ ++T ++ LP+ FDAR W +CS++ +
Sbjct: 58 KWINSDIAGVKAHMGVK---LGQESGIKLETVSAQANGLPEEFDARVQWGDKCSSLWEVR 114
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQ CGSCWAFGA E+LSDR CIH G ++ LS +LL CC CGDGCDGG+P +A Y+
Sbjct: 115 DQSTCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLLTCCA-ACGDGCDGGWPEAAMDYY 173
Query: 179 VHHGVVTEECDPYFDSTGCS---------------HPGCEPAYPTPKCVRKCVKKNQL-- 221
V+ G+VT D Y +++ C +P C PTP C+ C +
Sbjct: 174 VNTGLVTG--DLYGNNSWCQAYTFAPCAHHVTSDIYPPCTGELPTPPCINSCDSNSTHTI 231
Query: 222 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+ H AY I D + IMAEIYKNGP+EV+ TVYEDF YK+GVY+H+TGD +GG
Sbjct: 232 PYSKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGG 291
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVK++GWG ++G YW
Sbjct: 292 HAVKMVGWGV-ENGTPYWT 309
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/281 (43%), Positives = 166/281 (59%), Gaps = 23/281 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-K 94
H L D I +N K+ W A RN + ++ L+GV P K + PV TH +
Sbjct: 18 HPLSDEFINSINA-AKSTWTAGRNFA-QDKSMDYIIKLMGVLPDHKNYM--PPVLTHKLE 73
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+L++P FDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH N S +
Sbjct: 74 ALEIPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSD 133
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHP 200
DL++CC + CG GC+GGYP +AW Y+V G+V+ + C PY T S P
Sbjct: 134 DLVSCC-WTCGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRP 192
Query: 201 GCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C+ + TPKC + C ++ + N H+ AY I+SD + I AEI +NGPVE +F+V
Sbjct: 193 ACDASEGNTPKCAKSCESNYKINYSNDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSV 252
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y DF +YK+GVY+HI G +GGHA+++ GWG ++ YW+
Sbjct: 253 YADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENN-TPYWL 292
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 172/312 (55%), Gaps = 39/312 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L S + L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVLTSARSSLYFP-----------PLSDELVNFVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP+SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGA------ILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CC CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCDGECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY S P C TPKC + C + ++ KH
Sbjct: 165 KGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKH 224
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++G
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILG 284
Query: 288 WGTSDDGEDYWV 299
WG ++G YW+
Sbjct: 285 WGV-ENGTPYWL 295
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 170/307 (55%), Gaps = 29/307 (9%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQ 69
C+L+ V+ A +S L+ H L D I +N + K WKA RN F +T +
Sbjct: 3 CVLLCAVV----LATIALSYGGLNPHPLSDEFINAIN-SKKTTWKAGRN--FDIHTPLAN 55
Query: 70 FKHLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQC-STISRILDQGHCGSCW 127
K LLGV P K + +K H + +P+SFDAR AWP+C S I I DQ CGSCW
Sbjct: 56 IKKLLGVLPK-KANARQLELKVHSVDVNAIPESFDAREAWPECASIIGDIRDQASCGSCW 114
Query: 128 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGA EA+SDR CIH + +S+S DL CC + CGDGC+GG+P AW Y+ G+VT
Sbjct: 115 AFGAAEAMSDRICIHSNATVKVSISTEDLNTCC-YECGDGCNGGWPAEAWAYWAETGIVT 173
Query: 186 -------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 232
+ C Y C H P C PTP+C ++C + S SA
Sbjct: 174 GGKYETKDGCKAYT-VPPCEHHTEGDLPACGDIVPTPQCKKECDAGVDIEYKSDLRKGSA 232
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y+ +SD I EI NGPVE F VYEDF +YKSGVY+ TG+ GGHA+K++GWG +
Sbjct: 233 YQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGV-E 291
Query: 293 DGEDYWV 299
DG YW+
Sbjct: 292 DGTPYWL 298
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 157/281 (55%), Gaps = 25/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHDKSL 96
D +I+ VNE A WKAAR+ +FSN V FK LG + TP+ P HD S
Sbjct: 3 FSDELIRFVNEESGASWKAARSTRFSN--VDHFKLDLGALSETPEERNALRPTIKHDISK 60
Query: 97 K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFDARS WPQC TIS I DQ CGSCWA A A+SDR CIH M L+ D
Sbjct: 61 NDLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAAD 120
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-- 204
L+CC + CG GC GGYP AW Y++ G+VT C P+ T C H G
Sbjct: 121 PLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKY 178
Query: 205 ------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
YP P C R C N+ + K Y S+Y + IM EI KNGPVEV+F
Sbjct: 179 SRCPHYTYPKPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFA 238
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+++DF Y+SG+Y H+ G +G HAV++IGWG ++G +YW
Sbjct: 239 IFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYW 278
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 177/316 (56%), Gaps = 28/316 (8%)
Query: 8 LTTCLLILGVISSQTFA--EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L T ++ + ++++ + A + L+ + ++ +N+ K + A +P+F+N+
Sbjct: 2 LKTAIVAVVLVTAVSAASWQNAKKNLQEAEKLTGRELVDYINKAQKL-FTAKLSPRFANF 60
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHC 123
+ L+G K V KTH +PKSFD+R+ WP+C ++ I DQ C
Sbjct: 61 PNEIKRRLMGSKYVALPAKYRVNEKTHSDIDDTTIPKSFDSRTNWPECPSLYSIRDQSSC 120
Query: 124 GSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWA GAVEA++DR CI N +++S +DLL+CC CG GCDGG P +AW Y+V +
Sbjct: 121 GSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGGDPYAAWSYWVSN 179
Query: 182 GVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQLWRNS 225
G+VT Y +GC +P CE YPT C KC + NS
Sbjct: 180 GIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNS 237
Query: 226 -KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
KHY S Y + D I EI NGPVEV+F VYEDF HY SG+YKH TGD +GGHAVK
Sbjct: 238 DKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVK 297
Query: 285 LIGWGTSDDGEDYWVC 300
++GWGT ++G DYW+C
Sbjct: 298 MLGWGT-ENGTDYWIC 312
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 183/317 (57%), Gaps = 38/317 (11%)
Query: 13 LILGVISSQTFAEGVV-----SKLKLD--SHILQD---SIIKEVNENPKAGWKAARNPQF 62
LIL ++ S GV SK D + Q+ +I K+VN + K W+A N ++
Sbjct: 5 LILTLVLSSLIGFGVYVYSKHSKFTFDEPNQAYQNKLGNIAKKVN-SLKTTWQAGENQRW 63
Query: 63 SNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW-PQCSTISRILDQ 120
N + K +GV + + G+ L K LPK+FD+R W +C +++ + DQ
Sbjct: 64 QNMDIAGIKAHMGVLRESKSGINLE---KVSTVVENLPKNFDSRKQWGSKCPSLNEVRDQ 120
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAF A E+LSDR CIH G ++ LS +L++CC CGDGC+GGYP +A +YFV
Sbjct: 121 STCGSCWAFAAAESLSDRICIHTGEDVRLSTENLVSCCSS-CGDGCNGGYPEAAMQYFVK 179
Query: 181 HGVVTEECDPYFDSTGCS---------------HPGCEPAYPTPKCVRKCVKKNQLWR-- 223
G+VT D + D+ C +P C+ PTP+C +KC +++ R
Sbjct: 180 TGLVTG--DLFGDNNFCQAYSFPPCAHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPY 237
Query: 224 NSKHYS-ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
N Y +Y ++SDP+ IM EI NGPVEV+FTVYEDF YKSGVY+H+TG+ +GGHA
Sbjct: 238 NEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHA 297
Query: 283 VKLIGWGTSDDGEDYWV 299
VK+IGWG +D YW+
Sbjct: 298 VKMIGWGVEND-TPYWL 313
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/310 (40%), Positives = 177/310 (57%), Gaps = 23/310 (7%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I T + +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLTSVLCIASLI---THLDAHISIKNEKFKPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H++ ++++P +FD+R WP C +I+ I DQ CGS
Sbjct: 58 DARIQ-MGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG AW ++V G+
Sbjct: 117 CWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCE-SCGLGCEGGILGPAWDFWVKEGI 175
Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
VT C+PY T +P C Y TP+C + C KK + + KH
Sbjct: 176 VTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRG 235
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 236 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWG 295
Query: 290 TSDDGEDYWV 299
++ YW+
Sbjct: 296 V-ENKTPYWL 304
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 172/317 (54%), Gaps = 31/317 (9%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M +F+ LL + T+ + K + Q + +EVN N WKA N
Sbjct: 1 MKRQTIFIVAALLSAALTGFYTYEALKHKEFKYSDRLKQ--LAEEVN-NANTTWKAGENI 57
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW-PQCSTISRILD 119
++ N + K LG G L PV K+ LP +FDAR W +C+++ + D
Sbjct: 58 KWINADIAGVKAHLGALEGDNGENL--PVSNAVKA-DLPTAFDARQQWGDKCTSLWEVRD 114
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
Q +CGSCWAFGAVE+L+DR CIH G ++ LS ++L CC CG GC+GGYP SA Y+V
Sbjct: 115 QSNCGSCWAFGAVESLTDRHCIHLGQDIRLSAQNMLTCCA-TCGQGCNGGYPASAMSYYV 173
Query: 180 HHGVVTEECDPYFDSTG---------CSH-------PGCEPAYPTPKCVRKC-VKKNQLW 222
G+VT + +++TG C+H P C PTPKC + C Q +
Sbjct: 174 KTGLVTGD---LYNTTGWCQAYSFAPCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTY 230
Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
+ H AY + E IM EI NGPVE +FTVYEDF +YKSGVYKH+TG +GGHA
Sbjct: 231 --TVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHA 288
Query: 283 VKLIGWGTSDDGEDYWV 299
+K++GWG ++ YW+
Sbjct: 289 IKIVGWGVENN-TPYWI 304
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 161/288 (55%), Gaps = 29/288 (10%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT-PKGLLLGVPV 89
L LD+ I+ VN W A N +F+ T+ K+L G K PK +PV
Sbjct: 152 LGLDAPAQSRDIVDFVNA-LGTTWTAGHNKRFTYNTLRHVKNLCGAKKGGPK-----LPV 205
Query: 90 KTHDKSLKLPKSFDAR--SAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 144
K K + LP SFD R S WP C +++ + DQG CGSCWAFGA EA++DR CI +
Sbjct: 206 KRIPKKMALPTSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQ 265
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS DL +CC CG GC+GGYP +AW YF G+VT + C PY
Sbjct: 266 NNFYLSAEDLTSCCDS-CGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACD 324
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
TG P C PTP C C + N W + KH+ S+Y + +D + IM EIY NGP
Sbjct: 325 HHVTGKYQP-CGDIQPTPACANSC-QNNATWSSDKHFGASSYSVGTDQQSIMTEIYTNGP 382
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE S+ VY DF YKSGVY+H+TGD +GGHAVK+IGWG D YW+
Sbjct: 383 VEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGV-DGSTPYWI 429
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 170/311 (54%), Gaps = 26/311 (8%)
Query: 13 LILGVISSQTFAEGVV--SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
L++G+++ + V + +++ +L+ + + + +KA FS+Y
Sbjct: 8 LLVGLVAVNAYNVEVKHGDAIPVEAQMLRGQELVDYVNKVQTSFKAELGSYFSSYPDTIK 67
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K L+G K V TH + +P SFD+R+AWP C +IS+I DQ CGSCWA
Sbjct: 68 KQLMGAKMVEIPEEYRVFEMTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWA 127
Query: 129 FGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
A E +SDR CI LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT
Sbjct: 128 VSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG 187
Query: 187 ECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL-WRNSKHYS 229
Y D TGC +P CE YPT KC R C L ++ H+
Sbjct: 188 --GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFG 245
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
SAY ++ +I EI +GPVEV+FTVYEDF HY GVY H G +GGHAVK++GWG
Sbjct: 246 QSAYAVSKKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWG 305
Query: 290 TSDDGEDYWVC 300
D+G YW+C
Sbjct: 306 V-DNGTPYWLC 315
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/289 (43%), Positives = 166/289 (57%), Gaps = 24/289 (8%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
VS+ + H L +I +N+ WKA N F + G K+L G KG L +
Sbjct: 15 VSRGRPHIHPLSSDMINYINK-LNTTWKAGHN--FHDVDYGYVKNLCGT--LLKGPKLPI 69
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
V++ +KLPK FDAR WP+C T+ I DQG CGSCWAFGA EA+SDR CIH +
Sbjct: 70 MVQSAG-GMKLPKQFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRICIHTKGKV 128
Query: 148 SLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 192
S+ ++ DLL CC CG GC+GGYP +AW ++ G+VT C PY
Sbjct: 129 SVEISSQDLLTCCDS-CGMGCNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH 187
Query: 193 DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
G P TP+CV +C ++ KHY ++Y + S+ E I +EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGP 247
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VE +F VYEDF YKSGVY+H+TG +GGHA+K+IGWG ++G YW+C
Sbjct: 248 VEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWG-EENGVPYWLC 295
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 177/308 (57%), Gaps = 25/308 (8%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ LL++G++++ F + K H L D +I +N+ WKA RN S ++
Sbjct: 1 MLKSLLVVGLLAAVCFGREIHPK---KWHPLSDQMINFINK-INTTWKAGRNFDKS-ISM 55
Query: 68 GQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
+ L+GV P K L V HD+ LP+SFDAR WP C++I I DQ CGSC
Sbjct: 56 SYIRGLMGVHPKSKEYRLAEFV--HDEIPDDLPESFDAREKWPHCNSIHLIRDQSTCGSC 113
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGA EA+SDR CIH + +++S DLL CC CG GC+GG P +AW Y+ G+V
Sbjct: 114 WAFGAAEAMSDRVCIHSKGKIQVNISAEDLLDCCDS-CGAGCNGGTPAAAWEYWKESGLV 172
Query: 185 T-------EECDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
T + C PY + T S P C PTPKCV C K + +++ KH+
Sbjct: 173 TGGLYGTNDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQDDKHFGKK 232
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
Y I+SD + I EI+KNGPVE F V DF YKSGVY+H + DV+GGHA++++GWGT
Sbjct: 233 VYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGT- 291
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 292 ENGTPYWL 299
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 172/291 (59%), Gaps = 25/291 (8%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS-NYTVGQFKHLLGVKPTPKGLLL 85
+ L + H L I+++NE ++ WKA P F+ N + + L+GV P K +
Sbjct: 12 TAASLSVAVHPLSKEFIQQINEK-QSTWKAG--PNFAENVPMSYIRRLMGVPPNSKYHMP 68
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-- 143
V D ++++P FDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH
Sbjct: 69 SVKRHLLD-AMEIPDDFDARKQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKG 127
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 196
+N+ LS +DL++CC + CG GC+GG+P +AW Y+V+ G+V+ + C PY +
Sbjct: 128 AVNVRLSADDLVSCC-YSCGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPY-EIAP 185
Query: 197 CSH--PGCEPA-----YPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
C H G P TP C ++C K N ++ K++ AY I+S+ + I EI
Sbjct: 186 CEHHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMT 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE +F VYED YK GVY+H+ G+ +GGHA++++GWGT + G YW+
Sbjct: 246 NGPVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGT-EKGTPYWL 295
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/310 (40%), Positives = 177/310 (57%), Gaps = 23/310 (7%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LT+ L I +I T + +S L D II +NE+P AGW+A ++ +F +
Sbjct: 6 MLTSVLCIASLI---THLDAHISIKNEKFKPLSDDIISYINEHPNAGWRAEKSNRFHSLD 62
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P P H++ ++++P +FD+R WP C +I+ I DQ CGS
Sbjct: 63 DARIQ-MGARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGS 121
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG AW ++V G+
Sbjct: 122 CWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDFWVKEGI 180
Query: 184 VTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
VT C+PY T +P C Y TP+C + C KK + + KH
Sbjct: 181 VTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRG 240
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 241 KSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWG 300
Query: 290 TSDDGEDYWV 299
++ YW+
Sbjct: 301 V-ENKTPYWL 309
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 162/281 (57%), Gaps = 29/281 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHD-K 94
+L +I +N+ W A +N F N K L G PK +P H+ +
Sbjct: 22 LLSSEMIDFINK-VNTTWTAGQN--FHNVDSSYVKGLCGTFLKGPK-----LPQVLHNTE 73
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSL--SVN 152
++LP SFDAR WP C TI +I DQG CGSCWAFGA EA+SDR CIH G +SL S
Sbjct: 74 GIRLPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAE 133
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------ 199
DLL+CC CG GC GGYP SAW ++ G+VT C PY + C H
Sbjct: 134 DLLSCCD-ECGMGCSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAP-CEHHVNGTR 191
Query: 200 PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C+ TPKC +KC+ + KH+ +Y + S E IM E+YKNGPVE +FTV
Sbjct: 192 PPCQGTQETPKCEKKCIDGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTV 251
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y DF YK+GVY+H+TG+V+GGHA+K++GWG + G YW+
Sbjct: 252 YADFLLYKTGVYQHVTGEVLGGHAIKILGWG-EESGTPYWL 291
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 106/220 (48%), Positives = 143/220 (65%), Gaps = 17/220 (7%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
++LP SFD+R WP C TI+ I DQG CGSCWAFGAVEA+SDR C+H +N+ +S D
Sbjct: 68 VELPDSFDSRKQWPSCPTINEIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAED 127
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHP 200
LL+CCGF CG GC+GGYP AW+Y+ G+V+ C PY + G P
Sbjct: 128 LLSCCGFECGMGCNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGTRPP 187
Query: 201 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
TP+CV+KC ++ KHY +++Y I ++IMAEIYKNGPVE +F VY
Sbjct: 188 CSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVY 247
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+H++G+ +GGHA++++GWG D+G YW+
Sbjct: 248 SDFLMYKSGVYQHVSGEEVGGHAIRILGWGV-DNGTPYWL 286
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/283 (42%), Positives = 163/283 (57%), Gaps = 30/283 (10%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNG 193
Query: 198 SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAF 253
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
TV+ DF YKSGVYKH GD+MGGHA++++ WG ++G YW+
Sbjct: 254 TVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWL 295
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/303 (41%), Positives = 166/303 (54%), Gaps = 28/303 (9%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L ++S + S LD L D +I VN + W AAR+P+F +
Sbjct: 6 CLLVLFAVAS------IASAKPLDFQALSDDVIDYVN-SLNTTWTAARSPRFPSGNEVDV 58
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
K L GV L P K +P +FDAR W C +IS I DQG CGSCWA G
Sbjct: 59 KDLCGVLDVKHTL----PYKEKVSVGAIPDTFDARQKWSDCPSISDIRDQGSCGSCWALG 114
Query: 131 AVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 185
AVEA+SDR+C+ F N+ +S +L+ CC F CG+GC GG+ AW Y+V G+VT
Sbjct: 115 AVEAMSDRYCVSFQENVHISAENLMTCCKF-CGNGCAGGFLQQAWEYWVKDGLVTGGQYG 173
Query: 186 --EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 236
E C PY C+H PG C TP+C R C + HY AY ++
Sbjct: 174 SDEGCQPYLIPK-CNHHEPGPYENCTGEGKTPQCERTCRSGYTTSYEADLHYGEKAYAVH 232
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
+ E I EI NGPVE +FTVY DF YKSGVY+H+ G +GGHA++++GWGT ++G
Sbjct: 233 REVEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGT-ENGVP 291
Query: 297 YWV 299
YW+
Sbjct: 292 YWL 294
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/285 (42%), Positives = 162/285 (56%), Gaps = 34/285 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ Y GC
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191
Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKN PVE
Sbjct: 192 NGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEG 251
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTV+ DF YKSGVYKH GD+MGGHA++++GWG +G YW+
Sbjct: 252 AFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVG-NGVPYWL 295
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/279 (44%), Positives = 162/279 (58%), Gaps = 24/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L D +I +N+ WKA RN N V K L+GV P K L P+ H+ K
Sbjct: 27 LSDEMINFINK-LNTTWKAGRNFD-KNTPVSYLKGLMGVHPDSKNYRL--PLFYHEDIPK 82
Query: 98 -LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
LP+SFDAR W C++I I DQ CGSCWAFGA EA+SDR CIH + +++S DL
Sbjct: 83 DLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAEDL 142
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 201
L CC CG GC+GGYP +AW ++ G+VT + C PY+ C H P
Sbjct: 143 LTCCD-SCGAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPP-CEHHTVGPLPN 200
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C PTP+CVR C K + + KHY+ Y +++D I EI+KNGPVE FTVY
Sbjct: 201 CTGIKPTPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEADFTVYA 260
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+ + D +GGHA++++GWGT ++G YW+
Sbjct: 261 DFVSYKSGVYQRHSDDALGGHAIRILGWGT-ENGVPYWL 298
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 160/280 (57%), Gaps = 23/280 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
H L + +I+ VN WKA RN T+ + LLGV L P H
Sbjct: 25 HPLSEKMIEYVN-FMNTTWKAGRNFH-EGVTMKYIRGLLGVHKDNHKYRL--PSIRHAVP 80
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFD+R WP C TIS I DQG CGSCWAFGA EA+SDR CIH +N+ +S D
Sbjct: 81 GDLPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAED 140
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------P 200
LL CC CG GC+GG+P SAW Y+V G+VT C PY ++ C H P
Sbjct: 141 LLTCCD-SCGMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIAS-CEHHTKGKLP 198
Query: 201 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TP+CV C K N +R K++ +Y I+ + I EI NGPVE +FTVY
Sbjct: 199 PCGDIVDTPQCVHMCEKGYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVY 258
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+H+TG+ MGGHAV+++GWGT + G YW+
Sbjct: 259 ADFVTYKSGVYRHVTGEEMGGHAVRILGWGT-ESGTPYWL 297
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/282 (42%), Positives = 160/282 (56%), Gaps = 31/282 (10%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L ++ +N+ WKA N F N + K L G LG P
Sbjct: 26 LSSDLVNHINKL-NTTWKAGHN--FHNTDMSYVKKLCGT-------FLGGPKLPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ LP +FD+R WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 ADIDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCS 198
DLL+CCGF CG GC+GGYP AWRY+ G+V+ C PY G
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEHHVNGSR 195
Query: 199 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +F
Sbjct: 196 PPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFI 255
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+
Sbjct: 256 VYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWL 296
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/285 (42%), Positives = 163/285 (57%), Gaps = 34/285 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RN F N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ Y GC
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHV 191
Query: 198 --SHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
S P C T +C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE
Sbjct: 192 NGSRPPCTGEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEG 251
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+
Sbjct: 252 AFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWL 295
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 121/285 (42%), Positives = 166/285 (58%), Gaps = 33/285 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 93
H L D ++ +N+ W+A N F N + K L G LG P
Sbjct: 24 HPLSDELVNYINKQ-NTTWQAGHN--FHNVHLSYVKRLCGT-------YLGGPRLPQRIK 73
Query: 94 --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP+SFDAR WP C TI I DQG CGSCWAFGAV A+SDR CIH +N+ +
Sbjct: 74 FAEIVDLPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--- 199
S DLL+CCG CGDGC+GGYP +AW+Y+ G+V+ C PY C H
Sbjct: 134 SAEDLLSCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVN 192
Query: 200 ---PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
P C TPKC + C + ++ KH+ +Y ++S+ ++IMAEIYKNGPVE
Sbjct: 193 GTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEG 252
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTV+ DF YK+GVYKH+ G+++GGHA++++GWG ++G YW+
Sbjct: 253 AFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWG-KENGVPYWL 296
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/314 (40%), Positives = 178/314 (56%), Gaps = 39/314 (12%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
L T C L++ + S+Q+ +L L D ++ VN+ W+A N F +
Sbjct: 3 QLLATLCCLVV-LTSAQS---------RLYFKPLSDELVNHVNK-LNTTWQAGHN--FYD 49
Query: 65 YTVGQFKHLLGVKPTPKGLLLG--VPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQG 121
+ K L G LL G +P + H + + LP++FDAR WP C TI I DQG
Sbjct: 50 VDMSYVKRLCGT------LLNGPKLPQRVHLAEEMDLPENFDARENWPNCPTIKEIRDQG 103
Query: 122 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
CGSCWAFGAVEA+SDR CIH +N+ +S DLL CC CGDGC+GG+P AW ++
Sbjct: 104 SCGSCWAFGAVEAISDRVCIHTNGNVNVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWT 163
Query: 180 HHGVVTEE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNS 225
G+V+ C PY S P C+ TPKC + C + ++
Sbjct: 164 KKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKED 223
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
KHY S+Y + S ++IMAEIYKNGPVE +F+VY DF YKSGVY+H+TG+ +GGHA+++
Sbjct: 224 KHYGYSSYGVPSSEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRI 283
Query: 286 IGWGTSDDGEDYWV 299
+GWG ++G YW+
Sbjct: 284 LGWGV-ENGTPYWL 296
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 122/282 (43%), Positives = 160/282 (56%), Gaps = 32/282 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTHD 93
+L +I+ +N WKA +N F N + + L G KPT +P H
Sbjct: 24 LLSSEMIQYINR-LNTTWKAGQN--FYNVDLSYVQGLCGTLQNKPT-------LPELEHP 73
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+KLP +FDAR WP C TI I DQG CGSCWAFGA EA+SDR CIH + + +S
Sbjct: 74 AGVKLPDTFDARQQWPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLCIHSNAKITVEISA 133
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----- 199
DLL+CC CG GC GGYP +AW Y+ G+VT + C PY C H
Sbjct: 134 EDLLSCCE-ECGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPY-SIPPCEHHVNGT 191
Query: 200 -PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C+ TPKC KC+ + K++ Y + S E IM E+YKNGPVE +F+
Sbjct: 192 RPPCQGEGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFS 251
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYEDF YKSGVY+H+TGD++GGHA+K++GWG ++ YW+
Sbjct: 252 VYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKENN-TPYWL 292
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 119/263 (45%), Positives = 160/263 (60%), Gaps = 22/263 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F+N + K L G G L D ++LP SFD+R+AWP C T
Sbjct: 41 WKAGHN--FANADLHYVKRLCGTHLN--GPQLQKRFGFAD-GMELPDSFDSRAAWPNCPT 95
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 171
I + DQG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCGF CG GC+GGYP
Sbjct: 96 IREVRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMGCNGGYP 155
Query: 172 ISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAY-PTPKCVRKCVKK 218
AW+++ G+V+ C PY S P C+ TPKCV++C
Sbjct: 156 SGAWKFWTETGLVSGGLYDSHLGCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEDG 215
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
++ + KH+ ++Y + S ++IMAEIYKNGPVE +F VY DF YKSGVY+H TG+
Sbjct: 216 YAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVYQHETGEE 275
Query: 278 MGGHAVKLIGWGTSDDGEDYWVC 300
+GGHA+K++GWG ++G YW+C
Sbjct: 276 LGGHAIKILGWGV-ENGTPYWLC 297
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 162/283 (57%), Gaps = 30/283 (10%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
+L D ++ VN+ WKA N F + + L G +LG P S
Sbjct: 24 QLLSDELVDYVNKR-NTTWKAGHN--FYHVEPSYLRRLCGT-------ILGGPKLPQRVS 73
Query: 96 ----LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 149
+ LP++FDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI + +N+ +
Sbjct: 74 FAEDMVLPENFDAREHWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 134 SAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNG 193
Query: 198 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TPKC + C ++ KHY ++Y +++ ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAF 253
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+V+ DF YKSGVY+H+TG++MGGHAV+++GWG +D YW+
Sbjct: 254 SVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVEND-TPYWL 295
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 118/302 (39%), Positives = 173/302 (57%), Gaps = 23/302 (7%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S TF E V ++ L D +I +NE+P AGWKA ++ +F +++ + L+G
Sbjct: 8 IVSLFTFLEAHVTTRNNQRIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVE
Sbjct: 66 ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------ 185
A++DR CI G + LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 126 AMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKEN 184
Query: 186 -EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINS 237
C PY T +P C Y TP+C + C K + + KHY +Y + +
Sbjct: 185 HTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN 244
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
+ + I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + Y
Sbjct: 245 NEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPY 303
Query: 298 WV 299
W+
Sbjct: 304 WL 305
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 174/326 (53%), Gaps = 49/326 (15%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +C+++ + E V+ +LD D +I VNEN W A + +FS+
Sbjct: 4 LLFLSCIVVAAYCACNDNLESVLEAAELDG----DDLIDYVNENQNL-WTAKKQRRFSS- 57
Query: 66 TVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSAWPQCST 113
+ G K L+GV KT D L +P+SFD+R WP+C +
Sbjct: 58 -------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDNWPKCDS 110
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 171
I I DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG GC+GG P
Sbjct: 111 IKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGGDP 169
Query: 172 ISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKC 215
++AWRY+V G+VT Y + GC P CE YPTPKC +KC
Sbjct: 170 LAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKC 227
Query: 216 VKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
V ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y GVY H
Sbjct: 228 VSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHT 287
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
G + GGHAVKLIGWG DDG YW
Sbjct: 288 GGKLGGGHAVKLIGWGI-DDGIPYWT 312
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 118/277 (42%), Positives = 159/277 (57%), Gaps = 23/277 (8%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLP 99
+SI ++N GWKA N +F N T+ + +G + +G + + VK + LP
Sbjct: 34 ESIANDINAR-NVGWKAGVNERFVNVTMDYIRKQMGTRL--EGSPVTLDVKHVEVPADLP 90
Query: 100 KSFDARSAW-PQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 156
SFD+R+ W C ++ + DQ +CGSCWAFGAVEA++DR CI +S DLL
Sbjct: 91 TSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQTPHISAEDLLT 150
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCE 203
CC F CGDGC+GGYP +AW Y+ + G+VT + C PY +TG P C
Sbjct: 151 CCTFTCGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CG 209
Query: 204 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
PTP C R C + N + N KH+ S+Y + + I EI NGPVE +FTVY DF
Sbjct: 210 DIVPTPACKRSCRQGYNVTYPNDKHFGASSYGVRG-VDQIATEIMTNGPVEAAFTVYSDF 268
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY+H +G +GGHA+K+IGWG DG DYW+
Sbjct: 269 LSYKSGVYQHTSGQPLGGHAIKIIGWGVQ-DGTDYWI 304
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 109/222 (49%), Positives = 142/222 (63%), Gaps = 18/222 (8%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +N+ +S
Sbjct: 3 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 62
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 199
DLL CCG CGDGC+GGYP AW ++ G+V+ C PY C H
Sbjct: 63 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGA 121
Query: 200 -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FT
Sbjct: 122 RPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFT 181
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 182 VFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 222
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/263 (45%), Positives = 156/263 (59%), Gaps = 22/263 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F+N V K L G L L LP SFD+R+AWP C T
Sbjct: 41 WKAGHN--FANADVHYVKRLCGTHLNGPQLQKRFGFA---DDLDLPDSFDSRAAWPNCPT 95
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 171
I I DQG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCGF CG GC+GGYP
Sbjct: 96 IREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMGCNGGYP 155
Query: 172 ISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAY-PTPKCVRKCVKK 218
AWR++ G+V+ C PY S P C+ TPKC++ C +
Sbjct: 156 SGAWRFWTETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEG 215
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
+ + KH+ ++Y + S ++IMA+IYKNGPVE +F VY DF YKSGVY+H TG+
Sbjct: 216 YTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEE 275
Query: 278 MGGHAVKLIGWGTSDDGEDYWVC 300
+GGHA+K++GWG ++G YW+C
Sbjct: 276 LGGHAIKILGWGV-ENGTPYWLC 297
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 141/221 (63%), Gaps = 16/221 (7%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +N+ +S
Sbjct: 8 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 67
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSH 199
DLL CCG CGDGC+GGYP AW ++ G+V+ C PY S
Sbjct: 68 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSR 127
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV
Sbjct: 128 PPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTV 187
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 188 FSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 227
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 112/224 (50%), Positives = 142/224 (63%), Gaps = 17/224 (7%)
Query: 87 VPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG- 144
+P+KT + KLP +FD+R+ WP C TI I DQG CGSCWAFGAVE++SDR C+H G
Sbjct: 1 LPLKTSFSGNWKLPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGG 60
Query: 145 -MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF---- 192
N+ +S DLL+CCGF CG GC+GGYP AW+Y+ G+V+ C PY
Sbjct: 61 KQNVEVSAEDLLSCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPC 120
Query: 193 -DSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
S P C TPKCV+KC + K Y SAY + S PE IM EIYK+
Sbjct: 121 EHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKD 180
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
GPVE +FTVYEDF YKSGVY+H TG+ +GGHA+K++GWG ++
Sbjct: 181 GPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILGWGIENN 224
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/291 (42%), Positives = 164/291 (56%), Gaps = 46/291 (15%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL----LGVKPTPKGLLLGVPVKT 91
H L D +I +N+ W+A RNP N + K L LG P + G
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRNPY--NVDISYLKKLCGTVLGGPKLPGRVAFG----- 75
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 76 --EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
S DLL CCG CGDGC+GGYP AW ++ G+V+ Y+DS H GC P Y P
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVS---GGYYDS----HIGCLP-YTIP 185
Query: 210 KC----------------VRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIMAEIYK 248
C R+C K + ++ KH+ ++Y +++ + IMAEIYK
Sbjct: 186 PCEHHVNGSRPPCTGEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIYK 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE +FTV+ DF YKSGVYKH GD+MGGHA++++ WG ++G YW
Sbjct: 246 NGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWA 295
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 119/266 (44%), Positives = 161/266 (60%), Gaps = 28/266 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH---DKSLKLPKSFDARSAWPQ 110
WKA N F+N + K L G LL G ++ L+LP SFD+R+AWP
Sbjct: 41 WKAGHN--FANADLHYVKRLCGT------LLKGPQLQKRFGFADGLELPDSFDSRAAWPN 92
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 168
C TI I DQG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCG CG GC+G
Sbjct: 93 CPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGMGCNG 152
Query: 169 GYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAY-PTPKCVRKC 215
GYP AW+++ G+V+ C PY S P C+ TPKCV++C
Sbjct: 153 GYPSGAWQFWTETGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQC 212
Query: 216 VKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
+ + + KH+ ++Y + + ++IMAEIYKNGPVE +F VY DF YKSGVY+H T
Sbjct: 213 EEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVYQHET 272
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWVC 300
G+ +GGHA+K++GWG ++G YW+C
Sbjct: 273 GEELGGHAIKILGWGV-ENGTPYWLC 297
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 177/332 (53%), Gaps = 51/332 (15%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
L +C+++ + E V+ K + +DS + D +I VNEN W A +
Sbjct: 3 LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 61
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
+FS+ + G K L+GV KT D L +P+SFD+R
Sbjct: 62 RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 113
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG G
Sbjct: 114 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 172
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
C+GG P++AWRY+V G+VT Y + GC P CE YPTP
Sbjct: 173 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 230
Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
KC +KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y
Sbjct: 231 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 290
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY H G + GGHAVKLIGWG DDG YW
Sbjct: 291 GVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWT 321
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/332 (39%), Positives = 177/332 (53%), Gaps = 51/332 (15%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
L +C+++ + E V+ K + +DS + D +I VNEN W A +
Sbjct: 4 LLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAKKQ 62
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
+FS+ + G K L+GV KT D L +P+SFD+R
Sbjct: 63 RRFSS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDN 114
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG G
Sbjct: 115 WPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFG 173
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
C+GG P++AWRY+V G+VT Y + GC P CE YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231
Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
KC +KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y
Sbjct: 232 KCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 291
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY H G + GGHAVKLIGWG DDG YW
Sbjct: 292 GVYVHTGGKLGGGHAVKLIGWGI-DDGIPYWT 322
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 170/311 (54%), Gaps = 26/311 (8%)
Query: 13 LILGVISSQTFAEGVV--SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
L++G+++ + V + L++ +L+ + + + +KA FS+Y
Sbjct: 8 LLVGLVAVNAYNVEVKHGDSIPLEAQMLRGQDLVDYVNKQQTSFKAKLGSYFSSYPDTIK 67
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K L+G K V TH + L +P SFD+R+ WP C +IS+I DQ CGSCWA
Sbjct: 68 KQLMGAKMIEIPDEYRVFEMTHPEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWA 127
Query: 129 FGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
A E +SDR CI + LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT
Sbjct: 128 VSAAETISDRICIASNGKTQLSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG 187
Query: 187 ECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL-WRNSKHYS 229
Y + TGC +P CE YPT KC R C L + H+
Sbjct: 188 --GSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFG 245
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
SAY ++ +I EI +GPVEV+F+VYEDF HY GVY H G +GGHAVK++GWG
Sbjct: 246 QSAYAVSKKVTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWG 305
Query: 290 TSDDGEDYWVC 300
D+G YW+C
Sbjct: 306 V-DNGTPYWLC 315
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/284 (42%), Positives = 162/284 (57%), Gaps = 35/284 (12%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L ++ +N+ G +A N F N + K L G LG P
Sbjct: 26 LSSDLVNHINKLNTTG-RAGHN--FHNTDMSYVKKLCGT-------FLGGPKAPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ + LP +FD R WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE----- 203
DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P CE
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNG 193
Query: 204 -------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
TP+C R C + ++ KHY I++Y + ++IMAEIYKNGPVE +
Sbjct: 194 SRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGA 253
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+
Sbjct: 254 FIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWL 296
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/218 (50%), Positives = 140/218 (64%), Gaps = 18/218 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +N+ +S DLL
Sbjct: 1 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGC 202
CCG CGDGC+GGYP AW ++ G+V+ C PY C H P C
Sbjct: 61 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPC 119
Query: 203 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ D
Sbjct: 120 TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 179
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F YKSGVYKH GDVMGGHA++++GWG ++G YW+
Sbjct: 180 FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWL 216
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 115/280 (41%), Positives = 165/280 (58%), Gaps = 22/280 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINKHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVND 153
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPG 201
L++CC CGDGC GG+P AW Y+V G+VT C PY T +P
Sbjct: 148 LISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPA 206
Query: 202 C-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C Y TP+C +KC K + + K+Y Y + S+ + I EI GPVE +F VY
Sbjct: 207 CGTKIYKTPQCKQKCQKGYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYGPVEAAFDVY 266
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF +YKSG+Y+H+ G ++GGHA+++IGWG + G+ YW+
Sbjct: 267 EDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 172/311 (55%), Gaps = 26/311 (8%)
Query: 13 LILGVISSQTFAEGV--VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
L++G+++ Q + V + +++ +L+ + + + + A FS+Y
Sbjct: 8 LLVGLVAVQAYNVEVKHADAIPVEAQMLRGQELVDYVNKQQTTFTAKLGSYFSSYPDTIK 67
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K L+G K V TH + L +P SFD+R+ WP C +IS+I DQ CGSCWA
Sbjct: 68 KQLMGAKMVEIPEEYRVFEMTHPEVLDTAVPDSFDSRTQWPNCPSISKIRDQSSCGSCWA 127
Query: 129 FGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
A E +SDR CI + +S+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT
Sbjct: 128 VSAAETISDRICIASNGKTQISISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG 187
Query: 187 ECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL-WRNSKHYS 229
Y + +GC +P CE P+ YPT KC C L + H+
Sbjct: 188 --GSYQEKSGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFG 245
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
SAY ++ P +I EI +GPVEV+FTVYEDF HY GVY H G +GGHAVK++GWG
Sbjct: 246 QSAYAVSKKPAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWG 305
Query: 290 TSDDGEDYWVC 300
D+G YW+C
Sbjct: 306 V-DNGTPYWLC 315
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 170/306 (55%), Gaps = 34/306 (11%)
Query: 14 ILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 73
+L +S G S+ +L D ++ VN+ WKA N F N + L
Sbjct: 4 LLACLSCLVVLAGAQSRPPF--QLLSDELVNYVNKR-NTTWKAGHN--FHNVDPSYLRRL 58
Query: 74 LGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
G LG P +++ LP++FDAR WP C TI I DQG CGSCWAF
Sbjct: 59 CGT-------FLGGPKLPQRVWFAENMVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAF 111
Query: 130 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
GAVEA+SDR CI +N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+
Sbjct: 112 GAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGG 171
Query: 188 -------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 233
C PY C H P C TPKC + C ++ KHY S+Y
Sbjct: 172 LYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKFCEPGYTPSYKEDKHYGCSSY 230
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
++S ++IMAEIYKNGPVE +FTVY DF YKSGVY+H+TG++MGGHAV+++GWG ++
Sbjct: 231 SVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EN 289
Query: 294 GEDYWV 299
G YW+
Sbjct: 290 GTPYWL 295
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/294 (42%), Positives = 163/294 (55%), Gaps = 22/294 (7%)
Query: 22 TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
T E + K L +I +N WKAA +F TV + +LG P P
Sbjct: 21 TLNEIDARRHKRMYQPLSMELINFINYEANTTWKAAPTTRFR--TVSDIRRMLGALPDPN 78
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
G L + T S +LPKSFDAR WP C +IS I DQ CGSCWAFGAVEA+SDR CI
Sbjct: 79 GEQLET-LCTGYISDELPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICI 137
Query: 142 HFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF 192
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY
Sbjct: 138 KSKGKHKPFLSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY- 195
Query: 193 DSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAE 245
+ C H P C+ TP C C N + K Y YRI+S+PE IM E
Sbjct: 196 EFPPCEHHVIGPLPSCDGDVETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNPEAIMLE 255
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ +NGPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW+
Sbjct: 256 LMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWL 308
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 175/311 (56%), Gaps = 24/311 (7%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ ++S +++ ++ L D +I +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIVSLMS--ILTAHILTDNEVQFEPLSDEMIAYINQHPDAGWTASRSDRFK--S 56
Query: 67 VGQFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCG 124
V + LLG + L P H SL++P SFD+R W QC +IS I DQ CG
Sbjct: 57 VEDARILLGAMSEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
CWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y+V G
Sbjct: 117 PCWAFAAVEAMSDRICIQSKGKKSVELSAVDLLSCC-TECGLGCQGGFPGAAWDYWVEEG 175
Query: 183 VVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHY 228
+VT C PY T +P C E Y TPKC +KC K + ++ K+Y
Sbjct: 176 IVTGSSKENHTGCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYY 235
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 288
+Y + S + I EI +GPVE +FTVY DF +YKSG+YKH+ G V+GGHAV++IGW
Sbjct: 236 GKLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGW 295
Query: 289 GTSDDGEDYWV 299
G + YW+
Sbjct: 296 GV-EKKTPYWL 305
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 119/279 (42%), Positives = 163/279 (58%), Gaps = 24/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSL 96
L I+ VN WKA ++S +V + K+L G P G L P+ H +++
Sbjct: 65 LSQEIVDYVNTKADTTWKAEVTSKWS--SVAEVKNLCGSLKDPNGSRL--PIMRHKLEAV 120
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP FDAR W C TI + DQG CGSCWAFGAVEA+SDR CI N+ +S DL
Sbjct: 121 NLPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSEDL 180
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSHPG 201
L+CC CG GC+GG+P +AW YF G+V+ + C PY + G P
Sbjct: 181 LSCCSS-CGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP- 238
Query: 202 CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C PTPKC R C K ++ + + K++ +AY +++D + IM EI NGPVE +FTVY
Sbjct: 239 CSGEGPTPKCERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAFTVYA 298
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+H++G +GGHA++++GWG +DG YW+
Sbjct: 299 DFPTYKSGVYQHVSGGELGGHAIRVLGWGV-EDGTPYWL 336
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/305 (40%), Positives = 172/305 (56%), Gaps = 35/305 (11%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF- 70
LLI G+ S+ + + L D I +N + + W+A RN F+ T ++
Sbjct: 9 LLICGIFSAS-----------IPTDPLSDEFIDYIN-SLQTTWRAGRN--FAPNTPKKYL 54
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
K L GV +P + + LPK FDAR WP C++I+ I DQG CGSCWAFG
Sbjct: 55 KSLAGVHKDANNAFT-LPKRQVSLDVTLPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFG 113
Query: 131 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 185
AVEA+SDR CIH + + LS +L++CC CG GCDGGYP SAW Y+ + G+V+
Sbjct: 114 AVEAMSDRICIHSNGKLQVHLSAENLVSCCDS-CGFGCDGGYPASAWDYWQNVGIVSGGN 172
Query: 186 ----EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYR 234
+ C PY + C H P C TP C +C K++ + + +Y SAY
Sbjct: 173 YGSKQGCQPYSIAP-CEHHVPGPRPACSGEGSTPDCRNQCDKRSGISYDKDLYYGESAYS 231
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
+ + + I AEI KNGPVE +FTVYED +YK GVY+H+ G V+GGHA+K++GWG +D
Sbjct: 232 LEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVEND- 290
Query: 295 EDYWV 299
YW+
Sbjct: 291 TPYWL 295
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 166/283 (58%), Gaps = 23/283 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKTHDKS 95
L D +I +N+ P WKA R +F+ ++ K ++GV + L + +D +
Sbjct: 32 LSDELITFINKQPNIEWKADRTTRFT--SIHHAKSMMGVLLNSVDQHKLHHPIIHHNDIN 89
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+KLPK FD+R W CS+I I DQ CGSCWAFGAVE++SDR CIH +++ LS +
Sbjct: 90 IKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVN 149
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 200
LL+CC CG GC+GG P AW Y+ G+VT C PY ST +H
Sbjct: 150 LLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHS 208
Query: 201 GCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
CE Y TP+C + C + + N K+Y S+Y + SD IM EI NGPVE +F V
Sbjct: 209 SCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYV 268
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWVC 300
++DF +YK+GVYK++TG ++GGHA+++IGWG S + YW+C
Sbjct: 269 FDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTLNHTPYWLC 311
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 163/287 (56%), Gaps = 28/287 (9%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++ +L +H D +I +N ++ W A N F N K L G KG L
Sbjct: 15 ARPQLHTH---DEMISFINA-ARSTWTAGVN--FDNVPKEYLKSLCGT--VLKGPRLPHT 66
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 148
VK H ++KLP SFD R WP C T+S+I DQG CGSCWAFGAVE++SDR CIH S
Sbjct: 67 VK-HSTNVKLPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHSKGKQS 125
Query: 149 --LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 199
+S DLL+CC CG GC GG+P AW Y+ G+VT C PY C H
Sbjct: 126 PEISAEDLLSCCD-QCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPY-SIAPCEH 183
Query: 200 ------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
P C TPKC C+ K + ++ KH+ Y + SD + IM E+Y NGPV
Sbjct: 184 HVNGTRPPCSGEQDTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNGPV 243
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E +FTVYEDF YKSGVY+H+TG +GGHAVK++GWG ++G +W+
Sbjct: 244 EAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWG-EENGTPFWL 289
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 120/263 (45%), Positives = 154/263 (58%), Gaps = 23/263 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F N + L G KG L + V+ + LKLP FDAR WP+C T
Sbjct: 40 WKAGHN--FHNVDYSYVRRLCGT--MLKGPKLPIMVQ-YAGGLKLPAEFDAREQWPECPT 94
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
+ I DQG CGSCWAFGA EA+SDR CIH G +S+ ++ DLL CC CG GC+GGYP
Sbjct: 95 LKEIRDQGSCGSCWAFGAAEAISDRVCIHSGGKISVEISSEDLLTCCDS-CGMGCNGGYP 153
Query: 172 ISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKC-VK 217
SAW ++ G+V+ C PY S G P TP+C+ +C
Sbjct: 154 SSAWDFWTKEGLVSGGLYNSHIGCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAG 213
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
+ ++ KHY S+Y + E I AEI KNGPVE +FTVYEDF YKSGVY+H++G V
Sbjct: 214 YSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSV 273
Query: 278 MGGHAVKLIGWGTSDDGEDYWVC 300
+GGHA+K++GWG +DG YW+C
Sbjct: 274 LGGHAIKVLGWG-EEDGIPYWLC 295
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/267 (42%), Positives = 159/267 (59%), Gaps = 21/267 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVND 153
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPG 201
L++CC CGDGC GG+P AW Y+V G+VT C PY T +P
Sbjct: 148 LISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPA 206
Query: 202 C-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C Y TP+C +KC K + + KHY +Y + S+ + I EI NGPVE +F VY
Sbjct: 207 CGTKIYKTPQCKQKCQKGYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNGPVEAAFDVY 266
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLI 286
EDF +YKSG+Y+H+TG ++GGHA+++I
Sbjct: 267 EDFLNYKSGIYRHVTGSIVGGHAIRII 293
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 210 bits (534), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 120/264 (45%), Positives = 157/264 (59%), Gaps = 28/264 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAWPQC 111
W A +N F N K L G + PK +P HD + +KLP SFD R WP C
Sbjct: 40 WTAGQN--FHNKDSSFVKGLCGTILKGPK-----LPELAHDVEGIKLPDSFDPREQWPNC 92
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGG 169
T+ +I DQG+CGSCWAFGA EA+SDR CI G ++L +S DLL CC CG GC GG
Sbjct: 93 PTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAEDLLTCCD-ECGMGCFGG 151
Query: 170 YPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCV 216
+P +AW ++ + G+VT C PY + C H P C+ TPKCV +C
Sbjct: 152 FPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP-CEHHVNGSRPPCQGEVETPKCVTQCN 210
Query: 217 KKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
L + KH+ +Y I S E IM E+YKNGPVE +F+VY DF YK+GVY+H+TG
Sbjct: 211 NGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTG 270
Query: 276 DVMGGHAVKLIGWGTSDDGEDYWV 299
D++GGHAVK++GWG ++G YW+
Sbjct: 271 DMLGGHAVKILGWG-EENGTPYWL 293
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 178/314 (56%), Gaps = 34/314 (10%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
LTT L +S + + + D+ D +I+ +N W+A RNP F +
Sbjct: 9 LLTTVAL---AVSEDALRDRYLIPAETDAS--SDKMIQYINY-LNTTWQAGRNPGFED-- 60
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCG 124
+ LLGV +P+ +P + D S LP++FD+R WP+C+TI I DQG CG
Sbjct: 61 PAYVRGLLGV--SPENHRYRLPERRLDLSSLGPLPENFDSRENWPECTTIGEIRDQGSCG 118
Query: 125 SCWAFGAVEALSDRFCIHFG----MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
SCWAFGAVEA+SDR CIH + LS +DLL+CC CG+GC+GG+P SAW ++V
Sbjct: 119 SCWAFGAVEAMSDRTCIHSPSGGPKRVHLSADDLLSCC-RTCGNGCNGGFPGSAWSFWVK 177
Query: 181 HGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL-WRNS 225
G+VT + C PY C H P + PTP+CV C K + + +
Sbjct: 178 TGIVTGGNYDSDDGCMPY-PIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDD 236
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
KHY S+Y + S+ + I AEI NGPVE FTVY DF HYKSGVY+ T + +GGHA++L
Sbjct: 237 KHYGKSSYSVPSEEKQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRL 296
Query: 286 IGWGTSDDGEDYWV 299
+GWG ++G YW+
Sbjct: 297 LGWGV-ENGVPYWL 309
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 174/318 (54%), Gaps = 31/318 (9%)
Query: 1 MASSHLFLTT--CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR 58
M S + F + CL+ L E ++ K L +I +N WKAA
Sbjct: 1 MTSYNYFCSVLFCLIFLNY-------EIEANRHKYMHQPLSSELIHFINHEANTTWKAAP 53
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRI 117
+ +F +V + +LG P P G L + SL +LPK FDAR WP C +IS I
Sbjct: 54 SSRFK--SVSDIRRMLGALPDPNGGYLPTLCTGYTPSLDELPKEFDARKHWPHCPSISEI 111
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAW 175
DQ CGSCWAFGAVEA+SDR CI G++ LS +L+ACC CG GC+GG+P SAW
Sbjct: 112 RDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCS-SCGMGCNGGFPHSAW 170
Query: 176 RYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQL 221
Y+ G+VT + C PY + C H P C TPKC C N
Sbjct: 171 SYWKRSGIVTGDLYNTTDGCQPY-EFPPCEHHVVGPRPSCGGDVETPKCKTTCQPGYNIP 229
Query: 222 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 281
+ K Y + YR++S+ E IM E+ +GPVEV F VY DF +YKSGVY+H++G ++GGH
Sbjct: 230 YNKDKWYGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGH 289
Query: 282 AVKLIGWGTSDDGEDYWV 299
AV+L+GWG ++G YW+
Sbjct: 290 AVRLLGWG-EENGVPYWL 306
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 122/295 (41%), Positives = 162/295 (54%), Gaps = 22/295 (7%)
Query: 22 TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
T E + K L +I +N WKA +F TV + +LG P P
Sbjct: 20 TLNENDARRHKHMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77
Query: 82 GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
G L ++ +L +LPKSFDAR W C +IS I DQ CGSCWAFGAVEA+SDR C
Sbjct: 78 GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137
Query: 141 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 191
I LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196
Query: 192 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ C H P C+ TP C R C N + N K Y YR+ S+ E IM
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW+
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWL 309
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 122/295 (41%), Positives = 162/295 (54%), Gaps = 22/295 (7%)
Query: 22 TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
T E + K L +I +N WKA +F TV + +LG P P
Sbjct: 20 TLNENDARRHKRMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77
Query: 82 GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
G L ++ +L +LPKSFDAR W C +IS I DQ CGSCWAFGAVEA+SDR C
Sbjct: 78 GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137
Query: 141 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 191
I LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196
Query: 192 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ C H P C+ TP C R C N + N K Y YR+ S+ E IM
Sbjct: 197 -EFPPCEHNTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW+
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWL 309
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 124/290 (42%), Positives = 160/290 (55%), Gaps = 31/290 (10%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH- 92
D H+L D I+ V K W RN S + G + L+GV P L P K+
Sbjct: 23 DPHMLSDEFIELVRSKAKT-WTPGRNFDAS-VSEGHIRGLMGVHPDAHKFTL--PEKSQV 78
Query: 93 ------DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
D LP+SFDAR+AWP C TI I DQG CGSCWAFGAVEA+SDR CIH
Sbjct: 79 LGNLVGDDGDDLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGT 138
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 197
+N S DL++CC CG GC+GG+P +AW Y+ H G+V+ E C PY + C
Sbjct: 139 VNFHFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPY-EIEPC 196
Query: 198 SH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
H P C+ TP C +C + + KH+ +Y I +P +I EI NG
Sbjct: 197 EHHVNGTRPPCKNGR-TPSCKHQCESSYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNG 255
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWV 299
PVE +FTVYED YKSGVYKH+ G +GGHA++++GWG D + YW+
Sbjct: 256 PVEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVWGDSKVPYWL 305
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 122/304 (40%), Positives = 171/304 (56%), Gaps = 28/304 (9%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 68
CL+ + + T G+ +K D +L S + EVN K W A+ N + + ++G
Sbjct: 10 CLVAVFALLLATTVSGLYAKPS-DFPLLGKSFVAEVNSKAKGQWTASANNGYLVTGKSLG 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+ FDA WP C TIS I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++V G+ TE+
Sbjct: 129 IAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATED 187
Query: 188 CDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQL----WRNSKHYSISAYR 234
C PY FD CSH G YP TPKC C ++N++ ++ S YS+ +
Sbjct: 188 CQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTC-ERNEMDLVKYKGSTSYSVKGEK 244
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
++M E+ NGP+E++ VY DF YKSGVYKH+ GD +GGHAVKL+GWGT DG
Sbjct: 245 ------ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGT-QDG 297
Query: 295 EDYW 298
YW
Sbjct: 298 VPYW 301
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/289 (42%), Positives = 163/289 (56%), Gaps = 24/289 (8%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
VS + H L ++ +N+ WKA N F N + L G KG L V
Sbjct: 15 VSLARPHLHPLSSEMVNHINK-LNTTWKAGHN--FHNVDYSYVRKLCGT--MLKGPKLPV 69
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
V+ + +KLPK FDAR WP C T+ I DQG CGSCWAFGA EA+SDR CIH +
Sbjct: 70 MVQ-YAGDVKLPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNGKV 128
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS---- 194
N+ +S DLL CC CG GC+GGYP +AW ++ G+V+ C PY +
Sbjct: 129 NVEISSEDLLTCCDS-CGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEH 187
Query: 195 --TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
G P TP+CVR+C + KHY ++Y + SD + I EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNGP 247
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VE +FTVYEDF YK+GVY+H++G +GGHA+K++GWG ++G YW+C
Sbjct: 248 VEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWG-EENGTPYWLC 295
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/285 (43%), Positives = 163/285 (57%), Gaps = 31/285 (10%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH-----DK 94
D +I VN N + WKA + +FS Y KH G+ + L V K H D
Sbjct: 59 DELINYVNNNQQL-WKAKKQRRFSMYKGENDKHKWGLMGVNH-VRLSVKGKQHLSKTKDL 116
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
+ +P+SFD+R WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + +SLS +
Sbjct: 117 DMDIPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSAD 176
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------ 203
DLL+CC CG GC+GG P++AWRY+V G+VT + ++GC P CE
Sbjct: 177 DLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGS--NFTANSGCKPYPFPPCEHHSKKT 233
Query: 204 -------PAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
YPTPKC ++C + ++ + K Y SAY + D E I E+ +GP+E+
Sbjct: 234 HFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEI 293
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+F VYEDF +Y GVY H G + GGHAVKLIGWG +DG YW
Sbjct: 294 AFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGIPYWT 337
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/295 (41%), Positives = 162/295 (54%), Gaps = 22/295 (7%)
Query: 22 TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
T E + K L +I +N WKA +F TV + +LG P P
Sbjct: 20 TLNENDARRHKRMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77
Query: 82 GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
G L ++ +L +LPKSFDAR W C +IS I DQ CGSCWAFGAVEA+SDR C
Sbjct: 78 GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137
Query: 141 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 191
I LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196
Query: 192 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ C H P C+ TP C R C N + N K Y YR+ S+ E IM
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW+
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWL 309
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/282 (40%), Positives = 159/282 (56%), Gaps = 24/282 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L +II VN WKA + +F++ + Q + LG P P G L V +
Sbjct: 39 LSSAIIDYVNRI-NTTWKAEPSRRFTSPS--QVRQQLGALPDPMGRRLPVLYSLSENYKS 95
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH------FGMNLSLSV 151
LP SFD R WP C T+ I DQG CGSCWAFGA EA+SDR CI + + LS
Sbjct: 96 LPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSA 155
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE------------ECDPYFDSTGCSH 199
+DLL+CC CG GC+GG+P AW ++ H G+V+ E P +
Sbjct: 156 DDLLSCC-RDCGMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHHVNGTR 214
Query: 200 PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P CE PTPKC C ++ ++ ++ KHY++ Y ++S+ + I E+ +GPVE F V
Sbjct: 215 PPCEGDAPTPKCKNVCQEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEV 274
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
Y DF YKSGVY+H++G ++GGHA+KL+GWG +DG YW+C
Sbjct: 275 YADFPTYKSGVYQHVSGALLGGHAIKLMGWG-EEDGVPYWLC 315
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/295 (41%), Positives = 162/295 (54%), Gaps = 22/295 (7%)
Query: 22 TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
T E + K L +I +N WKA +F TV + +LG P P
Sbjct: 20 TLNENDARRHKHMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77
Query: 82 GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
G L ++ +L +LPKSFDAR W C +IS I DQ CGSCWAFGAVEA+SDR C
Sbjct: 78 GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRIC 137
Query: 141 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 191
I LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196
Query: 192 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ C H P C+ TP C R C N + N K Y YR+ S+ E IM
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW+
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWL 309
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 176/333 (52%), Gaps = 51/333 (15%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLK---LDSHILQ---DSIIKEVNENPKAGWKAARN 59
L L +CL + E + K + +D + D +I VN N W+A +
Sbjct: 4 LLLLSCLAVAVYCGCNDNVESTLDKFRNREIDDEAAELDGDELINYVNNNQDL-WRAKKQ 62
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGLLLGVP------------VKTHDKSLKLPKSFDARSA 107
+F++ + G K L+GV KT D + +P++FD+R
Sbjct: 63 RRFTS--------VYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDMDIPENFDSREN 114
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 165
WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + +SLS +DLL+CC CG G
Sbjct: 115 WPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFG 173
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTP 209
C+GG P++AWRY+V G+VT Y ++GC P CE YPTP
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTP 231
Query: 210 KCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
KC +KC+ ++ + K Y SAY + D E I E+ +GP+E++F VYEDF +Y
Sbjct: 232 KCEKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDG 291
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
GVY H G + GGHAVKL+GWG ++G YW C
Sbjct: 292 GVYVHTGGKLGGGHAVKLVGWGI-ENGIPYWTC 323
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/302 (38%), Positives = 173/302 (57%), Gaps = 26/302 (8%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+L V + ++ G VS + IL I +N++ K W+A N + +
Sbjct: 7 FLLTVYAGAAYSRGAVS-----NGILSKDYIDSINKDSKT-WRAGSNFD-EEISTSYIRG 59
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L+GV P K L + T + ++P++FD+R WP C TIS I DQG CGSCWAFGAV
Sbjct: 60 LMGVLPNHKDYLPPA-LPTLLGTEQIPENFDSRQKWPHCPTISLIRDQGSCGSCWAFGAV 118
Query: 133 EALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 185
EA+SDR CIH +++S +LL+CC + CG GC+GG+P +AW ++ G+V+
Sbjct: 119 EAMSDRLCIHSNKIVNVSAENLLSCC-YSCGFGCNGGFPGAAWSFWKKKGLVSGGLYGSH 177
Query: 186 EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL--WRNSKHYSISAYRINS 237
+ C PY + C H P C TPKC C ++ + K + S+Y + S
Sbjct: 178 KGCQPYAIAP-CEHHANGTRPPCSGGGRTPKCHTFCENEDYSLPYEKDKSFGRSSYSVKS 236
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
DP+ I EI NGPVE +F+VY DF +YKSGVY+H+ G ++GGHA++++GWG ++G Y
Sbjct: 237 DPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGV-ENGTPY 295
Query: 298 WV 299
W+
Sbjct: 296 WL 297
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 121/317 (38%), Positives = 174/317 (54%), Gaps = 29/317 (9%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M S +F LLI +F + +++D + L D I +N + + W A RN
Sbjct: 1 MFKSIIFALVGLLIF------SFGRVDGATVRVDLNPLSDEFIDHIN-SIQYYWSAGRNF 53
Query: 61 QFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+ + K L+GV + + L + +D S LP++FDAR WP C TI + D
Sbjct: 54 H-KDTPISYIKGLMGVHEKNAEYPKLEQLLTYNDASTDLPETFDARERWPNCPTIREVRD 112
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
QG CGSCWAFGAVEA+SDR CIH N S +L++CC + CG GC+GG+P +AW Y
Sbjct: 113 QGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNY 171
Query: 178 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-W 222
+ G+V+ PY + GC + C+ TP CV+KC + ++ +
Sbjct: 172 WKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPY 229
Query: 223 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
H+ SAY I +D + I EIY NGPVE +FTVYEDF Y++GVYKH+ G +GGHA
Sbjct: 230 AQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHA 289
Query: 283 VKLIGWGTSDDGEDYWV 299
++++GWG + YW+
Sbjct: 290 IRILGWGVQNGEIPYWL 306
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 130/312 (41%), Positives = 171/312 (54%), Gaps = 32/312 (10%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNY 65
L C L+ G +S+ V + K L D +I +N+ WKA +N +
Sbjct: 4 LVLCALVAGAMSAL-----VEFRDKDIFEPLSDEMIWFINK-LNTTWKAGQNFHHIAKDD 57
Query: 66 TVGQFKHLLGVK-PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ K + G TP L L P K + LP SFD+R+ WP C T+ + DQG CG
Sbjct: 58 RLAHVKMMCGTYLNTPPELRL--PEKKMEPLKDLPASFDSRTQWPNCPTLKEVRDQGACG 115
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAFGAVEA+SDR CI N+ +S DL +CC CG+GC+GG+P +AW Y+ G
Sbjct: 116 SCWAFGAVEAMSDRICIKSQGKENVHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKRDG 174
Query: 183 VVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKH 227
+VT + C PY C H P + PTPKC C N + KH
Sbjct: 175 LVTGGQYNSHQGCQPY-TIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKH 233
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y +SAY ++ E IM EI NGPVE +FTVY DF YKSGVYKH TG +GGHA+K++G
Sbjct: 234 YGMSAYSVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILG 292
Query: 288 WGTSDDGEDYWV 299
WGT ++G+DYW+
Sbjct: 293 WGT-ENGDDYWL 303
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 123/282 (43%), Positives = 163/282 (57%), Gaps = 32/282 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L D ++ VN+ WKA N F N K L G LG P
Sbjct: 26 LSDELVHYVNKQ-NTTWKAGHN--FHNVDQSYLKKLCGT-------FLGGPKPPQRLWFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+++ LP+SFD+R WP C TI I DQG CGSCWAFGAVEA+SDR CI ++S+ V+
Sbjct: 76 ENMILPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 199
D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C H
Sbjct: 136 EDMLTCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPY-SIPPCEHHVNGS 194
Query: 200 -PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P C TPKC + C ++ KHY S+Y ++S ++IMAEIYKNGPVE +F+
Sbjct: 195 RPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFS 254
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YKSGVY+H+TG++MGGHAV+++GWG ++G YW+
Sbjct: 255 VYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGV-ENGTPYWL 295
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 105/210 (50%), Positives = 136/210 (64%), Gaps = 16/210 (7%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+KLP++FD+R+ WP+C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ +S D
Sbjct: 11 VKLPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAED 70
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPG 201
LL+CCG CG GC+GGYP AW ++ G+V+ C PY S P
Sbjct: 71 LLSCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEHHVNGSRPS 130
Query: 202 CEPAY-PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKCV +C + KH+ ++Y ++S+ DI EIYKNGPVE +FTVY
Sbjct: 131 CTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVY 190
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
EDF YKSGVYKH+TGD +GGHA++++GWG
Sbjct: 191 EDFLQYKSGVYKHVTGDAVGGHAIRILGWG 220
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 119/281 (42%), Positives = 160/281 (56%), Gaps = 21/281 (7%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
H L D I+ + +N K WKA RN N + K L+GV K + V +
Sbjct: 19 HPLSDKFIQLL-QNEKTTWKAGRNFN-KNLPMRYLKSLMGVHADSKFHMSPVHKHKIPEG 76
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
K+PK FD+R+AW C TIS I DQG CGSCWAFGAVE ++DR CIH N S +
Sbjct: 77 FKIPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAEN 136
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH------P 200
L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C H P
Sbjct: 137 LVSCC-HLCGFGCNGGFPGAAFQYWVHSGIVSGGAFNSTQGCQPY-EIAPCEHHVSGPRP 194
Query: 201 GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKC + C + + + H+ Y ++ D I +I NGPVE +FTVY
Sbjct: 195 KCAEGGSTPKCHKNCESNYVVDYESDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVY 254
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
DF HYKSGVY+H G +GGHA++++GWG +DG YW+C
Sbjct: 255 VDFLHYKSGVYQHTHGLPLGGHAIRVLGWG-EEDGTPYWLC 294
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/268 (46%), Positives = 155/268 (57%), Gaps = 29/268 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQC 111
WKA N + N LLGV+P L P +T D S LP++FDAR WP C
Sbjct: 76 WKAGHNSGYDNPE--DVIPLLGVRPENSRYRL--PERTLDVSALRVLPENFDAREHWPDC 131
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHF-----GMNLSLSVNDLLACCGFLCGDGC 166
TI I DQG CGSCWAFGAVEA+SDR CIH + L+ +D+L+CC CG GC
Sbjct: 132 PTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKPRVIAHLAADDVLSCC-TECGAGC 190
Query: 167 DGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCV 212
+GG+P SAW Y+VH G+VT E C PY C H P + PTP+CV
Sbjct: 191 NGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKTIPPTPRCV 249
Query: 213 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
R C K + + + KHY AY + + + I AEI NGPVE FTVYEDF HYKSGVY+
Sbjct: 250 RMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQ 309
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
T +GGHA++L+GWG ++G YW+
Sbjct: 310 RHTDSALGGHAIRLLGWGV-ENGVPYWL 336
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/319 (37%), Positives = 174/319 (54%), Gaps = 30/319 (9%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M + L L+ L++ + FA + + K + + I E N WKA N
Sbjct: 1 MKHTALILSASFLLIALTG---FATYEIFRFKHQKYHDRLKQIAEKVNNSNTTWKAGENI 57
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHDKSLKLPKSFDARSAW-PQCSTISRIL 118
++ N + K +G K GV + K + ++ LP FD+R W +CS++ +
Sbjct: 58 KWINSDIAGVKAHMGTLLNQKS---GVKLEKVNRQANNLPSEFDSRVQWGDKCSSLWEVR 114
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
DQ +CGSCWAFGA E+LSDR CIH G ++ LS +L+ CC CG GCDGG+P +A Y+
Sbjct: 115 DQSNCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLVTCCD-ECGFGCDGGWPEAAMDYY 173
Query: 179 VHHGVVTEECDPYFDSTGCS---------------HPGCEPAYPTPKCVRKCVKKNQL-- 221
V++G+VT D Y +++ C +P C PTP CV+ C +
Sbjct: 174 VNNGLVTG--DLYGNNSWCQAYSLAPCAHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTI 231
Query: 222 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+ H AY I+ + + IM EI NGP+EV+FTVYEDF YKSGVY+H+TG +GG
Sbjct: 232 PYPKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGG 291
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVK++GWG ++G YW+
Sbjct: 292 HAVKMVGWGV-ENGTPYWI 309
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 163/295 (55%), Gaps = 27/295 (9%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 86
+ + + D H+L + ++ V K W RN S + + L+GV P L
Sbjct: 12 IAAATEDDPHMLSEEFMELVRGKAKT-WTVGRNFDAS-VSEHHIRGLMGVHPDAHKFTLP 69
Query: 87 VPVKTHDKSLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
+ ++ LP+ FDAR+AWP C TI I DQG CGSCWAFGAVEA+SDR CI
Sbjct: 70 EKSQVLGNLMEADGGDLPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCI 129
Query: 142 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF 192
H +N S +DL++CC CG GC+GG+P +AW Y+ H G+V+ E C PY
Sbjct: 130 HSNATVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPY- 187
Query: 193 DSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAE 245
+ C H P C TP+C+ KC + + KH+ AY +N +P DI E
Sbjct: 188 EVEPCEHHVNGTRPPCHSG-STPRCMHKCESGYSVDYAKDKHFGAKAYSVNRNPLDIQRE 246
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
I NGPVE +FTVYED YK+GVY+H+ G +GGHA++++GWG D+ YW+
Sbjct: 247 IMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVWGDNKVPYWL 301
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 116/253 (45%), Positives = 147/253 (58%), Gaps = 23/253 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
W +N QF N +G LLG K + +PV D ++K P SFD+R+AW C+T
Sbjct: 39 WVEEKNDQFDNIKIGS---LLGFKKSLN--RPSIPVLNADPNIKAPASFDSRTAWSNCTT 93
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 173
I I +Q CGSCWAFGAVE+ DR CIH G+++ LS DL+ C DGC+GG +S
Sbjct: 94 IGYIENQARCGSCWAFGAVESAQDRICIHKGLDVQLSFLDLVTC--DQSDDGCEGGDDVS 151
Query: 174 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNS 225
AW + GVVT+EC PY + P C PA TP CV++C + L +
Sbjct: 152 AWNFLKKQGVVTQECKPY------TIPTCPPAQQPCLNFVNTPNCVKQCESNSTLIYSQD 205
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
KH Y INS E IM EI NGPVE F+VYEDF YKSGVY+H TG +GGH VK+
Sbjct: 206 KHKMAKIYSINS-VEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKI 264
Query: 286 IGWGTSDDGEDYW 298
G+GT +G +YW
Sbjct: 265 FGYGTL-NGVNYW 276
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 123/286 (43%), Positives = 159/286 (55%), Gaps = 33/286 (11%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPVKT 91
H L D I +N K+ WKA RN + + K LLGV P TPK +P K
Sbjct: 24 HPLSDDFINRINSR-KSTWKAGRNFDI-DTPISHIKQLLGVLPETENTPK-----LPKKI 76
Query: 92 HD-KSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNL 147
H + ++P SFDAR AWP C+ I I DQ CGSCWAFGAVEA+SDR CIH + +
Sbjct: 77 HSINAQEIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGAVEAMSDRICIHSNATVKV 136
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------- 199
++S D L CC +CG GC+GG P AW ++ +G+VT Y D+ GC
Sbjct: 137 NISAEDPLDCC-TICGMGCNGGMPAMAWLHWTVNGIVTG--GNYEDTNGCKAYSFAPCEH 193
Query: 200 ------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
P C P PTP C ++C + L + S Y I+ P+ I EI NGPVE
Sbjct: 194 HVDGDLPPCGPTKPTPDCKKECDSGSSLTYQNDLTHGSNYGIDPYPKQIQTEIMTNGPVE 253
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SF+VYEDF YKSGVY+H+ G+ GGHA+K++GWG +D YW+
Sbjct: 254 ASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVEND-TPYWL 298
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 172/314 (54%), Gaps = 42/314 (13%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CL++L S+ + + L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLVVLTNARSRPYFQ-----------PLSDELVNYVNKR-NTTWKAGHN--FHNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQG 121
+ K L G LG P + + LP++FDAR WP C TI I DQG
Sbjct: 51 DLSYVKRLCGT-------FLGGPKLPQRVWFAEDVVLPENFDAREQWPNCPTIKEIRDQG 103
Query: 122 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFV 179
CGSCWAFGAVEA+SDR CI ++S+ V+ D+L CCG CGDGC+GG+P AW ++
Sbjct: 104 SCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWT 163
Query: 180 HHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNS 225
G+V+ C PY G P TPKC + C + ++
Sbjct: 164 KQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKED 223
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
KHY S+Y ++S ++IMAEI+KNGPVE +FTVY DF YKSGVY+H+ GD+MGGHAV++
Sbjct: 224 KHYGCSSYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRI 283
Query: 286 IGWGTSDDGEDYWV 299
+GWG ++G YW+
Sbjct: 284 LGWGV-ENGTPYWL 296
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 155/264 (58%), Gaps = 25/264 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F N + L G KG L V V+ + LKLP+ FDAR WP C T
Sbjct: 40 WKAGHN--FHNVDYSYIQRLCGT--MLKGPKLPVMVQ-YTGDLKLPEEFDAREQWPNCPT 94
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
+ I DQG CGSCWAFGA EA+SDR CIH +S+ ++ DLL CC CG GC+GGYP
Sbjct: 95 LKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLTCC-MSCGMGCNGGYP 153
Query: 172 ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-V 216
+AW ++ G+V+ C PY + C H P C TP+C+ KC
Sbjct: 154 SAAWDFWTKEGLVSGGLYDSHIGCRPYTIAP-CEHHVNGSRPSCTGEGGDTPQCITKCEA 212
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
++ KH+ ++Y + SD E I +EI+KNGPVE +F VYEDF YKSGVY+H++G
Sbjct: 213 GYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGS 272
Query: 277 VMGGHAVKLIGWGTSDDGEDYWVC 300
+GGHA+K++GWG +DG YW+C
Sbjct: 273 AVGGHAIKILGWGV-EDGVPYWLC 295
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 170/308 (55%), Gaps = 37/308 (12%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF- 70
L+I GV+ + F S + H + N K W+A N F + +
Sbjct: 3 LIIFGVLIAMVFTMPKNSMFQSHIHTIN---------NMKTTWEAGEN--FGPHITSDYI 51
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQ-CSTISRILDQGHCGSCWA 128
++L G TP L +P+K K + LP FDAR W C ++ + DQG CGSCWA
Sbjct: 52 RNLCGALKTP--LSKKLPIKDLSKEVHDLPIEFDARKEWGSICPSLLEVRDQGECGSCWA 109
Query: 129 FGAVEALSDRFCIHF-GMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
FGA EA++DR CI G N + +S DLL CC CG GC+GGYP SAW +F G+VT
Sbjct: 110 FGAAEAMTDRICIATKGKNQVRISTEDLLTCCD-SCGFGCNGGYPQSAWEFFKTKGIVTG 168
Query: 187 ECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
PY GC S C + PTPKC + C K N ++N KHY ++
Sbjct: 169 --GPYNSHKGCQPYAIPACDHHVPHSKNPCNGSLPTPKCEKVCEKGYNITYKNDKHYGVT 226
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y IN+D +IM EI NGPVE +FTV+ DF +YKSGVY+H++G+ +GGHA+K++GWG
Sbjct: 227 SYSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVE 286
Query: 292 DDGEDYWV 299
++ YW+
Sbjct: 287 NN-TPYWL 293
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/262 (43%), Positives = 152/262 (58%), Gaps = 23/262 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F+ V K+L G P L P+ H+ + LPKSFD+R W C +
Sbjct: 42 WKAGTN--FAGLPVSYVKYLCGALEDPNHFQL--PIHVHEDTSDLPKSFDSRDKWRMCPS 97
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 171
I I DQG CGSCW+FGAVE+++DR CIH + + +S DL+ CC CG GC+GG+
Sbjct: 98 IREIRDQGSCGSCWSFGAVESITDRICIHSNGKVKVHISAEDLMTCCT-SCGMGCNGGFL 156
Query: 172 ISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 218
AW Y+V++G+VT + C PY + C H C PTPKC +KC
Sbjct: 157 PQAWHYWVNNGIVTGGQYHSHKGCQPY-EIPKCEHHVKGPFKACGKELPTPKCSQKCQPG 215
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
N+ + KH+ +Y I ++ + I EI NGPVE +FTVY DF YKSGVY+H TG
Sbjct: 216 YNKTFNQDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGP 275
Query: 278 MGGHAVKLIGWGTSDDGEDYWV 299
+GGHAVK++GWGT ++ YW+
Sbjct: 276 LGGHAVKILGWGTENN-TPYWL 296
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/284 (41%), Positives = 163/284 (57%), Gaps = 23/284 (8%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVK 90
K + IL +S I VNE + WKA P F T + + L+GV P + L P+
Sbjct: 19 KTYNSILSESFIASVNEEAQI-WKAG--PNFHPETSSNYIRSLMGVLPNHRDYLP-PPLP 74
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 150
+ +P +FDAR WP C +I I DQG CGSCWAFGA EA+SDR CIH N+++S
Sbjct: 75 NLLGTESIPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNVNIS 134
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGC 197
+LL+CC + CG GC+GG+P +AWR++ + G+V+ + C PY G
Sbjct: 135 AENLLSCC-YTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGT 193
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVS 255
P C TPKC + C KN K S S+Y I SDP+ I +I NGPVE +
Sbjct: 194 RKP-CAEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAA 252
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F+VY DF YKSGVY+H+ G ++GGHA++++GWG + G YW+
Sbjct: 253 FSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGM-EKGTPYWL 295
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 166/306 (54%), Gaps = 37/306 (12%)
Query: 27 VVSKLKLDSHILQDSIIKEVNE-NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL 85
V S D + I++EVN N + WKA N +F + Q + ++G TP ++
Sbjct: 12 VASVQAFDFKLFTSEIMEEVNNYNTGSTWKAGYNKRFEGMSFDQIQAMMGTIATPVHMIP 71
Query: 86 G---VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
P +T ++L LP+SFD R A+P+C ++ ++ DQ +CGSCWAFG VEA+SDR CI
Sbjct: 72 DERYTPFETI-QNLSLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIA 130
Query: 143 FGM--NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVT------------E 186
G +S +LL+CC F CG GC+GGY AW Y+V G+V+
Sbjct: 131 SGQKDQTRISSENLLSCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKT 190
Query: 187 ECDPYFDSTGCSH------PGCE--PAYPTPKCVRKCVKKNQLWRNSK----HYSISAYR 234
EC PY CSH C P + TPKC +C +Q +NS H +S+Y
Sbjct: 191 ECQPY-SFPPCSHHVQGEYQACTDLPQFNTPKCYTEC--NSQYTQNSYEQDLHKGVSSYS 247
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
+ E I AEIY+ G SF VY DF Y SGVY++ +G MGGHA+K++GWG ++G
Sbjct: 248 VPKSEEQIKAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGV-ENG 306
Query: 295 EDYWVC 300
YW+C
Sbjct: 307 TPYWLC 312
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 117/282 (41%), Positives = 160/282 (56%), Gaps = 30/282 (10%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG--VPVKTH-DK 94
L + ++ +N+ + WKA N F N + L G +L G +PVK
Sbjct: 25 LSNEMVNHINK-VNSTWKAGLN--FQNVDYSYLRRLCGT------MLKGPKLPVKLQFTA 75
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
++LP FDAR WPQC T+ + DQG CGSCWAFGA EA+SDR CIH MN+ +S
Sbjct: 76 DVQLPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAAEAISDRLCIHSNGLMNVEISAE 135
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSH 199
DLL+CC CG GC+GGYP +AW ++ G+V+ C PY + G
Sbjct: 136 DLLSCCDS-CGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEHHVNGSRP 194
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P TP+C +KC + KHY +Y ++ ++I EIYKNGPVE +FTV
Sbjct: 195 PCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTV 254
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
YEDF YK+GVY+H+TG +GGHA+K++GWG ++G YW+C
Sbjct: 255 YEDFLLYKTGVYQHVTGSAVGGHAIKVLGWG-EENGTPYWLC 295
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/274 (41%), Positives = 151/274 (55%), Gaps = 15/274 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D + + EVN+ K W A + + + T K L+G K +L +
Sbjct: 27 DGRFITREFVAEVNKLNKGIWTARYDTKMARLTRQGVKRLMGAKLRDAPVLPRRHFTEEE 86
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVN 152
LP+SFDA +AWP C TI RI DQ CGSCWA A A+SDRFC+ G+ +L +S
Sbjct: 87 LRAPLPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCVTGGVRDLGISAG 146
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 207
DLL+CC CGDGCDGGYP AW YF G+V++ C PY C H G P
Sbjct: 147 DLLSCC-TSCGDGCDGGYPDEAWLYFTESGLVSDYCQPY-PFPPCKHSGGRSKNPSCHDM 204
Query: 208 ---TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
TPKC C K ++++ +Y + + ED E+Y GP EV+FTVYEDF
Sbjct: 205 HFHTPKCNATCTDKRIP--VVRYFASESYSLQGE-EDYKRELYLRGPFEVAFTVYEDFLA 261
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
Y+SGVYKH++G +GGHAV+++GWG +G YW
Sbjct: 262 YESGVYKHVSGGPVGGHAVRVVGWGER-NGVPYW 294
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/308 (38%), Positives = 165/308 (53%), Gaps = 29/308 (9%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+I ++ F+ G +++D L D I +N + + W A RN N + K
Sbjct: 5 IIFALVGLLIFSFGCCDDIRVDLDPLSDEFIDHIN-SIQYYWSAGRNFH-KNTPMSYLKG 62
Query: 73 LLGVKPT----PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
L+GV + PK L V D LP++FDAR WP C TI + DQG CGSCWA
Sbjct: 63 LMGVHESNAHYPK---LEQLVSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWA 119
Query: 129 FGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
FGAVEA+SDR CIH N S +L++CC CG GC+GG+P +AW Y+ G+V+
Sbjct: 120 FGAVEAMSDRVCIHSKGAKNFHFSAENLVSCC-RTCGFGCNGGFPGAAWHYWKTKGIVSG 178
Query: 187 ECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSIS 231
PY GC + C+ TP CV+KC ++ + H S
Sbjct: 179 --GPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKS 236
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
AY + +D + I EIY NGPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG
Sbjct: 237 AYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQ 296
Query: 292 DDGEDYWV 299
+ YW+
Sbjct: 297 NGEIPYWL 304
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 120/288 (41%), Positives = 163/288 (56%), Gaps = 25/288 (8%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
VS K +L +++ +N N W A +N F N + K L G KG L
Sbjct: 15 VSWAKPRLPLLSPEMVQYIN-NADTTWTAGQN--FHNVDISYVKSLCGT--LLKGPRLPE 69
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
V++ D+ + LP SFDAR WP C TI I DQG CGSCWAFGA EA+SDR+CIH +
Sbjct: 70 LVQS-DEDMSLPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKV 128
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 198
++ +S DLL+CC CG GC GG+P +AW Y+ G+VT C PY + C
Sbjct: 129 SVEISAEDLLSCCD-ACGMGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAP-CE 186
Query: 199 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
H P C TPKCV +C ++ K + Y + + IM E+YKNGP
Sbjct: 187 HHVNGTRPPCTGEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGP 246
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE +F+VYEDF YK+GVY+H+TG ++GGHA+K++GWG ++ YW+
Sbjct: 247 VEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWG-KENNTPYWL 293
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 120/289 (41%), Positives = 162/289 (56%), Gaps = 24/289 (8%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
VS+ + L ++ +N+ WKA N F N + L G KG L +
Sbjct: 15 VSQARPRLKPLSSEMVNYINK-VNTTWKAGHN--FHNVDFSYVQRLCGT--MLKGPKLPI 69
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
V+ + +KLPK+FD+R WP C T+ I DQG CGSCWAFGA EA+SDR CIH +
Sbjct: 70 MVQ-YAGDMKLPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKV 128
Query: 148 S--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 192
S +S DLL CC CG GC+GGYP +AW ++ G+V+ C PY
Sbjct: 129 SVEISAEDLLTCCD-SCGMGCNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEH 187
Query: 193 DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
G P TP+C+ +C +R KHY ++Y + SD +I EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGP 247
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG ++G YW+C
Sbjct: 248 VEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWG-EENGVPYWLC 295
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/287 (42%), Positives = 163/287 (56%), Gaps = 32/287 (11%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSN-YTVGQFKHLLGVKPTPKGLLLGVPVKTH-----D 93
D +I +N+N W A + +F++ Y K G+ + L V K H D
Sbjct: 44 DELIDYINDNQNL-WTAKKQKRFTSVYGETDDKAKWGLMGVNH-VRLSVKGKQHLSKTKD 101
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
L +P+SFD+R WP+C +I I DQ CGSCWAFGAVEA+SDR CI H + +SLS
Sbjct: 102 LDLDIPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSA 161
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE----- 203
+DLL+CC CG GC+GG P++AWRY+V G+VT Y ++GC P CE
Sbjct: 162 DDLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKK 218
Query: 204 --------PAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
YPTPKC +KC+ ++ + K Y SAY + D E I E+ +GP+E
Sbjct: 219 THFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLE 278
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
++F VYEDF +Y GVY H G + GGHAVKLIGWG +DG YW C
Sbjct: 279 IAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGIPYWTC 324
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/303 (39%), Positives = 167/303 (55%), Gaps = 26/303 (8%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 68
CL+ + + T G+ +K D +L S + EVN K W A+ + + + ++G
Sbjct: 10 CLVAVFALLLATTVSGLYAKPS-DFPLLGKSFVAEVNSKAKGQWTASADNGYLVTGKSLG 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+ FDA WP C TIS I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTDMSTEAVPPRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++V G+ TE+
Sbjct: 129 IAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATED 187
Query: 188 CDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ---LWRNSKHYSISAYRI 235
C PY FD CSH G YP TPKC C + ++ S YS+ +
Sbjct: 188 CQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTCERSEMDLVKYKGSTSYSVKGEK- 244
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
++M E+ NGP+E++ VY DF YKSGVYKH+ G+ +GGHAVKL+GWGT DG
Sbjct: 245 -----ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLVGWGT-QDGV 298
Query: 296 DYW 298
YW
Sbjct: 299 PYW 301
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 110/284 (38%), Positives = 152/284 (53%), Gaps = 22/284 (7%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK 90
L+ S D + + ++ W + N ++ + K +G +
Sbjct: 13 LRFQSQTFYDFVNSQ-----QSTWVSGHNQRWEQFNEATLKTQMGTFLDEPDFMKLPEST 67
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 150
++L++P+SFDAR WP C +I + DQ CGSCWAFGA EA+SDR CI G +S
Sbjct: 68 VQFENLEIPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIATGKQTRIS 127
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH---- 199
DLL CCG CG GC+GG+P AW YF + G+VT + C PY C H
Sbjct: 128 TEDLLTCCGITCGMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPY-TFPPCDHHVDD 186
Query: 200 ---PGCEPAYPTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C + PTP CV+ C ++ + + + K SI +Y ++S E I EI GPVE S
Sbjct: 187 GKYGPCGDSQPTPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEAS 246
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FTVYEDF YKSGVY+++ G +GGHAVK+IGWG + YW+
Sbjct: 247 FTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKN-VPYWL 289
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 119/280 (42%), Positives = 158/280 (56%), Gaps = 25/280 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 96
L + I +N PK W A RN +N K L+G +L +P THD L
Sbjct: 26 LSEDFINILNSKPKT-WTAGRNFP-ANTPFAHIKMLMGALKDDN--ILKLPKMTHDAELI 81
Query: 97 -KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP++FD R WP C T++ I DQG CGSCWAFGAVEA++DR C + + S D
Sbjct: 82 ASLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAED 141
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH--PG--- 201
LL+CC +CG GC+GG P AW Y+ H G+V T+ C PY + C H PG
Sbjct: 142 LLSCCP-ICGLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPY-EVPPCEHHVPGNRL 199
Query: 202 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKC + C N ++ KHY Y ++ + ++I AE++KNGPVE +FTVY
Sbjct: 200 PCNGDTKTPKCQKTCEAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVY 259
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
D YKSGVY+H G +GGHAVK++GWG ++G YW+
Sbjct: 260 SDLLSYKSGVYQHTDGSALGGHAVKILGWGV-ENGSKYWL 298
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 105/222 (47%), Positives = 132/222 (59%), Gaps = 19/222 (8%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P++FDAR W QC +I I DQ HCGSCWA A E +SDR CIH +N+ LS D
Sbjct: 93 VEIPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATD 152
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY 206
+L+CCG CG GC GGYPI AWRYF+ HGV T + C PY C H E Y
Sbjct: 153 ILSCCGTTCGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHP-CGHHRNEIYY 211
Query: 207 --------PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
PTP+C + C + + K Y SAY + ++ + I EI NGPV+ +F
Sbjct: 212 GECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFM 271
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYEDF+ Y+SG+Y H G GGHAVKLIGWG DDG YW+
Sbjct: 272 VYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGNKYWL 313
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 116/281 (41%), Positives = 158/281 (56%), Gaps = 24/281 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
H L ++ +N+ WKA N F N + L G KG L + V+ +
Sbjct: 23 HPLSSDMVNYINK-LNTTWKAGHN--FKNADYSYVQKLCGT--MLKGPKLPIMVQ-YAGD 76
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--D 153
+KLP FDAR+ WP C T+ I DQG CGSCWAFGA EA+SDR CIH +S+ ++ D
Sbjct: 77 VKLPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEISSED 136
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHP 200
LL CC CG GC+GGYP +AW ++ G+VT C PY G P
Sbjct: 137 LLTCCE-SCGMGCNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEHHVNGTRPP 195
Query: 201 GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
TP+C+ +C ++ KHY ++Y + ++ I EIYKNGPVE +F VY
Sbjct: 196 CTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVY 255
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
EDF YKSGVY+H++G ++GGHA+K++GWG +DG YW+C
Sbjct: 256 EDFPMYKSGVYQHVSGSLIGGHAIKILGWGV-EDGVPYWLC 295
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 172/307 (56%), Gaps = 30/307 (9%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
L+++ + + T A SK + L I+E+N W+A +N + ++ +
Sbjct: 5 LVVIALAAVGTNAAAGGSK----KYPLSSKFIEEINTKATT-WRAGQNFH-PDTSLTYIR 58
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
L+GV P P HD S +LP++FD+R WP C TI I DQG CGSCWAF
Sbjct: 59 GLMGVHPDADKFR--EPEILHDLSDGDELPENFDSREQWPNCPTIREIRDQGSCGSCWAF 116
Query: 130 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
GAVEA+SDR C+ G ++ S DL++CC CG GC+GG+P +AW Y+V G+V+
Sbjct: 117 GAVEAMSDRVCVASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGG 175
Query: 188 -------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
C PY + C H P CE TPKCV+KC + N ++ K + S+
Sbjct: 176 PFGSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASS 234
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y I I EI NGPVE +FTVYED HYK GVY+H+TG ++GGHA++++GWG +
Sbjct: 235 YSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGV-E 293
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 294 NGTKYWL 300
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 117/281 (41%), Positives = 154/281 (54%), Gaps = 21/281 (7%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
S L D I +N WKA RN + V + L+G + L D
Sbjct: 21 SEPLSDDFINLINSKQDT-WKAGRNFPV-DTPVKHIQKLMGTLKDDRFTTLVTLQHEVDL 78
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
LP++FD R WP C T++ + DQG CGSCWAFGAVEA++DR C + + S
Sbjct: 79 IASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAE 138
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH--PG-- 201
DLL+CC +CG GC+GG P AW Y+ H G+V T+ C PY + C H PG
Sbjct: 139 DLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPY-EIPPCEHHVPGNR 196
Query: 202 --CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C TPKC++KC N ++ KHY Y + + I AE+YKNGPVE +FTV
Sbjct: 197 LPCSGDTKTPKCIKKCEDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAFTV 256
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y D YKSGVYKH+ GD +GGHA+K++GWG ++G YW+
Sbjct: 257 YADLLSYKSGVYKHVAGDALGGHAIKIMGWGV-ENGNKYWL 296
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/281 (43%), Positives = 161/281 (57%), Gaps = 27/281 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARN--PQFS-NYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
IL I +NE + WKA RN P+ S NY + L+GV P K L P+ +
Sbjct: 25 ILSSEYIHSINEASEI-WKAGRNFHPETSSNY----LRSLMGVLPNHKDHLP-PPLPSLL 78
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 153
+ LP FDAR WP C +I I DQG CGSCWAFGA EA+SDR CIH N+++S +
Sbjct: 79 GTEALPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNVNISAEN 138
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 200
LL+CC + CG GC+GG+P +AW+Y+ G+V+ C PY D C H
Sbjct: 139 LLSCC-YSCGFGCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPY-DIEPCEHHVNGTRQ 196
Query: 201 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C TPKC R C +N K S S+Y I SDP+ I EI NGPVE +F+V
Sbjct: 197 PCAEGGRTPKCHRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSV 256
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y DF + KSGVY+H+ G ++GGHA++++GWG + G YW+
Sbjct: 257 YSDFMNDKSGVYRHVKGSLLGGHAIRILGWGV-EKGTPYWL 296
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 169/312 (54%), Gaps = 32/312 (10%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNY 65
L C L+ G +S+ V + K L D +I +N+ WKA +N +
Sbjct: 4 LVLCALVAGAMSAL-----VEFRDKDIFEPLSDEMIWFINKM-NTTWKAGQNFHHIAKDD 57
Query: 66 TVGQFKHLLGVK-PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ K + G TP L L P K + LP +FD+R+ WP C T+ + DQG CG
Sbjct: 58 RLAHVKMMCGTYLNTPPELRL--PEKKMEPLKDLPATFDSRTQWPNCPTLKEVRDQGACG 115
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAFGAVEA+SDR CI N +S DL +CC CG+GC+GG+P +AW Y+ G
Sbjct: 116 SCWAFGAVEAMSDRICIKSQGKENTHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKKDG 174
Query: 183 VVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKH 227
+VT + C PY C H P + PTPKC C N + KH
Sbjct: 175 LVTGGQYNSHQGCLPY-TIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKH 233
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y SAY ++ E IM EI NGPVE +FTVY DF YKSGVYKH TG +GGHA+K++G
Sbjct: 234 YGSSAYSVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILG 292
Query: 288 WGTSDDGEDYWV 299
WGT ++G+DYW+
Sbjct: 293 WGT-ENGDDYWL 303
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 118/311 (37%), Positives = 166/311 (53%), Gaps = 26/311 (8%)
Query: 13 LILGVISSQTFAEGVV--SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
L++G+++ + V ++ ++ +L+ + + + + A FS+Y
Sbjct: 8 LLVGLVAVNAYNIEVKHGEEIPVEVQMLRGQELVDYINKKQTTFTAKLGAYFSDYPDTIK 67
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K L+G K V H + L +P SFD+R+ WP C +IS+I DQ CGSCWA
Sbjct: 68 KQLMGAKMVEIPEEYRVFEMEHPEVLDAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWA 127
Query: 129 FGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
A E +SDR CI +S+S +D+ ACCG CG+GC+GGYPI AWR++V +G VT
Sbjct: 128 VSAAETISDRICIASKGQTQVSISADDINACCGMACGNGCNGGYPIEAWRHYVKNGYVTG 187
Query: 187 ECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL-WRNSKHYS 229
Y + TGC +P CE YPT KC R C L ++ H+
Sbjct: 188 --GSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFG 245
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
SAY ++ +I EI NGPVEV+FTVY DF Y GVY H G +GGHAVK++GWG
Sbjct: 246 QSAYAVSKKATEIQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWG 305
Query: 290 TSDDGEDYWVC 300
D+G YW+C
Sbjct: 306 V-DNGTPYWLC 315
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 122/301 (40%), Positives = 166/301 (55%), Gaps = 36/301 (11%)
Query: 28 VSKLKLDSH---ILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTP--- 80
++ L L+ H IL D ++ V + K W RN F T + ++ L+GV P
Sbjct: 10 LALLALNVHGDDILSDRFMEIVRQKAKT-WTVGRN--FHKLTPMSHYRQLMGVHPDAHYY 66
Query: 81 ----KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
K ++L + +PK FD+R+ WP C TI I DQG CGSCWAFGAVEA+S
Sbjct: 67 ALPDKRMVLREEELVGLGNDMIPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMS 126
Query: 137 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS 194
DR CIH +N S +DL++CC CG GC+GG+P +AW Y+V G+V+ PY S
Sbjct: 127 DRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWGYWVRKGIVSG--GPYGSS 183
Query: 195 TGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 238
GC + P CE Y TP+C KC ++ ++ KH+ AY I+ +
Sbjct: 184 QGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKN 243
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DI EI NGPVE +FTVYED YK GVY+H+ G +GGHA+++IGWG D YW
Sbjct: 244 VRDIQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-TPYW 302
Query: 299 V 299
+
Sbjct: 303 L 303
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 115/277 (41%), Positives = 151/277 (54%), Gaps = 26/277 (9%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSLKL 98
+ + EVN+ + W A N +F+ T K +GV + P+ +P K L
Sbjct: 35 EQVAAEVNQ-AQTSWTAGVNSRFARATDDFIKSQMGVLEGGPQ-----LPEKDIAVLADL 88
Query: 99 PKSFDARSAW-PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
P +FD+R W C + I DQ CGSCWAFGAVE+++DR CI +L +S DL+
Sbjct: 89 PTAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDLM 148
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGC 202
CC F CG GC GGYP +AW +F G+VT + C PY C H P C
Sbjct: 149 TCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPY-SLPNCDHHVSGQYPAC 207
Query: 203 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
PTP C + C N + N KH+ +AY + + + I EI NGPVE +FTVYED
Sbjct: 208 SGEGPTPACKKSCEAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYED 267
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKSGVY+H TG V+GGHA+K+IGWG + G DYW
Sbjct: 268 LLTYKSGVYQHTTGQVLGGHAIKIIGWGV-ESGVDYW 303
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 134/226 (59%), Gaps = 22/226 (9%)
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
S +P FDAR WP C +I+ I DQ CGSCWAF A EA+SDR CI + +N LS
Sbjct: 79 SDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138
Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGC 197
DLL+CC F CG+GC+GGYPI AW+++V HG+VT C PY + G
Sbjct: 139 DLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGV 198
Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
P C E PTPKCV C KN + KH+ +AY + E I EI NGP+E
Sbjct: 199 KWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIE 258
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW+
Sbjct: 259 VAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWL 303
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 118/282 (41%), Positives = 159/282 (56%), Gaps = 24/282 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDK 94
H L I ++N WKA P FS T F + L+GV + V + +
Sbjct: 28 HPLSQKFIDQINSKATT-WKAG--PNFSPETSMSFIRGLMGVHKDADKFMPPVYLHEMEA 84
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
P++FD+R+ WP C TI I DQG CGSCWAFGAVEA+SDR CIH ++ +S
Sbjct: 85 DDDFPENFDSRTQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSE 144
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------ 199
DL++CC CG GC+GG+P +AW Y+V G+V+ + C PY + C H
Sbjct: 145 DLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAP-CEHHVNGSR 202
Query: 200 PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P CE TPKCV+KC N + K Y S+Y I + + I EI NGPVE +FT
Sbjct: 203 PSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFT 262
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYED +YK GVY H+ G ++GGHA++++GWG +DG YW+
Sbjct: 263 VYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGV-EDGTKYWL 303
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 120/292 (41%), Positives = 165/292 (56%), Gaps = 27/292 (9%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLL 85
V++ K + L D I +N + WKA RN P+ +++ K ++GV
Sbjct: 14 VLAAAKDLPYPLSDEFINTINLKQNS-WKAGRNFPRDTSFA--HLKKIMGVIEDEH--FA 68
Query: 86 GVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 143
+P+KTH L LP++FD R WP C T++ + DQG CGSCWAFGAVEA++DR C +
Sbjct: 69 TLPIKTHKIDLIAGLPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYS 128
Query: 144 G--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS 194
+ S DLL+CC +CG GC GG P AW Y+ H G+V+ + C PY +
Sbjct: 129 NGTKHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EI 186
Query: 195 TGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 247
C H PG C TPKC +KC + ++ K Y Y ++ D + I AE++
Sbjct: 187 PPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELF 246
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KNGPVE +FTVY D YKSGVYKH GD +GGHAVK++GWG +D + YW+
Sbjct: 247 KNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNK-YWL 297
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 167/308 (54%), Gaps = 32/308 (10%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL + V+S T A K L + I E+N W+A RN + ++ +
Sbjct: 3 LLAVAVVSGTTAAGSGNKKYALSA-----KFIDEINSKAST-WRAGRNFH-PDVSLSYIR 55
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
L+GV P HD S LP++FD+R WP C TI I DQG CGSCWA
Sbjct: 56 GLMGVHQ--DAYKFREPEFVHDLSADVDDLPENFDSREQWPNCPTIREIRDQGSCGSCWA 113
Query: 129 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
FGAVEA+SDR CI G ++ S DL++CC CG GC+GG+P +AW Y+VH G+V+
Sbjct: 114 FGAVEAMSDRVCIASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVHKGLVSG 172
Query: 187 E-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSIS 231
C PY + C H P CE TPKCV+KC + + K Y
Sbjct: 173 GPFGSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSK 231
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y I + I EI NGPVE +FTVYED HYK GVY+H+TG ++GGHA++++GWG
Sbjct: 232 SYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVE 291
Query: 292 DDGEDYWV 299
++ + YW+
Sbjct: 292 NNTK-YWL 298
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 172/318 (54%), Gaps = 40/318 (12%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
F+ C+ L F + V++ L ++ +L D ++ V K W RN S
Sbjct: 5 FVIICIAFL------AFGQ-VLANLDAENDLLSDEFLEIVRSKAKT-WTPGRNYDKS-VP 55
Query: 67 VGQFKHLLGVKPTP-------KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
F+ L+GV P K L+LG V D + P+ FDAR AWP C TI I D
Sbjct: 56 RSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDV--PEEFDARKAWPNCPTIGEIRD 113
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
QG CGSCWAFGAVEA+SDR CIH ++ S +DL++CC CG GC+GG+P +AW Y
Sbjct: 114 QGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLVSCC-HTCGFGCNGGFPGAAWAY 172
Query: 178 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 221
+ G+V+ PY S GC + P C+ + TP C +C K +
Sbjct: 173 WTRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVD 230
Query: 222 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 281
++ KH+ +Y + + +DI EI +NGPVE +FTVYED YK GVY+H+ G +GGH
Sbjct: 231 YKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGH 290
Query: 282 AVKLIGWGTSDDGEDYWV 299
A++++GWG ++ YW+
Sbjct: 291 AIRILGWGV-ENKTPYWL 307
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 105/225 (46%), Positives = 139/225 (61%), Gaps = 19/225 (8%)
Query: 84 LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF 143
+L P ++K+P +FDAR+ WPQC +I+ I DQ CGSCWAFGAVEA+SDR CI
Sbjct: 1 MLAGPPDFDYPNVKIPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIAS 60
Query: 144 GMNL--SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-- 199
+ LS D+L+CC CG GC+GG+P AWR+F HG+ TE PY C H
Sbjct: 61 NGTVKDELSAEDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKYPYVFPP-CEHHI 119
Query: 200 -----PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
C P+ PTPKCVR KK +++ S Y ++ P I AEI NGPVE
Sbjct: 120 NKTHYKPCGPSQPTPKCVRASEKK------PRYHGKSVYSVS--PAKIQAEIMTNGPVEA 171
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTVY+DF Y+SGVY+H++G +GGHA+K++GWG + G YW+
Sbjct: 172 AFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGV-EAGNKYWL 215
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 176/312 (56%), Gaps = 26/312 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ +S ++ + ++ L D +I +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIISFMS--ILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLE 58
Query: 67 VGQFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ LLG + L P H SL++P SFD+R W QC +IS I DQ CG
Sbjct: 59 DARI--LLGAMHEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y+V G
Sbjct: 117 SCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCC-TECGLGCQGGFPGAAWDYWVEDG 175
Query: 183 VVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKH 227
+VT C PY +TG +P C E Y TPKC +KC K + ++ K+
Sbjct: 176 IVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKY 234
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y +Y + ++ I EI +GPVE +FTV+ DF +YKSG+YK++TG +GGHAV++IG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIG 294
Query: 288 WGTSDDGEDYWV 299
WG + YW+
Sbjct: 295 WGV-EKKTPYWL 305
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 119/295 (40%), Positives = 161/295 (54%), Gaps = 22/295 (7%)
Query: 22 TFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
T + + K L +I +N WKA +F TV + +LG P P
Sbjct: 20 TLNDNDARRHKRMHQPLSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPN 77
Query: 82 GLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
G L ++ ++ +LPKSFDAR W C +IS I DQ CGS WAFGAVEA+SDR C
Sbjct: 78 GEQLETLCTGYELTVNELPKSFDARKEWTHCPSISEIRDQSSCGSYWAFGAVEAMSDRIC 137
Query: 141 IHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY 191
I LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY
Sbjct: 138 IESKGKYKPFLSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY 196
Query: 192 FDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ C H P C+ TP C R C N + N K Y YR+ S+ E IM
Sbjct: 197 -EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMK 255
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW+
Sbjct: 256 ELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWL 309
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 119/283 (42%), Positives = 159/283 (56%), Gaps = 27/283 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
H L D+ I+ +N W+A RN F T L+G + +P HD
Sbjct: 23 HPLSDAFIRLINSKQNT-WRAGRN--FPTTTPFAHINKLMGALQDDN--VAKMPKVEHDA 77
Query: 95 SL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
L LP++FD R WP C T++ I DQG CGSCWAFGAVEA++DR+C + + S
Sbjct: 78 DLIASLPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFS 137
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH--PG 201
DLL+CC +CG GC+GG P AW Y+ H G+V T+ C PY + C H PG
Sbjct: 138 SEDLLSCCP-ICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPY-EIPPCEHHVPG 195
Query: 202 ----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
C TPKC + C N +++ K Y Y +++ + I AE+YKNGPVE +F
Sbjct: 196 NRMPCSGDTKTPKCQKNCENGYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAF 255
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
TVY D YKSGVYKHI GD +GGHA+K++GWG +D + YW+
Sbjct: 256 TVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNK-YWL 297
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 176/312 (56%), Gaps = 26/312 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ +S ++ + ++ L D +I +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIVSFMS--ILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLE 58
Query: 67 VGQFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ LLG + L P H SL++P SFD+R W QC +IS I DQ CG
Sbjct: 59 DARI--LLGAMREDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y+V G
Sbjct: 117 SCWAFTAVEAMSDRICIESKGKKSVELSAVDLLSCC-TECGLGCQGGFPGAAWDYWVEDG 175
Query: 183 VVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKH 227
+VT C PY +TG +P C E Y TPKC +KC K + ++ K+
Sbjct: 176 IVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKY 234
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y +Y + ++ I EI +GPVE +FTV+ DF +YKSG+YK++TG +GGHAV++IG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIG 294
Query: 288 WGTSDDGEDYWV 299
WG + YW+
Sbjct: 295 WGV-EKKTPYWL 305
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 176/312 (56%), Gaps = 26/312 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ +S ++ + ++ L D +I +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIISFMS--ILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLE 58
Query: 67 VGQFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ LLG + L P H SL++P SFD+R W QC +IS I DQ CG
Sbjct: 59 DARI--LLGAMHEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWHQCKSISNIRDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y+V G
Sbjct: 117 SCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDG 175
Query: 183 VVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKH 227
+VT C PY +TG +P C E Y TPKC +KC K + ++ K+
Sbjct: 176 IVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKY 234
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y +Y + ++ I EI +GPVEV+FTV+ DF +YKSG+YK++TG +G HAV++IG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIG 294
Query: 288 WGTSDDGEDYWV 299
WG + YW+
Sbjct: 295 WGV-EKKTPYWL 305
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 115/278 (41%), Positives = 156/278 (56%), Gaps = 21/278 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L D I +N + WKA RN S+ K L+G + L +
Sbjct: 24 LSDDFINLINSKQDS-WKAGRNFP-SDTPFKHIKKLMGTLRDDRFTTLVTMQHEVELIAS 81
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP++FD R WP C T++ + DQG CGSCWAFGAVEA++DR C + + S DLL
Sbjct: 82 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLL 141
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG----C 202
+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C H PG C
Sbjct: 142 SCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRLPC 199
Query: 203 EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKCV++C ++ ++ KHY Y + + I AE+YKNGPVE +FTVY D
Sbjct: 200 SGDTKTPKCVKECESGYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAFTVYAD 259
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVYKH+TGD +GGHA+K++GWG ++G YW+
Sbjct: 260 LLSYKSGVYKHVTGDALGGHAIKIMGWGV-ENGNKYWL 296
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/279 (42%), Positives = 156/279 (55%), Gaps = 24/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L ++ +N+ WKA N F + + L G KG L + V+ + LK
Sbjct: 25 LSKEMVNYINKM-NTTWKAGHN--FRDVDYSYVRRLCGT--MLKGPKLPIMVQ-YAGGLK 78
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLL 155
LP FD+R WP+C T+ I DQG CGSCWAFGA EA+SDR CIH G +S+ ++ DLL
Sbjct: 79 LPAQFDSREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSGSKVSVEISSEDLL 138
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 203
CC CG GC+GGYP +AW ++ G+V+ C PY S P C
Sbjct: 139 TCCD-ACGMGCNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPPCEHHVNGSRPHCS 197
Query: 204 -PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKCV C + + KHY S+Y + + E I AEI +NGPVE +F VYED
Sbjct: 198 GEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYED 257
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
F YKSGVY+H TG +GGHA+K++GWG +DG YW+C
Sbjct: 258 FVMYKSGVYQHTTGSALGGHAIKVLGWG-EEDGVPYWLC 295
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 174/312 (55%), Gaps = 26/312 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ +S +++ ++ L D II +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIVSFMS--ILTAHILTGNEMQFEPLSDEIIAYINQHPDAGWTASRSDRFK--S 56
Query: 67 VGQFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCG 124
V + LLGV + L P H SL++P +FD+R W QC +IS I DQ CG
Sbjct: 57 VEDARILLGVMREDEKLRKKRRPTVDHQNVSLEIPSTFDSRKKWSQCKSISSIHDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
S WAF AVE +SDR CI ++ LS DLL+CC CG GC GG+P SAW Y+V G
Sbjct: 117 SGWAFAAVEVMSDRICIQSKGEKSVELSAVDLLSCC-RECGLGCLGGFPGSAWDYWVEEG 175
Query: 183 VVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKH 227
VVT C PY ++TG +P C + Y TPKC +KC K + ++ KH
Sbjct: 176 VVTGSSGENHTGCQPYPFPKCEHNTTG-KYPACGQKIYETPKCQKKCQKGYKTPYKKDKH 234
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y AY + ++ + I EI +GPV FTVY DF +YKSG+YKH+ G +G H V+++G
Sbjct: 235 YGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVG 294
Query: 288 WGTSDDGEDYWV 299
WG + G YW+
Sbjct: 295 WGV-EKGTPYWL 305
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/316 (41%), Positives = 180/316 (56%), Gaps = 38/316 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAA-RNPQFSN 64
+FL L IL V+ S + + V+S + SI + VN + + W+A + +F
Sbjct: 3 VFLAVVLFILPVVFSVPY-DPVLSYAES-----MRSIAERVN-SLQTTWRATPSSKRFEG 55
Query: 65 YTVGQFKHLLGVKPTPKGLLLG---VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 121
T + L G LL G +PVK + +P +FDAR WP C TI + DQG
Sbjct: 56 VTENYVRSLCGT------LLHGGPTLPVKEIEVPAVIPDTFDARQKWPDCPTIGTVRDQG 109
Query: 122 HCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF--- 178
CGSCWAFGAVEA+SDR+CI F +++S +LL+CC CG GCDGGYP +AWR++
Sbjct: 110 ACGSCWAFGAVEAMSDRYCISFKEQVNISAENLLSCCE-TCGSGCDGGYPAAAWRHWADK 168
Query: 179 -VHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 223
++ G+VT C PY C H PG C + TP C R C+ ++ +R
Sbjct: 169 LLYEGIVTGGQYDSNAGCQPY-TIPKCDHHEPGPYENCSGSQSTPSCKRSCISSYDKSYR 227
Query: 224 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
+ KHY ++Y I+SD I EI NGPVE +F+VY DF Y SGVY+H TG +GGHA+
Sbjct: 228 SDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAI 287
Query: 284 KLIGWGTSDDGEDYWV 299
K++GWGT ++G YW+
Sbjct: 288 KILGWGT-ENGVPYWL 302
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 126/307 (41%), Positives = 173/307 (56%), Gaps = 30/307 (9%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 69
+LGV++S EG +L + +++ L D ++ +N WKA N + +
Sbjct: 7 FFLLGVLASVRAEEG---RLMVPTYLAPLSDKMVDYIN-FINTTWKAGHNEGHRDLETVR 62
Query: 70 FKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K LGV L P HD + +P FD+R W C TI I DQG CGSCWA
Sbjct: 63 RK--LGVSRDNHKYRL--PELVHDTLEMDIPAQFDSRQQWQDCPTIREIRDQGACGSCWA 118
Query: 129 FGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 185
FGAVE++SDR CIH G + L+ +D+L+CC + CG GC+GG+P +AW Y+V G+VT
Sbjct: 119 FGAVESMSDRHCIHSGAKNIVHLAADDVLSCC-WGCGSGCNGGFPGAAWSYWVEKGIVTG 177
Query: 186 ------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
E C PY C H C PTPKCVR C K N +++ KHY S+
Sbjct: 178 GNYDTDEGCMPY-PVPSCDHHVNGTLGPCGQDPPTPKCVRLCRKGYNIDFKDDKHYGKSS 236
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y ++S+ I EI KNGPVE +FTVY DF YKSGVYK + D +GGHA++++GWG +
Sbjct: 237 YSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGV-E 295
Query: 293 DGEDYWV 299
+G +W+
Sbjct: 296 NGVPFWL 302
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 107/223 (47%), Positives = 135/223 (60%), Gaps = 22/223 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P FDAR WP C +I+ I DQ CGSCWAF A EA+SDR CI + +N LS DLL
Sbjct: 83 IPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLL 142
Query: 156 ACCGFL--CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 200
+CC L CG+GC+GGYPI AW+++V HG+VT C PY + G + P
Sbjct: 143 SCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 202
Query: 201 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
C + PTPKCV C N + KH+ +AY + E I EI KNGPVEV+F
Sbjct: 203 KCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAF 262
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
TVYEDF Y +GVY H +G +GGHAVK++GWG D+G YW+
Sbjct: 263 TVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGV-DNGTPYWL 304
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/299 (40%), Positives = 161/299 (53%), Gaps = 18/299 (6%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ--FSNYTVG 68
CL+ + V+ T + +K D +L S + E N K W A+ + + ++
Sbjct: 10 CLVAVFVVLLATTVSALYAKPS-DIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLE 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+SFDA WP C TI I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTSMSTEAVPPRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S +LL+CC F+CG GC GG P AW ++V GV TE
Sbjct: 129 IAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTEL 187
Query: 188 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
C PY CSH G YP TPKC C N K+ +S+Y I +
Sbjct: 188 CQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGER 244
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E +M E+ NGP+EV+ VY DF YKSGVYKH++GD +GGHAVKL+GWG DG YW
Sbjct: 245 E-LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYW 301
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/307 (40%), Positives = 167/307 (54%), Gaps = 38/307 (12%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF- 70
LLI G+ S+ + + L D I +N + + W+A RN F+ T ++
Sbjct: 9 LLICGIFSAS-----------IPTDPLSDEFIDYIN-SLQTTWRAGRN--FAPNTPKKYL 54
Query: 71 KHLLGV--KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K L G K T G L P++ + LP FDAR WP CSTI I DQG CGSCWA
Sbjct: 55 KSLAGGVHKNTKNGFTL--PIRDVSLDITLPDEFDARKQWPNCSTIGEIRDQGSCGSCWA 112
Query: 129 FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 185
FGAVEA+SDR CIH + + LS +LL+CC CGDGC GG P SAW Y+ G+V+
Sbjct: 113 FGAVEAMSDRLCIHSNGKLQVHLSAENLLSCCD-SCGDGCLGGSPESAWEYWHKFGIVSG 171
Query: 186 ------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 232
+ C PY + C H P C TPKC ++C K + + + +Y
Sbjct: 172 GNYGSKQGCQPYSIAP-CEHSIHGSSPACGGVTDTPKCKKQCEKGYSIPYDKAFYYGQPG 230
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y I +D + I AEI KNGP+ SF VYED YK GVY+H+ G+ +GGH +K+ GWG +
Sbjct: 231 YAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGI-E 289
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 290 NGTPYWL 296
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/270 (43%), Positives = 155/270 (57%), Gaps = 31/270 (11%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH-----DKSLKLPKSFDARSAW 108
WKA ++ +F +Y L+GV + L V K H D + +P++FDAR W
Sbjct: 76 WKAKKHRRFVHYPDRTKWGLMGVN----NVHLSVKAKQHLSSTKDLDIDIPETFDARQHW 131
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGC 166
C +I I DQ CGSCWAFGAVEA+SDR CI + + ++LS +DLL+CC CG GC
Sbjct: 132 SNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSADDLLSCC-RTCGFGC 190
Query: 167 DGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKC 211
+GG P+ AW+Y+V HG+VT + C PY C H P YPTPKC
Sbjct: 191 EGGDPMFAWQYWVDHGIVTGSNFTANQGCKPY-PFPPCEHHSNKTRFDPCRHDLYPTPKC 249
Query: 212 VRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
+KCV K + + + + Y +AY + +D I EI +GPVEV+F VYEDF HY G+
Sbjct: 250 SKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGGI 309
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y H G + GGHAVKLIGWG D G YW+
Sbjct: 310 YVHTGGKLGGGHAVKLIGWGI-DQGTPYWL 338
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 175/312 (56%), Gaps = 26/312 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ +S ++ + ++ L D +I +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIISFMS--ILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLE 58
Query: 67 VGQFKHLLGVKPTPKGLLLGV-PVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ LLG + L P H SL++P SFD+R W QC +IS I DQ CG
Sbjct: 59 DARI--LLGAMHEDEELRKKRRPTVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y+V G
Sbjct: 117 SCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCC-TECGLGCQGGFPGAAWDYWVEDG 175
Query: 183 VVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKH 227
+VT C PY +TG +P C E Y TPKC +KC K + + K+
Sbjct: 176 IVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYGKDKY 234
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y +Y + ++ I EI +GPVE +FTV+ DF +YKSG+YK++TG +GGHAV++IG
Sbjct: 235 YGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIG 294
Query: 288 WGTSDDGEDYWV 299
WG + YW+
Sbjct: 295 WGV-EKKTPYWL 305
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 105/224 (46%), Positives = 135/224 (60%), Gaps = 22/224 (9%)
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
+P FDAR WP C +I I DQ CGSCWA A E +SDR CI + +N+ +S DL
Sbjct: 74 NIPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISAEDL 133
Query: 155 LACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSH 199
L+CC G+ CGDGC+GGYPI AWRY+VH+G+VT C PY + G +
Sbjct: 134 LSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGVTW 193
Query: 200 PGCEP-AYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
P C TP+CV++C K+ + KHY SAY I + I EI +NGPVEV
Sbjct: 194 PKCAADEVATPECVKQCTSKSDYAVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVG 253
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VY DF YKSG+YKH+ G +GGHAVK++GWG ++G YW+
Sbjct: 254 FLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWGV-ENGTPYWL 296
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 120/279 (43%), Positives = 157/279 (56%), Gaps = 25/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSL 96
L II VN WKA N F TV K L GV P L P+K H+ +
Sbjct: 24 LTQEIIDYVN-TIDTTWKAGWN--FQGATVSYVKGLCGVIRDPNNHKL--PLKLHELNAQ 78
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDL 154
+P +FD+R+ W C TI + DQG CGSCWA AVEA+SDR C+ G ++ +S DL
Sbjct: 79 DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAEDL 138
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 201
+CC CG+GC+GG+P +AW Y+ G+VT + C PY + C H P
Sbjct: 139 NSCCKS-CGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPY-EIKPCEHHINGSRPA 196
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C PTP+C + C N + KHY+ +AY ++S + I EI NGPVE +FTVY
Sbjct: 197 CGKLEPTPRCKKSCESGYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYA 256
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF HYKSGVY+H +G +GGHAVK+IGWGT + YW+
Sbjct: 257 DFPHYKSGVYQHESGAELGGHAVKMIGWGT-EGSTPYWL 294
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 173/316 (54%), Gaps = 27/316 (8%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
MAS L T +L+ + + + +D L D I +N + WKA RN
Sbjct: 1 MASYEYLLLTAMLLFSCMQFTSSVPPPEPSVLVDP--LSDDFIDHIN-SLNTTWKAHRN- 56
Query: 61 QFSN-YTVGQFKHLLGVKPTPKGLLLGVPVKT-HDKSLKLPKSFDARSAWPQCSTISRIL 118
F N + + K L+GV+ + + L P K+ D +++P+ FD R WP+C T+ I
Sbjct: 57 -FGNDIPLREIKKLMGVRRSLENFRL--PEKSMEDIDIEIPEEFDPREQWPECPTLKEIR 113
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 176
DQG CGSCWAFGAVEA+SDR CIH + S DLL CC CG GC+GG P +AW
Sbjct: 114 DQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSS-CGFGCNGGEPGAAWD 172
Query: 177 YFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WR 223
Y+V G+V+ + C PY C H P TP+CV++C + + +
Sbjct: 173 YWVSTGIVSGGSYNSHQGCQPYAIEP-CEHHVNGTRKPCGEGDTPRCVKRCEEGYDVPYG 231
Query: 224 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
+H+ SAY + + I E+ NGP E + TVY+DF HY++GVY+H++G +GGHAV
Sbjct: 232 KDRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAV 291
Query: 284 KLIGWGTSDDGEDYWV 299
+L+GWG +DG YW+
Sbjct: 292 RLLGWGV-EDGTPYWL 306
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 165/309 (53%), Gaps = 34/309 (11%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
F+ LLI G S+ + + L D I +N + W+A RN F+ T
Sbjct: 4 FILFSLLICGTFSA-----------SIPTDPLSDEFIDYIN-TLQTTWRAGRN--FAPNT 49
Query: 67 VGQF-KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
++ K L GV +P + + +P FDAR WP C +I+ I DQG CGS
Sbjct: 50 PKKYLKSLAGVHKNANNAFT-LPKRKVSLDVTIPDEFDARKQWPNCPSITDIRDQGSCGS 108
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWAFGAVEA+SDR CIH + + LS +L++CC CG GCDGG+P SAW Y+ + G+
Sbjct: 109 CWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCD-SCGYGCDGGFPASAWDYWQNEGI 167
Query: 184 VT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI 230
V+ + C PY + C H P C TP C +C + + + + HY
Sbjct: 168 VSGGNYGSKQGCQPYSIAP-CEHHVPGSRPACSGGGDTPDCRNQCDEGSGISYDQDHYYG 226
Query: 231 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
+ + I AEI KNGPVE +FTVYED +YK GVY+H+ G+ +GGHA+K++GWG
Sbjct: 227 ETVYTLDEAKQIQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGV 286
Query: 291 SDDGEDYWV 299
+D YW+
Sbjct: 287 END-TPYWL 294
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 121/301 (40%), Positives = 165/301 (54%), Gaps = 36/301 (11%)
Query: 28 VSKLKLDSH---ILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTP--- 80
++ L L+ H IL D ++ V + K W RN F T + ++ L+GV P
Sbjct: 10 LALLALNVHGDDILSDKFMEIVRQKAKT-WTVGRN--FHKLTPMSHYRQLMGVHPDAHNY 66
Query: 81 ----KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
K ++L + +PK FD+R WP C TI I DQG CGSCWAFGAVEA+S
Sbjct: 67 ALPDKRMVLREEELVGLGNNMIPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMS 126
Query: 137 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS 194
DR CIH +N S +DL++CC CG GC+GG+P +AW Y+V G+V+ PY S
Sbjct: 127 DRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGIVSG--GPYGSS 183
Query: 195 TGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 238
GC + P CE Y TP+C KC ++ ++ KH+ AY I+ +
Sbjct: 184 QGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKN 243
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DI EI +GPVE +FTVYED YK GVY+H+ G +GGHA+++IGWG D YW
Sbjct: 244 VHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-IPYW 302
Query: 299 V 299
+
Sbjct: 303 L 303
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 200 bits (509), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 170/313 (54%), Gaps = 28/313 (8%)
Query: 6 LFLTTCLLILGVIS--SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 63
+ + CLL+ VI + + + +D+ D +I+ +N W+A RN +
Sbjct: 1 MVKSVCLLLAFVIGVWGDVLEDRYLVPVDMDN--FPDKMIEYINY-LNTTWQAGRNLGYE 57
Query: 64 NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 123
+ + LLGV P L ++ ++++P FD+R W C TI I DQG C
Sbjct: 58 DPRY--VRTLLGVHPNNHKYRL-PEIEIDTSNVQIPDHFDSRHRWHDCPTIREIRDQGSC 114
Query: 124 GSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWAFGAVEA+SDR CIH G + L+ +D+L+CC CG GC+GG+P +AW Y+VH
Sbjct: 115 GSCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVLSCC-MSCGSGCNGGFPGAAWSYWVHK 173
Query: 182 GVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKK-NQLWRNSK 226
G+VT E C PY C H P + PTP+CVR C K N + + K
Sbjct: 174 GIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDK 232
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
HY +Y + S+ I EI NGPVE FTVY DF YKSGVY+ T +GGHA++L+
Sbjct: 233 HYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLL 292
Query: 287 GWGTSDDGEDYWV 299
GWG + G YW+
Sbjct: 293 GWGV-EKGVPYWL 304
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 112/278 (40%), Positives = 157/278 (56%), Gaps = 21/278 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L I ++N W+A RN + + + L+GV + V + D+
Sbjct: 25 LSGKFIDQINAKATT-WRAGRNFH-PDTPMSYIRGLMGVHKDADKFMPPVMLHDLDEGDD 82
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP++FDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH ++ +S DL+
Sbjct: 83 LPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLV 142
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSHPGC 202
+CC CG GC+GG+P +AW Y+V G+V+ + C PY S G P C
Sbjct: 143 SCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGP-C 200
Query: 203 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKCV+KC N + K + S+Y I S + I E++ NGPVE +FTVYED
Sbjct: 201 NGEGKTPKCVKKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVYED 260
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YK GVY+H G ++GGHA++++GWG +D + +W+
Sbjct: 261 LLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTK-FWL 297
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 165/311 (53%), Gaps = 29/311 (9%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
C +++ V + +E ++ K + + D ++ VN+ + A +P+FS Y
Sbjct: 8 CTVLVAVAAFVPQSERILGK---NVELTGDDLVDYVNKAQNL-FTAKLSPRFSEYPTAIK 63
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ L+G K V THD +P SFD+R+ WP C +I I DQ CGSCWA
Sbjct: 64 RRLMGSKYVAIPSKYRVNEVTHDDIDDSAIPSSFDSRTQWPNCPSIKSIRDQSSCGSCWA 123
Query: 129 FGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
FGA EA++DR CI + ++S +DLL+CC CG GCDGG+P +AW Y+V G+V+
Sbjct: 124 FGAAEAMTDRICIASKGAIQFTVSADDLLSCCD-ECGFGCDGGFPYAAWNYWVEKGIVSG 182
Query: 187 ECDPYFDSTGCS----------------HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYS 229
Y +GC HP + YPT C KC + N K Y
Sbjct: 183 --GSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQSGYATAYTNDKRYG 240
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
AY + + + I EI +GPVEV++ VYEDF HY G+YKH G +GGHAVK+IGWG
Sbjct: 241 AKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIGWG 300
Query: 290 TSDDGEDYWVC 300
T ++G YW+C
Sbjct: 301 T-ENGIPYWIC 310
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 120/300 (40%), Positives = 164/300 (54%), Gaps = 45/300 (15%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L ++ +N+ + W A N F N K L G KG L + ++ + +K
Sbjct: 25 LSSEMVNYINK-LNSTWTAGHN--FHNVDYSYVKKLCGT--LLKGPKLPLMIR-YAGDIK 78
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LPK FD+R WP C T+ I DQG CGSCWAFGA EA+SDR CIH +S LS DLL
Sbjct: 79 LPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLL 138
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------------------ECDPYFDSTG 196
CC CG GC+GGYP SAW ++V G+V+ D F S G
Sbjct: 139 TCCNS-CGMGCNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPG 197
Query: 197 C--------------SHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPE 240
C S P C TP+C+ +C + ++ KH+ ++Y ++S+ +
Sbjct: 198 CRPYTIPPCEHHVNGSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEED 257
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+I EIYKNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG ++G YW+C
Sbjct: 258 EIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSALGGHAIKMLGWG-EENGVPYWLC 316
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/279 (41%), Positives = 155/279 (55%), Gaps = 23/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
L D I +N + WKA RN F +T K L GV P L +
Sbjct: 26 LSDDFINLINTKQNS-WKAGRN--FPEHTPFAHIKKLAGVLPDYHLSKLSKVEHEDELIA 82
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
LP++FD R WP C T++ + DQG CGSCWAFGAVEA++DR+C + + S DL
Sbjct: 83 SLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDL 142
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG---- 201
L+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C H PG
Sbjct: 143 LSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMP 200
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C TPKC + C N +R K Y + ++S + I AE++KNGPVE +FTVY
Sbjct: 201 CNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYS 260
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
D +YK+GVYKH GD +GGHAVK++GWG ++G YW+
Sbjct: 261 DLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWL 298
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 115/262 (43%), Positives = 152/262 (58%), Gaps = 23/262 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
W A N F + K L G KG L V V+ + + LKLPK+FDAR WP C T
Sbjct: 40 WTAGHN--FRDVDYSYVKRLCGT--FLKGPKLPVMVQ-YTEGLKLPKNFDAREQWPNCPT 94
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
+ I DQG CGSCWAFGA EA+SDR CI +S+ ++ DLL CC CG GC+GGYP
Sbjct: 95 LKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCD-SCGMGCNGGYP 153
Query: 172 ISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 218
+AW ++ G+VT C PY G P TP C KC
Sbjct: 154 SAAWDFWTTDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPG 213
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
+ L++ KH+ ++Y + S+ IMAE++KNGPVE +FTVYEDF YKSGVY+H++G
Sbjct: 214 YSPLYKEDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSA 273
Query: 278 MGGHAVKLIGWGTSDDGEDYWV 299
+GGHA+K++GWG ++G YW+
Sbjct: 274 LGGHAIKILGWG-EENGVPYWL 294
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 116/279 (41%), Positives = 155/279 (55%), Gaps = 23/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
L D I +N + WKA RN F +T K L GV P L +
Sbjct: 26 LSDDFINLINTKQNS-WKAGRN--FPEHTPFAHIKRLAGVLPDYHLSKLSKVEHEDELIA 82
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
LP++FD R WP C T++ + DQG CGSCWAFGAVEA++DR+C + + S DL
Sbjct: 83 SLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDL 142
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG---- 201
L+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C H PG
Sbjct: 143 LSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMP 200
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C TPKC + C N +R K Y + ++S + I AE++KNGPVE +FTVY
Sbjct: 201 CNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYS 260
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
D +YK+GVYKH GD +GGHAVK++GWG ++G YW+
Sbjct: 261 DLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWL 298
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 119/279 (42%), Positives = 155/279 (55%), Gaps = 25/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSL 96
L II VN + WKA N F TV K L GV P L P+K H+ +
Sbjct: 24 LTQEIIDYVN-SIDTTWKAGWN--FQGATVSYVKGLCGVIRDPNNHKL--PLKLHELNAQ 78
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
+P +FD+R+ W C TI + DQG CGSCWA A EA+SDR C+ + + + LS +L
Sbjct: 79 DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSENL 138
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 201
+ACC CG GC GG+P +AW Y+ G+VT + C PY + C H P
Sbjct: 139 MACCE-TCGMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPY-EIAPCEHHINGSRPA 196
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C PTP+C + C N + KHY+ SAY ++S + I EI NGPVE +FTVY
Sbjct: 197 CGKIEPTPRCKKTCESGYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYA 256
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF HYKSGVY+H +G +GGHAVK+IGWG + YW+
Sbjct: 257 DFPHYKSGVYQHESGAELGGHAVKMIGWGM-EGSTPYWL 294
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/295 (39%), Positives = 164/295 (55%), Gaps = 27/295 (9%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGL 83
+VSK+ ++ L + + WKA N +F NY+ L+GV + + K
Sbjct: 8 IVSKISHEAEKLTGYALANYVNRKQNLWKAKFNNKFRNYSDRVKYGLMGVNNVRLSVKAK 67
Query: 84 LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-- 141
P + +D + +P++FDAR W QC+++ I DQ CGSCWAFGAVEA+SDR CI
Sbjct: 68 KNLSPTRFYD--IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIAS 125
Query: 142 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS 194
+ + +SLS +DLL+CC CG GCDGG P++AW+Y+V G+VT + C PY
Sbjct: 126 NGKIQVSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PF 183
Query: 195 TGCSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMA 244
C H P YPTPKC +KC + + + K + +AY + D I
Sbjct: 184 PPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQK 243
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI +GPVEV+F VYEDF Y G+Y H G + GGHAVK++GWG + G YW+
Sbjct: 244 EILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWL 297
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/262 (44%), Positives = 150/262 (57%), Gaps = 23/262 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F + K L G KG L V V+ D LKLP +FDAR WP C T
Sbjct: 40 WKAGHN--FHDVDYSYVKRLCGT--LLKGPRLPVMVQYAD-DLKLPTNFDAREQWPNCPT 94
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 171
+ I DQG CGSCWAFGA EA+SDR CIH +S +S DLL CC CG GC+GGYP
Sbjct: 95 LKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISAQDLLTCCDG-CGMGCNGGYP 153
Query: 172 ISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 218
+AW ++ G+VT C PY G P TP C C
Sbjct: 154 SAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMSCEPG 213
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
+ ++ KH+ ++Y + S+ +DIM E+YKNGPVE +FTVYEDF YKSGVY+H++G
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPA 273
Query: 278 MGGHAVKLIGWGTSDDGEDYWV 299
+GGHA+K++GWG ++G YW+
Sbjct: 274 LGGHAIKILGWG-EENGVPYWL 294
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/299 (39%), Positives = 164/299 (54%), Gaps = 18/299 (6%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 68
CL+ + + T G+ +K D +L S + E+N + W A+ + + S ++
Sbjct: 10 CLVAVFAVLLATTVSGLYAKPS-DFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLE 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+ FDA WP C TIS I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTDMSTEAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++V G+ TE
Sbjct: 129 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 187
Query: 188 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
C PY CSH G YP TPKC C K K+ ++Y + +
Sbjct: 188 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 244
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E +M E+ NGP+EV+ VY DF YKSGVYKH++GD++GGHAVKL+GWGT G YW
Sbjct: 245 E-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYW 301
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 110/268 (41%), Positives = 151/268 (56%), Gaps = 18/268 (6%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 98
Q +++EVN W A NP F++ T+ F+ L G + TP + + V T + L
Sbjct: 18 QQKLVREVNSRNDVNWVAGINPHFADATIEDFRRLNGARQTPLSDRVYMDVSTVPVA-NL 76
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 156
P FD+R+ WP C I +I DQGHCGSCWA + E L DRFCI LS L +
Sbjct: 77 PDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHLTS 136
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR-KC 215
C GC+GG+ +A+ + +G++ E+C PY C HPGC +PTPKC + KC
Sbjct: 137 CTPGC--SGCNGGWMSTAFGFMQSNGILGEDCIPY-QMGKCKHPGCS-TWPTPKCNKTKC 192
Query: 216 ----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
K +LW ++ S+Y + S+ DI EIY+NGPV SF VYED + Y+SGVY+
Sbjct: 193 YPNDTKSTELW-----HAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQ 247
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
H+TG G HA+K++GWG DG YW
Sbjct: 248 HVTGGFEGLHAIKVVGWGIL-DGVKYWT 274
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 166/317 (52%), Gaps = 40/317 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L T LL L ++ T +E H+L D I E+ ++ W RN +
Sbjct: 4 LIATVSLLALVAMTKATESE---------PHMLSDEFI-ELVKSKATTWTPGRNFD-AAV 52
Query: 66 TVGQFKHLLGVKPT-------PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
+ + L+GV P K LLG + D LP+ FD+ WP C TI I
Sbjct: 53 SEHHIRALMGVHPDSHKFTLPEKRELLGADGEDKD----LPEEFDSSKNWPNCPTIREIR 108
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 176
DQG CGSCWAFGAVEA+SDR CIH +N S +DL+ CC CG GC+GG+P +AW
Sbjct: 109 DQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVTCC-HTCGFGCNGGFPGAAWS 167
Query: 177 YFVHHGVV-------TEECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WR 223
Y+ G+V TE C PY + C H P P TP C +C + +
Sbjct: 168 YWTTRGIVSGGSYNSTEGCRPY-EVEPCEHHVDGPRPPCHSGSTPHCKHQCQPNYSVDYE 226
Query: 224 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
KH+ S+Y IN +P +I EI NGPVE +FTVYED YK+GVY+H+ G +GGHA+
Sbjct: 227 KDKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAI 286
Query: 284 KLIGWGTSDDGE-DYWV 299
++IGWG + + YW+
Sbjct: 287 RIIGWGVWGESKVPYWL 303
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 110/270 (40%), Positives = 152/270 (56%), Gaps = 21/270 (7%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP---VKTHDKSLKLPKSFDAR 105
N + WKA RNP F + ++GV+ + K +P + +++P FD+R
Sbjct: 50 NLQTTWKAGRNPYFETVPSHVIQGMMGVRRSSKLETNSIPLPVISYEHIDMEIPVEFDSR 109
Query: 106 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 163
WP C TI I DQ +CGSCWAFGAVEA+SDR CI +S DLL+CC +CG
Sbjct: 110 KQWPYCPTIGEIRDQSNCGSCWAFGAVEAISDRICIATDGRQKPHISSTDLLSCCK-ICG 168
Query: 164 DGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPK 210
GC GG P AW ++V +G+VT + C PY S G P PTP
Sbjct: 169 FGCQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGPCSHDLEPTPV 228
Query: 211 CVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C + C ++ N K+Y + AY +++ D+ E+ NGP+EV+F VYEDF YK+GV
Sbjct: 229 CKKACQSTYKIQYNKDKYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGV 288
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+H TG V+GGHAV+L+GWG ++G YW+
Sbjct: 289 YQHHTGSVLGGHAVRLLGWG-EENGVPYWL 317
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 112/271 (41%), Positives = 157/271 (57%), Gaps = 22/271 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKTHDKS 95
L D +I +N+ P WKA R +F+ ++ K ++GV + L + +D +
Sbjct: 32 LSDELITFINKQPNIEWKADRTTRFT--SIHHAKSMMGVLLNRVDQHKLHHPIIHHNDIN 89
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+KLPK FD+R W CS+I I DQ CGSCWAFGAVE++SDR CIH +++ LS +
Sbjct: 90 IKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVN 149
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 200
LL+CC CG GC+GG P AW Y+ G+VT C PY ST +H
Sbjct: 150 LLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHS 208
Query: 201 GCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
CE Y TP+C + C + + N K+Y S+Y + SD IM EI NGPVE +F V
Sbjct: 209 SCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYV 268
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
Y+DF +YK+GVYK++TG ++GGHA+++ G
Sbjct: 269 YDDFLNYKTGVYKYVTGSLLGGHAIRITWLG 299
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/299 (39%), Positives = 164/299 (54%), Gaps = 18/299 (6%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 68
CL+ + + T G+ +K D +L S + E+N + W A+ + + S ++
Sbjct: 15 CLVAVFAVLLATTVSGLYAKPS-DFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLE 73
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+ FDA WP C TIS I DQ +CGSCWA
Sbjct: 74 EVRKLMGVTDMSTEAVPPRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 133
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++V G+ TE
Sbjct: 134 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 192
Query: 188 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
C PY CSH G YP TPKC C K K+ ++Y + +
Sbjct: 193 CQPYPFGP-CSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 249
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E +M E+ NGP+EV+ VY DF YKSGVYKH++GD++GGHAVKL+GWGT G YW
Sbjct: 250 E-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYW 306
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 106/226 (46%), Positives = 133/226 (58%), Gaps = 22/226 (9%)
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
S +P FDAR WP C +I I DQ CGSCWAF A EA+SDR CI + +N LS
Sbjct: 79 SDAIPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138
Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGC 197
DLL+CC F CG+GC+GGYPI AW+++ HG+VT C PY + G
Sbjct: 139 DLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGV 198
Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
+ P C E PTPKCV C + + KH+ +AY + E I EI KNGP+E
Sbjct: 199 TWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIE 258
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW+
Sbjct: 259 VAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWL 303
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 158/303 (52%), Gaps = 20/303 (6%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + + L G + L V +LP+SFD+ WP C TI I DQ CG
Sbjct: 57 ITFAEARRLTGARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACG 116
Query: 125 SCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
SCWA A+SDR C G+ L +S LL+CC CG GCDGGYP +AWRY+V HG+
Sbjct: 117 SCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCK-DCGYGCDGGYPDAAWRYYVSHGL 175
Query: 184 VTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRI 235
+ C PY C H G + P TPKC C K K+ +Y +
Sbjct: 176 ASSYCQPY-PFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNHSYEV 232
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+ + ED E+Y NGP V+F VY DF YK+GVY+H++GDV+GGHAV+++GWG +G
Sbjct: 233 HGE-EDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGT 290
Query: 296 DYW 298
YW
Sbjct: 291 PYW 293
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 100/225 (44%), Positives = 137/225 (60%), Gaps = 20/225 (8%)
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+D S LP++FDAR WP C TI + DQG CGSCWAFGAVEA+SDR CIH N
Sbjct: 20 NDASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHF 79
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------ 197
S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 80 SAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHV 136
Query: 198 --SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ C+ TP CV+KC + ++ + H+ SAY I +D + I EIY NGPVE
Sbjct: 137 NGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEG 196
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW+
Sbjct: 197 AFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWL 241
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 116/299 (38%), Positives = 164/299 (54%), Gaps = 18/299 (6%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 68
CL+ + + T G+ +K D +L S + E+N + W A+ + + + ++
Sbjct: 10 CLVAVFAVLLATTVSGLYAKPS-DFPLLGKSFVAEINSKARGQWTASADNGYLVTGKSLE 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+ FDA WP C TIS I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTDMSTEAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++V G+ TE
Sbjct: 129 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 187
Query: 188 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
C PY CSH G YP TPKC C K K+ ++Y + +
Sbjct: 188 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 244
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E +M E+ NGP+EV+ VY DF YKSGVYKH++GD++GGHAVKL+GWGT G YW
Sbjct: 245 E-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYW 301
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/257 (43%), Positives = 140/257 (54%), Gaps = 24/257 (9%)
Query: 52 AGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWP 109
A W + R P+ + H+ G K + P HD +++LPK+FDAR WP
Sbjct: 40 ARWISGRRPK--RFESDDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWP 97
Query: 110 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCD 167
CS+IS I DQ CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC
Sbjct: 98 HCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKD-CGFGCR 156
Query: 168 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCV 212
GGYP AW Y+ HG+VT D +GC P CE YPTP+CV
Sbjct: 157 GGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECV 214
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
++C + + K + +Y I + IM EI GPVE FT+YEDF Y SGVY H
Sbjct: 215 QQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFH 274
Query: 273 ITGDVMGGHAVKLIGWG 289
G M GHAV+++GWG
Sbjct: 275 ALGAPMSGHAVRILGWG 291
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/226 (46%), Positives = 133/226 (58%), Gaps = 22/226 (9%)
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
S +P FDAR WP C +I I DQ CGSCWAF A EA+SDR CI + +N LS
Sbjct: 79 SDAIPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSE 138
Query: 153 DLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGC 197
DLL+CC F CG+GC+GGYPI AW+++ HG+VT C PY + G
Sbjct: 139 DLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGV 198
Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
+ P C E PTPKCV C + + KH+ +AY + E I EI KNGP+E
Sbjct: 199 TWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIE 258
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW+
Sbjct: 259 VAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWL 303
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 117/280 (41%), Positives = 156/280 (55%), Gaps = 25/280 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 96
L D I +N + WKA RN N + K L GV L +P HD L
Sbjct: 29 LTDEFINLINTKQNS-WKAGRNFPV-NTPLTHIKKLTGVLVDTH--LSKLPKVEHDADLI 84
Query: 97 -KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP++FD R WP C T++ + DQG CGSCWAFGAVEA++DR+C + + S D
Sbjct: 85 ADLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAED 144
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG--- 201
LL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C H PG
Sbjct: 145 LLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRM 202
Query: 202 -CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKC + C N + K Y Y ++S + I AE+YKNGPVE +FTVY
Sbjct: 203 PCNGDSKTPKCHKTCESSYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAFTVY 262
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
D +YK+GVYKH G+ +GGHA+K++GWG ++G YW+
Sbjct: 263 SDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGNKYWL 301
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 122/294 (41%), Positives = 162/294 (55%), Gaps = 40/294 (13%)
Query: 36 HILQDSIIKEVNENPKAGWK---AARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK-- 90
H L D ++ VN+ W+ A + F N V K L G LG P
Sbjct: 24 HPLSDELVNYVNKR-NTTWQVGCGAASYNFYNVDVSYLKRLCGT-------FLGGPKPPQ 75
Query: 91 --THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC--W-----AFGAVEALSDRFCI 141
T + L LP+SF AR WPQC TI Q G W AFGAVEA+SDR CI
Sbjct: 76 RVTFTEDLNLPESFYAREQWPQCPTIXXXRAQPGRGGLTRWGSFLQAFGAVEAISDRICI 135
Query: 142 HFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF 192
H ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 136 HTNAHISVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPY- 194
Query: 193 DSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAE 245
C H P C TPKC + C + ++ KHY ++Y +++ +DIMAE
Sbjct: 195 SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAE 254
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
IYKNGPVE +F+VY DF YKSGVY+HITG++MGGHA++++GWG ++G YW+
Sbjct: 255 IYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGV-ENGTPYWL 307
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 119/299 (39%), Positives = 160/299 (53%), Gaps = 18/299 (6%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ--FSNYTVG 68
CL+ + V+ T + +K D +L S + E N K W A+ + + ++
Sbjct: 10 CLVAVFVVLLATTVSALYAKPS-DIPLLGKSFVAETNSKAKGQWTASADNGHLVTGKSLE 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+SFDA WP C TI I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTSMSTEAVPPRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S +LL+CC F+CG GC GG P AW ++V GV TE
Sbjct: 129 IAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTEL 187
Query: 188 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
C PY CSH G YP TPKC C N K+ +S+Y I +
Sbjct: 188 CQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGER 244
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E + E+ NGP+EV+ VY DF YKSGVYKH++GD +GGHAVKL+GWG DG YW
Sbjct: 245 E-LDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVK-DGIPYW 301
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 163/312 (52%), Gaps = 35/312 (11%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL+L I A V + + +L D I+ V K WK RN S T G +
Sbjct: 3 LLLLVAI-----AASVAALTSGEPSLLSDEFIEVVRSKAKT-WKVGRNFDAS-VTEGHIR 55
Query: 72 HLLGVKPTPKGLLLGVPVKTH-------DKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
L+GV P L P K + +LP+ FD+R WP C TI I DQG CG
Sbjct: 56 RLMGVHPDAHKFAL--PDKREVLGDLYMNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCG 113
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y+ G
Sbjct: 114 SCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKG 172
Query: 183 VVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHY 228
+V+ + C PY + + C H P C TPKC C + + KH+
Sbjct: 173 IVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGGTPKCSHVCQSSYTVDYAKDKHF 231
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 288
+Y + + +I EI NGPVE +FTVYED YK GVY+H G +GGHA++++GW
Sbjct: 232 GSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGW 291
Query: 289 GT-SDDGEDYWV 299
G D+ YW+
Sbjct: 292 GVWGDEKIPYWL 303
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 109/278 (39%), Positives = 154/278 (55%), Gaps = 22/278 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L + I ++N W A RN + + F+ L+GV + V + D+
Sbjct: 25 LSEKFIDQINAKATT-WHAGRNFH-PDTPLSYFRGLMGVHKDADKFMPPVMLHDLDEGDD 82
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLL 155
LP++FD+R WP C TI I DQG CGSCWAFGAVEA+SDR CIH + +S DLL
Sbjct: 83 LPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKVLFRVSAEDLL 142
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------- 207
CC CG GCDGG P + W++++ G+V+ P+ GC EP
Sbjct: 143 TCCTN-CGHGCDGGAPGAGWKHWIEKGLVSG--GPFGSDQGCRPYTIEPCVHVENGAQSP 199
Query: 208 -----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKC++KC+ N + K + S Y I +D I EI+ NGPVE +FTV++D
Sbjct: 200 CKDSITPKCIKKCLPGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPVEATFTVFDD 259
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FA YK G+Y+H +G++ G HAV+++GWG ++G YW+
Sbjct: 260 FASYKHGIYQHTSGNLAGEHAVRILGWGV-ENGTKYWL 296
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 116/274 (42%), Positives = 154/274 (56%), Gaps = 19/274 (6%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL-PK 100
+ + V+ A W + R P+ + + H ++ + V+ D KL PK
Sbjct: 30 VREHVHPTAGARWISVRYPK-PFESDNKLHHFGAIREPVEQRAQRSTVRHEDFDSKLIPK 88
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACC 158
SFDAR+ WP C +IS I DQ CGSCWAFGAVEA+SDR CIH N SLS DLL+CC
Sbjct: 89 SFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAVDLLSCC 148
Query: 159 GFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPY------FDSTGCSHPGCEPA 205
CGDGCDGG+P AW ++ HG+VT EE C PY S G P
Sbjct: 149 K-DCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYPPCPRRI 207
Query: 206 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
YPTPKCV+ C ++ K + ++Y ++ IM EI NGPVE +F V+EDF Y
Sbjct: 208 YPTPKCVKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEY 267
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSG+Y H G +GGHA++++GWG ++G YW+
Sbjct: 268 KSGIYFHAWGGSVGGHAIRILGWG-EENGVPYWL 300
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 116/280 (41%), Positives = 156/280 (55%), Gaps = 27/280 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVG---QFKHLLGVKPTPKGLLLGVPVK--THDKSL 96
II VN +P W+A+ +N G F L+GV P P+K D+S
Sbjct: 32 IIDSVNADPGNTWRASD----TNVIPGDGKNFNQLMGVLPRNFNSFRFAPIKKSAEDESN 87
Query: 97 K-LPKSFDARSAWPQCSTI-SRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP++FDAR WP+CS++ I DQ +CGSCWA A SDR CI G + +LS
Sbjct: 88 EALPENFDARERWPECSSLLGSIKDQSNCGSCWAVSAASVFSDRLCIATGGAVARNLSAE 147
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPGCEP 204
L CC + CG+GCDGG P SAW +F+ HG+VT + C PY G C
Sbjct: 148 QLNTCC-YRCGNGCDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIE 206
Query: 205 AYP-TPKC-VRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
P TP C ++ C N + +R HY + Y ++ EDIM ++YKNGPV+ +F VY
Sbjct: 207 DDPDTPDCSIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYT 266
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
DF +YKSGVY + G + GGHA+K++GWG DDG YW+C
Sbjct: 267 DFMYYKSGVYSYTRGQIEGGHAIKILGWGV-DDGTKYWLC 305
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 167/304 (54%), Gaps = 25/304 (8%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN-YTVGQF 70
LL++G++++ F + K H L D +I +N+ WKA N F ++
Sbjct: 5 LLVMGLLAAVCFGREIHPK---KWHPLSDQMINYINK-INTTWKAGSN--FDKCISMSYI 58
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
+ LLGV P + L V + LP+SFDAR+ W C +I I DQ CGSCWAFG
Sbjct: 59 RGLLGVHPKSEEYRLAEFVHE-EIPDDLPESFDARAKWSHCDSIHLIRDQSTCGSCWAFG 117
Query: 131 AVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 185
A EA+SDR CIH M +++S DLL CC CG GC GG+P +AW ++ G+V+
Sbjct: 118 ATEAMSDRICIHSKGKMQVNISAEDLLDCCD-TCGHGCKGGFPAAAWEHWKERGIVSGGL 176
Query: 186 ----EECDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 235
+ C PY + T C P C P TP+CV C K ++ ++ KH+ Y I
Sbjct: 177 YGTPDGCKPYSLAPCEYHTKCRIPNCIPIVHTPECVHHCRKGYDKDYQEDKHFGQKVYSI 236
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+ D + I EI+ NGPVE F VY DF YKSGVY+ + D G HA++++GWGT ++G
Sbjct: 237 SRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGT-ENGT 295
Query: 296 DYWV 299
YW+
Sbjct: 296 PYWL 299
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 111/268 (41%), Positives = 147/268 (54%), Gaps = 26/268 (9%)
Query: 51 KAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQ 110
K W A R +F ++ + L G TP+ L P+K + +P +FD+R+ WP
Sbjct: 36 KTTWVAERPTRFGSFD--EVARLCGALETPEDQRL--PLKVAPIAEAIPDTFDSRTNWPA 91
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 168
C TI + DQ CGSCWAFGAVE++SDR CI + LS +DLL+CC CGDGCDG
Sbjct: 92 CPTIKEVRDQSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSCC-TSCGDGCDG 150
Query: 169 GYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYP--------TPKCVR 213
G +W Y+ + G+VT C PY D C+H P YP TPKC +
Sbjct: 151 GQLGPSWDYYKNKGIVTGYLYNTTGYCKPY-DFPACAHHEASPDYPDCPSTDYSTPKCTK 209
Query: 214 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
CV + HY S+Y + I EI +GPVE +FTVY DF Y+SGVYK
Sbjct: 210 SCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYK 269
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
H +G V+GGHA+ ++GWGT + G YW+
Sbjct: 270 HTSGSVLGGHAISIVGWGT-ESGSPYWL 296
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 114/257 (44%), Positives = 141/257 (54%), Gaps = 24/257 (9%)
Query: 52 AGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWP 109
A W + R P+ + G H+ G K + P HD +++LPK+FDAR WP
Sbjct: 40 ARWISGRLPK--RFESGDLIHMFGAKRETREQKAQRPTLRHDGFDNMRLPKNFDARKTWP 97
Query: 110 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCD 167
CS+IS I DQ CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC
Sbjct: 98 HCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCK-DCGFGCR 156
Query: 168 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCV 212
GGYP AW Y+ HG+VT D +GC P CE YPTP+CV
Sbjct: 157 GGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECV 214
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
++C + + K + +Y I + IM EI GPVE FT+YEDF Y SGVY H
Sbjct: 215 QQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFH 274
Query: 273 ITGDVMGGHAVKLIGWG 289
G M GHAV+++GWG
Sbjct: 275 ALGAPMSGHAVRILGWG 291
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 164/310 (52%), Gaps = 31/310 (10%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL+L I A V + + +L D I+ V K WK RN S T G +
Sbjct: 3 LLLLVAI-----AASVAALTSGEPSLLSDEFIEVVRSKAKT-WKVGRNFDAS-VTEGHIR 55
Query: 72 HLLGVKPTPKGLLL----GVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSC 126
L+GV P L V + SL +LP+ FD+R WP C TI I DQG CGSC
Sbjct: 56 RLMGVHPDAHKFALPDKREVLGDLYMNSLDELPEEFDSRKQWPNCPTIGEIRDQGSCGSC 115
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y+ G+V
Sbjct: 116 WAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIV 174
Query: 185 T-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 230
+ + C PY + + C H P C TPKC C + + KH+
Sbjct: 175 SGGPYGSNQGCRPY-EISPCEHHVNGTRPPCANGSGTPKCSHVCQSSYTVDYAKDKHFGS 233
Query: 231 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
+Y + + +I EI NGPVE +FTVYED YK GVY+H G +GGHA++++GWG
Sbjct: 234 KSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGV 293
Query: 291 -SDDGEDYWV 299
++ YW+
Sbjct: 294 WGNEKIPYWL 303
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 116/282 (41%), Positives = 154/282 (54%), Gaps = 25/282 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGLLLGVPVKTHDKSL 96
D +I +NE A WKAA + +F N + FK LG+ + TP+ P ++ S
Sbjct: 16 FSDELIHYINEKSGASWKAAPSSRFIN--IEHFKQHLGLLEETPEERQTRRPTVRYNVSD 73
Query: 97 K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFDAR WP C +I +I DQ CGSCWA V A+SDR CIH M LS D
Sbjct: 74 NDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAID 133
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP-- 204
L++CC + CG+GC GG P +AW Y+ +G+VT C PY C HPG
Sbjct: 134 LVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQL 191
Query: 205 ------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
YPTP C C ++ + K Y ++Y ++ IM EI KNGPVE F
Sbjct: 192 NPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFI 251
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DFA YKSG+Y H++G G HA+++IGWG ++G YW+
Sbjct: 252 VYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVKYWL 292
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 115/298 (38%), Positives = 156/298 (52%), Gaps = 26/298 (8%)
Query: 24 AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 83
A V + + +L D I+ V K W RN S T G + L+GV P
Sbjct: 10 AASVAALTAGEPSLLSDEFIELVRSKAKT-WTVGRNFDAS-VTEGHIRRLMGVHPDAHKF 67
Query: 84 LLGVPVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 138
L + + ++P+ FD+R WP C TI I DQG CGSCWAFGAVEA+SDR
Sbjct: 68 ALADKREVLGDLYMNSVDEIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDR 127
Query: 139 FCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECD 189
CIH G +N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C
Sbjct: 128 VCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCR 186
Query: 190 PYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDI 242
PY + + C H P C TPKC C + + KH+ +Y + + DI
Sbjct: 187 PY-EISPCEHHVNGTRPPCAHGGATPKCSHVCQSSYTVDYAKDKHFGSKSYSVRRNVRDI 245
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
EI NGPVE +FTVYED YK GVY+H G +GGHA++++GWG D+ YW+
Sbjct: 246 QEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWL 303
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 134/218 (61%), Gaps = 19/218 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLL 155
+P FD+R WP C TI + DQG CGSCWAFGAVEA+SDR+CI + +S DLL
Sbjct: 4 VPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLL 63
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGC 202
+CC CG GC+GGYP SAW ++ G+VT + C PY C H C
Sbjct: 64 SCC-ETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPY-KIAACDHHVVGKLKPC 121
Query: 203 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+ PTPKC RKC N + + KH+ SAY + SDP +I EI NGPVE +FTVY D
Sbjct: 122 KGDSPTPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVYAD 181
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F YKSGVY+H +G +GGHA+K++GWG ++G YW+
Sbjct: 182 FPTYKSGVYQHTSGSALGGHAIKILGWG-EENGTPYWL 218
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 163/290 (56%), Gaps = 32/290 (11%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVP 88
KL + L + + ++ N WKA N +F NY+ L+GV + + K P
Sbjct: 59 KLTGYALANYVNRKQNL-----WKAKFNNKFRNYSDRVKYGLMGVNNVRLSVKAKKNLSP 113
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
+ +D + +P++FDAR W QC+++ I DQ CGSCWAFGAVEA+SDR CI + +
Sbjct: 114 TRFYD--IYIPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQ 171
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 199
+SLS +DLL+CC CG GCDGG P++AW+Y+V G+VT + C PY C H
Sbjct: 172 VSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PFPPCEH 229
Query: 200 --------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
P YPTPKC +KC + + + K + +AY + D I EI +
Sbjct: 230 HSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTH 289
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GPVEV+F VYEDF Y G+Y H G + GGHAVK++GWG + G YW+
Sbjct: 290 GPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWL 338
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 147/254 (57%), Gaps = 24/254 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQCS 112
W + QF N VGQ LLG K +P L +K++D +++P SF+A++ WP C+
Sbjct: 39 WVEEQTDQFDNIKVGQ---LLGFKRSPNRPKL--QIKSYDPLGVQIPTSFNAQTNWPNCT 93
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 172
TIS+I +Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG
Sbjct: 94 TISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAF 151
Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRN 224
SAW + G V+EEC PY + P C PA TP C ++C + L +
Sbjct: 152 SAWNWLRKQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQ 205
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
KH Y +SD E IM EI NGPVE FTV+EDF YKSGVY H TG +GGH VK
Sbjct: 206 DKHKMAKIYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVK 264
Query: 285 LIGWGTSDDGEDYW 298
L+G+GT +G DY+
Sbjct: 265 LVGFGTL-NGVDYY 277
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 196 bits (498), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 154/303 (50%), Gaps = 19/303 (6%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + + L G L V +LP+SFD+ WP C TI I DQ CG
Sbjct: 57 ITFAEARRLTGAFRRKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACG 116
Query: 125 SCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
SCWA A+SDR C G+ L +S LL+CC CGDGCDGGYP SAW Y+V HG+
Sbjct: 117 SCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCK-DCGDGCDGGYPDSAWEYYVSHGL 175
Query: 184 VTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRI 235
+ C PY C H G + P TPKC C K K+ +Y +
Sbjct: 176 ASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNDSYVL 232
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+D E+Y NGP V+F VY DF YK+GVY+H++GD +GGHAV+++GWG +G
Sbjct: 233 LHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGT 291
Query: 296 DYW 298
YW
Sbjct: 292 PYW 294
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 116/299 (38%), Positives = 163/299 (54%), Gaps = 18/299 (6%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVG 68
CL+ + + T G+ +K D +L S + E+N + W A+ + + S ++
Sbjct: 10 CLVAVFAVLLATTVSGLYAKPS-DFPLLGKSFVAEINSKARGQWTASADNGYLVSGKSLE 68
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
+ + L+GV + + LP+ FDA WP C TIS I DQ +CGSCWA
Sbjct: 69 EVRKLMGVTDMSTEAVPPRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWA 128
Query: 129 FGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++V G+ TE
Sbjct: 129 IAAVEAISDRYCTLGGVPDRRISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEV 187
Query: 188 CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 239
C PY CSH G YP TPKC C K K+ ++Y + +
Sbjct: 188 CQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEK 244
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E +M E+ NGP+EV+ VY DF YKSG YKH++GD++GGHAVKL+GWGT G YW
Sbjct: 245 E-LMIELMTNGPLEVTMQVYSDFVGYKSGGYKHVSGDLLGGHAVKLVGWGT-QGGVPYW 301
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 156/306 (50%), Gaps = 26/306 (8%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + + L G + L V +LP+SFD+ WP C TI I DQ CG
Sbjct: 57 ITFAEARRLTGARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACG 116
Query: 125 SCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
SCWA A+SDR+C G+ L +S LL+CC CG GCDGGYP +AW Y+V HG+
Sbjct: 117 SCWAVSTASAISDRYCTVGGVQQLRISAAHLLSCCKD-CGYGCDGGYPGTAWEYYVSHGL 175
Query: 184 VTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKHYSISA 232
+ C PY C H G + P TPKC C K +R + Y +
Sbjct: 176 ASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPLIKYRGNHSYGLDG 234
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
+D E+Y NGP V+F VY DF YK+GVY+H++GDV+GGHAV+++GWG
Sbjct: 235 ------EDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL- 287
Query: 293 DGEDYW 298
+G YW
Sbjct: 288 NGTPYW 293
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/275 (41%), Positives = 150/275 (54%), Gaps = 18/275 (6%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ IL D ++ VN W A R + + T + LLG +L +
Sbjct: 28 DAPILTDEFLEHVNSLNGGKWTAGRTSRTKHLTRREASRLLGTFLGNTSILAPRQFSEAE 87
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVN 152
++L FDA AWP C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S
Sbjct: 88 LRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAG 147
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEP 204
DL++CC +CG GC+GG+P AW ++V HG+V+E C PY F S C+H C
Sbjct: 148 DLMSCCD-VCGYGCNGGFPEVAWVFYVVHGLVSEYCQPYPFPS--CAHHVNSSDLAPCSG 204
Query: 205 AYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
Y TPKC C KK L R ++S + S E E+ NGP EV+F VY DF
Sbjct: 205 DYKTPKCNSTCTEKKIPLIRYRGNHSY----VLSGEEHFKRELLLNGPFEVAFEVYADFM 260
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
Y GVYKH+ GD++GGHAV+L+GWG +GE YW
Sbjct: 261 AYTGGVYKHVAGDLLGGHAVRLVGWGEL-NGEPYW 294
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/262 (43%), Positives = 152/262 (58%), Gaps = 23/262 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
W A N F + K L G KG L V V+ + + LKLPK+FDAR WP C T
Sbjct: 40 WTAGHN--FRDVDYSYVKKLCGT--FLKGPKLPVMVQ-YTEGLKLPKNFDAREQWPNCPT 94
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
+ I DQG CGSCWAFGA EA+SDR CIH +S+ ++ DLL CC CG GC+GGYP
Sbjct: 95 LKEIRDQGSCGSCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCD-SCGMGCNGGYP 153
Query: 172 ISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 218
+AW ++ G+VT C PY G P TP C KC
Sbjct: 154 SAAWDFWATEGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPG 213
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
+ ++ KH+ ++Y + S+ IMAE++KNGPVE +FTVYEDF YKSGVY+H++G
Sbjct: 214 YSPSYKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSP 273
Query: 278 MGGHAVKLIGWGTSDDGEDYWV 299
+GGHA+K++GWG ++G YW+
Sbjct: 274 VGGHAIKILGWG-EENGVPYWL 294
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 116/280 (41%), Positives = 151/280 (53%), Gaps = 21/280 (7%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
H L D I +N W A RN T+ K L+G L D
Sbjct: 22 HPLSDKFIDLINSKQNT-WIAGRNFDIGR-TLKSIKKLMGALEDKYLHKLYTVEHDDDTI 79
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP++FD R WP C T++ I DQG CGSCWAFGAVEA++DR+C + + S D
Sbjct: 80 NNLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAED 139
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG--- 201
LL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C H PG
Sbjct: 140 LLSCCP-VCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPY-EIPPCEHHVPGNRI 197
Query: 202 -CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKC R C K+ +++ K Y Y + E I AEI+KNGPVE +FTVY
Sbjct: 198 PCNGETSTPKCHRSCRKEYTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAFTVY 257
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
D YKSGVYKH G+ +GGHA+K++GWG ++G YW+
Sbjct: 258 ADLLTYKSGVYKHTEGEALGGHAIKIMGWGV-ENGNKYWL 296
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 165/313 (52%), Gaps = 35/313 (11%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
+ LT LL+ ++S + ++ + + + I + EN GW A R +F +
Sbjct: 1 MKLTALLLVCALLSIN------AAHIESNYYPFEKEIYEVNREN--LGWVAGRQKRFEGH 52
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
T L GVK + L +PV +P FD+R+ WP C TI I DQ +CGS
Sbjct: 53 TEEYIAGLCGVKGSIPLPLSDLPVLE-----DIPDMFDSRTQWPDCKTIGLIEDQSNCGS 107
Query: 126 CWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
CWAFGA E++SDR+CIH M+L +S +L+ CC CG+GC+GG+ +AW Y+ G+VT
Sbjct: 108 CWAFGATESMSDRYCIHMKMHLLISAANLMECCRN-CGNGCEGGFLGAAWNYWKQEGLVT 166
Query: 186 -----------EECDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC-VKKNQLWRNSK 226
+ C PY C H G +PA P TP+CV C +
Sbjct: 167 GGLYNPSATESDTCQPY-PLPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDL 225
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
HY SAY + +I EI NGPVE +FTVY DF YKSGVYK + +GGHAVK+I
Sbjct: 226 HYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMI 285
Query: 287 GWGTSDDGEDYWV 299
GWG +DG YW+
Sbjct: 286 GWG-EEDGIPYWL 297
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 99/224 (44%), Positives = 136/224 (60%), Gaps = 20/224 (8%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D + LP++FDAR WP C TI + DQG CGSCWAFGAVEA+SDR CIH N S
Sbjct: 23 DAPIDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFS 82
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------- 197
+L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 83 AENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVN 139
Query: 198 -SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
+ C+ TPKCV+KC ++ + H SAY +++D + I EIY NGPVE +
Sbjct: 140 GTRGPCKEGGKTPKCVKKCEDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGA 199
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW+
Sbjct: 200 FTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWL 243
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 117/273 (42%), Positives = 152/273 (55%), Gaps = 39/273 (14%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK--------LPKSFDAR 105
WKA N +F+ Y+ LLGV K + H K+L +P+SFDAR
Sbjct: 77 WKAGMN-KFNLYSDTVKYGLLGVNNRKKSV-------EHKKNLSPIRHSNIFIPESFDAR 128
Query: 106 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCG 163
WP+C+++ I DQ CGSCWA AVEA+SDR CI + LS +DLL+CC CG
Sbjct: 129 KNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCG 187
Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYP 207
GC GG P++AW+Y+V G+VT Y + +GC P CE YP
Sbjct: 188 FGCFGGEPMAAWKYWVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYP 245
Query: 208 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
TPKC ++C K + ++ K+Y AY + +D E I EI GPVE SF VY DF HY
Sbjct: 246 TPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYT 305
Query: 267 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SG+YKH+ G V GGHAVK++GWG D G YW+
Sbjct: 306 SGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWL 337
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 114/265 (43%), Positives = 145/265 (54%), Gaps = 18/265 (6%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
ILQ +I ++N N GW A NP+F+ T K LLG K PKG L
Sbjct: 21 ILQQEMIDQIN-NANVGWTAGVNPRFAGKTREDIKGLLGTKLLPKGTKLREFPVVDTIVD 79
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
+P SFDAR+ WP ++I I DQ CGSCWAFGA EALSDR I + +N+ LS DL
Sbjct: 80 AIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDL 137
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 214
++C GCDGGYPI+AW Y GVVT+ C PY G S TP C
Sbjct: 138 VSCDS--TDYGCDGGYPINAWHYMQSLGVVTDTCYPYTSGNGDSGTCQITGKKTPACATA 195
Query: 215 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
K + +AY++ ++ I +EI NGPVE +F+VY+DF Y SGVY H +
Sbjct: 196 TFYKAK----------TAYQVANNMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQS 245
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G + GGHAVK++GWG D YW+
Sbjct: 246 GALDGGHAVKIVGWGV-DGTTPYWI 269
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 157/289 (54%), Gaps = 34/289 (11%)
Query: 30 KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV 89
+L SH + D I K WKA P F N K L G LL G +
Sbjct: 21 RLPPLSHQMVDYINKA-----NTTWKAG--PNFHNVDYSYVKRLCGT------LLKGPKL 67
Query: 90 KT---HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
T + ++LP +FD R WP C T+ I DQG CGSCWAFGA EA+SDR CIH
Sbjct: 68 PTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAK 127
Query: 147 LSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 191
+S+ ++ DLL+CC CG GC+GGYP +AW ++ G+VT C PY
Sbjct: 128 VSVEISSEDLLSCCDS-CGMGCNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCE 186
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
G P TP+C +C ++ KH+ ++Y + S+ + IMAE+ KNG
Sbjct: 187 HHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNG 246
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG + G YW+
Sbjct: 247 PVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWG-EEGGTPYWL 294
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 104/227 (45%), Positives = 136/227 (59%), Gaps = 32/227 (14%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P SFD+R WP+C +I+ I DQ CGSCWAFGAVEA+SDR CI G N+ LS D
Sbjct: 1 VEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------- 204
LL+CC CG GC+GG AW Y+V G+VT S+ +H GCEP
Sbjct: 61 LLSCC-ESCGLGCEGGILGPAWDYWVKEGIVT-------GSSKENHAGCEPYPFPKCEHH 112
Query: 205 -----------AYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
Y TP+C + C KK + + KH S+Y + +D + I EI K GPV
Sbjct: 113 TKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPV 172
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E FTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG + YW+
Sbjct: 173 EAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKA-PYWL 218
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 155/303 (51%), Gaps = 19/303 (6%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + + L G L V +LP+SFD+ WP C TI I DQ CG
Sbjct: 57 ITFAEARRLTGAFRRKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACG 116
Query: 125 SCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
SCWA A+SDR+C G+ L +S L++CC CGDGC GG P SAW Y+V HG+
Sbjct: 117 SCWAVSTASAISDRYCTVGGVQQLRISAAHLMSCCED-CGDGCKGGAPDSAWEYYVSHGL 175
Query: 184 VTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRI 235
+ C PY C H G + P TPKC C K K+ ++Y +
Sbjct: 176 ASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNNSYML 232
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+ +D E+Y NGP V F VY DF YK+GVY+H++GDV+GGHAV+++GWG +G
Sbjct: 233 LNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGT 291
Query: 296 DYW 298
YW
Sbjct: 292 PYW 294
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 17/205 (8%)
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFL 161
+R WP C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ +S DLL+CC
Sbjct: 1 SREQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLE 60
Query: 162 CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPT 208
CG+GC+GGYP AW ++ + G+V+ C PY S C H P C T
Sbjct: 61 CGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISP-CEHHVNGSRPKCSGEIET 119
Query: 209 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
P+C R+C + + KHY +++Y I SD +IM EIYKNGPVE + V++DF YKS
Sbjct: 120 PRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKS 179
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSD 292
GVY+H TG +GGHA+K++GWG +
Sbjct: 180 GVYQHKTGGSIGGHAIKILGWGEEN 204
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 103/227 (45%), Positives = 138/227 (60%), Gaps = 18/227 (7%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 146
V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G +
Sbjct: 43 VDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQS 102
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-----FDS 194
LS DL++CC CGDGC GG+P AW Y+V G+VT EE C PY
Sbjct: 103 AELSALDLISCCKD-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHL 161
Query: 195 TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
T +P C Y TP+C + C K + + KHY Y + S+ + I EI GPV
Sbjct: 162 TKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPV 221
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW+
Sbjct: 222 EAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWL 267
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 129/211 (61%), Gaps = 12/211 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P +FD+R+ W +C +I I DQ CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 202
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + + KH+ +SAY + + I AEIY NGPVE +F+VYEDF YKSG
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH G +GGHA+K+IGWGT + G YW+
Sbjct: 263 VYKHTAGKYLGGHAIKIIGWGT-ESGSPYWL 292
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 154/288 (53%), Gaps = 27/288 (9%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
LD H L D I +NE WKA +N + ++ + K GV P L H
Sbjct: 21 LDLHPLSDEYIASINEKATT-WKAGKNFEVDDWERVK-KIAAGVLPRKAALRFVTQNNPH 78
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLS 150
D+S ++P+SFDAR WP+C ++ +I DQ CGSCWAFGAVEA+SDR CIH + + +S
Sbjct: 79 DESEEVPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVS 138
Query: 151 VNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--- 205
DL +CC F CG GCDGGY W Y+ G+VT Y S GC EP
Sbjct: 139 AEDLNSCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTG--GAYNSSQGCKDYSLEPCEHH 196
Query: 206 -------------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
+ TP+CVR C + + + S + ++ + + EI KNGP+
Sbjct: 197 VEVGSRPQCSSLNFDTPECVRSCYESSLDYTESLTFGQQVSTFTNEKQ-MQLEILKNGPI 255
Query: 253 EVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGTSDDGEDYWV 299
E +FTVY DF YKSGVY+ D +GGHA+K++GWG ++G YW+
Sbjct: 256 EAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGV-EEGTKYWL 302
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 116/283 (40%), Positives = 160/283 (56%), Gaps = 24/283 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLL-LGVPVKTH-DKS 95
L D ++ VN A WKAA++ +F T+ + + +LG + + P +H D +
Sbjct: 26 LSDELVDYVNSQVDATWKAAKSERFK--TLEEIRSVLGTMREDQNVKEFRRPTISHEDIT 83
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVN 152
L+LP FDAR WP+C TI +I DQ CGSCWAF AV A+SDR CIH +N+ LS
Sbjct: 84 LELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSAT 143
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-------FDSTGCS 198
DLLACC CG GC GG+ AW Y+ +G+VT C PY + G
Sbjct: 144 DLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSE 202
Query: 199 HPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+P C E Y TP+CV +C K + + K + ++Y + I EI+ GPVE +
Sbjct: 203 YPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRSVTTIQKEIWMRGPVEATM 262
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DFA+Y GVYKH TG+++GGHA++L+GWG +DG YW+
Sbjct: 263 NVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWL 305
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 112/271 (41%), Positives = 148/271 (54%), Gaps = 21/271 (7%)
Query: 46 VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFD 103
V+ A W A P+ + G + + P+ P +H+ +PK+FD
Sbjct: 28 VDSETGAKWIYAEPPE--TFRQGNLQLMFRAIREPEEQRSKRPTVSHESLGDENIPKTFD 85
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFL 161
AR WP C TI +I DQ CGSCWAFGAVEA+SDR CIH SLS DL++CCG+
Sbjct: 86 AREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLVSCCGY- 144
Query: 162 CGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPT 208
CG GC GGYP +AW ++ +G+VT + DP + CSH G + Y T
Sbjct: 145 CGFGCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHGSKKYPPCPHRIYDT 204
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
PKCV KC N + K + Y + IM EI NGPVE +F VYEDF YK G
Sbjct: 205 PKCVPKCDTPNIDYETDKTRANITYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQG 264
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY H TG+ +GGHA++++GWG ++G YW+
Sbjct: 265 VYFHSTGEFIGGHAIRILGWG-EENGTPYWL 294
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 117/273 (42%), Positives = 152/273 (55%), Gaps = 39/273 (14%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK--------LPKSFDAR 105
WKA N +F+ Y+ LLGV K + H K+L +P+SFDAR
Sbjct: 33 WKAGMN-KFNLYSDTVKYGLLGVNNRKKSV-------EHKKNLSPIRHSNIFIPESFDAR 84
Query: 106 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCG 163
WP+C+++ I DQ CGSCWA AVEA+SDR CI + LS +DLL+CC CG
Sbjct: 85 KNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCG 143
Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYP 207
GC GG P++AW+Y+V G+VT Y + +GC P CE YP
Sbjct: 144 FGCFGGEPMAAWKYWVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYP 201
Query: 208 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
TPKC ++C K + ++ K+Y AY + +D E I EI GPVE SF VY DF HY
Sbjct: 202 TPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYT 261
Query: 267 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SG+YKH+ G V GGHAVK++GWG D G YW+
Sbjct: 262 SGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWL 293
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 116/278 (41%), Positives = 154/278 (55%), Gaps = 27/278 (9%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK-THDKSL-KL 98
+II EVN AGW A N T+ + LG P K HD + +
Sbjct: 40 AIIDEVN-TANAGWTAGENFH-EQTTLEDVRSWLGAWSNKD---YDWPQKYPHDDLVGDI 94
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 156
P +FD+RS W CS I +I DQG CGSCWAFGA EA+SDR CI ++ + D+L+
Sbjct: 95 PATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLS 154
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCE 203
CC CG+GC+GGYP++A YFV G+VT + C PY C H P C
Sbjct: 155 CC-LTCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPY-TLEACEHHVPGDRPPCT 212
Query: 204 PAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TPKC +C+ + +++ K + AY + +D I EI GPVE +FTVY D
Sbjct: 213 EGGGTPKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSD 272
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F YKSGVY+H +G +GGHA+K+IGWGT + G+DYW+
Sbjct: 273 FPSYKSGVYRHTSGSELGGHAIKIIGWGT-EGGDDYWL 309
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 116/300 (38%), Positives = 157/300 (52%), Gaps = 30/300 (10%)
Query: 24 AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGL 83
A V + + +L D I+ V K W RN S T G + L+GV P
Sbjct: 10 AASVAALTSGEPSLLSDEFIEVVRSKAKT-WTVGRNFDAS-VTEGHIRRLMGVHPDAHKF 67
Query: 84 LLGVPVKTH-------DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
L P K + +LP+ FD+R WP C TI I DQG CGSCWAFGAVEA+S
Sbjct: 68 AL--PDKREVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMS 125
Query: 137 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EE 187
DR CIH G +N S +DL++CC CG GC+GG+P +AW Y+ G+V+ +
Sbjct: 126 DRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184
Query: 188 CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE 240
C PY + + C H P C TPKC C + + KH+ +Y + +
Sbjct: 185 CRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNVR 243
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
+I EI NGPVE +FTVYED YK GVY+H G +GGHA++++GWG ++ YW+
Sbjct: 244 EIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWL 303
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 161/310 (51%), Gaps = 31/310 (10%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL+L I++ V + + L D I+ V K W RN S+ T G +
Sbjct: 3 LLLLVAIAAS-----VAALTSGEPSFLSDEFIELVRSKAKT-WTVGRNFD-SSVTEGYIR 55
Query: 72 HLLGVKPTPKGLLLGVPVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
L+GV P L + + ++P+ FD+R WP C TI I DQG CGSC
Sbjct: 56 RLMGVHPDAHKFALADKREVLGDLYMNTVDQIPEEFDSRKQWPNCPTIGEIRDQGECGSC 115
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y+ G+V
Sbjct: 116 WAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIV 174
Query: 185 T-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 230
+ + C PY + C H P C TPKC C + + KH+
Sbjct: 175 SGGPYGSNQGCRPY-EIAPCEHHVNGTRPPCGHGGGTPKCSHVCESGYTVDYAKDKHFGS 233
Query: 231 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
+Y + + DI EI NGPVE +FTVYED YK GVY+H G +GGHA++++GWG
Sbjct: 234 KSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGV 293
Query: 291 -SDDGEDYWV 299
++ YW+
Sbjct: 294 WGEEKIPYWL 303
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 116/283 (40%), Positives = 160/283 (56%), Gaps = 24/283 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLL-LGVPVKTH-DKS 95
L D ++ VN A WKAA++ +F T+ + + +LG + + P +H D +
Sbjct: 26 LSDELVDYVNSQVDATWKAAKSERFK--TLEEIRSVLGTMREDQNVKEFRRPTISHEDIT 83
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVN 152
L+LP FDAR WP+C TI +I DQ CGSCWAF AV A+SDR CIH +N+ LS
Sbjct: 84 LELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSAT 143
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-------FDSTGCS 198
DLLACC CG GC GG+ AW Y+ +G+VT C PY + G
Sbjct: 144 DLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSE 202
Query: 199 HPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+P C E Y TP+CV +C K + + K + ++Y + I EI+ GPVE +
Sbjct: 203 YPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRSVTAIQKEIWMRGPVEATM 262
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DFA+Y GVYKH TG+++GGHA++L+GWG +DG YW+
Sbjct: 263 NVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWL 305
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 104/224 (46%), Positives = 131/224 (58%), Gaps = 20/224 (8%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
++ K+P SFDAR WP C +IS I DQ CGSCWAFG+ EA+SDR CI H + LS
Sbjct: 89 EEGFKIPDSFDARVQWPHCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELS 148
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 203
+D+L+CC + CGDGCDGGYPISAW YFV GVVT + C PY + C H E
Sbjct: 149 ADDILSCC-YDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPY-EIPPCGHHRNE 206
Query: 204 PAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
Y TP CV C + + + K + +Y I S I EI GPV +
Sbjct: 207 TFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAA 266
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF HY G+YKH++G GGHAV+++GWG + G YW+
Sbjct: 267 FIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWG-EEKGTAYWL 309
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 114/291 (39%), Positives = 163/291 (56%), Gaps = 25/291 (8%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
+K+ ++ L D + + + + WKA N +F+ Y+ LLGV + +
Sbjct: 54 TKIAPEAENLSDQELIDYVNSHQTLWKAEMN-KFNLYSNTVKYGLLGVNNMKQSVDGKKN 112
Query: 89 VK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--M 145
+ T ++ +P+SFDAR WP+C+++ + DQ CGSCWA AVEA+SDR CI
Sbjct: 113 LSPTRHSTIFIPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKK 172
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 202
++LS +DLL+CC CG GC GG P++AW+Y+V G+VT Y + +GC P C
Sbjct: 173 QVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLRGIVTG--SEYTNHSGCRPYPFPPC 229
Query: 203 E-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
E YPTPKCV+KC K + ++ K+Y Y + S+ E I EI
Sbjct: 230 EHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESNVESIQKEIMT 289
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GPVE SF VY DF +Y G+YKH+ G + GGHAVK++GWG D G YW+
Sbjct: 290 LGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGI-DQGVPYWL 339
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 116/281 (41%), Positives = 155/281 (55%), Gaps = 25/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-KPTPKGL-LLGVPVKTHDKS 95
D +I +NE A WKAA + +F+N + Q K LGV + TP+ V+
Sbjct: 3 FSDELIHYINEESGASWKAAPSTRFNN--IDQVKQNLGVLEETPEDRNTQRQTVRYSVSE 60
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFDAR WP C +IS I DQ C SCWA + A++DR CIH LS D
Sbjct: 61 NDLPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAID 120
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGC 202
+++CC + CG GC+GG P +W Y+ GVVT C PY CSH PG
Sbjct: 121 IVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGL 178
Query: 203 EPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P YPTPKC +KC N+ + K S+Y + DIM EI KNGPV+ F
Sbjct: 179 PPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFY 238
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
++EDF YKSG+Y + TG ++GGHA+++IGWG ++G +YW
Sbjct: 239 MFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVNYW 278
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 117/282 (41%), Positives = 156/282 (55%), Gaps = 27/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGVKPTPKGLLLGVPVKTHDKSL 96
D +I+ VNE A WKAAR+ +F+N + QFK HL ++ TP+ P + S
Sbjct: 26 FSDELIRYVNEESGASWKAARSTRFNN--IEQFKKHLGALEETPEERNTRRPTVRYSVSE 83
Query: 97 K-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFDAR WP CS+IS I DQ C SCWA G A++DR CIH LS D
Sbjct: 84 NDLPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSAVD 143
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGC 202
L++CC + CG GC+GGYP AW Y+ HG+V+ C PY CSH PG
Sbjct: 144 LVSCCPY-CGYGCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPY-PFPKCSHLEETPGL 201
Query: 203 EPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
P Y TPKC ++C ++ K S+Y + DIM EI NGPV +
Sbjct: 202 APCPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPVSTIYY 261
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
++EDF YKSG+Y++ +G +MGGH + IGWG ++G YW+
Sbjct: 262 IFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGV-ENGVKYWL 300
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 105/219 (47%), Positives = 131/219 (59%), Gaps = 19/219 (8%)
Query: 98 LPKSFDARSAW-PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDL 154
LP S+D R W C + + I DQG CGSCWAFGAVEA +DR CI N +S DL
Sbjct: 77 LPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDL 136
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPG 201
L CCGF CG GC+GG AW +F + G VT E C PY ++G P
Sbjct: 137 LTCCGFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP- 195
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
CE + PTPKC R C + N + + KH S Y I +D E I EIY NGPVE +FTVY
Sbjct: 196 CEGSEPTPKCKRSCREGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLNGPVEAAFTVYS 255
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSGVYK+ TG+ +GGHA+K++GWG ++ YW+
Sbjct: 256 DFPNYKSGVYKYTTGNALGGHAIKILGWGVENN-VPYWL 293
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 154/303 (50%), Gaps = 19/303 (6%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + + L G L V +LP+SFD+ WP C TI I DQ CG
Sbjct: 57 ITFAEARRLTGAFRRKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACG 116
Query: 125 SCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
SCWA A+SDR C G+ L +S LL+CC CGDGCDGGYP +AWRY+V HG+
Sbjct: 117 SCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCK-DCGDGCDGGYPDAAWRYYVSHGL 175
Query: 184 VTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRI 235
+ C PY C H G + P TPKC C K ++ +Y +
Sbjct: 176 ASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAIPL--IEYRGNDSYVL 232
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+D E+Y NGP V+F V+ DF YK+GVY+H++GD +GGHAV+++GWG +G
Sbjct: 233 LHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGT 291
Query: 296 DYW 298
YW
Sbjct: 292 PYW 294
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 167/311 (53%), Gaps = 24/311 (7%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+ T L I+ +S ++ + ++ L D +I +N++P AGW A+R+ +F +
Sbjct: 1 MMNTVLCIISFMS--ILTAHILPENEIQFEPLSDEMIAYINQHPDAGWTASRSDRFKSLK 58
Query: 67 VGQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ LLG + L V D SL++P SFD+R WPQC +IS I DQ CG
Sbjct: 59 DARI--LLGAMREDEELRKKRRPTVDHQDVSLEIPTSFDSRKEWPQCKSISNIRDQSRCG 116
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
+ WAF AV+A+SDR CI ++ LS DLL+CC CG GC G+P AW Y+V G
Sbjct: 117 AGWAFAAVQAMSDRICIESKGKKSVELSAVDLLSCC-IECGLGCQMGFPGIAWDYWVQEG 175
Query: 183 VVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHY 228
+VT C PY T +P C E Y PKC +KC K + + K+Y
Sbjct: 176 IVTGGSKENHTGCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKYY 235
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 288
+Y + + + I EI +GPVE SF V+ DF +YKSG+YKH+TG +G H V++IGW
Sbjct: 236 GKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGW 295
Query: 289 GTSDDGEDYWV 299
G + YW+
Sbjct: 296 GVEKE-TPYWL 305
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/232 (44%), Positives = 134/232 (57%), Gaps = 22/232 (9%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
+K + + +P S+D R WPQC +++ I DQ HCGSCWA A EA+SDR CI + +N
Sbjct: 64 IKLAETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVN 123
Query: 147 LSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS--- 194
LS D+L CC F CGDGC+GGYPI AWRY+V +G+VT C PY +
Sbjct: 124 TLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCG 183
Query: 195 ---TGCSHPGCEPAYP-TPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 247
G + P C TPKC C N + KH+ SAY I + I EI
Sbjct: 184 ETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEIL 243
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+GPVEV F VYEDF YK+G+Y H+ G +GGHAVK++GWG D+G YW+
Sbjct: 244 AHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGV-DNGTPYWL 294
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 114/281 (40%), Positives = 157/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLK 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAVGAISDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 200
L++CC CG GCDGG+P AW Y+V HG+VT C PY C H P
Sbjct: 148 LISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYP 205
Query: 201 GC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C + Y TP+C RKC K + + KHY A + + I EI GPVE +
Sbjct: 206 SCGDKMYKTPQCKRKCQKGYTTPYEHDKHYGGIAINVIKNELAIQKEIMMYGPVEAYLLI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+EDF +YKSG+YK+ TG +G H V++IGWG ++G YW+
Sbjct: 266 FEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGI-ENGTAYWL 305
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 157/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSL 96
L D +I +NE+P AGWKA ++ +F + +F L G K P P V HD ++
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRFHSVDDARFL-LGGRKEDPNLREKRRPTVDHHDLNV 88
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS DL
Sbjct: 89 EIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDL 148
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------- 203
++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 149 ISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYR 205
Query: 204 ----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
Y TP+C + C K N + KHY +Y + S I +I +GPVE +
Sbjct: 206 ACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 266 YEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 110/264 (41%), Positives = 151/264 (57%), Gaps = 25/264 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQC 111
W A RN F +T F H+ ++ + + V THD L LP+ FD R WP+C
Sbjct: 1 WSAGRN--FPTHT--SFAHIKILREHERRYYMEVAYVTHDVELIATLPEIFDPRDKWPEC 56
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDGG 169
T++ I DQG CGSCWAFGAVEA++DR CI+ + S DL++CC +CG GC+GG
Sbjct: 57 LTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGG 115
Query: 170 YPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCV 216
P AW Y+ H G+V+ + C PY + C H PG C TPKC + C
Sbjct: 116 MPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCE 174
Query: 217 KK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
N ++ K Y Y ++ + I AE++KNGPVE +FTVY D YK+GVYKH G
Sbjct: 175 SSYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEG 234
Query: 276 DVMGGHAVKLIGWGTSDDGEDYWV 299
+ +GGHA+K+IGWG ++ + YW+
Sbjct: 235 NALGGHAIKIIGWGVENNNK-YWL 257
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 20/217 (9%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D LP++FDAR WP C TI + DQG CGSCWAFGAVEA+SDR CIH N S
Sbjct: 23 DAPTDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFS 82
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------------- 197
+L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 83 AENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEVAPCEHHVN 139
Query: 198 -SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
+ C+ TPKCV+KC ++ + H+ SAY +++D + I EIY NGPVE +
Sbjct: 140 GTRGPCKEGGKTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGA 199
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
FTVYEDF Y++GVYKH+ G +GGHA++++GWG +
Sbjct: 200 FTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQN 236
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/302 (38%), Positives = 163/302 (53%), Gaps = 28/302 (9%)
Query: 14 ILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 73
+ ++ + A V L+ + L D I +N + WKA RN N + K L
Sbjct: 8 FVALVCALALASANVEDLQ---NPLTDEFINLINSKQNS-WKAGRNFPV-NTPLTHIKKL 62
Query: 74 LGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
GV L +P HD L LP++FD R WP C T++ + DQG CGSCWAFGA
Sbjct: 63 TGVLVDTH--LSKLPKAEHDMDLIASLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGA 120
Query: 132 VEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---- 185
VEA++DR+C + + S DLL+CC +CG GC+GG P AW Y+ H G+V+
Sbjct: 121 VEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSY 179
Query: 186 ---EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 235
+ C PY + C H PG C TPKC + C + + K Y Y +
Sbjct: 180 NSGQGCRPY-EIPPCEHHVPGNRVPCNGDSKTPKCHKTCEASYSVDYHKDKRYGKHVYSV 238
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+S + I AE++KNGPVE +FTVY D +YK+GVYKH G+ +GGHA+K++GWG ++G
Sbjct: 239 SSKEDHIKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGN 297
Query: 296 DY 297
Y
Sbjct: 298 KY 299
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 153/312 (49%), Gaps = 38/312 (12%)
Query: 12 LLILGVI---SSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVG 68
+LILG + S+ F G +S I+ EVN NP + WKAAR P F T
Sbjct: 8 ILILGCLFSTSANCFKFGEMSPF----------IVFEVNSNPNSTWKAARYPHFEKMTRE 57
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLK---LPKSFDARSAWPQCSTISRILDQGHCGS 125
Q LG P + L P K D + +P+ FDAR WP C +I I DQ CGS
Sbjct: 58 QLLGHLGSLDEPDWVKL--PTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGS 115
Query: 126 CWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWAF A E SDR CI L S+S DLL CC CG GC GGYP +AW Y GV
Sbjct: 116 CWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCKGGYPSAAWGYMKRQGV 175
Query: 184 VT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY 228
T C PY TG P C P PTP+CV++C + + H+
Sbjct: 176 STGGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPIQPTPQCVKECNSEYTQNTYEKDLHF 234
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIG 287
+ Y I + + I EI +GPV+ SF V DF YKSGVY ++ GGH+VK+IG
Sbjct: 235 ASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIG 294
Query: 288 WGTSDDGEDYWV 299
WG + YW+
Sbjct: 295 WG-KEGNTPYWL 305
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 157/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSL 96
L D +I +NE+P AGWKA ++ +F + +F L G K P P V HD ++
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRFHSVDDARFL-LGGRKEDPNLREKRRPTVDHHDLNV 88
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS DL
Sbjct: 89 EIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDL 148
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------- 203
++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 149 ISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYR 205
Query: 204 ----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
Y TP+C + C K N + KHY +Y + S I +I +GPVE +
Sbjct: 206 ACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 266 YEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 111/303 (36%), Positives = 156/303 (51%), Gaps = 20/303 (6%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + + L G + L V +LP+SFD+ WP C TI I DQ CG
Sbjct: 57 ITFAEARRLTGARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACG 116
Query: 125 SCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
SCWA A+SDR C G+ L +S L++CC CGDGCDGGYP ++W Y+V HG+
Sbjct: 117 SCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCE-DCGDGCDGGYPGTSWEYYVSHGL 175
Query: 184 VTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRI 235
+ C PY C H G + P TPKC C K K+ +Y +
Sbjct: 176 ASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEV 232
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
+ + +D E+Y NGP V F VY DF YK+GVY+H++GD +GGHAV+++GWG +G
Sbjct: 233 HGE-DDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGT 290
Query: 296 DYW 298
YW
Sbjct: 291 PYW 293
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 166/304 (54%), Gaps = 27/304 (8%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S T E V ++ L D +I +NE+P AGWKA ++ +F ++V + LLG
Sbjct: 8 IVSLSTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ L V HD ++++P FD+R WP+C +IS+I DQ CGS WA AV
Sbjct: 66 GRREDPNLREKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVG 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y+V G+VT
Sbjct: 126 AMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE- 183
Query: 192 FDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 235
+ TGC P C+ Y TP+C + C K N + KHY +Y +
Sbjct: 184 -NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNV 242
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
S I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G
Sbjct: 243 LSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGT 301
Query: 296 DYWV 299
YW+
Sbjct: 302 AYWL 305
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/214 (48%), Positives = 134/214 (62%), Gaps = 19/214 (8%)
Query: 103 DARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGF 160
D+R WP C +IS I DQG CGSCWAFGAVEA+SDR CIH + + +S DLL+CC
Sbjct: 1 DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCS- 59
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYP 207
CG GCDGG+P SAW ++V G+ T C PY + C H P C
Sbjct: 60 SCGMGCDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPY-EIPACEHHTTGDRPPCSDIVD 118
Query: 208 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
TPKCV C K N +R+ KH+ +Y I S + I EI+KNGPVE +F+VY DF +YK
Sbjct: 119 TPKCVHLCEKGYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYK 178
Query: 267 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
SGVY+H +G+ +GGHA++++GWG +D YW+C
Sbjct: 179 SGVYQHHSGESLGGHAIRVLGWGYEND-VPYWLC 211
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/283 (39%), Positives = 152/283 (53%), Gaps = 32/283 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
L D+ I +N WKA RN F + + + LLGV + +K +
Sbjct: 27 LSDAEIFYINHVANTTWKAGRN--FHPAEIKRARALLGVNMAENKAYNRIHLKYKQVQPR 84
Query: 98 --LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
LP +FD R+ WP C++++ I DQ +CGSCWAFG+ EA++DR CI N+ +S D+
Sbjct: 85 NDLPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMTDRICIAGKGNIHISAEDIN 144
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGC 202
CC CG GC+GGYP +AW ++V GVV+ E C PY +TG P C
Sbjct: 145 DCCK-SCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQP-C 202
Query: 203 EPAYPTPKCVRKCVK------KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
PTPKC +KC+ N R K Y + + IM E+ NGPV +F
Sbjct: 203 PAVVPTPKCEKKCLTGYPKSYSNDKTRGKKSYGVRGV------QSIMQELVDNGPVTAAF 256
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YK+GVY+H TG GGHAVK+IG+GT + G+DYW+
Sbjct: 257 DVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGT-ESGQDYWL 298
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 165/304 (54%), Gaps = 27/304 (8%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S T E V ++ L D +I +NE+P AGWKA ++ +F ++V + LLG
Sbjct: 8 IVSLSTLLEAHVTTRNNQRIEPLSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ L V HD +++P FD+R WP+C +IS+I DQ CGS WA AV
Sbjct: 66 GRREDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVG 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y+V G+VT
Sbjct: 126 AMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE- 183
Query: 192 FDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 235
+ TGC P C+ Y TP+C + C K N + KHY +Y +
Sbjct: 184 -NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNV 242
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
S I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G
Sbjct: 243 LSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGT 301
Query: 296 DYWV 299
YW+
Sbjct: 302 AYWL 305
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 169/305 (55%), Gaps = 29/305 (9%)
Query: 17 VISSQTFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 74
++S T E V+K + + I L D +I +N++P AGWKA ++ +F ++V + LL
Sbjct: 8 IVSLFTLLEAHVTK-RNNQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVDDARILL 64
Query: 75 GVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
G + L V HD ++++P FD+R WP+C +IS+I DQ C S WA +V
Sbjct: 65 GGRKEDSNLRQKRRPTVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSSV 124
Query: 133 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR CI G ++ LS DL++CC CG GCDGGY + +W Y+V HG+VT
Sbjct: 125 GAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGYFLPSWDYWVSHGIVTGGSKE 183
Query: 191 YFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYR 234
+ TGC P C+ Y TP+C + C K N + KHY +Y
Sbjct: 184 --NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYN 241
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
+ S I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G
Sbjct: 242 VLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENG 300
Query: 295 EDYWV 299
YW+
Sbjct: 301 TAYWL 305
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/228 (43%), Positives = 133/228 (58%), Gaps = 20/228 (8%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMN 146
V D LP++FDAR WP C TI + DQG CGSCWAFGAVEA+SDR CIH N
Sbjct: 15 VSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKN 74
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------- 197
S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY GC
Sbjct: 75 FHFSAENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAPCE 131
Query: 198 -----SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
+ C+ TP CV+KC ++ + H SAY + +D + I EIY NGP
Sbjct: 132 HHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGP 191
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW+
Sbjct: 192 VEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWL 239
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 136/216 (62%), Gaps = 18/216 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FDAR WP C ++ I +Q CGSCWAFGA E +SDR CI +S D+L
Sbjct: 95 LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCEPA----YPT 208
+CCG CG GC GGY I A +Y+++ GVVT ++ GC S P C+ + + T
Sbjct: 155 SCCGSTCGKGCQGGYTIEAMKYWMNSGVVT---GGDYNGAGCMPYSFPPCKKSPCVEFST 211
Query: 209 PKCVRKCVKKNQL--WRNSKHYSISAYRINSDPE---DIMAEIYKNGPVEVSFTVYEDFA 263
P C C +K ++N KH++ SAY++++ I EIY NGPVE S+ V+EDF
Sbjct: 212 PSCKTTCQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPVEASYRVFEDFY 271
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY H++G+++GGHAVK+IGWGT ++G DYW+
Sbjct: 272 QYKSGVYHHVSGNLVGGHAVKIIGWGT-ENGVDYWL 306
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 157/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSL 96
L D +I +NE+P AGWKA ++ +F + +F L G K P P V HD ++
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRFHSVDDARFL-LGGRKEDPNLREKRRPTVDHHDLNV 88
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS DL
Sbjct: 89 EIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDL 148
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------- 203
++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 149 ISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYR 205
Query: 204 ----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
Y TP+C + C K N + KHY +Y + S I +I +GPVE +
Sbjct: 206 ACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 266 YEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/231 (45%), Positives = 136/231 (58%), Gaps = 16/231 (6%)
Query: 78 PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSD 137
P P + V T ++ P++FDAR+ WP+C +I I +Q +CGSCWAFGA E +SD
Sbjct: 69 PPPSDEIRATEVNTVLATI--PETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISD 126
Query: 138 RFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECD 189
R CI +S D++ CCG CG GCDGGY I A R++V GVVT + C
Sbjct: 127 RICIATKGARQPVISPMDMVDCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQGDGCK 186
Query: 190 PYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
PY C+ GC P TP+C C K N + K++ SAY + I +I
Sbjct: 187 PY---QFCNSAGC-PDAVTPECALSCQSKYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMT 242
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE SF VYEDF YKSGVYK+I G ++GGHA+K+IGWGT ++G YW+
Sbjct: 243 NGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHAIKIIGWGT-ENGTAYWL 292
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 158/281 (56%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 200
L++CC + CG GCDGG+ +W Y+V G+VT C PY C H
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPY-PFPKCDHFVKGKYR 205
Query: 201 GC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C + Y TP+C + C K N + KHY +Y + S I +I +GPVE +
Sbjct: 206 ACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 266 YEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 113/278 (40%), Positives = 154/278 (55%), Gaps = 22/278 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDKSL 96
L D I +N + + W+A RN F+ T ++ K L GV +P + +
Sbjct: 24 LSDEFIDYIN-SLQTTWRAGRN--FAPNTPKKYLKSLAGVHKDANNAFT-LPKRQVSVDV 79
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
+P FDAR WP CS+I+ I DQG CGSCWAFGAVEA+SDR CIH + + LS +L
Sbjct: 80 TVPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENL 139
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHPGC 202
L+CC CG GC GG +AW Y+ G+V+ + C PY S S P C
Sbjct: 140 LSCCDS-CGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPAC 198
Query: 203 EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
E TPKC ++C K + + + Y Y I +D + I AEI KNGP+ S VYED
Sbjct: 199 EGVRDTPKCKKQCEKGYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYED 258
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YK+GVY+H+ G+V+GGH +K++GWG +D YW+
Sbjct: 259 LFSYKAGVYQHVAGEVLGGHVIKILGWGVEND-TPYWL 295
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 149/276 (53%), Gaps = 16/276 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 91
D+ +L + + VN + WKA + N T+ + K L GV K +L
Sbjct: 28 DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 87
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ LP SFD+ AWP C TI +I DQ CGSCWA A A+SDRFC G+ ++ +S
Sbjct: 88 EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 147
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 207
DLLACC CGDGC+GG P AW YF G+V++ C PY H + YP
Sbjct: 148 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 206
Query: 208 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC C N + S ++Y + + +D M E++ GP EV+F VYEDF
Sbjct: 207 QFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 263
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
Y SGVY H++G +GGHAV+L+GWGTS +G YW
Sbjct: 264 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYW 298
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 149/276 (53%), Gaps = 16/276 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 91
D+ +L + + VN + WKA + N T+ + K L GV K +L
Sbjct: 28 DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 87
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ LP SFD+ AWP C TI +I DQ CGSCWA A A+SDRFC G+ ++ +S
Sbjct: 88 EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 147
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 207
DLLACC CGDGC+GG P AW YF G+V++ C PY H + YP
Sbjct: 148 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 206
Query: 208 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC C N + S ++Y + + +D M E++ GP EV+F VYEDF
Sbjct: 207 QFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 263
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
Y SGVY H++G +GGHAV+L+GWGTS +G YW
Sbjct: 264 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYW 298
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 157/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSL 96
L D +I +NE+P AGWKA ++ +F + +F L G K P P V HD ++
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRFHSVDDARFL-LGGRKEDPNLREKRRPTVDHHDLNV 88
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS DL
Sbjct: 89 EIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDL 148
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------- 203
++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 149 ISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYR 205
Query: 204 ----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
Y TP+C + C K N + KHY +Y + S I +I +GPVE +
Sbjct: 206 ACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 266 YEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 149/276 (53%), Gaps = 16/276 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 91
D+ +L + + VN + WKA + N T+ + K L GV K +L
Sbjct: 5 DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 64
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ LP SFD+ AWP C TI +I DQ CGSCWA A A+SDRFC G+ ++ +S
Sbjct: 65 EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 124
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 207
DLLACC CGDGC+GG P AW YF G+V++ C PY H + YP
Sbjct: 125 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 183
Query: 208 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC C N + S ++Y + + +D M E++ GP EV+F VYEDF
Sbjct: 184 QFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 240
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
Y SGVY H++G +GGHAV+L+GWGTS +G YW
Sbjct: 241 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYW 275
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 169/305 (55%), Gaps = 29/305 (9%)
Query: 17 VISSQTFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLL 74
++S T E V+K +++ I L D +I +N++P AGWKA ++ +F ++V + LL
Sbjct: 8 IVSLFTLLEAHVTK-RINQRIEPLSDEMISFINKHPNAGWKADKSDRF--HSVDDARILL 64
Query: 75 GVKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
G + L V HD +++P FD+R WP+C +IS+I DQ CGS WA AV
Sbjct: 65 GGRKEDPNLRQKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124
Query: 133 EALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y+V G+VT
Sbjct: 125 GAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE 183
Query: 191 YFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYR 234
+ TGC P C+ Y TP+C + C K N + KHY +Y
Sbjct: 184 --NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYN 241
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
+ S I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G
Sbjct: 242 VLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENG 300
Query: 295 EDYWV 299
YW+
Sbjct: 301 TAYWL 305
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/271 (39%), Positives = 149/271 (54%), Gaps = 21/271 (7%)
Query: 46 VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFD 103
V+ A W A P+ + G F+ + G P+ P +H+ +PK+FD
Sbjct: 28 VDSKSGARWIYAEPPE--RFQPGNFQLMFGALREPEEQRSKRPTVSHESFSDEHIPKAFD 85
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFL 161
AR WP C TI I DQ CGSCWAFGAVEA+SDR CIH + +S DL++CCG+
Sbjct: 86 ARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY- 144
Query: 162 CGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPT 208
CG GC GG+P +AW ++ G+VT + +P + CSH G + Y T
Sbjct: 145 CGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDT 204
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
P CV+KC + + K + Y + + IM EI NGPVE +F VYEDF YKSG
Sbjct: 205 PNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG 264
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY H G ++GGHA++++GWG ++G YW+
Sbjct: 265 VYFHSDGTLLGGHAIRILGWG-EENGVAYWL 294
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 149/276 (53%), Gaps = 16/276 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKT 91
D+ +L + + VN + WKA + N T+ + K L GV K +L
Sbjct: 6 DAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTE 65
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ LP SFD+ AWP C TI +I DQ CGSCWA A A+SDRFC G+ ++ +S
Sbjct: 66 EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHIS 125
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--- 207
DLLACC CGDGC+GG P AW YF G+V++ C PY H + YP
Sbjct: 126 AGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCS 184
Query: 208 -----TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC C N + S ++Y + + +D M E++ GP EV+F VYEDF
Sbjct: 185 QFNFDTPKCDYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDF 241
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
Y SGVY H++G +GGHAV+L+GWGTS +G YW
Sbjct: 242 IAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYW 276
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 166/304 (54%), Gaps = 27/304 (8%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S T E V ++ L D +I +NE+P AGWKA ++ +F ++V + LLG
Sbjct: 8 IVSLSTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ L + HD ++++P FD+R WP+C +IS+I DQ CGS WA AV
Sbjct: 66 GRREDPNLREKRRPTIDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVG 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y+V G+VT
Sbjct: 126 AMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE- 183
Query: 192 FDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 235
+ TGC P C+ Y TP+C + C K N + KHY +Y +
Sbjct: 184 -NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNV 242
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
S I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G
Sbjct: 243 LSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGT 301
Query: 296 DYWV 299
YW+
Sbjct: 302 AYWL 305
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 114/279 (40%), Positives = 149/279 (53%), Gaps = 26/279 (9%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ IL D ++ VN W A R + T LLG +L P + +
Sbjct: 28 DAPILTDEFLELVNRLNGGKWTAGRTSRTKYLTRRGASRLLGTFLRNTSIL--PPRQFSE 85
Query: 94 KSLKLP--KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ L++P FDA AWP+C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S
Sbjct: 86 EELRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGVRDLRIS 145
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGC 202
DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C
Sbjct: 146 AGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPC 202
Query: 203 EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
Y TP C C K +R + Y I S E E+ NGP EVSF+VY
Sbjct: 203 SGEYDTPTCNSTCTDKKIPLIKYRGNTSY------ILSGEESFKRELLLNGPFEVSFSVY 256
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DF Y GVYKH+TG +GGHAV+++GWG +GE YW
Sbjct: 257 ADFVAYTGGVYKHVTGVFLGGHAVRIVGWGEL-NGEPYW 294
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 159/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V ++LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARNLLGGRREDPNLRQKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 116/279 (41%), Positives = 164/279 (58%), Gaps = 20/279 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL- 154
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S ++ L
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-FDS----TGCSHPGC 202
L C CG GC GG+P AW Y+V G+VT EE C PY F T +P C
Sbjct: 148 LISCCKDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPAC 207
Query: 203 -EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
Y TP+C + C K + + KHY Y + S+ + I EI GPVE +F VYE
Sbjct: 208 GTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYE 267
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSG+Y+H+TG ++GGHA+++IGWG + G+ YW+
Sbjct: 268 DFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 122/308 (39%), Positives = 169/308 (54%), Gaps = 39/308 (12%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
L+LG S ++A + L SH + + I K WKA N F N K
Sbjct: 6 FLVLGSGLSISWARPHLPPL---SHEMVNFINKA-----NTTWKAGHN--FHNVDYSYVK 55
Query: 72 HLLGVKPTPKGLLLGVPVKT---HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
L G LL G + T + + ++LPK+FD R WP C T+ + DQG CGSCWA
Sbjct: 56 RLCGT------LLKGPKLSTMVQYTEDMELPKNFDPRLQWPNCPTLKEVRDQGSCGSCWA 109
Query: 129 FGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
FGA EA+SDR CIH +S+ ++ DLL+CC CG GC+GGYP +A ++ G+V+
Sbjct: 110 FGAAEAISDRVCIHSNAKVSVEISSEDLLSCCES-CGMGCNGGYPSAACDFWTKEGLVSG 168
Query: 187 E-------CDPYFDSTGCSH------PGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSIS 231
C PY C H P C+ TP+C +C ++ KH+
Sbjct: 169 GLYDSHIGCRPY-SIPPCEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKR 227
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y + SD ++IM E+YKNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG
Sbjct: 228 SYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWG-E 286
Query: 292 DDGEDYWV 299
+ G YW+
Sbjct: 287 EGGIPYWL 294
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 156/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSL 96
L D +I +NE+P AGWKA ++ +F + +F L G K P P V HD ++
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRFHSVDDARFL-LGGRKEDPNLREKRRPTVDHHDLNV 88
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS DL
Sbjct: 89 EIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDL 148
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------- 203
++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 149 ISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYR 205
Query: 204 ----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
Y TP+C + C K N + KHY +Y + S I +I +GP E +
Sbjct: 206 ACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 266 YEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 120/283 (42%), Positives = 164/283 (57%), Gaps = 30/283 (10%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-- 93
H L D ++ +N+ W+A N F N + + L G LG P H
Sbjct: 24 HPLSDDLVNYINKQ-NTTWQAGHN--FRNADMSYVRKLCGT-------FLGGPKLPHRIK 73
Query: 94 --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP+SFDAR W C TI I DQG CGSCWAFGAVE++SDR CIH +N+ +
Sbjct: 74 FAEDMNLPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
S D+L CCG CG+GC+GGYP +AW ++ G+V+ C PY
Sbjct: 134 SAEDMLTCCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNG 193
Query: 198 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TPKC + C + ++ KHY S+Y + ++IMAEIYKNGPVE +F
Sbjct: 194 SRPPCTGEGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPGIEKEIMAEIYKNGPVEGAF 253
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+VY DF YKSGVY+H+TG++MGGHA++++GWGT ++G YW+
Sbjct: 254 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGT-ENGTPYWL 295
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 113/296 (38%), Positives = 155/296 (52%), Gaps = 29/296 (9%)
Query: 26 GVVSKLKLDSH--ILQDSIIKEVNENPKAGWKAARNPQFSNY----TVGQFKHLLGVKPT 79
G + D H ++ + N WKA F N + K L G P
Sbjct: 11 GAAWSYRFDFHDDYFSEAFVNYHNSRDDVSWKATTE-NFKNVPYKGRMDYVKSLCGANPA 69
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
P + PVK + LP +FDAR+ WP C ++ + DQG CGSCWAFG VEA +DR
Sbjct: 70 PPEMKF--PVKEIEVPKDLPDTFDARTQWPDCPSLKEVRDQGACGSCWAFGCVEAATDRL 127
Query: 140 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDP 190
CI +N LS DL +CC CG+GC+GG+ AW Y G+VT + C P
Sbjct: 128 CIQSKGIVNAHLSAEDLTSCC-RTCGNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLP 186
Query: 191 YFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIM 243
Y + C H C+ PTP+C ++C N + +H++ + + + E IM
Sbjct: 187 Y-EIKACDHHVVGKLQPCKGDGPTPRCKKECESGYNNTYSKDEHHAKTVHAVEG-VEQIM 244
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI NGPVE +FTVY DF YKSGVY+H +G +GGHA+K +GWG ++DG+DYW+
Sbjct: 245 TEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWG-NEDGKDYWL 299
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 190 bits (483), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 125/212 (58%), Gaps = 13/212 (6%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FD+R WP+C +I I +Q CGSCWAFGA E +SDR CI + +SV D+L
Sbjct: 97 LPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDIL 156
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC GGY I A R++ G VT C PY C C TP
Sbjct: 157 SCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDYNGAGCMPY-SFAPCKKDSCAQG-TTP 214
Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
C C K + KH+ +AY+I + I EIY NGPVE SF VYEDF YKS
Sbjct: 215 SCKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKS 274
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY++ +G ++GGHAVK+IGWGT ++G DYW+
Sbjct: 275 GVYQYTSGKLVGGHAVKIIGWGT-ENGVDYWL 305
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 190 bits (483), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/265 (41%), Positives = 153/265 (57%), Gaps = 26/265 (9%)
Query: 54 WKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQ 110
W+A RN F +T K L+G +L +P THD L LP++FD R WP
Sbjct: 1 WRAGRN--FPIHTPFAHIKKLMGSLKDDN--ILKLPKVTHDADLIASLPENFDPRDKWPD 56
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDG 168
C T++ I DQG CGSCWAFGAVEA++DR CI+ + S DL++CC +CG GC+G
Sbjct: 57 CPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNG 115
Query: 169 GYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKC 215
G P AW Y+ H G+V+ + C PY + C H PG C TPKC + C
Sbjct: 116 GMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCEKTC 174
Query: 216 VKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
+ ++ K Y Y ++ ++I AE++KNGPVE +FTVY D YKSGVY+H
Sbjct: 175 ESSYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTH 234
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G+ +GGHA+K++GWG ++G YW+
Sbjct: 235 GNALGGHAIKILGWGV-ENGSKYWL 258
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 152/287 (52%), Gaps = 31/287 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTH--- 92
+L D I+ V + W+A RN F ++ + L+GV P L P K
Sbjct: 25 LLSDEFIELVKTKTRT-WQAGRN--FDEGVSEEYIRGLMGVHPDAYKFAL--PDKQEVLG 79
Query: 93 ---DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 147
K +PK FDAR WP C TI+ I DQG CGSCWAFGAVEA+SDR CIH +N
Sbjct: 80 YLSQKVDDIPKEFDAREKWPNCPTINEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGNVNF 139
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH- 199
S +DL++CC CG GC+GG+P +AW Y+ G+V+ C PY + C H
Sbjct: 140 RFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPY-EIAPCEHH 197
Query: 200 -----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
C TPKC +C N + KH+ +Y + + DI EI NGPVE
Sbjct: 198 VNGTRAPCNHDSKTPKCQHQCEAGYNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTNGPVE 257
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWV 299
+FTVYED YKSGVY+H G +GGHA++++GWG E YW+
Sbjct: 258 GAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGKEEVPYWL 304
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 172/308 (55%), Gaps = 41/308 (13%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ + H L D ++ +N+ + W+A N F N +
Sbjct: 10 CLLVLTSAWSKPYF-----------HPLSDELVNFINKQ-NSTWQAGHN--FRNVDMSYL 55
Query: 71 KHLLGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
K L G LG P K + LPKSFDAR W C TI I DQG CGSC
Sbjct: 56 KRLCGS-------FLGGPKLPQRVKFAKDMNLPKSFDAREQWSHCPTIKEIRDQGSCGSC 108
Query: 127 WAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGAVE++SDR CIH ++S+ V+ DLL CCG CGDGC+GGYP AW ++ G+V
Sbjct: 109 WAFGAVESISDRICIHTNGHVSVEVSAEDLLTCCGGQCGDGCNGGYPAEAWNFWTRKGLV 168
Query: 185 TEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
+ C PY S P C TPKC + C + ++ KH+ +
Sbjct: 169 SGGLYESHVGCRPYSIPPCEHHVNGSRPACTGEGDTPKCSKTCEPGYSPTYKEDKHFGYT 228
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y + ++ +IMAEIYKNGPVE +F+VY DF YKSGVY+H+TGD+MGGHA++++GWG
Sbjct: 229 SYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWG-E 287
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 288 ENGVPYWL 295
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 162/301 (53%), Gaps = 29/301 (9%)
Query: 23 FAEGVVSKLKLDS----HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-K 77
FA VV++ K + D +I +NE A WKAA + +F+N + Q K LGV +
Sbjct: 7 FAAIVVAQAKPNYKRQFEPFSDELIHYINEESGASWKAAPSTRFNN--IDQVKQNLGVLE 64
Query: 78 PTPKGL-LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
TP+ V+ LP+SFDAR W C +IS I DQ C SCWA + A++
Sbjct: 65 ETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAIT 124
Query: 137 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EE 187
DR CIH LS D+++CC + CG GC+GG P +W Y+ GVVT
Sbjct: 125 DRICIHSNGQKKPRLSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTG 183
Query: 188 CDPYFDSTGCSH----PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSD 238
C PY CSH PG P YPTPKC +KC N+ + K S+Y +
Sbjct: 184 CLPY-PFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGGQ 242
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DIM EI KNGPV+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG ++G YW
Sbjct: 243 ETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVKYW 301
Query: 299 V 299
+
Sbjct: 302 L 302
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 118/284 (41%), Positives = 153/284 (53%), Gaps = 31/284 (10%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLL-----LGVPVKTH 92
L D +I +N+ WKA +N + + Q L VK L L +PV+
Sbjct: 54 LSDEMIWFINK-VNTSWKAGQN----FHHIKQEDRLDHVKIMCGTYLDVPPHLQLPVRDI 108
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
+ LP +FDAR+ W C TI I DQG CGSCWAFGAVE++SDR CI N +S
Sbjct: 109 EPRKDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHIS 168
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH---- 199
DL +CC CG+GC+GG+ AW Y+ G+VT + C PY C H
Sbjct: 169 AEDLTSCC-RSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPY-TVKACDHHVVG 226
Query: 200 ---PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
P + TP C +C N + KHY +AY + + IM EI NGPVE +
Sbjct: 227 KLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTNGPVEGA 285
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FTVY DF YKSGVYKH TG +GGHA+K++GWGT + G+DYW+
Sbjct: 286 FTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGT-EGGDDYWL 328
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 111/282 (39%), Positives = 157/282 (55%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLK 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 110/247 (44%), Positives = 143/247 (57%), Gaps = 22/247 (8%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA N F+N + L G KG L V V+ + +KLPK+FD+R WP C T
Sbjct: 40 WKAGHN--FNNVDYSYVQKLCGT--MLKGPKLPVLVQ-YSGDMKLPKNFDSREQWPNCPT 94
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
+ I DQG CGSCWAFGA EA+SDR CIH +S+ ++ DLL CC CG GC+GGYP
Sbjct: 95 LKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLTCCDS-CGMGCNGGYP 153
Query: 172 ISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 218
+AW ++ G+V+ C PY G P TP+C+ +C
Sbjct: 154 SAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESG 213
Query: 219 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 277
++ KHY S+Y + SD E I +EIYKNGPVE +FTVYEDF YK+GVY+H+TG
Sbjct: 214 YTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSA 273
Query: 278 MGGHAVK 284
+GGHA+K
Sbjct: 274 VGGHAIK 280
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 156/294 (53%), Gaps = 27/294 (9%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
+S + H+L D I+ V W RN S + + L+GV P L
Sbjct: 15 LSMFEAKDHLLSDEFIELVRGKANT-WTVGRNFHES-VSEKYIRGLMGVHPDADKFALPD 72
Query: 88 PVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
++ D +P FDAR W C TI I DQG CGSCWAFGAVEA+SDR CIH
Sbjct: 73 KMEVLGKLVEDSDSDIPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIH 132
Query: 143 F--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+N LS +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY +
Sbjct: 133 SQGKVNFHLSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPY-E 190
Query: 194 STGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEI 246
C H P C TP+C C ++ ++ K++ +Y I ++ DI EI
Sbjct: 191 IEPCEHHVNGTRPPCSSG-STPRCQHVCESSYKVDYKKDKNFGSKSYSIKNNVLDIQKEI 249
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
NGPVE +FTVYED YKSGVY+H+ G +GGHA++++GWG D+ YW+
Sbjct: 250 MNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWL 303
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 189 bits (481), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 157/282 (55%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRKEDPNLRQKRRPTVDHHDLK 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 157/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 4 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLRQKRRPTVDHHDLK 61
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ C S WA AV A+SDR CI G ++ LS D
Sbjct: 62 VEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAID 121
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 200
L++CC CG GCDGG+P AW Y+V HG+VT C PY C H P
Sbjct: 122 LISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKCEHHSKGKYP 179
Query: 201 GC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C + Y TP+C RKC K + + + KHY + + + I EI GPVE +
Sbjct: 180 SCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLI 239
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+EDF +YKSG+Y++ TG +G H V++IGWG ++G YW+
Sbjct: 240 FEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWL 279
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 156/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGGKEDAEMKWKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 200
L++CC CG GCDGG+P AW Y+V HG+VT C PY C H P
Sbjct: 148 LISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYP 205
Query: 201 GC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C + Y TP+C RKC K + + KHY + + + I EI GPVE +
Sbjct: 206 SCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+EDF +YKSG+Y++ TG +G H V++IGWG ++G YW+
Sbjct: 266 FEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWL 305
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLRQKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 156/281 (55%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGGKEDAEMKWKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------P 200
L++CC CG GCDGG+P AW Y+V HG+VT C PY C H P
Sbjct: 148 LISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYP 205
Query: 201 GC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C + Y TP+C RKC K + + KHY + + + I EI GPVE +
Sbjct: 206 SCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQNEIMMYGPVEAYLLI 265
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+EDF +YKSG+Y++ TG +G H V++IGWG ++G YW+
Sbjct: 266 FEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWL 305
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 165/304 (54%), Gaps = 27/304 (8%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S T E V ++ L D +I +NE+P AGWKA ++ +F ++V + LLG
Sbjct: 8 IVSLSTLLEAHVTTRNNERIEPLSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ L + HD ++++P FD+R WP+C +IS+I DQ CGS WA AV
Sbjct: 66 GRREDPNLREKRRPTIDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVG 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y+V G+VT
Sbjct: 126 AMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE- 183
Query: 192 FDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 235
+ TGC P C+ Y TP+C + C K N + KHY +Y +
Sbjct: 184 -NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNV 242
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G
Sbjct: 243 LGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGT 301
Query: 296 DYWV 299
YW+
Sbjct: 302 AYWL 305
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 157/282 (55%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/271 (39%), Positives = 148/271 (54%), Gaps = 21/271 (7%)
Query: 46 VNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFD 103
V+ A W A P+ + G F+ + G P+ P +H+ +PK+FD
Sbjct: 28 VDSKSGARWIYAEPPE--RFQPGNFQLMFGALREPEEQRSKRPTVSHESFSDEHIPKAFD 85
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFL 161
AR WP C TI I DQ CGSCWAFGAVEA+SDR CIH + +S DL++CCG+
Sbjct: 86 ARKQWPHCPTIGEIRDQSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY- 144
Query: 162 CGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPT 208
CG GC GG+P AW ++ G+VT + +P + CSH G + Y T
Sbjct: 145 CGFGCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDT 204
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
P CV+KC + + K + Y + + IM EI NGPVE +F VYEDF YKSG
Sbjct: 205 PNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG 264
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY H G ++GGHA++++GWG ++G YW+
Sbjct: 265 VYFHSDGTLLGGHAIRILGWG-EENGVAYWL 294
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 150/281 (53%), Gaps = 25/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVPVKTHDKS 95
D +I +NE A WKA + +F N + FK LG+ + + V+ +
Sbjct: 3 FSDELIHYINEKSGASWKAGPSSRFIN--IEHFKQHLGLLEETPEERETRRPTVRYNVSE 60
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+SFDAR WP C +I +I DQ CGSCWA V A+SDR CIH M LS D
Sbjct: 61 NDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAID 120
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA- 205
L++CC + CG+GC GG P +AW Y+ +G+VT C PY C HPG
Sbjct: 121 LVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQL 178
Query: 206 -------YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
YPTP C C ++ + K Y ++Y ++ IM EI KNGPVE F
Sbjct: 179 NPCPGYIYPTPSCYPYCQAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFI 238
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
VY DFA YKSG+Y H++G G HA+++IGWG ++G +YW
Sbjct: 239 VYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVNYW 278
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 108/278 (38%), Positives = 144/278 (51%), Gaps = 13/278 (4%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L +H+ +++ +N + W A N + + P P+ + V
Sbjct: 29 LTTHLTGKALVDHIN-TAQTSWLAEHNVISDSEMKFKVMDERFADPLPEEESGEILVSGE 87
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 150
+P +FDAR WP C +I I +Q CGSCWAFGA E +SDR CI +S
Sbjct: 88 IVPEPIPDTFDARENWPDCKSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIIS 147
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEP 204
V D+L+CCG CG GC GGY I A R++ +G VT C PY + P E
Sbjct: 148 VEDILSCCGTTCGKGCQGGYSIEAMRFWKSNGAVTGGDYNGNGCMPYSFAPCQKSPCVES 207
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRI---NSDPEDIMAEIYKNGPVEVSFTVYED 261
PT K + + KHY SAYR+ N+ I EIY NGPVE S+ VYED
Sbjct: 208 TTPTCKTTCQSSYTTANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEASYKVYED 267
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F YKSGVY +++G ++GGHAVK+IGWGT +D DYW+
Sbjct: 268 FYQYKSGVYHYVSGKLVGGHAVKIIGWGTEND-VDYWL 304
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 157/287 (54%), Gaps = 31/287 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT----- 91
+L D I E+ + + W+ RN + S + + L+GV P L P K
Sbjct: 22 MLSDEFI-ELVRSKASTWQVGRNFKES-VSEEYIRGLMGVHPDAHKFAL--PEKRIVLGD 77
Query: 92 --HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 147
D + +P+ FDAR AWP C TI I DQG CGSCWAFGAVEA+SDR CIH +N
Sbjct: 78 LYADDGIDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNF 137
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH- 199
LS +DL++CC +CG GC+GG+P +AW Y+ G+V T+ C PY + C H
Sbjct: 138 HLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCEHH 195
Query: 200 -----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
P C TP C KC + + K++ +Y + + +I EI NGPVE
Sbjct: 196 VNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVE 254
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWV 299
+FTVYED YKSGVY+H G +GGHA++++GWG + + YW+
Sbjct: 255 GAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWL 301
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 157/287 (54%), Gaps = 31/287 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT----- 91
+L D I E+ + + W+ RN + S + + L+GV P L P K
Sbjct: 22 MLSDEFI-ELVRSKASTWQVGRNFKES-VSEEYIRGLMGVHPDAHKFAL--PEKRIVLGD 77
Query: 92 --HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 147
D + +P+ FDAR AWP C TI I DQG CGSCWAFGAVEA+SDR CIH +N
Sbjct: 78 LYADDGVDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSEGKVNF 137
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCSH- 199
LS +DL++CC +CG GC+GG+P +AW Y+ G+V T+ C PY + C H
Sbjct: 138 HLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCEHH 195
Query: 200 -----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
P C TP C KC + + K++ +Y + + +I EI NGPVE
Sbjct: 196 VNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVE 254
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWV 299
+FTVYED YKSGVY+H G +GGHA++++GWG + + YW+
Sbjct: 255 GAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWL 301
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 112/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVP-VKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG K P P V HD +
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRKEDPNLRQRRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA A+ A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAIGAMSDRICIQSGGKQSVKLSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCEN-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/246 (43%), Positives = 140/246 (56%), Gaps = 19/246 (7%)
Query: 71 KHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
KHL + P+ H D ++++P +FD+R WP C +I+ I DQ CGS WAF
Sbjct: 39 KHLDARREESDLRRKRRPIVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSSWAF 98
Query: 130 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
GAVEA+SDR CI G N+ LS DLL+CC CGDG +GG+P AW Y+V G+VT
Sbjct: 99 GAVEAMSDRSCIQSGGKQNVELSAVDLLSCCEH-CGDGFEGGFPALAWDYWVKEGIVTGS 157
Query: 186 -----EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAY 233
C PY T +P C E Y TP C C K + + KH S Y
Sbjct: 158 SKENHTSCQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRY 217
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ +D + I EI K GPVE +F VYEDF +YKSG+YKHITG ++ HA+++IGWG ++
Sbjct: 218 NVKNDEKAIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGV-EN 276
Query: 294 GEDYWV 299
YW+
Sbjct: 277 NTPYWL 282
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 107/275 (38%), Positives = 149/275 (54%), Gaps = 15/275 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQ--FSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
++ +L + + E+N K W A+ + S + + + L+GV L
Sbjct: 32 NTPLLSNRFVAEINLKAKGQWTASADNGHLVSGKSDEELRKLMGVLNMSTAALSPRIFSA 91
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ + +LP SFD+ WP+C TIS I DQ +CGSCWA AVEA+SDR+C G+ +L +S
Sbjct: 92 EELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVAGITDLRVS 151
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY------FDSTGCSHPGCEP 204
LL+CC F+CG GC GG P AW ++V G+ +E C PY + G +P C
Sbjct: 152 TGHLLSCC-FVCGMGCQGGIPTMAWLWWVWVGLTSEVCQPYPFPPCGHHTDGGKYPACPS 210
Query: 205 A-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
Y TP C C + +KH +Y + + E M E+ GP EV+F VY DF
Sbjct: 211 TIYDTPTCNSTCADSHTAL--TKHKGEKSYSLRGERE-YMIELMTYGPFEVAFDVYADFV 267
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKSGVY H TG+ +GGHAVKL+GWG +G YW
Sbjct: 268 SYKSGVYSHTTGERLGGHAVKLVGWGV-QNGTPYW 301
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/279 (41%), Positives = 166/279 (59%), Gaps = 20/279 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL- 154
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S ++ L
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-FDS----TGCSHPGC 202
L C CG GC GG+P AW Y+V G+VT EE C PY F T +P C
Sbjct: 148 LISCCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPAC 207
Query: 203 -EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
Y TP+C + C K + ++ KHY +Y + S+ + I EI GPVE +F VYE
Sbjct: 208 GTKIYKTPQCKQTCQKGYKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYE 267
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSG+Y+H+TG ++GGHA+++IGWG + G+ YW+
Sbjct: 268 DFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/211 (47%), Positives = 123/211 (58%), Gaps = 12/211 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P SFD+R+ W +C +I I DQ CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY C+ C P TP
Sbjct: 182 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCTSGNC-PESKTP 239
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + KH+ SAY + I EI NGPVE +FTVYEDF YKSG
Sbjct: 240 SCSLSCQSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSG 299
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH G +GGHA+K+IGWGT + G YW+
Sbjct: 300 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWL 329
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 158/282 (56%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/279 (39%), Positives = 149/279 (53%), Gaps = 26/279 (9%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ IL D ++ VN W A R + + T +LG +L P + +
Sbjct: 28 DAPILTDEFLEHVNRLNGGKWTAGRTSRTKHLTRRGASRMLGTFLRNTSIL--PPRQFSE 85
Query: 94 KSLKLP--KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ L++P FDA AWP+C T++ I DQ CGSCWA A A+SDR+C G+ +L +S
Sbjct: 86 EELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRIS 145
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGC 202
DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C
Sbjct: 146 AGDLMSCCD-VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPC 202
Query: 203 EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
Y TP C C K +R + Y +S E E+ NGP EVSF+VY
Sbjct: 203 SGEYDTPTCNSTCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPFEVSFSVY 256
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DF Y GVYKH+ G +GGHAV+++GWG +GE YW
Sbjct: 257 ADFVAYTGGVYKHVAGIFLGGHAVRIVGWGEL-NGEPYW 294
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/232 (47%), Positives = 141/232 (60%), Gaps = 21/232 (9%)
Query: 87 VPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 143
V V HD + +P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI +
Sbjct: 69 VEVVEHDIQEDTIPATFDARTQWPNCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNG 128
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF---- 192
+N LS D+L+CC CG GCDGGYPI+AW+Y V G T C PY
Sbjct: 129 AVNTLLSAEDVLSCCSN-CGYGCDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPC 187
Query: 193 -DSTG-CSHPGC-EPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIY 247
++ G + P C + Y TP CV KC K N +++ KH+ +AY + I AEI
Sbjct: 188 GETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEII 247
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+GPVE +FTVYEDF YKSGVY H TG +GGHA++++GWGT D+G YW+
Sbjct: 248 AHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWL 298
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/279 (40%), Positives = 148/279 (53%), Gaps = 26/279 (9%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ IL D ++ VN W A R + + T LLG +L P + +
Sbjct: 28 DAPILTDEFLELVNRLNGGKWTAGRTSRTKHLTRRGASRLLGTFLRNTSIL--PPRQFSE 85
Query: 94 KSLKLP--KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLS 150
+ L+ P FDA AWP+C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S
Sbjct: 86 EELREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRIS 145
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGC 202
DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C
Sbjct: 146 AGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPC 202
Query: 203 EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
Y TP C C K +R + Y +S E E+ NGP EVSF+VY
Sbjct: 203 SGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPFEVSFSVY 256
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DF Y GVYKH+ G +GGHAV+++GWG +GE YW
Sbjct: 257 ADFLAYTGGVYKHVAGTFLGGHAVRIVGWGEL-NGEPYW 294
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/301 (39%), Positives = 161/301 (53%), Gaps = 29/301 (9%)
Query: 23 FAEGVVSKLKLDS----HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV-K 77
FA VV++ K + D +I +NE A WKAA + +F+N + Q K LGV +
Sbjct: 7 FAAIVVAQAKPNYKRQFEPFSDELIHYINEESGASWKAAPSTRFNN--IDQVKQNLGVLE 64
Query: 78 PTPKGL-LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALS 136
TP+ V+ LP+SFDAR W C +IS I DQ C SCWA + A++
Sbjct: 65 ETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAIT 124
Query: 137 DRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EE 187
DR CIH LS D+++CC + CG GC+GG P +W Y+ GVVT
Sbjct: 125 DRICIHSNGQKKPRLSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTG 183
Query: 188 CDPYFDSTGCSH----PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSD 238
C PY CSH PG P YPTPKC +KC N+ + K S+Y +
Sbjct: 184 CLPY-PFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 242
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
D M EI KNGPV+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG ++G YW
Sbjct: 243 ETDFMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVKYW 301
Query: 299 V 299
+
Sbjct: 302 L 302
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 155/302 (51%), Gaps = 20/302 (6%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 3 VYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNI 57
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
T + + L G + L V +LP+SFD+ WP C TI I DQ CGS
Sbjct: 58 TFAEARRLTGARIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGS 117
Query: 126 CWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
CWA A+SDR C G+ L +S L++CC CG GCDGGYP ++W Y+V HG+
Sbjct: 118 CWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCE-DCGYGCDGGYPGTSWEYYVSHGLA 176
Query: 185 TEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
+ C PY C H G + P TPKC C K K+ +Y ++
Sbjct: 177 SSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVH 233
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
+ +D E+Y NGP V F VY DF YK+GVY+H++GD +GGHAV+++GWG +G
Sbjct: 234 GE-DDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTP 291
Query: 297 YW 298
YW
Sbjct: 292 YW 293
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 174/313 (55%), Gaps = 41/313 (13%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L S + L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVLTSARSSLYFP-----------PLSDELVNFVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP+SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGT------ILGGPKLPQRDAFAADVVLPESFDARKQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSK 226
G+V+ C PY C H P C TPKC + C + ++ K
Sbjct: 165 KGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDK 223
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
H+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++
Sbjct: 224 HFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRIL 283
Query: 287 GWGTSDDGEDYWV 299
GWG ++G YW+
Sbjct: 284 GWGV-ENGTPYWL 295
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 173/312 (55%), Gaps = 39/312 (12%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L +CLL+L S + L D ++ VN+ WKA N F N
Sbjct: 5 LATLSCLLVLTSARSSLYFP-----------PLSDELVNFVNKQ-NTTWKAGHN--FYNV 50
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SLKLPKSFDARSAWPQCSTISRILDQGH 122
+ K L G +L G + D + LP+SFDAR WP C TI I DQG
Sbjct: 51 DLSYVKKLCGA------ILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGS 104
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDGC+GG+P AW ++
Sbjct: 105 CGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTK 164
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY S P C TPKC + C + ++ KH
Sbjct: 165 KGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKH 224
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGHA++++G
Sbjct: 225 FGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILG 284
Query: 288 WGTSDDGEDYWV 299
WG ++G YW+
Sbjct: 285 WGV-ENGTPYWL 295
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 132/223 (59%), Gaps = 22/223 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
+P +D R + QC +++ I DQ HCGSCWA A EA+SDR CI +N LS D+L
Sbjct: 81 IPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDIL 140
Query: 156 ACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHP 200
CC + CGDGC+GGYPI AW+Y+V +G+VT C PY + G + P
Sbjct: 141 TCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWP 200
Query: 201 GCEPA-YPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
C + TPKCV C + + KHY +AY ++ + I +EI KNGPVEV F
Sbjct: 201 KCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGF 260
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
TVY DF YKSGVY H+ G +GGHAVKL+GWG D+G YW+
Sbjct: 261 TVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGV-DNGTPYWL 302
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/296 (35%), Positives = 151/296 (51%), Gaps = 15/296 (5%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+++L ++ A G + L D+ +L + + +N+ WKA + + N T + K
Sbjct: 5 VVVLSSFAATLVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYDGKMQNLTFSEAK 64
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G L V +LP+SFDA WP C TI I DQ C + WA
Sbjct: 65 RLTGAFSRKTSSLPPVRFTEEQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVAT 124
Query: 132 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR+C + G L +S DL+ACC CG GC+GGYP +AW Y+V HG+ + +C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGITSSQCQP 183
Query: 191 YFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
Y C H G + P TP+C C K+ K+ +Y + + ED
Sbjct: 184 Y-PFPRCEHRGAQGKKPPCSKYKFVTPQCNATCTDKSVPL--IKYRGNHSYEVRGE-EDY 239
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP V F V+ DF YKSGVY+H+ G+ +GG AV+++GWG +G YW
Sbjct: 240 KRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKL-NGTPYW 294
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 157/282 (55%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLK 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 305
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 104/257 (40%), Positives = 141/257 (54%), Gaps = 28/257 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPT---PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQ 110
W +NP FS G +G K + PK + +P ++ LP +FDA WPQ
Sbjct: 32 WVELKNPIFS----GDNLPRMGFKKSLDRPKKIYKTLP-----HNVNLPTNFDAAQQWPQ 82
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGY 170
C TI I +Q CGSCWAFGA+E++SDRFCIH ++ LS DL+ C +GC+GG
Sbjct: 83 CPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNESVQLSFQDLITCDN--QDNGCEGGD 140
Query: 171 PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQLWR 223
P +A++Y +GVVT C PY + P C PA TP C KC + ++
Sbjct: 141 PYTAYKYVQKNGVVTSNCQPY------TIPTCPPAQQPCMNFVNTPPCSAKCANSSVNFQ 194
Query: 224 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
H+ + Y + + I EI NGPVE F VYEDF YKSGVY H +G +GGH +
Sbjct: 195 QDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCI 254
Query: 284 KLIGWGTSDDGEDYWVC 300
K++G+G S +G YW+C
Sbjct: 255 KIVGFGVS-NGTPYWIC 270
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 136/227 (59%), Gaps = 18/227 (7%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 148
V H+ ++++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S
Sbjct: 18 VDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQS 77
Query: 149 --LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDS 194
LS DL++CC CG GC GG+P AW Y+V G+VT C PY
Sbjct: 78 AELSALDLISCCE-DCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHH 136
Query: 195 TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
T +P C Y TP+C + C K + + KHY +Y + ++ + I +I GPV
Sbjct: 137 TKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPV 196
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW+
Sbjct: 197 EAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWL 242
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 99/211 (46%), Positives = 125/211 (59%), Gaps = 12/211 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P SFD+R+ W +C +I I +Q CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 86 IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTP 203
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + + KH+ SAY + I EI NGPVE +FTVYEDF YKSG
Sbjct: 204 ACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYKSG 263
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH G +GGHA+K+IGWGT + G YW+
Sbjct: 264 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWL 293
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 115/279 (41%), Positives = 163/279 (58%), Gaps = 20/279 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRNRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL- 154
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S ++ L
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-----FDSTGCSHPGC 202
L C CG GC GG+P AW Y+V G+VT EE C PY T +P C
Sbjct: 148 LISCCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPAC 207
Query: 203 -EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
Y TP+C + C K + + KHY Y + S+ + I EI GPVE +F VYE
Sbjct: 208 GTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYE 267
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSG+Y+H+TG ++GGHA+++IGWG + G+ YW+
Sbjct: 268 DFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 99/211 (46%), Positives = 124/211 (58%), Gaps = 12/211 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P SFD+R+ W +C +I I +Q CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 86 IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTP 203
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + KH+ SAY + I EI NGPVE +FTVYEDF YKSG
Sbjct: 204 ACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSG 263
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH G +GGHA+K+IGWGT + G YW+
Sbjct: 264 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWL 293
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 109/282 (38%), Positives = 157/282 (55%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 154/282 (54%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRKEDPNLRQKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ C S WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC CG GCDGG +W Y+V HG+VT + TGC P C+
Sbjct: 148 LISCCEN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + I EI GPVE
Sbjct: 205 RACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWL 305
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 101/214 (47%), Positives = 126/214 (58%), Gaps = 16/214 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FDAR WP C+TI I +Q CGSCWAFGA E +SDR CI +SV D+L
Sbjct: 92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC GGY I A R++ G VT C PY S C P TP
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 208
Query: 210 KCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 265
C C K + ++ KHY SAY++ + +I EIY GPVE S+ VYEDF HY
Sbjct: 209 SCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 268
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSGVY + +G ++GGHAVK+IGWG ++G DYW+
Sbjct: 269 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWL 301
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 99/211 (46%), Positives = 124/211 (58%), Gaps = 12/211 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P SFD+R+ W +C +I I +Q CGSCWAFGA E +SDR CI +S +DLL
Sbjct: 86 IPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 145
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG+GC+GGYPI A R++ GVVT C PY + C+ C P TP
Sbjct: 146 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTP 203
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + KH+ SAY + I EI NGPVE +FTVYEDF YKSG
Sbjct: 204 ACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSG 263
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH G +GGHA+K+IGWGT + G YW+
Sbjct: 264 VYKHTAGKALGGHAIKIIGWGT-ESGSPYWL 293
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 154/282 (54%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMILFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLRQKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ C S WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC CG GCDGG +W Y+V HG+VT + TGC P C+
Sbjct: 148 LISCCKN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + I EI GPVE
Sbjct: 205 RACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLQ 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTSYWL 305
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 120/280 (42%), Positives = 158/280 (56%), Gaps = 25/280 (8%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD-KSLK 97
Q++I + VN ++ WKA P+ + T+ Q K L V V HD
Sbjct: 25 QEAITEYVNSK-QSLWKA-EIPK--DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINEDT 80
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N LS D+L
Sbjct: 81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTG-CSHPGC 202
+CC CG GC+GGYPI+AW+Y V G T C PY ++ G + P C
Sbjct: 141 SCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199
Query: 203 -EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ Y TP CV KC KN + KH+ +AY + I AEI +GPVE +FTVY
Sbjct: 200 PDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVY 259
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF YK+GVY H TG +GGHA++++GWGT D+G YW+
Sbjct: 260 EDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWL 298
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 112/289 (38%), Positives = 157/289 (54%), Gaps = 21/289 (7%)
Query: 23 FAEGVVSKLKLDS-HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK 81
FA G+ S L + H L D I ++N + ++ WKA RN Y + FK L P+
Sbjct: 9 FALGLSSALPSNKPHPLSDEYIAQIN-SKQSTWKAGRNFAIDEYEL--FKSLASGVKKPQ 65
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTI-SRILDQGHCGSCWAFGAVEALSDRFC 140
GL + + + ++P+SFD+R+AWP+C+ I I DQ CGSCWAF AVEA+SDR C
Sbjct: 66 GLKTAQKL-VREITEEIPESFDSRTAWPECTQIIGMIRDQSRCGSCWAFAAVEAMSDRIC 124
Query: 141 IHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH-------HGVVTEECDPY 191
IH L +S DLL C GC+GG+P AW + + +G + + C Y
Sbjct: 125 IHSNATKKLLVSSQDLLTCG---TAGGCNGGWPAVAWSDWTNGIVTGGLYGALEQGCKSY 181
Query: 192 FDSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
F HP C TP CV +C + + ++ + Y + Y I + E I EI NG
Sbjct: 182 FLEGCDDHPNKCRNYVSTPACVEQCDEPSLYYKAQETYGQTPYEIQGE-EQIQYEIMTNG 240
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PVE + VY DFA Y+SG+Y+ T + GGHAVK++GWG +DG YW+
Sbjct: 241 PVEATMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGV-EDGVKYWL 288
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 186 bits (473), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 163/308 (52%), Gaps = 26/308 (8%)
Query: 8 LTTCLLILGVISSQTFAE-GVVSKLKLDSHILQDSIIKEVNE-NPKAG-WKAARNPQFSN 64
+ +L+L V+ FA+ G S + S ++ I + P A W NP N
Sbjct: 1 MFRAILVLAVVGQAAFAQYGRPSGSQSGSFPPYEATISIAEKVRPLATTWTPGANPLPPN 60
Query: 65 -YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 123
Y G + L P G+L+ VK H + LP+ FDAR WP+C+++ +I +QG C
Sbjct: 61 LYRTGAKREDLEKHRLPLGILV---VKDH---IVLPERFDARDRWPECTSLKQIRNQGCC 114
Query: 124 GSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWA A E +DR+CIH S DLL+CC CGDGC GG AW+++V
Sbjct: 115 GSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLLSCC-HSCGDGCQGGNLGPAWQFWVQR 173
Query: 182 GVVTEECDPYFDSTGCSHP-------GCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISA 232
GV + PY GC HP + TPKC RKC + + + + A
Sbjct: 174 GVSSG--GPYNSRQGC-HPYPVDVCHSADEDADTPKCTRKCQSMYNVTNVSDDRRFGRVA 230
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y ++ D E I EI++NGPV+ SF VY DF YK+GVY+H+ G + GGHAVK+IGWG +
Sbjct: 231 YSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGV-E 289
Query: 293 DGEDYWVC 300
+G YW+C
Sbjct: 290 NGTKYWLC 297
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 121/317 (38%), Positives = 160/317 (50%), Gaps = 28/317 (8%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVN-----ENPKAGWK 55
M + L L C L + SS + V +S Q + N N W
Sbjct: 1 MIRALLLLVCCQAALSIDSSSFIKQAQVPGQNQNSVQQQAASRASANIAAMVRNRTNSWT 60
Query: 56 AA--RNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
A R P S+Y VG L K G+L+ + + LP+ FDAR WPQC +
Sbjct: 61 AGAPRQP-LSSYRVGVNMEELESKRLKPGILI------LKEDIDLPEQFDARDKWPQCPS 113
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
+ I +QG CGSCWA A EA +DR+CIH + + S DL++CC CGDGC GG
Sbjct: 114 LREIRNQGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCC-HSCGDGCQGGVL 172
Query: 172 ISAWRYFVHHGVVTEECDPYFDSTGC-SHPGCEPAYP-----TPKCVRKCVKKNQLWRNS 225
AW Y+V GV + PY GC S+P P PKC RKC + S
Sbjct: 173 GPAWDYWVQKGVSSG--GPYNSKQGCHSYPFDTCHSPDEDDDAPKCSRKCQSSYSVQDVS 230
Query: 226 K--HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
K + AY + +D IM EI+ NGPV+ +F VY DF YKSGVY+H+TG + GGHA+
Sbjct: 231 KDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAI 290
Query: 284 KLIGWGTSDDGEDYWVC 300
K++GWG ++G YW+C
Sbjct: 291 KILGWGV-ENGTKYWLC 306
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/261 (40%), Positives = 144/261 (55%), Gaps = 28/261 (10%)
Query: 63 SNYTVGQFKHLLGVKPTPKGLLLGVPVKTH-------DKSLKLPKSFDARSAWPQCSTIS 115
++ T G + L+GV P L P K + +LP+ FD+R WP C TI
Sbjct: 37 ASVTEGHIRRLMGVHPDAHKFAL--PDKREVLGDLYVNSVDELPEEFDSRKQWPNCPTIG 94
Query: 116 RILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPIS 173
I DQG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +
Sbjct: 95 EIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGA 153
Query: 174 AWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQ 220
AW Y+ G+V+ + C PY + + C H P C TPKC C
Sbjct: 154 AWSYWTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYT 212
Query: 221 L-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 279
+ + KH+ +Y + + +I EI NGPVE +FTVYED YK GVY+H G +G
Sbjct: 213 VDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELG 272
Query: 280 GHAVKLIGWGT-SDDGEDYWV 299
GHA++++GWG ++ YW+
Sbjct: 273 GHAIRILGWGVWGEEKIPYWL 293
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 155/313 (49%), Gaps = 35/313 (11%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
LF++ +L+ V S Q G + + L H V+ A W + R+ + +
Sbjct: 4 LFISYAILVF-VNSFQDAQCGELEDVGLREH---------VHSVTGARWISGRHSK--GF 51
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHC 123
H G K P H +LPK+FDARS WP CS++S I DQ C
Sbjct: 52 ESDHLIHTFGAKMETAEQKAQRPTVKHVGFDDTRLPKNFDARSKWPHCSSVSEIRDQSSC 111
Query: 124 GSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y+ H
Sbjct: 112 GSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDYWRTH 170
Query: 182 GVVTEECDPYFDSTGCS--------------HPGC-EPAYPTPKCVRKCVKKNQLWRNSK 226
G+VT D +GC +P C YPTP+CV+ C + K
Sbjct: 171 GIVTGGSKE--DPSGCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDCDTPELGYLEDK 228
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
+ +Y I + IM EI GPVE FTVYEDF YKS VY H G M GHA++++
Sbjct: 229 TRANISYNIYASEISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRIL 288
Query: 287 GWGTSDDGEDYWV 299
GWG D YW+
Sbjct: 289 GWGEEGD-VPYWL 300
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 111/281 (39%), Positives = 149/281 (53%), Gaps = 24/281 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
H L D +I +N+ WKA RN N K L+GV + +P H
Sbjct: 25 HPLSDEMIDFINK-LNTTWKAGRNFD-KNVPFSYIKGLMGVA---RNKTRRLPTLMHSSI 79
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP+SFDAR W +C++I I DQ CG+CWAFGAVEA+SDR CIH + +++S
Sbjct: 80 PDNLPESFDARQHWRKCNSIHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQ 139
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSH 199
DLL CC + C GC GG P AW ++ G+VT + C PY + +TG
Sbjct: 140 DLLTCCDY-CRTGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLP 198
Query: 200 PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P P P C R+C K + + KHY Y ++ D I EI+KNGPVE F V
Sbjct: 199 PPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAV 258
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y DF YKSGVY+ + G HA++++GWGT ++G YW+
Sbjct: 259 YADFYSYKSGVYQAHSRVRCGSHAIRILGWGT-ENGVPYWL 298
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/288 (45%), Positives = 168/288 (58%), Gaps = 34/288 (11%)
Query: 34 DSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP--- 88
DSH+ L D ++ +N+ W+A N F N V K L G LG P
Sbjct: 20 DSHLHPLSDELVNFINKQ-NTTWQAGHN--FFNVEVSYLKKLCGT-------FLGGPKLP 69
Query: 89 --VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
V+ D +KLP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH
Sbjct: 70 RRVEFAD-DIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGH 128
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF----- 192
+N+ +S D+L CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 129 VNVEVSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCE 188
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
S P C TP+C + C + ++ KHY S+Y ++SD +I AEIYKNGP
Sbjct: 189 HHVNGSRPACTGEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGP 248
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE +FTVY DF YKSGVY+H TGD+MGGHA++++GWG ++G YW+
Sbjct: 249 VEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWG-EENGVPYWL 295
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 154/282 (54%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMILFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ C S WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------- 203
L++CC CG GCDGG +W Y+V HG+VT + TGC P C+
Sbjct: 148 LISCCKN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 204 -----PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
Y TP+C + C K N + KHY +Y + I EI GPVE
Sbjct: 205 RACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIGVESVIQKEIMMYGPVEAYLH 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTSYWL 305
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 108/232 (46%), Positives = 140/232 (60%), Gaps = 21/232 (9%)
Query: 87 VPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 143
V V HD + +P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI +
Sbjct: 69 VEVIKHDIQEDTIPDTFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNG 128
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF---- 192
+N LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY
Sbjct: 129 AVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPC 187
Query: 193 -DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIY 247
++ G + P C + Y TP CV KC N +++ KH+ +AY + I AEI
Sbjct: 188 GETVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEIL 247
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+GPVE +FTVYEDF YKSGVY H TG +GGHA++++GWGT D+G YW+
Sbjct: 248 AHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWL 298
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 161/310 (51%), Gaps = 26/310 (8%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+LG +S F + + L+ + ++ +N+ K + A +P+F+N
Sbjct: 7 FAVLGTAASAAFLQHTENVLREAEQLSGSDLVNYINKAQKL-FTAKLSPRFANLPRDIKH 65
Query: 72 HLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
L+G K + KTH+ + +PKSFDAR+ WP+C+++ + DQ CGS WA
Sbjct: 66 RLMGSKYVALPAKYRMNEKTHNDIDNSTIPKSFDARTNWPKCASLRTVRDQSACGSGWAV 125
Query: 130 GAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AV A+ DR CI + LS +D+L+CC CG GC+GG AW Y+ G+VT
Sbjct: 126 AAVGAIMDRICIASEGKQQVILSADDILSCCT-ECGYGCEGGDTYKAWNYWTTDGIVTGS 184
Query: 188 CDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL-WRNSKHYSI 230
Y +GC +P CE YPT C KC + + KHY
Sbjct: 185 --NYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKHYGA 242
Query: 231 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
Y + D I EI +GPVEV+F VYEDF HY SG+YKH+ G+ +G HAVK++GWGT
Sbjct: 243 YPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGT 302
Query: 291 SDDGEDYWVC 300
++G DYW+C
Sbjct: 303 -ENGVDYWIC 311
>gi|19880041|gb|AAM00234.1|AF359422_1 cathepsin B-like cysteine proteinase [Nicotiana tabacum]
Length = 110
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 86/110 (78%), Positives = 96/110 (87%)
Query: 56 AARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTIS 115
AA NP+FSN+TV QFK LLGVKPT KG L G+P+ TH K L+LP+ FDAR AWP CSTI
Sbjct: 1 AALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLLELPQEFDARVAWPNCSTIG 60
Query: 116 RILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG 165
RILDQGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDLLACCGFLCGDG
Sbjct: 61 RILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDG 110
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 109/271 (40%), Positives = 147/271 (54%), Gaps = 17/271 (6%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFS-NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
IL I +N+ W A P F N + L G + P K
Sbjct: 21 ILSQQFINAINQK-HPSWLAG--PNFPPNTPHSHLRSLNGARDDP-AFFTDTETKNVTIP 76
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 153
++P++FDAR WPQC +I +I +QG CGSCWAFGAVE +SDR CI + S D
Sbjct: 77 EQIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQD 136
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPK 210
LLACC CG GC GGY AW+Y+V G+V+ + S GC HP A+ TP
Sbjct: 137 LLACCK-ECGHGCGGGYSSRAWQYWVTDGIVSG--GDFNTSQGC-HPYSVQAFRDSTTPN 192
Query: 211 CVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C K + + K Y +YRI + E I AEI +GPV+ S+ VY+DF Y++G
Sbjct: 193 CSSFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG 252
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY+H+ G+V G H+VK++GWG ++G DYW+
Sbjct: 253 VYQHVLGNVSGRHSVKILGWG-RENGTDYWL 282
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 114/279 (40%), Positives = 162/279 (58%), Gaps = 20/279 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL- 154
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S ++ L
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-----FDSTGCSHPGC 202
L C CG GC GG+P AW Y+V G+VT EE C PY T +P C
Sbjct: 148 LISCCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPAC 207
Query: 203 -EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
Y TP+C + C K + + KHY Y + S+ + I EI GPVE +F VYE
Sbjct: 208 GTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYE 267
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSG+Y+H+ G ++GGHA+++IGWG + G+ YW+
Sbjct: 268 DFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 108/284 (38%), Positives = 153/284 (53%), Gaps = 36/284 (12%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L +++ + + EVN +P P F + ++ +K + L L V +
Sbjct: 38 LVAYLRRSQNLFEVNSDP--------TPDFE-------QKIMSIKYKHQKLNLMVK-EDP 81
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI KNGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 156/282 (55%), Gaps = 26/282 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 30 LSDEMISFINEHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLK 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC-------- 202
L++CC + CG GCDGG+ +W Y+V G+VT + TGC P C
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKY 204
Query: 203 ----EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ Y TP+C + C K N + KHY +Y + S I +I +GPVE
Sbjct: 205 RACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLE 264
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF +YKSG+Y++ TG + GHAV+LIG G ++G YW+
Sbjct: 265 IYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGV-ENGTAYWL 305
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 138/229 (60%), Gaps = 20/229 (8%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
VK + +P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N
Sbjct: 72 VKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVN 131
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DS 194
LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY ++
Sbjct: 132 TLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGET 190
Query: 195 TG-CSHPGCEP-AYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
G + P C Y TP CV KC N +++ KH+ +AY + I AEI +G
Sbjct: 191 VGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHG 250
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PVE +FTVYEDF YKSGVY H TG+ +GGHA++++GWGT D+G YW+
Sbjct: 251 PVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRILGWGT-DNGTPYWL 298
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 105/296 (35%), Positives = 145/296 (48%), Gaps = 14/296 (4%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+++L ++ A G + L D+ +L + + +N+ WKA N + N T + K
Sbjct: 5 VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYNGKMQNITFSEAK 64
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G + L KLP++FDA WP C TI I DQ C + WA
Sbjct: 65 RLTGARIQKSSALPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124
Query: 132 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+V +G+ + C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183
Query: 191 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
Y C H G + + TPKC C K K+ + Y + ED
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDY 240
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG +G YW
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYW 295
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 156/287 (54%), Gaps = 31/287 (10%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
S L + IL D I +N ++ W A RN P+ + + K L G TP L+G
Sbjct: 15 SALSAQNPILSDEFINSINAQ-QSTWTAGRNFPE--DTPIEHLKRLNGALITPD--LVG- 68
Query: 88 PVKTHDKSL---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--H 142
+TH ++ +P++FD R+ W QC ++ I +QG+CGSCWAFG+VE ++DR CI
Sbjct: 69 KNQTHVINVIPEAIPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASK 128
Query: 143 FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 195
S +DLLACC CG GCDGG P A+ Y+V G+V+ E C PY S
Sbjct: 129 GKTKFEFSADDLLACCT-ACGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYEGSA 187
Query: 196 GCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPV 252
+ TPKC KC+ K + KHY Y + + +I EI NGPV
Sbjct: 188 FLNSV-------TPKCSTKCLNSKYTTPYAKDKHYGTDFIYMTSKNVAEIQTEIMNNGPV 240
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYEDF YKSGVY+H++G+ MGGHAVK+IGWGT + G YW+
Sbjct: 241 VTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGT-EKGVPYWL 286
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 114/279 (40%), Positives = 162/279 (58%), Gaps = 20/279 (7%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMILFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL- 154
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S ++ L
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPY-----FDSTGCSHPGC 202
L C CG GC GG+P AW Y+V G+VT EE C PY T +P C
Sbjct: 148 LISCCEDCGGGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPAC 207
Query: 203 -EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
Y TP+C + C K + + KHY Y + S+ + I EI GPVE +F VYE
Sbjct: 208 GTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYE 267
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSG+Y+H+ G ++GGHA+++IGWG + G+ YW+
Sbjct: 268 DFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGKPYWL 305
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 91/190 (47%), Positives = 124/190 (65%), Gaps = 14/190 (7%)
Query: 123 CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
CGSCWAFGAVEA+SDR CIH +++ +S DLL CCG +CGDGC+GGYP AW ++ G
Sbjct: 1 CGSCWAFGAVEAISDRICIHTNVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG 60
Query: 183 VVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYS 229
+V+ C PY S P C TPKC + C + ++ KHY
Sbjct: 61 LVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG 120
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
+Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG
Sbjct: 121 YDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWG 180
Query: 290 TSDDGEDYWV 299
++G YW+
Sbjct: 181 V-ENGTPYWL 189
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 108/278 (38%), Positives = 156/278 (56%), Gaps = 26/278 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLP 99
+I +N++P AGWKA ++ +F ++V + LLG + L V HD ++++P
Sbjct: 1 MISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLNVEIP 58
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 157
FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS DL++C
Sbjct: 59 SHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISC 118
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE----------- 203
C + CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 119 CKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACG 175
Query: 204 -PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
Y TP+C + C K N + KHY +Y + S I +I +GPVE +YED
Sbjct: 176 DKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYED 235
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 236 FLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 272
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 105/296 (35%), Positives = 145/296 (48%), Gaps = 14/296 (4%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+++L ++ A G + L D+ +L + + +N+ WKA N + N T + K
Sbjct: 5 VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWKAVYNGKMQNITFSEAK 64
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G + L KLP++FDA WP C TI I DQ C + WA
Sbjct: 65 RLTGARIQKSSGLQPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124
Query: 132 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+V +G+ + C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183
Query: 191 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
Y C H G + + TPKC C K K+ + Y + ED
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDY 240
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG +G YW
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYW 295
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 96/224 (42%), Positives = 130/224 (58%), Gaps = 20/224 (8%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C RKC +++R K Y AY + + I +EI +NGPV S
Sbjct: 200 TYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 104/296 (35%), Positives = 145/296 (48%), Gaps = 14/296 (4%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+++L ++ A G + L D+ +L + + +N+ W+A N + N T + K
Sbjct: 5 VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYNGKMQNITFSEAK 64
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G + L KLP++FDA WP C TI I DQ C + WA
Sbjct: 65 RLTGARIQKSSALPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124
Query: 132 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+V +G+ + C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183
Query: 191 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
Y C H G + + TPKC C K K+ + Y + ED
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDY 240
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG +G YW
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYW 295
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 112/294 (38%), Positives = 164/294 (55%), Gaps = 32/294 (10%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
L+L + + TFA+ LD + ++I+++N + GW AA PQF+ T+ +
Sbjct: 4 LLLALAAVSTFAQ----LSTLDRPVHDHTLIQKINADSSIGWTAAAYPQFAGMTLRDARK 59
Query: 73 LLG---VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
LLG V P + +P KT +LK SFDAR+ W +C + I DQ CGSCWAF
Sbjct: 60 LLGTVLVHP-----INNLPKKTMPANLKAASSFDARTKWGKC--VHPIRDQQQCGSCWAF 112
Query: 130 GAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
A E LSDRFCI + +++ LS +L C GCDGGY +AW + G+ +++
Sbjct: 113 SASEVLSDRFCIASNGSVDVVLSPEYMLQCDS--TDYGCDGGYLNNAWAFLAGTGIPSDK 170
Query: 188 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 247
CDPY ++G G P T K K +K S++ S +DI +I
Sbjct: 171 CDPY--TSGNGDVGSCPTSCTDGSAIKLYK-------AKSSSVAQL---SSIDDIQKDIQ 218
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED--YWV 299
NGPV+ +F+VY+DF YKSGVY+H++G + GGHA+K++GWG + DG+D YW+
Sbjct: 219 ANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWI 272
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 104/301 (34%), Positives = 153/301 (50%), Gaps = 25/301 (8%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+++L ++ A G + L D+ +L + + +N+ WKA + + N T + K
Sbjct: 5 VVVLSSFAAALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYDGKMQNLTFSEAK 64
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLK--LPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
L G L P + ++ L+ LP+SFDA WP C TI I DQ C + WA
Sbjct: 65 RLTGAFSRKTSTL--PPARFTEEQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAV 122
Query: 130 GAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 188
A+SDR+C + G L +S DL+ACC CG GC+GGYP +AW Y+V HG+ + +C
Sbjct: 123 ATASAISDRYCTVGKGKQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGIASSQC 181
Query: 189 DPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINS 237
PY C H G + + TP+C C K +R + Y +
Sbjct: 182 QPY-PFPRCEHRGAQGKKTPCSKYKFVTPQCNATCTDKTIPLIKYRGNHSYEVRG----- 235
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
ED E+Y NGP V F V+ DF YK+GVY+H+ G+ +GG AV+++GWG +G Y
Sbjct: 236 -EEDYKRELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKL-NGTPY 293
Query: 298 W 298
W
Sbjct: 294 W 294
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 114/274 (41%), Positives = 146/274 (53%), Gaps = 29/274 (10%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
+ KEVN K W A +YT LG K L P K LP+
Sbjct: 22 EVAKEVNAM-KTTWLANEAIPTRDYT-----QYLGALRGGKQL----PEKNIAIRGDLPE 71
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
SFD WP+C ++ I DQ CGSCWAFGA EA +DR CI + LS DLL CC
Sbjct: 72 SFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLLTCC 131
Query: 159 GFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPA 205
CG GC+GG+P AW +F GV T + C+ Y + C H P C
Sbjct: 132 E-SCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAY-EFPKCDHHVEGKYPPCGET 189
Query: 206 YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
PTP+CV KC + + ++ KH+ AY + S+ E I E+ NGP+EV F+VYEDF
Sbjct: 190 QPTPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMT 249
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKSG+Y+H+ G +GGHAVKL+GWG +DG +YW
Sbjct: 250 YKSGIYQHVAGKYLGGHAVKLVGWGV-EDGVEYW 282
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 101/295 (34%), Positives = 144/295 (48%), Gaps = 12/295 (4%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
++L ++ A G + D +L + + +N+ WKA N + N T + K
Sbjct: 4 FVVLSSFAATLVALGTSALRAKDGPVLTQTFVDRINQLNGGMWKAVYNGKMQNITFSEAK 63
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G + L KLP++FDA WP C TI I DQ C + WA
Sbjct: 64 RLTGARIQKSRTLPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 123
Query: 132 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR+C + G L +S DL+ACC CGDGC GG+P AW Y+V +G+ + +C P
Sbjct: 124 ASAISDRYCTVGGGKQLRISAADLMACCK-QCGDGCKGGFPGFAWLYYVEYGITSSQCQP 182
Query: 191 Y-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 243
Y + G P + + TPKC C K+ K+ + Y + ED
Sbjct: 183 YPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYK 240
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP F VY D YKSGVY+++ GD +GG AV+++GWG +G YW
Sbjct: 241 RELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYW 294
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/222 (43%), Positives = 129/222 (58%), Gaps = 23/222 (10%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+S+D R W +C ++ I DQ CGSCWA A E +SDR CI + +N +S DLL
Sbjct: 78 IPESYDVRDHWSKCISVDNIRDQSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLL 137
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSHPGC 202
+CC CGDGCDGGYP+ AWRY+V G+V+ C PY + G + P C
Sbjct: 138 SCCT-SCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKC 196
Query: 203 EPAY--PTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
PA TP+C C K+ + KHY +SAY + I EI ++GPVE F
Sbjct: 197 -PAQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFL 255
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY DF YKSG+Y H++G +GGHAVK++GWG ++G YW+
Sbjct: 256 VYSDFYRYKSGIYTHVSGQELGGHAVKILGWGV-ENGTKYWL 296
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 103/296 (34%), Positives = 146/296 (49%), Gaps = 14/296 (4%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
+++L ++ A G + L D+ +L + + +N+ W+A N + N T + K
Sbjct: 5 VVVLSSFAATLVALGASALLAKDAPVLTKTFVDHINQLNGGMWRAVYNGKMQNITFSEAK 64
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G + L KLP++FDA WP C TI I DQ C + WA
Sbjct: 65 RLTGARIQKSSALPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVST 124
Query: 132 VEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 190
A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+V +G+ + C P
Sbjct: 125 ASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQP 183
Query: 191 YFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
Y C H G + + TPKC C K+ K+ + Y + ED
Sbjct: 184 Y-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKSVPL--IKYRGNATYLLLHGEEDY 240
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP F VY D YKSGVY+++ GD +GG AVK++GWG +G YW
Sbjct: 241 KRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKL-NGTPYW 295
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 107/281 (38%), Positives = 148/281 (52%), Gaps = 23/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
+ ++E N+ + W+AAR +F + LG + L +P+K +++
Sbjct: 27 FSEKFVEEFNKRYNSTWRAARYQKFEEMDPETLQGHLGAL-IDEPLWAKLPIKNVEQTND 85
Query: 98 -LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDL 154
+P+SFD+R WP C++I I DQ CGSCWAF A E SDR CI L S+S DL
Sbjct: 86 PIPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSISSEDL 145
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 201
L CC CG+GC GGYP +AW+Y GV T C PY C H P
Sbjct: 146 LECCA-TCGNGCQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPP-CDHHVVGQYPP 203
Query: 202 CEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C P PTPKCV++C + + ++ H+ Y++ ++ E I EI +GPV+ SF V
Sbjct: 204 CGPIKPTPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHGPVQASFRVA 263
Query: 260 EDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY + GGH+VK+IGWG + G YW+
Sbjct: 264 SDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGV-EQGTPYWL 303
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 107/300 (35%), Positives = 152/300 (50%), Gaps = 19/300 (6%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+T L + +++ + E +KL + I+E N+ + +N F +
Sbjct: 1 MITVWLFFIFTLTNAAYYEETYNKLLKE--------IQEKNDLEGLPYTFGKNAYFEGAS 52
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
+ K LLG K K + S+ LP DAR WPQC I + DQ +CGSC
Sbjct: 53 IETVKRLLGFKGKLLSHTSISSSKNANLSVDLPFEMDARKRWPQCKYIGFVRDQANCGSC 112
Query: 127 WAFGAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WA + ++DR CI LS +L++CC +CG GCDGGYP A+ Y+ G+
Sbjct: 113 WAVSSASVMTDRICIESIAAKQPLLSEEELVSCCK-ICGYGCDGGYPDKAFIYWATRGIP 171
Query: 185 TEECDPYFDSTGCS----HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 239
T PY + GC E TP C R+C+ + +H+ Y +NS+
Sbjct: 172 TG--GPYGSTKGCKPYSIGSNSEDEAETPLCTRQCINEYPYNLSQDRHFGEKPYWVNSNE 229
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E IM E+YKNGPV V+F VYEDF +Y GVY+H G +GGHAVKLIGWG ++ + YW+
Sbjct: 230 EQIMQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGI-ENSKKYWL 288
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 132/219 (60%), Gaps = 20/219 (9%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
P SFDAR+ WP+C +I I DQ CGSCWA + EA+SD C+ + + +S +D+L+
Sbjct: 90 PASFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILS 149
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--- 206
CCG CG GC GG+PI A+++ GVVT + C PY C H +P Y
Sbjct: 150 CCGISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPY-AFYPCGHHQNDPYYGPC 208
Query: 207 -----PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
PTPKC + C +K N+ ++ KH++ AY + ++ +I EIYKNGPV +F VY+
Sbjct: 209 PGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQ 268
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF++YK G+Y H G G HAVK++GWG ++ DYW+
Sbjct: 269 DFSYYKKGIYVHKWGGQTGAHAVKVVGWG-RENATDYWL 306
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 112/300 (37%), Positives = 161/300 (53%), Gaps = 35/300 (11%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQ 69
L+L IS VS+ ++D I I +N+ ++ W A RN +N + +
Sbjct: 6 FLLLASIS--------VSRAEID--IQSQDFIDSINQK-QSHWVARRNFPENTTNEYLYK 54
Query: 70 FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
LG+ P P + +K + +PK+FDAR WP+C +++RI DQG CGSCWAF
Sbjct: 55 LNGFLGLHPDPN--YMPEKIKHNFNPQDIPKTFDARKKWPKCDSLNRIRDQGSCGSCWAF 112
Query: 130 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
AVE +SDR CIH S DLL+CC CG C GGY ++A+ +++ GVV+
Sbjct: 113 AAVETMSDRICIHSSGAKKFFFSAEDLLSCCT-ACG-SCSGGYMMAAFDFYIKQGVVSGG 170
Query: 186 -----EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 239
E C PY T +H TP C + C K + + KHY Y +++
Sbjct: 171 DLNSNEGCRPY---TADAHDKG----VTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGV 223
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+I EI NGP+ VSF VY+DF +Y SGVY H++G+ G H VK++GWGT + +DYW+
Sbjct: 224 SNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKE-QDYWL 282
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 99/225 (44%), Positives = 130/225 (57%), Gaps = 19/225 (8%)
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 149
+DK +P+SFDAR+ WP+CS++ I DQ +CGSCWA ALSDR CI + + +
Sbjct: 84 NDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHV 143
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG- 201
S D+L+CCG CG GC+GG+PI A+ YF G VT C PY C H G
Sbjct: 144 SATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPCGHHGK 202
Query: 202 ------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
C TPKCVRKC + ++ + AY + + + I EI KNGPV
Sbjct: 203 DTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVG 262
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTVYEDF++YK G+YKH G GGHA+K+IGWG + G YW+
Sbjct: 263 AFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWL 306
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 110/255 (43%), Positives = 141/255 (55%), Gaps = 29/255 (11%)
Query: 51 KAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAW 108
+AGW F ++ K L G + P LL +PVK HD +++PKSFDAR W
Sbjct: 1 QAGWN-----DFGEASMSDLKVLCGTILDDPD--LLNLPVKQHDLTDMEIPKSFDARMEW 53
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGC 166
C +I DQGHCGSCWAF + E LSDR CI N+ LS DLL+C G GC
Sbjct: 54 STCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLLSC--DKAGRGC 111
Query: 167 -DGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
DGG AWRY GVV C PY +TG P+C+ KC + ++
Sbjct: 112 SDGGRLSEAWRYMQKKGVVANRCKPYTSGATGF----------IPECMSKCTGEGHAYQ- 160
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
K Y + Y ++ + + I EI NGPVE +FTVY D HYKSGVY H +G +GGHAVK
Sbjct: 161 -KFYGLYLYTVSGENQ-IKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVK 218
Query: 285 LIGWGTSDDGEDYWV 299
++GWG D+ E+YW+
Sbjct: 219 VLGWGVEDE-EEYWL 232
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 105/302 (34%), Positives = 147/302 (48%), Gaps = 17/302 (5%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL S+ A G + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLL-----STALVALGASALLAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + K L G L KLP++FDA WP C TI I DQ C
Sbjct: 57 ITFAEAKRLTGAWIQKSSTLPPARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSACR 116
Query: 125 SCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
+ WA A+SDR+C + G L +S DLL+CC CGDGC GG+P AW Y+V +G+
Sbjct: 117 ASWAVSTASAISDRYCTVGGGKQLRISAADLLSCCK-QCGDGCKGGFPGFAWLYYVEYGI 175
Query: 184 VTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
+ C PY + G P + + TPKC C K+ K+ + Y +
Sbjct: 176 ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLL 233
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
ED E+Y NGP F VY D YKSGVY+++ GD +GG AV+++GWG +G
Sbjct: 234 HGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTP 292
Query: 297 YW 298
YW
Sbjct: 293 YW 294
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 106/284 (37%), Positives = 152/284 (53%), Gaps = 36/284 (12%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L S++ + + EVN +P P F + ++ +K + L L V +
Sbjct: 38 LVSYLRRSQSLFEVNSDP--------TPNFE-------QKIMDIKYNHQRLNLMVK-EDP 81
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D + +P S+D R W C+T I DQ +CGSCWA A+SDR CI +++S
Sbjct: 82 DPEVDIPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNIS 140
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPG-- 201
D++ CC CGDGC+GG+PI AW+YF++ GVV+ C PY C H G
Sbjct: 141 ATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPY-PIHPCGHHGND 199
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C PTP C ++C +++R K Y AY + + I +EI +NGPV S
Sbjct: 200 TYYGECRGTAPTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVAS 259
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W+
Sbjct: 260 FAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWL 302
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 102/259 (39%), Positives = 142/259 (54%), Gaps = 12/259 (4%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL--GVPVKTHDKSLKLP 99
++ E+N GW A NP F ++ +F+ L + P L VK D+ +P
Sbjct: 15 MVHEINNRNDVGWTARVNPHFKSFNQKKFRSLNSAQHNPSFSLQFKNEFVKIEDE---IP 71
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLAC 157
+SFDAR+ WP C TI I DQGHCGSCWA + E L DRFCIH + LS D+ +C
Sbjct: 72 ESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDITSC 131
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-V 216
GC+GG+ +A+ Y GV TEEC PY C HPGC ++ TP C ++C
Sbjct: 132 DSR--SHGCNGGWTETAFEYAKKAGVPTEECVPYLMGK-CHHPGCS-SWQTPTCKKECSS 187
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
N + ++++Y+ +Y I + E I E+ +NGPV FT Y+D A Y GVY H+ G
Sbjct: 188 LSNYNYSSNRYYASKSYSIQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYNHVMGS 247
Query: 277 VMGGHAVKLIGWGTSDDGE 295
G HA+K++GWG + E
Sbjct: 248 EQGLHAIKIVGWGVWRESE 266
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 153/278 (55%), Gaps = 26/278 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLP 99
+I +N++P AGWKA ++ +F ++V + LLG + L V HD ++++P
Sbjct: 1 MISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLNVEIP 58
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 157
FD+R WP+C +IS+I DQ C S WA AV A+SDR CI G ++ LS DL++C
Sbjct: 59 SHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISC 118
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE----------- 203
C CG GCDGG +W Y+V HG+VT + TGC P C+
Sbjct: 119 CKN-CGSGCDGGVTGYSWDYWVSHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACG 175
Query: 204 -PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
Y TP+C + C K N + KHY +Y + S I +I +G VE +YED
Sbjct: 176 DKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYED 235
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 236 FLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 272
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 104/217 (47%), Positives = 141/217 (64%), Gaps = 16/217 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ +S D+L
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 203
CCG CGDGC+GG+P AW ++ G+V+ C PY S P C
Sbjct: 61 TCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 204 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF
Sbjct: 121 GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDF 180
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY+H++G++MGGHA++++GWG ++G YW+
Sbjct: 181 LLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWL 216
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 93/210 (44%), Positives = 122/210 (58%), Gaps = 16/210 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC 157
+P +F++ W CS IS I +Q CGSCWAFGAVE++SDRFCIH G ++ LS DL+ C
Sbjct: 70 VPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVLLSFQDLVTC 129
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPK 210
+GC GG +A ++ G+V+ +C PY + P C PA TP+
Sbjct: 130 --DQSDNGCQGGDAYTAMKFIQKKGIVSNDCLPY------TIPTCAPAQQPCLNFVDTPQ 181
Query: 211 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
CV KC + + H+ Y +N I EI NGPVE F VYEDF YKSGVY
Sbjct: 182 CVEKCSNASYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVY 241
Query: 271 KHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+H TG +GGH VK+IGWGT ++ E YW+C
Sbjct: 242 QHTTGKDLGGHCVKMIGWGTQNN-ELYWIC 270
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 107/231 (46%), Positives = 129/231 (55%), Gaps = 23/231 (9%)
Query: 88 PVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-- 143
P TH +++LPK+FDAR+ WP C +IS I DQ CGSCWAFGAVEA+SDR CIH
Sbjct: 74 PTVTHVGFDAMRLPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNG 133
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HP 200
N SLS DLL+CC CG GC GGYP AW Y+ HG+VT D +GC P
Sbjct: 134 AFNKSLSAVDLLSCCEN-CGYGCSGGYPAVAWDYWGAHGIVTGGSKE--DPSGCRSYPFP 190
Query: 201 GCE------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
CE YPTP+CV+ C + K + +Y I S IM EI
Sbjct: 191 KCEHHVQGHYPPCPHQYYPTPECVQHCDTPGIDYVKDKTRANMSYNIYSSEILIMKEIML 250
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GPVE FTVYEDF YK GVY H G + HA++++GWG D YW+
Sbjct: 251 RGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGD-VPYWL 300
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 113/274 (41%), Positives = 145/274 (52%), Gaps = 29/274 (10%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
+ KEVN K W A +YT LGV + L P KT LP+
Sbjct: 22 EVAKEVNAM-KTTWIANEAIPTRDYT-----QYLGVLFGDRQL----PSKTIVARGDLPE 71
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
SFD WP+C ++ I DQ CGSCWAFGA EA +DR CI + LS DLL CC
Sbjct: 72 SFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLLTCC 131
Query: 159 GFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPA 205
CG GCDGG+ AWR+F GV T + C+ Y C H P C +
Sbjct: 132 D-SCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAY-SFPKCEHHAEGKYPPCGES 189
Query: 206 YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
TP+CV++C + + + KH+ AY + + I E+ NGP+EVSF VYEDF
Sbjct: 190 QETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLT 249
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKSG+Y+H+ G +GGHAVKL+GWG +DG +YW
Sbjct: 250 YKSGIYQHVAGKYLGGHAVKLVGWGV-EDGIEYW 282
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 152/290 (52%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINQINTNAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQTSPDMFKT 73
Query: 92 HDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD++ ++P +FDAR W +CSTI + DQGHCGSCWAFG A +DR CI
Sbjct: 74 HDEAYNSLPNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW +F HG+VT E C PY
Sbjct: 134 FNELLSAEELAFCC-HKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C R C +L ++ H++ AY + I ++ G
Sbjct: 193 LDEYGNNTCRGKPAEKNHRCTRMCYGNQELDFKEDHHWTRDAYYLTYTT--IQKDVMAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF +YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 113/290 (38%), Positives = 153/290 (52%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINQINANAKT-WKAGANFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73
Query: 92 HDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD++ ++P +FDAR W +CST+ ++ DQG+CG+CWAFG A +DR CI
Sbjct: 74 HDEAYNSLPNRIPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 134 FNELLSAEELAFCC-HKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
FD G + +PA +C R C L ++ Y+ AY +N + I ++ G
Sbjct: 193 FDEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFKEDHRYTRDAYYLNY--QIIQNDLMTYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E S+ VY+DF +YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 141/250 (56%), Gaps = 21/250 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVND 153
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPG 201
L++CC CGDGC GG+P AW Y+V G+VT C PY T +P
Sbjct: 148 LISCCE-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPA 206
Query: 202 C-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C Y TP+C + C K + + KHY +Y + S+ + I EI GPVE +F VY
Sbjct: 207 CGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNEKAIQREIMMYGPVEAAFDVY 266
Query: 260 EDFAHYKSGV 269
EDF +YKSG+
Sbjct: 267 EDFLNYKSGI 276
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/217 (47%), Positives = 140/217 (64%), Gaps = 16/217 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ +S D+L
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCE 203
CCG CGDGC+GG P AW ++ G+V+ C PY S P C
Sbjct: 61 TCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCT 120
Query: 204 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF
Sbjct: 121 GEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDF 180
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY+H++G++MGGHA++++GWG ++G YW+
Sbjct: 181 LLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWL 216
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/289 (36%), Positives = 154/289 (53%), Gaps = 23/289 (7%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG 86
+V K + L + I +N + ++ W A +N N ++ + K+LLG K KG L
Sbjct: 13 IVLSYKGSPNPLSNDFINYIN-SKQSTWVAGKNFD-ENLSIQEIKNLLGAK---KGKLGV 67
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HF 143
TH + +++P SFDAR W +CS IS ++DQ CGSCWA A A+SDR CI
Sbjct: 68 AKEFTHSEDIQVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQG 127
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY----- 191
+ + +S +LL+CC CG GC+GGYP AW Y++ G+ T + C PY
Sbjct: 128 KLKVPVSAENLLSCCDS-CGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPC 186
Query: 192 -FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
+ G Y TP C KC +++ + + R +I EI NG
Sbjct: 187 EHHTEGNKVQCSTLDYDTPSCKHKCDDSALNYKSELTFGSGSVRNFYSVANIQKEILTNG 246
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PVE +F VY DF +YKSGVY+H+ G+ +GGHAV+++GWG + G YW+
Sbjct: 247 PVEAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWG-EESGVPYWL 294
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 148/290 (51%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P SFDAR W +CSTI + DQG CGSCWAFG A +DR CI
Sbjct: 74 HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 134 FNELLSAEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C R C L ++ HY+ AY + I +I G
Sbjct: 193 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 109/277 (39%), Positives = 148/277 (53%), Gaps = 24/277 (8%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT-PKGLLLG 86
+SK K+ S L D I GW+A PQF N T K +LG + P+G L
Sbjct: 19 ISKEKVISRDLVDKI-----NTLNVGWEATLYPQFENLTFESAKSMLGSRGAWPEGSL-- 71
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 144
P + +P++FDAR WP +I I +QG CGSCWAFGA E LSDRF I
Sbjct: 72 PPEIEVRVAENIPENFDARKQWP--GSIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQ 129
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC-DPYFDSTGCSHPGCE 203
+ ++LS L+ C L GC GG+PI+AW Y V G++TE+C PY+ C
Sbjct: 130 IYVTLSAQQLVDCD--LDNSGCSGGWPINAWNYMVKTGLLTEQCYGPYY----AKQYTCR 183
Query: 204 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
T C + K + + Y + A + E I +I NGPVE FT+++DF
Sbjct: 184 LTANTTDCPWQPGVKARFYHAKSAYKLPAKNV----EAIQTDIMNNGPVEADFTIFQDFY 239
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
Y+SG+Y H TG +GGHA+K++GWGT D+ DYW+C
Sbjct: 240 AYRSGIYVHATGKQLGGHAIKILGWGTEDN-VDYWLC 275
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 110/299 (36%), Positives = 158/299 (52%), Gaps = 36/299 (12%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
F++T L+ L V + V + L L+ +L D I N N A W A RNP+F +
Sbjct: 3 FISTLLIALTVFA-------VCNALDLNKPVLDDKFIHNHNAN-GASWVAGRNPRFEGQS 54
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
+G LLG K P+ P + + +P SFD+R+ WP C + +L+QG CGSC
Sbjct: 55 IGDILGLLGTK-KPRN----TPEEVSVSKVAVPNSFDSRTNWPGC--VHAVLNQGQCGSC 107
Query: 127 WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAF A E+LSDR CI +N++LS L++ C GC+GG P AW Y HG+
Sbjct: 108 WAFAASESLSDRLCIASQGAINVTLSPQALVS-CDIEFNQGCNGGIPQMAWEYLELHGIP 166
Query: 185 TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDI 242
T+ C PY G + P C ++C K QL++ K +++ + S I
Sbjct: 167 TDSCFPYTSGNGTA----------PDCQKECSDGSKYQLYK-GKTFTL---KTCSSVAAI 212
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGT-SDDGEDYWV 299
A ++ GP+E + VY+DF Y SGVY G ++GGHA+K++GWGT S G DYW+
Sbjct: 213 QANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGWGTDSTSGLDYWI 271
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 149/290 (51%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLG-VPVKT 91
++ L++ I ++N N K WKA N P+ S + F LLG K V KT
Sbjct: 18 AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPVMFKT 73
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P SFDAR W +CSTI + DQG+CGSCWAFG A +DR CI
Sbjct: 74 HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C + C L ++ HY+ AY + I ++ G
Sbjct: 193 LDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 129/220 (58%), Gaps = 21/220 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+SFDAR+ WP+CS++ I DQ +CGSCWA ALSDR CI + + +S D+L
Sbjct: 2 IPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPG------ 201
+CCG CG GC+GG+PI A+ YF G VT C PY F C H G
Sbjct: 62 SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHP--CGHHGKDTYYG 119
Query: 202 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKCVRKC + ++ + AY + + + I EI KNGPV +FTVY
Sbjct: 120 ECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVY 179
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF++YK G+YKH G GGHA+K+IGWG ++G YW+
Sbjct: 180 EDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KENGVPYWL 218
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 148/290 (51%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 21 AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 76
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P SFDAR W +CSTI + DQG CGSCWAFG A +DR CI
Sbjct: 77 HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 136
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 137 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 195
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C R C L ++ HY+ AY + I +I G
Sbjct: 196 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 253
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 254 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 302
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/210 (45%), Positives = 120/210 (57%), Gaps = 12/210 (5%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLA 156
P +FDAR+ WPQC ++ I +Q +CGSCWAF E +SDR CI +S DLL
Sbjct: 84 PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPK 210
CCG CG+GCDGG+P A++++ GVVT C PY C+ C TP
Sbjct: 144 CCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPY-PIRPCNSDNCV-NLQTPP 201
Query: 211 CVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C C + N K+Y SAY + I A+IY NGPV +F VYEDF YKSG+
Sbjct: 202 CRLSCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGI 261
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+HI G GGHAVKLIGWGT + G YW+
Sbjct: 262 YRHIAGRSKGGHAVKLIGWGT-ERGTPYWL 290
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/298 (35%), Positives = 142/298 (47%), Gaps = 21/298 (7%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+IL +S A + + ++ +L + VN W A + + N TV + K
Sbjct: 5 VILCSVSVVLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTAVYDGRMQNTTVSEAKR 64
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L P +L V + LP++FDA WP C TI+ I DQ CGSCWA A
Sbjct: 65 LNRATRKPVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAA 124
Query: 133 EALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
+++DR+C IH L +S DLLACCG CG GC GG P AW YF G+ + C PY
Sbjct: 125 TSMTDRYCTIHGVRGLRISAADLLACCG-DCGYGCLGGDPDMAWAYFSSEGIASGRCQPY 183
Query: 192 FDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPE 240
CSH YP TP C C + +R K YS+S E
Sbjct: 184 -PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSLSG------EE 236
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
D E+Y GP + F V+ D YK GVYKH+ G +G HAV+++GWG + G YW
Sbjct: 237 DFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYW 293
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 128/220 (58%), Gaps = 21/220 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+SFDAR+ WP+CS++ I DQ +CGSCWA ALSDR CI + + +S D+L
Sbjct: 2 IPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPG------ 201
+CCG CG GC+GG+PI A+ YF G VT C PY F C H G
Sbjct: 62 SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHP--CGHHGKDTYYG 119
Query: 202 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKCVRKC + ++ + AY + + + I EI KNGPV +FTVY
Sbjct: 120 ECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVY 179
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF++YK G+YKH G GGHA+K+IGWG + G YW+
Sbjct: 180 EDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWL 218
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 121/213 (56%), Gaps = 21/213 (9%)
Query: 105 RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLC 162
RS WPQC TIS I DQ CGSCWA A A+SDR CIH M L+ D L+CC + C
Sbjct: 1 RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-C 59
Query: 163 GDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYP 207
G GC GGYP AW Y++ G+VT C P+ T C H G YP
Sbjct: 60 GQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYP 118
Query: 208 TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
TP C R C N+ + K Y S+Y + IM EI KNGPVEV+F +++DF Y+
Sbjct: 119 TPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYR 178
Query: 267 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SG+Y H+ G +G HAV++IGWG ++G +YW+
Sbjct: 179 SGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWL 210
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 148/290 (51%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P SFDAR W +CSTI + DQG CGSCWAFG A +DR CI
Sbjct: 74 HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C R C L ++ HY+ AY + I +I G
Sbjct: 193 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 147/279 (52%), Gaps = 33/279 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSL 96
L D +I +N++P AGWKA ++ +F + +F L G K P P V HD ++
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRFHSVDDARFL-LGGRKEDPNLRQKRRPTVDHHDLNV 88
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G S
Sbjct: 89 EIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAISDRICIQSGGKQSY------- 141
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE---------- 203
CG GCDGG+ +W Y+V G+VT + TGC P C+
Sbjct: 142 -----CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRAC 194
Query: 204 --PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
Y TP+C + C K N + KHY +Y + S I +I +GPVE +YE
Sbjct: 195 GDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYE 254
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+
Sbjct: 255 DFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWL 292
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/306 (37%), Positives = 159/306 (51%), Gaps = 36/306 (11%)
Query: 23 FAEGVVSKLKLDSHILQDSIIKE---VNENPKAGWKAARNPQF----SNYTVGQFKHLLG 75
F V+ L D ILQD++ KE + + A + F S + K+LL
Sbjct: 4 FLTLFVAILAADEKILQDAVKKESKALTGHALAEFLRTLQSLFEVKKSEEVPVRMKYLL- 62
Query: 76 VKPTPKGLLLGVPVKTHDKSLKL----PKSFDARSAWPQC-STISRILDQGHCGSCWAFG 130
PK ++ P + ++L P+ FDAR AWP C I + DQ CGSCWA
Sbjct: 63 ----PKHFMVK-PKEEDRTKIQLDKEPPEKFDARDAWPYCREIIGHVRDQSRCGSCWAVS 117
Query: 131 AVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE- 187
A +SDR C+ + L V+D +LACCG CGDGC GG+P AW + +GV T
Sbjct: 118 AASVMSDRLCVQSNGKIKLHVSDTDILACCGEFCGDGCSGGWPFQAWEWVRKYGVCTGGD 177
Query: 188 ------CDPYFDSTGCSHP-----GCEP--AYPTPKCVRKCVKKN-QLWRNSKHYSISAY 233
C PY +H G P ++PTP+C + C + + ++ K Y+ +Y
Sbjct: 178 YRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSY 237
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ +D ++I +I KNGPV+ +F VYEDF YK G+YKH G GGHAVK+IGWG D+
Sbjct: 238 WLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWG-KDN 296
Query: 294 GEDYWV 299
G DYW+
Sbjct: 297 GTDYWL 302
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/298 (35%), Positives = 141/298 (47%), Gaps = 21/298 (7%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+IL +S A + + ++ +L + VN W A + + N TV + K
Sbjct: 5 VILCSVSVVLLAMNTSALVAREAPLLTKEFVDTVNRLSGGMWTAVYDGRMQNTTVSEAKR 64
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L P +L V + LP++FDA WP C TI+ I DQ CGSCWA A
Sbjct: 65 LNRATRKPVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAA 124
Query: 133 EALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
+++DR+C IH L +S DLLACCG CG GC GG P AW YF G+ + C PY
Sbjct: 125 TSMTDRYCTIHGVRGLRISAADLLACCG-DCGYGCLGGDPDMAWAYFSSEGIASGRCQPY 183
Query: 192 FDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPE 240
CSH YP TP C C + +R K YS S E
Sbjct: 184 -PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSFSG------EE 236
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
D E+Y GP + F V+ D YK GVYKH+ G +G HAV+++GWG + G YW
Sbjct: 237 DFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYW 293
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 108/281 (38%), Positives = 142/281 (50%), Gaps = 24/281 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK--S 95
L D ++E +P G + + + G HL G L P H+ +
Sbjct: 25 LTDLGVQEY-AHPSMGARWIAGGRLERFETGNSLHLFGAMRETAEQRLQRPTVRHEDFDN 83
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 153
LP+SFDAR+ WP C +IS I DQ CGSCWAFGAVEA+SDR CIH N SLS D
Sbjct: 84 QHLPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVD 143
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH---PGCE------- 203
L++CC CG GC GGY AW + HG+VT TGC P CE
Sbjct: 144 LVSCC-TECGCGCRGGYSPIAWDLWKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQY 200
Query: 204 -----PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
YPTP+C+++C K + K + +Y + + +M EI GPV V
Sbjct: 201 PPCPHQLYPTPECIKRCDTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHV 260
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YED YKSGVY H+ G +G H ++++GWG +DG YW+
Sbjct: 261 YEDLLDYKSGVYFHVWGGHLGEHGIRILGWG-EEDGVPYWL 300
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/270 (36%), Positives = 142/270 (52%), Gaps = 10/270 (3%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
+ S I ++ I+ +NE W A +N F T Q K L V + + +PV H
Sbjct: 22 VPSQIDTEAFIQSINEKATT-WTARKN--FEGRTPEQLKALADVIGINRDPNVTLPVVFH 78
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
+ +P SFDAR WP C +I I D+G CGSCWAF AVE +SDR C+ S
Sbjct: 79 EAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFS 138
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
++++CC CG GC GG+ ++Y+V +G+ + Y GC + TP+
Sbjct: 139 AEEVVSCC-TACGGGCRGGFLNEPYKYWVTNGIPSG--GDYGSKLGCKPYTAAVSGETPQ 195
Query: 211 CVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C + CV + W ++ SAY++N I EI NGPV VYEDF Y +G+
Sbjct: 196 CQKACVSGYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTGI 255
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+H +G +GGHAVK+IGWG+ +D YW+
Sbjct: 256 YQHTSGSFVGGHAVKIIGWGSEND-VPYWI 284
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 98/249 (39%), Positives = 138/249 (55%), Gaps = 47/249 (18%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ + +P SFDAR WP C +I I +Q +CG+CWAFGA E +SDR CI G +SV
Sbjct: 72 QGVYVPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISV 131
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH------PGCEPA 205
D+L+CCG CG+GC GGYP+ +++++ GVVT ++ TGC CE +
Sbjct: 132 EDILSCCGSSCGEGCKGGYPLEGLKFWMNSGVVT---GGDYNGTGCQPYTFPPCSSCEAS 188
Query: 206 YPTPKCVRKC--------VKKNQLWRNSKH---------YSI--------SAYRINSDPE 240
TP C +KC K ++ + N + Y + SAYR+++
Sbjct: 189 KSTPSCQKKCQTGYLEATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTS 248
Query: 241 D----------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
I EIY NGPVEVS+ V+EDF YKSGVY +++G + G HAVK+IGWGT
Sbjct: 249 SNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGT 308
Query: 291 SDDGEDYWV 299
++ DYW+
Sbjct: 309 -ENKVDYWL 316
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 103/287 (35%), Positives = 159/287 (55%), Gaps = 27/287 (9%)
Query: 26 GVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGL 83
V+S + +L I +N ++ W A RN +N + + +G+ P P
Sbjct: 12 AVLSASLAEIDVLSSEFIDSINR-IQSSWVAGRNFPENTTNEYLYKLNGFIGLHPDPN-- 68
Query: 84 LLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
PV H + + +P+SFDAR+ WP C +++RI DQG CGSCWAF ++E++SDR CIH
Sbjct: 69 -YKPPVLVHTFNARDVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIH 127
Query: 143 F--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
S DLL+CC CGD C GGY +SA ++++ G+V+ E C PY
Sbjct: 128 SSGSAQFMFSPEDLLSCCT-SCGD-CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY-- 183
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
T +H + TP C + C + + KHY + Y ++S + I E+ NGP+
Sbjct: 184 -TADAHDQGQ----TPACTKSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPI 238
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+F V++DF +Y SGVY+H++G+ +G H VK++GWG ++G YW+
Sbjct: 239 IVNFEVFQDFYNYVSGVYRHVSGESVGFHVVKIVGWGV-ENGVPYWL 284
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/230 (43%), Positives = 126/230 (54%), Gaps = 32/230 (13%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
LP +FDAR WP C+TI I +Q CGSCWAFGA E +SDR CI +SV D+L
Sbjct: 30 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 89
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC GGY I A R++ G VT C PY S C P TP
Sbjct: 90 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTP 146
Query: 210 KCVRKCVK--KNQLWRNSKHYS----------------ISAYRINSDPE--DIMAEIYKN 249
C C K + ++ KHY SAY++ + +I EIY
Sbjct: 147 SCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHY 206
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GPVE S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG ++G DYW+
Sbjct: 207 GPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWL 255
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/250 (40%), Positives = 146/250 (58%), Gaps = 24/250 (9%)
Query: 68 GQFKHLLGVKPTPKGLLLGVPVKT-HDKSLK---LPKSFDARSAWPQCSTISRILDQGHC 123
G+F+ + G+ +P L +P K H SL +P FDAR WP C +I + +QG C
Sbjct: 59 GEFRSIKGIYESP--LDFTLPSKRLHASSLDEVVIPDRFDAREKWPFCQSIHSVRNQGTC 116
Query: 124 GSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVH 180
GSCWA V +SDR CIH +NL L+ DL+ CC CG+GC+GG+ +A++Y+V
Sbjct: 117 GSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCK-DCGNGCNGGFLDGTAFQYWVD 175
Query: 181 HGVVT-------EECDPY-FDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYS 229
G+V+ E C PY F+ CS+P GC PKC+ C+ ++ +R K +
Sbjct: 176 AGLVSGAPYNSSEGCKPYPFEP--CSYPFVGCHHEKKNPKCLHHCINGYDRKYRKDKFFG 233
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
+AY+I +D I EI NGPV F V+EDF Y SGVYKH+ G +G HA++++GWG
Sbjct: 234 ATAYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWG 293
Query: 290 TSDDGEDYWV 299
T ++G YW+
Sbjct: 294 T-ENGTPYWL 302
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/210 (44%), Positives = 122/210 (58%), Gaps = 12/210 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P SFD+R+ W C++I I DQ CGSCWAF E +SDR CI ++S D+L
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-HPGCE----PAYPTPK 210
ACCG CGDGC GGYPI A+R++ GVVT F +GC +P P TP
Sbjct: 141 ACCGNSCGDGCKGGYPIQAFRWWNSRGVVT---GGDFRGSGCRPYPFAPCISCPEEKTPT 197
Query: 211 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C C + + K + +SAY + + I EI NGPV +FT+YED YKSGV
Sbjct: 198 CSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGV 257
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+H G ++GGHA+K+IGWGT +G YW+
Sbjct: 258 YRHTAGRLLGGHAIKIIGWGT-QNGIPYWL 286
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 147/290 (50%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L+ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 21 AYFLEKDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 76
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P SFDAR W +CSTI + DQG CGSCWAFG A +DR CI
Sbjct: 77 HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGE 136
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 137 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCP 195
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C R C L ++ HY+ AY + I +I G
Sbjct: 196 LDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYG 253
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 254 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 302
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 133/232 (57%), Gaps = 28/232 (12%)
Query: 85 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
L + + + ++LP+SFDAR W QC +++ I +QG CGSCWA A A++DR+CI
Sbjct: 74 LAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSK 133
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 202
S D+LACC CGDGC GGY AW+++V GV + PY GC HP
Sbjct: 134 GKEQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP-- 187
Query: 203 EPAYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 247
YP TPKC ++C +W++ + Y AY I +D + IM EIY
Sbjct: 188 ---YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIY 243
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPV+ +F Y+D YKSGVY+H+ G + GGHAVKL+GWG ++G YW+
Sbjct: 244 INGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWL 294
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 148/290 (51%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLG-VPVKT 91
++ L+ I ++N N K WKA N P+ S + F LLG K V KT
Sbjct: 18 AYFLEVDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASLVMFKT 73
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P SFDAR W +CSTI + DQG+CGSCWAFG A +DR CI
Sbjct: 74 HDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C + C L ++ HY+ AY + I ++ G
Sbjct: 193 LDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 133/232 (57%), Gaps = 28/232 (12%)
Query: 85 LGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
L + + + ++LP+SFDAR W QC +++ I +QG CGSCWA A A++DR+CI
Sbjct: 74 LAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSK 133
Query: 145 --MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 202
S D+LACC CGDGC GGY AW+++V GV + PY GC HP
Sbjct: 134 GKEQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP-- 187
Query: 203 EPAYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 247
YP TPKC ++C +W++ + Y AY I +D + IM EIY
Sbjct: 188 ---YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIY 243
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPV+ +F Y+D YKSGVY+H+ G + GGHAVKL+GWG ++G YW+
Sbjct: 244 INGPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWL 294
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 110/306 (35%), Positives = 159/306 (51%), Gaps = 30/306 (9%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
MA + + L CL I G +S + S +Q++++ + + W A
Sbjct: 1 MAFTKILLVVCLAI-----------GTISGFSI-SDQMQNALVSAIRSRTRT-WVAQVYD 47
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILD 119
Q + V LG++P + + VP+ + +S++ LP+SFD+R WP C ++++I D
Sbjct: 48 QREKFGVMN----LGLRPN-ESVANAVPLLENQRSVRSLPESFDSRQKWPNCPSLNQIRD 102
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
QG CGSC+ A++DR+CIH G + D LACC CDGGY W+Y
Sbjct: 103 QGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFGATDYLACCTDCFK--CDGGYVGKTWQY 160
Query: 178 FVHHGVVTEECDPYFDSTGC-SHPGCEPAY--PTPKCVRKCVKKNQL-WRNSKHYSISAY 233
+V G+ +E PY GC S+P P P C R C L + Y SAY
Sbjct: 161 WVDSGLTSE--GPYKSGQGCNSYPFGSYCVNDPLPTCSRTCQAGYPLTYSQDLKYGGSAY 218
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
R+ + IM EIY+NGPV V F V+ DF YKSGVY+H+TG G HAV++IGWG ++
Sbjct: 219 RVMWNENAIMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGV-EN 277
Query: 294 GEDYWV 299
G YW+
Sbjct: 278 GVKYWL 283
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 125/219 (57%), Gaps = 18/219 (8%)
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
K+P SFDAR WP C +IS I DQ CGSCWAF + E +SDR CI H + LS +D+
Sbjct: 65 KIPDSFDARVTWPHCPSISYIRDQSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADDI 124
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPG 201
L+CC G GCDGG+P+SAW+YFV GVVT + C PY +
Sbjct: 125 LSCC-TDGGYGCDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSN 183
Query: 202 CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C TP C C + + + K Y +AY +++ I EI GPV +FTVY+
Sbjct: 184 CTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYD 243
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF HYK+G+YKH++G GGHAV+++GWG G YW+
Sbjct: 244 DFFHYKTGIYKHVSGAEAGGHAVRILGWG-QQGGVPYWL 281
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/295 (38%), Positives = 147/295 (49%), Gaps = 25/295 (8%)
Query: 26 GVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLL 85
G+ + +L DS+ +N+ K +++ +F +V K L G L
Sbjct: 55 GLSGLFSMSRPMLMDSLADALNQGQKTWVASSKQERFKGASVFDVKALCGTILNGPSKLP 114
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
P LP FDAR + C+T I + DQ CGSCWAF EA SDR CI
Sbjct: 115 KKPASESTALSNLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSS 174
Query: 145 MNLSL---SVNDLLACCGFLCG---DGCDGGYPISAWRYFVHHGVVTE---ECDPYFDST 195
L S ACC G GCDGG P SAWR+F HGVV+E C PY +
Sbjct: 175 GEFDLVPLSAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSGCWPY-NFP 233
Query: 196 GCSH----PGCEPAY---PTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDIMA 244
CSH G EP P+P C C +N ++ S +H++ + ++I
Sbjct: 234 ECSHHVETKGMEPCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEIKK 291
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI NGPV +FTVYEDF +YKSGVYKH+ G +GGHAVK+IGWGT D E YW+
Sbjct: 292 EIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGT-DQNEQYWL 345
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 116/290 (40%), Positives = 148/290 (51%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++N N K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINQINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P +FDAR W +CSTI + DQGHCGSCWAFG A +DR CI
Sbjct: 74 HDEAYNNWSNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GG PI AW F HG+VT E C PY
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +PA +C R C L ++ HY+ AY + I ++ G
Sbjct: 193 LDEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQYDVLAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 148/289 (51%), Gaps = 32/289 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGL-----LLGV 87
++ L++ IK++N N K W+A N P+ S + F +LLG K +
Sbjct: 18 AYFLEEDYIKQINANAKT-WEAGVNFDPKLS---IDSFVNLLGSKGVQAAKKASPDMFKT 73
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 145
K ++ + ++P +FDAR W +C +I + DQGHCGSCWAFG A +DR CI
Sbjct: 74 GDKAYNLAQRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEF 133
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 192
N LS +L CC CG GC+GGYPI AW F HG+VT E C PY
Sbjct: 134 NELLSAEELTFCC-HKCGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPL 192
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
D G + +P +C R C L + N HY+ AY + I ++ GP
Sbjct: 193 DEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFNNDHHYTRDAYYLTYGT--IQNDVLTYGP 250
Query: 252 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 IEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWG-EEYGVPYWL 298
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 98/223 (43%), Positives = 133/223 (59%), Gaps = 17/223 (7%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D S ++P SFDAR WP+C++I I DQ HCGSCWA + E +SDR C+ + + LS
Sbjct: 85 DFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLS 144
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY--FDSTGCSHPG 201
D+LACC CG GC GG+ I AW YF + GV T + C PY + S+
Sbjct: 145 DTDILACCPN-CGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGK 203
Query: 202 C-EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C + ++PTPKC + C K ++ + + K+Y+ SAYRI + I EI +NGPV SF +Y
Sbjct: 204 CPKDSFPTPKCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFRIY 263
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGED--YWV 299
DF Y+ GVY G +GGHA+K+IGWGT +G D YW+
Sbjct: 264 PDFGFYEKGVYVTSGGRELGGHAIKIIGWGTEKVNGTDLPYWL 306
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 159/320 (49%), Gaps = 37/320 (11%)
Query: 6 LFLTTCLL--ILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 63
+ + CLL ++ IS+ E V ++ + I +N NPK+ WKA N
Sbjct: 1 MRFSICLLFAVVSAISALPDQENTVREI-------ANKWIDAINNNPKSTWKAGHNFH-P 52
Query: 64 NYTVGQFKHLLGVKPTPKGL-----LLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRIL 118
+ + + LLGV L + +K +K+PK FDAR W +C ++ I
Sbjct: 53 DTPMSYLQGLLGVSELESNLADLDKYEEMEENEENKKIKVPKYFDARKKWKKCKSLREIR 112
Query: 119 DQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 176
DQG+CGSCWA A +DR CI + N +S +L++CC + CG GC+GG+P +AW
Sbjct: 113 DQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELMSCCSY-CGFGCEGGFPDAAWV 171
Query: 177 YFVHHGVVT-------EECDPYFDSTGCSH------PGC--EPAYPTPKCVRKCVKKNQL 221
+ HG+VT + C PY C H P C P PTP C C + L
Sbjct: 172 FIKRHGLVTGGDYHSHDGCQPY-PIAPCEHHMEGSKPNCSASPTEPTPACETTCTHGSSL 230
Query: 222 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMG 279
++ + SAY + + EI+KNGP+ +F VYEDF YKSGVYK H G
Sbjct: 231 AYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRG 290
Query: 280 GHAVKLIGWGTSDDGEDYWV 299
HAVK+IGWG +G YW+
Sbjct: 291 RHAVKVIGWG-EQNGLPYWL 309
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 89/192 (46%), Positives = 123/192 (64%), Gaps = 16/192 (8%)
Query: 123 CGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVH 180
C WAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++
Sbjct: 11 CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY S P C TPKC + C + ++ KH
Sbjct: 71 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKH 130
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++G
Sbjct: 131 YGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILG 190
Query: 288 WGTSDDGEDYWV 299
WG ++G YW+
Sbjct: 191 WGV-ENGTPYWL 201
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 106/289 (36%), Positives = 153/289 (52%), Gaps = 26/289 (8%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
K+ L++ +L+ + + + ++AA PQ N+ +K K ++ V
Sbjct: 27 EKIPLEAQLLRGEELINYLKTNQNFFEAAITPQSYNFKRNLMDRRF-IKHNRKPIVEDV- 84
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
+D +P+SFDAR+ WP CS+++ I DQ CGSCWA ALSDR CI
Sbjct: 85 ---NDDGDDIPESFDARTHWPNCSSLTHIRDQADCGSCWAVSTASALSDRICIASKGAKQ 141
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 199
+ +S D+L+CC CGDGCDGGY I A+++F G VT + C PY C H
Sbjct: 142 VYVSATDILSCC-HSCGDGCDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPY-PFHPCGH 199
Query: 200 PGCEPAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRIN-SDPEDIMAEIYKNG 250
G E Y TP+CVRKC + + + + AYR+ + I EI +NG
Sbjct: 200 HGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSVKAIQKEIMRNG 259
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PV +F V++DF+ Y+ G+Y H+ G GGHAVK+IGWGT + G YW+
Sbjct: 260 PVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGT-EHGVPYWI 307
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 113/289 (39%), Positives = 150/289 (51%), Gaps = 31/289 (10%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-K 90
+++ L++ I ++NEN K WKA N P+ S V F LLG K + K
Sbjct: 17 EAYFLEEDYINQINENAKT-WKAGINFDPKLS---VENFVKLLGSKGVQAAKKASPDMFK 72
Query: 91 THDKSL---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GM 145
T DK+ ++PK FDAR W +CSTI + DQG CGSCWAFG A +DR CI
Sbjct: 73 TDDKTYENQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGDF 132
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 192
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 133 NELLSAEELTFCC-HTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPL 191
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
D G + +PA +C R C +++ ++ ++ AY + I ++ GP
Sbjct: 192 DEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFKEDHRFTRDAYYLTYGT--IQKDVMTYGP 249
Query: 252 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+E S+ VY+DF YKSGVY + +GGHAVKLIGWG + G YW+
Sbjct: 250 IEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWL 297
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 121/210 (57%), Gaps = 12/210 (5%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
+P SFD+R+ W C++I I DQ CGSCWAF E +SDR CI ++S D+L
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-HPGCE----PAYPTPK 210
ACCG CGDGC G YPI A+R++ GVVT F +GC +P P TP
Sbjct: 141 ACCGNSCGDGCKGRYPIQAFRWWNSRGVVT---GGDFRGSGCRPYPFAPCISCPEEKTPT 197
Query: 211 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C C + + K + +SAY + + I EI NGPV +FT+YED YKSGV
Sbjct: 198 CSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGV 257
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+H G ++GGHA+K+IGWGT +G YW+
Sbjct: 258 YRHTAGRLLGGHAIKIIGWGT-QNGIPYWL 286
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 123/211 (58%), Gaps = 13/211 (6%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-HFGMNLSL-SVNDLL 155
+P FDAR+ WP C +I I +Q CGSCWAFGA E +SDR CI G + S DLL
Sbjct: 75 IPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQPIISPTDLL 134
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
+CCG CG GC G P+ A+R++ GVVT C PY C+ C + TP
Sbjct: 135 SCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRGSGCKPY-PFAPCTALPCTKS-ETP 192
Query: 210 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
+C C ++ + K++ AY + D I EI NGPVE +F VY+DF HY+SG
Sbjct: 193 RCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVEAAFIVYDDFNHYRSG 251
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY+H+ G ++GGHAVK+IGWG +G YW+
Sbjct: 252 VYRHVAGKLVGGHAVKIIGWGI-QNGAPYWL 281
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/224 (42%), Positives = 126/224 (56%), Gaps = 20/224 (8%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D+ +P+SFDAR+ WP C++I I DQ +CGSCWA ALSDR CI + +S
Sbjct: 89 DEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHIS 148
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D ++CC CG GCDGG+PI A+ ++ + G VT + C PY C H G
Sbjct: 149 SIDFVSCCE-SCGYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGND 206
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C TPKC R+C + + + K Y AY + + I EI KNGPV +
Sbjct: 207 TYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGA 266
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FTVYEDF++YK G+YKH G GGHA+K+IGWG +D YW+
Sbjct: 267 FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWL 309
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/296 (38%), Positives = 161/296 (54%), Gaps = 32/296 (10%)
Query: 27 VVSKLKLDSHILQ----------DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV 76
+++KL L +H+LQ S++ VN + WKA + S + +FK +
Sbjct: 1 MLAKLFLIAHLLQYTFSQQTLSGKSLVNHVN-TIQTLWKAEY-FEISEEEM-KFKVMDSK 57
Query: 77 KPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 135
P+ + P + SL + P SFDAR WP C +I I DQ +CGSCWAFGA E +
Sbjct: 58 FAFPEEQISSEPNNSLPGSLSRAPTSFDARDYWPNCKSIKMIRDQAYCGSCWAFGAAEVI 117
Query: 136 SDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EE 187
SDR CI +S D+L CC GC GG+ + A +++ GVVT +
Sbjct: 118 SDRICIQSNGTDQPIISPEDILTCC--TNSHGCQGGFVLEAMKFWKSKGVVTGGDFQGDG 175
Query: 188 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDP--EDIM 243
C PY CS C A TPKC +C K ++ K+Y SAYR+++ I
Sbjct: 176 CIPY-SYGSCSD--CHTAQTTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSNAVRTIQ 232
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+EI +NGPVE ++ VYEDF +YKSGVY++I+G MGGHAVK+IGWG ++ +YW+
Sbjct: 233 SEILRNGPVEATYQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGV-EENVNYWL 287
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 143/278 (51%), Gaps = 36/278 (12%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS------ 95
+I +N P A W+A PQF ++ +LLG + L G V D S
Sbjct: 54 MISNINSQPSASWQAVEYPQFKGKSLADMTNLLGALNVNENDLKG-EVMDKDNSTNTPLS 112
Query: 96 -------LKL---PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG- 144
L+L P FDAR WPQC I I +Q +CGSCWAF A L+DRFCI G
Sbjct: 113 DSRYLTILRLRDFPTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGG 170
Query: 145 -MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 203
+N+ LS +++C G +GC+GG+ + WR+ V G V+E C PY S G + P C
Sbjct: 171 KVNVDLSPQFMVSCSG--QNNGCNGGFFDATWRFLVSVGTVSEACVPYV-SFGGAVPACN 227
Query: 204 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
V+ C Q S Y + R DIMA++ NGP++V+ VY DF
Sbjct: 228 --------VKSCGVPGQ---KSPFYRAGSARKLEGMLDIMADLKANGPIQVAMGVYRDFY 276
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWG-TSDDGEDYWVC 300
YKSGVY H++G +GGHAVK++GWG S YW+C
Sbjct: 277 SYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWIC 314
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 145/290 (50%), Gaps = 33/290 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I +N N K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINHINANAKT-WKAGVNFDPKLS---IDSFVKLLGSKGVQAAKQASPDMFKT 73
Query: 92 HDK-----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
HD+ S ++P FDAR W +C TI + DQGHCGSCWAFG A +DR CI
Sbjct: 74 HDEAYNNWSNRIPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGE 133
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 134 FNELLSPEELAFCC-HKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCP 192
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
D G + +P +C R C L ++ HY+ AY + I ++ G
Sbjct: 193 LDEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDVLAYG 250
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+
Sbjct: 251 PIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWL 299
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/299 (37%), Positives = 161/299 (53%), Gaps = 32/299 (10%)
Query: 22 TFAEGVV---SKLKLDSHILQDSIIKEVNENPKAGWKAAR---NPQFSNYTVGQFKHLLG 75
FA GVV +L D + +V + K A +F N F+++ G
Sbjct: 8 VFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFEN-----FQNMKG 62
Query: 76 VKPTPKGLLLGVPVKTHDKS--LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + G L P K HD + + +P+ FDAR WP C +IS I +QG CG+CWA AV
Sbjct: 63 IFESKIGFRL--PTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVS 120
Query: 134 ALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV------ 184
+SDR CIH ++ L+ DL+ CC CG+GC+GG+ ++++Y+V G+V
Sbjct: 121 VMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAAYN 179
Query: 185 -TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPE 240
T+ C PY C +P GC P TP C C + + +R K+Y +AY++ +D
Sbjct: 180 STDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDER 237
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
I EI NGPVE F+VY+D YK+GVY+H+ G +G HAV+LIGWG + G YW+
Sbjct: 238 MIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYWL 295
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 114/288 (39%), Positives = 147/288 (51%), Gaps = 31/288 (10%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KT 91
++ L++ I ++NEN K WKA N P+ S + F LLG K + KT
Sbjct: 18 AYFLEEDYINQINENAKT-WKAGINFDPKLS---IENFVKLLGSKGVQAAKKASPDMFKT 73
Query: 92 HDKSL---KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 146
DK+ K+PK FDAR W +C TI + DQG CGSCWAFG A +DR CI N
Sbjct: 74 IDKAYENQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSSAFADRLCIATNGEFN 133
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 193
LS +L CC CG GC GGYPI AW F HG+VT E C PY D
Sbjct: 134 ELLSAEELTFCC-HKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLD 192
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
G + +PA +C R C L ++ H++ AY + I ++ GP+
Sbjct: 193 EYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKKDHHFTRDAYYLTFGI--IQRDVMAYGPI 250
Query: 253 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E S+ VY+DF YKSGVY + +GGHAVKLIGWG + G YW+
Sbjct: 251 EASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWL 297
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/192 (46%), Positives = 121/192 (63%), Gaps = 16/192 (8%)
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDGC+GGYP AW ++
Sbjct: 1 CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60
Query: 181 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKH 227
G+V+ C PY S P TP+C + C + ++ KH
Sbjct: 61 KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDTPRCNKSCEAGYSPSYKEDKH 120
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ ++Y +++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH GD+MGGHA++++G
Sbjct: 121 FGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILG 180
Query: 288 WGTSDDGEDYWV 299
WG ++G YW+
Sbjct: 181 WGV-ENGVPYWL 191
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/312 (34%), Positives = 159/312 (50%), Gaps = 26/312 (8%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKL-KLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+ + T LLI + S T E + + + + + + + VN++ ++ +KA +P
Sbjct: 2 ITIITLLLIASTVKSLTVEEYLARPVPEYATKLTGQAYVDYVNQH-QSFYKAEYSPLVEQ 60
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
Y + KP + VK D ++ LP++FDAR WP C++I I DQ +CG
Sbjct: 61 YAKAVMRSEFMTKPNQNYV-----VKDVDLNINLPETFDAREKWPNCTSIRTIRDQSNCG 115
Query: 125 SCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A +SDR CI + S D+L+CC + CG GCDGG P +A+ + + +G
Sbjct: 116 SCWAVSAASVMSDRLCIQSNGTIQSWASDTDILSCC-WNCGMGCDGGRPFAAFFFAIDNG 174
Query: 183 VVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKH 227
V T C PY H P + +PTPKC + C +K N +++ K
Sbjct: 175 VCTGGPFREPNVCKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRKMCQLKYNVAYKDDKI 234
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y AY + ++ IM EI+ NGPV SF+V+ DFA YK GVY G HAVK+IG
Sbjct: 235 YGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVSNGIQQNGAHAVKIIG 294
Query: 288 WGTSDDGEDYWV 299
WG DG YW+
Sbjct: 295 WGVQ-DGLKYWL 305
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 124/215 (57%), Gaps = 16/215 (7%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVND 153
++LP +FD+R WP C++I I DQ +CGSCWAF A E +SDR CI +S D
Sbjct: 83 IQLPTAFDSRVQWPNCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPED 142
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYP 207
+L+CCG C +GC GGY I A +Y+++ GVVT C PY CS C+
Sbjct: 143 ILSCCGSSCNNGCQGGYTIEAMKYWMNSGVVTGGDYQGAGCIPY-SFRPCST--CKEPKD 199
Query: 208 TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
P C C K +R S +A N+ + I EIY NGPVEV++ VY+DF H
Sbjct: 200 APSCKTTCQASYKAKSAYRLPTTTSSNAIVANA-VQMIQTEIYNNGPVEVAYQVYDDFYH 258
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY H+ GD GHAVK+IGWGT + DYW+
Sbjct: 259 YKSGVYYHVYGDKPSGHAVKIIGWGT-EKKVDYWL 292
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 98/207 (47%), Positives = 123/207 (59%), Gaps = 18/207 (8%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 160
FDA AWP+C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S DL++CC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD- 59
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 212
+CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C Y TP C
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117
Query: 213 RKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
C KK L + + S I S E E+ NGP EVSF+VY DF Y GVYK
Sbjct: 118 STCTDKKIPLIKYRGNTSC----ILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYK 173
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYW 298
H+TG +GGHAV+++GWG +GE YW
Sbjct: 174 HVTGVFLGGHAVRIVGWGEL-NGEPYW 199
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 95/209 (45%), Positives = 121/209 (57%), Gaps = 22/209 (10%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 160
FDA AWP+C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S DL++CC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD- 59
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 212
+CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C Y TP C
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117
Query: 213 RKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C K +R + Y +S E E+ NGP EVSF+VY DF Y GV
Sbjct: 118 STCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGV 171
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKH+ G +GGHAV+++GWG +GE YW
Sbjct: 172 YKHVAGTFLGGHAVRIVGWGEL-NGEPYW 199
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 106/259 (40%), Positives = 136/259 (52%), Gaps = 40/259 (15%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL LG S+ H L D ++ VN+ W+A N F N V
Sbjct: 10 CLLALGDARSRP-----------SFHPLSDELVNYVNKQ-NTTWQAGHN--FYNVDVSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP+SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 110 AFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVS 169
Query: 186 E-------ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 231
C PY C H P C TPKC + C + ++ KHY +
Sbjct: 170 GGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYN 228
Query: 232 AYRINSDPEDIMAEIYKNG 250
+Y +++ +DIMAEIYKNG
Sbjct: 229 SYSVSNSEKDIMAEIYKNG 247
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 120/200 (60%), Gaps = 23/200 (11%)
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
Q CGSCWA GAVEA++DR CI N +++S +DLL+CC CG GCDG P +AW Y
Sbjct: 2 QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGRDPYAAWSY 60
Query: 178 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL 221
+V +G+VT Y +GC +P CE YPT C KC +
Sbjct: 61 WVSNGIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSI 118
Query: 222 WRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
NS KHY S Y + D I EI NGPVEV+F VYEDF HY SG+YKH TGD +GG
Sbjct: 119 SYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGG 178
Query: 281 HAVKLIGWGTSDDGEDYWVC 300
HAVK++GWGT ++G DYW+C
Sbjct: 179 HAVKMLGWGT-ENGTDYWIC 197
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 125/224 (55%), Gaps = 20/224 (8%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D+ +P+SFDAR+ WP C++I I DQ +CGSCWA ALSDR CI + +S
Sbjct: 89 DEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHIS 148
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-- 201
D ++CC C GCDGG+PI A+ ++ + G VT + C PY C H G
Sbjct: 149 SIDFVSCCE-SCSYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGND 206
Query: 202 -----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
C TPKC R+C + + + K Y AY + + I EI KNGPV +
Sbjct: 207 TYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGA 266
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FTVYEDF++YK G+YKH G GGHA+K+IGWG +D YW+
Sbjct: 267 FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWL 309
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 110/299 (36%), Positives = 160/299 (53%), Gaps = 32/299 (10%)
Query: 22 TFAEGVV---SKLKLDSHILQDSIIKEVNENPKAGWKAAR---NPQFSNYTVGQFKHLLG 75
FA GVV +L D + +V + K A +F N F+++ G
Sbjct: 8 VFAIGVVVIARSERLGDDPFNDGFLAQVQRHAKTWTPDATFRDGIRFEN-----FQNMKG 62
Query: 76 VKPTPKGLLLGVPVKTHDKS--LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + G L P K HD + + +P+ FDAR WP C +IS I +QG CG+CWA V
Sbjct: 63 IFESKIGFRL--PTKRHDVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVATVS 120
Query: 134 ALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV------ 184
+SDR CIH ++ L+ DL+ CC CG+GC+GG+ ++++Y+V G+V
Sbjct: 121 VMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAAYN 179
Query: 185 -TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPE 240
T+ C PY C +P GC P TP C C + + +R K+Y +AY++ +D
Sbjct: 180 NTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDER 237
Query: 241 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
I EI NGPVE F+VY+D YK+GVY+H+ G +G HAV+LIGWG + G YW+
Sbjct: 238 MIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYWL 295
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 110/278 (39%), Positives = 141/278 (50%), Gaps = 36/278 (12%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYT--VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL 98
I K VN+ + W A N +Y+ +G K+ KP P + +P+K +L
Sbjct: 22 EIAKRVNKQ-QNSWVANENTPLRDYSSFIGTLKNK---KPLP---IRSIPIKR-----EL 69
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMN-LSLSVNDLLA 156
PK FD+ WP+C +I + DQ C SCWAFG VE +DR CI G N + LS D+L
Sbjct: 70 PKEFDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVLE 129
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-- 207
CC CG C GGY AW Y GVVT E C Y CSH G E YP
Sbjct: 130 CCK-DCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSY-PFPPCSH-GIEGQYPQC 186
Query: 208 ------TPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
PKC C + + Y S Y++ ++ + I EI +NGPV+ SF VYE
Sbjct: 187 STKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYE 246
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
DF YKSG+Y H+ G M H VK+IGWG ++GE YW
Sbjct: 247 DFMTYKSGIYHHVEGKFMNLHTVKIIGWG-EENGEAYW 283
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/218 (42%), Positives = 127/218 (58%), Gaps = 19/218 (8%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+S+D R W CS++ I DQ +CGSCWA + A+SDR CI + +S D++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-- 206
+CC + CGDGC+GG+PISA+R+ GVVT C PY + C H G E Y
Sbjct: 151 SCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGE 208
Query: 207 -----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TP+C R+C+ S Y AY++ + + I +I KNGPV ++TVYED
Sbjct: 209 CVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYED 268
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FAHY+SG+YKH G G HAVK+IGWG + G YW+
Sbjct: 269 FAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWI 305
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/209 (45%), Positives = 121/209 (57%), Gaps = 22/209 (10%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 160
FDA AWP+C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S DL++CC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD- 59
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 212
+CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C Y TP C
Sbjct: 60 VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117
Query: 213 RKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C K +R + Y +S E E+ NGP EVSF+VY DF Y GV
Sbjct: 118 STCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGV 171
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKH+ G +GGHAV+++GWG +GE YW
Sbjct: 172 YKHVAGIFLGGHAVRIVGWGEL-NGEPYW 199
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 111/290 (38%), Positives = 149/290 (51%), Gaps = 29/290 (10%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVK---PTPKGLLLG 86
L +H L S + ++NE K WKA +N P++ T Q LLG K PK L+
Sbjct: 17 LTEQAHFLSKSYVDKINEVAKT-WKAKQNFPEY--MTKEQIVRLLGSKNLTSVPKSLIKE 73
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 144
+ + S ++P FDAR W C TI + +QG+CGSCWA G A +DR CI +
Sbjct: 74 NDSEYINDS-EIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATNGD 132
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPY------ 191
N +S +L CC CG GC+GG P+ AW+YF HGVV T+ C PY
Sbjct: 133 FNELISAEELTFCC-HRCGFGCNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCV 191
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI-SAYRINSDPEDIMAEIYKNG 250
D G + +P P KC R C HY +AY +N D + + G
Sbjct: 192 KDEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKNAYYLNIDT--MQKDTIAYG 249
Query: 251 PVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF +Y+SGVY+ +GGHAVK+IGWG +DG YW+
Sbjct: 250 PIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWG-EEDGTPYWL 298
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/305 (35%), Positives = 165/305 (54%), Gaps = 26/305 (8%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP---QFSNYTVG 68
L+++ + +Q +A+ ++ K + + + +++ VN + ++ +K +P QF +
Sbjct: 7 LVVVLLAINQLYADELLHKQESEHGLSGQALVDYVNSH-QSLFKTEYSPTNEQFVKARIM 65
Query: 69 QFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
K++ P K + +++LP+ FDAR WP C++I I D CGSCWA
Sbjct: 66 DIKYMTEASHK-------YPRKGINLNVELPERFDAREKWPHCASIGLIRDHSACGSCWA 118
Query: 129 FGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 185
A +SDR CI G N LS D+LACCG CG GC+GGYPI A+ Y + GV +
Sbjct: 119 VSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYPIQAYFYLENTGVCSG 178
Query: 186 ------EECDPY-FDSTGCSHPGC--EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRI 235
C PY F ++ C E A+ TPKC + C + + + K + +++ +
Sbjct: 179 GEYREKNVCKPYPFYPCDGNYGPCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGKNSHIL 238
Query: 236 NSDPE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
D E I EI+ NGPV +F V+EDF HYK G+YK G +G HA+KLIGWGT ++G
Sbjct: 239 LQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGT-ENG 297
Query: 295 EDYWV 299
DYW+
Sbjct: 298 TDYWL 302
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 154/303 (50%), Gaps = 46/303 (15%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
L + L I +IS E +S L D II +NE+P AGW+A ++ +F +
Sbjct: 1 MLISVLYIASLIS---HLEAHISIKNEKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLD 57
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+F+ L + P P H D ++++P SFD+R WP+C +I+ I DQ CGS
Sbjct: 58 DARFQ-LGARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGS 116
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD------GCDGGYPISAWRY 177
C AFGAVEA+S+R CI G N+ LS DL G + G GC+ YP +
Sbjct: 117 CCAFGAVEAMSERSCIQSGGKQNVELSAVDLE---GIVTGSSKENNTGCEP-YPFPKCEH 172
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
F T +P C Y TP+C C K R Y+ +R
Sbjct: 173 F----------------TKGQYPPCGSKIYKTPRCKTTCQK-----RYKTSYAQDKHRA- 210
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++
Sbjct: 211 -----IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-ENKTP 264
Query: 297 YWV 299
YW+
Sbjct: 265 YWL 267
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 103/244 (42%), Positives = 128/244 (52%), Gaps = 39/244 (15%)
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
V K H+K LP SF A+ WP C +I I DQG+CGSCWA A +SDR CI G
Sbjct: 60 VEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQT 119
Query: 147 --LSLSVNDLLACCGFLC----GDGCDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+S DLL+CCG C GCDGGYP AW+Y G+VT C PY
Sbjct: 120 DKRQISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPY-S 178
Query: 194 STGCSHPG-------CEPAY-----PTPKCVRKCVKKNQLWRNSKHYSI-------SAYR 234
CSH CE + TP C +KC + S+ Y + + Y+
Sbjct: 179 FPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQF-----SRTYDVDKIRSRENPYK 233
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 294
+ D E I EIY NGPV+ FTV++DF +YKSGVY+ TG G HAVK+IGWGT ++G
Sbjct: 234 LIKDQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGT-ENG 292
Query: 295 EDYW 298
YW
Sbjct: 293 VPYW 296
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/209 (44%), Positives = 121/209 (57%), Gaps = 22/209 (10%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 160
FDA AWP+C T++ I DQ CGSCWA A A+SDR+C G+ +L +S DL++CC
Sbjct: 1 FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD- 59
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCV 212
+CG GC+GGYP AW Y+ HG+V+E C PY F S C+H C Y TP C
Sbjct: 60 VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCN 117
Query: 213 RKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C K +R + Y +S E E+ NGP EVSF+VY DF Y GV
Sbjct: 118 STCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPFEVSFSVYADFVAYTGGV 171
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
YKH+ G +GGHAV+++GWG +GE YW
Sbjct: 172 YKHVAGIFLGGHAVRIVGWGEL-NGEPYW 199
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/236 (41%), Positives = 130/236 (55%), Gaps = 30/236 (12%)
Query: 92 HDKSLKL--PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS- 148
D+SL L P SFD RS W CS ++ I DQ CGSCWA A E +SDR C+ ++
Sbjct: 76 EDRSLALSIPPSFDVRSLWHVCS-LNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKA 134
Query: 149 -LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY--------- 191
+S D+L+CCG CG GC+GG+PI AWR+F G T C PY
Sbjct: 135 CISDTDILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRH 194
Query: 192 ---FDSTGCSHPG----CEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIM 243
D C + C TP+C R+C+ + + + ++Y SAY + + I
Sbjct: 195 LKRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQ 254
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI KNGPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG ++ D+W+
Sbjct: 255 REIMKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWG-KENNTDFWL 309
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 106/291 (36%), Positives = 150/291 (51%), Gaps = 38/291 (13%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAA-RNPQFSNYTVGQFKHLLGVKPTPKGLLL 85
+S L +++ ++ + EVN P G+K + +F N Q +L+ VK P
Sbjct: 31 TLSGEPLVAYLRKNQNLFEVNSTPTPGFKQKIMDIKFRN----QNPNLI-VKDDP----- 80
Query: 86 GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 143
+ +P+ +D R W C++ I DQ +CGSCWA A+SDR CI
Sbjct: 81 -------EPEDDIPEEYDPRKIWSNCTSFY-IRDQANCGSCWAVSTAAAISDRICIATKA 132
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 196
+++S DL+ CC CG GCDGG+ I AW YF + G+V+ C PY
Sbjct: 133 RKQVNISATDLVTCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPY-PIHP 191
Query: 197 CSHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
C H G C TP C +KC +L+R K Y A+++ E I E+ K
Sbjct: 192 CGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLK 251
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPV SF VYEDF+ YKSG+Y+H G++ G HAVK+IGWGT ++ DYW+
Sbjct: 252 NGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGT-ENRTDYWL 301
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 162/320 (50%), Gaps = 41/320 (12%)
Query: 19 SSQTFAEGVVSKLKLDSHILQDSIIKE-VNENPKAGWKAARNPQFSNYTVGQFKHLLGV- 76
S + G + L++ L+ ++ ++ W+A +P+F +++ K +G
Sbjct: 147 SRPAVSNGALQHLRVKMQRLKLQAAEQGLDPEQAVTWEAEVSPRFKYHSIKDAKRHMGTY 206
Query: 77 --------KP-TPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSC 126
KP P G L V V + + FDAR A+PQC+ I + DQG CGSC
Sbjct: 207 LSFYSDPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSC 266
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAWRYFVHHG 182
WAF + EAL+DRFCI G +LS +CC L GC GG P AWR+F + G
Sbjct: 267 WAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDG 326
Query: 183 VVT----------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC-----VKKNQ 220
VVT + C PY + C H P CE P PKC + C K +
Sbjct: 327 VVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVK 385
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+++ H++ SAY + + I E+ +NG + +F VYEDF YK GVY H+TG MGG
Sbjct: 386 PFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG 444
Query: 281 HAVKLIGWGTSDDGEDYWVC 300
HAVK+IG+G ++DG DYW+
Sbjct: 445 HAVKVIGFG-NEDGRDYWLA 463
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 162/320 (50%), Gaps = 41/320 (12%)
Query: 19 SSQTFAEGVVSKLKLDSHILQDSIIKE-VNENPKAGWKAARNPQFSNYTVGQFKHLLGV- 76
S + G + L++ L+ ++ ++ W+A +P+F +++ K +G
Sbjct: 147 SRPAVSNGALQHLRVKMQRLKLQAAEQGLDPEQAVTWEAEVSPRFKYHSIKDAKRHMGTY 206
Query: 77 --------KP-TPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSC 126
KP P G L V V + + FDAR A+PQC+ I + DQG CGSC
Sbjct: 207 LSFYSDPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRDQGDCGSC 266
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAWRYFVHHG 182
WAF + EAL+DRFCI G +LS +CC L GC GG P AWR+F + G
Sbjct: 267 WAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDG 326
Query: 183 VVT----------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC-----VKKNQ 220
VVT + C PY + C H P CE P PKC + C K +
Sbjct: 327 VVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVK 385
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+++ H++ SAY + + I E+ +NG + +F VYEDF YK GVY H+TG MGG
Sbjct: 386 PFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG 444
Query: 281 HAVKLIGWGTSDDGEDYWVC 300
HAVK+IG+G ++DG DYW+
Sbjct: 445 HAVKVIGFG-NEDGRDYWLA 463
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 104/265 (39%), Positives = 136/265 (51%), Gaps = 21/265 (7%)
Query: 52 AGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS--LKLPKSFDARSAWP 109
A W + +P+ + H G P H+ S +LPKSFDAR+ WP
Sbjct: 5 ARWISGGHPR--RFESASLLHTFGALRESAEQRARRPTVKHEVSDEKELPKSFDARTKWP 62
Query: 110 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCD 167
C +IS I DQ C S WAFGAVE++SDR CIH N SLS DLL+CC CG GC
Sbjct: 63 HCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLLSCCED-CGLGCG 121
Query: 168 GGYPISAWRYFVHHGVVT----EE---CDPY-FDSTGCSHPGCEPA-----YPTPKCVRK 214
G+ AW ++ HG+VT EE C + F G G P YPTP+C+++
Sbjct: 122 AGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQ 181
Query: 215 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
C + + K + +Y + IM EI NGPVE SF +Y DF Y GVY H
Sbjct: 182 CDEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCW 241
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G + HA++++GWG DDG YW+
Sbjct: 242 GGPISRHAIRILGWG-EDDGVPYWL 265
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 141/315 (44%), Gaps = 80/315 (25%)
Query: 61 QFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHD-KSLKLPKSFDARSAWPQCSTISRIL 118
+ + G HL G ++ T + L V+ D + LP+SFDAR+ WP C +IS I
Sbjct: 600 RLERFETGNSLHLFGAIRETAEQRLQRPTVRHEDFDNQHLPESFDARANWPHCPSISEIR 659
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 176
DQ CGSCWAFGAVEA+SDR CIH N SLS DL++CC CG GC GGY AW
Sbjct: 660 DQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLVSCCT-ECGCGCRGGYSPIAWD 718
Query: 177 YFVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQL 221
++ HG+VT TGC P CE YPTP+C+++C K
Sbjct: 719 FWKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEID 776
Query: 222 WRNSK----------------------------------------HYSIS---------- 231
+ K H+SI
Sbjct: 777 YEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLTCLNFMHHSIDLLSSRLEKAV 836
Query: 232 -------AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+Y + + +M EI GPV VYED YKSGVY H+ G +G H ++
Sbjct: 837 LRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIR 896
Query: 285 LIGWGTSDDGEDYWV 299
++GWG +DG YW+
Sbjct: 897 ILGWG-EEDGVPYWL 910
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 129/226 (57%), Gaps = 19/226 (8%)
Query: 90 KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS- 148
+ + + +P+SFDAR+ WP C +IS I DQ CGSCWAF E++SDR CI N +
Sbjct: 85 ENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWAFAVGESISDRVCIATDANKTA 144
Query: 149 -LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHP 200
SV D+L CC CG GCDGG+P +AW YFV GVVT C PY S +HP
Sbjct: 145 EFSVEDILTCCD-ECGFGCDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHP 203
Query: 201 GCEPAY------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
E Y TP C C K + +++ K +Y + + I +I K+GP+
Sbjct: 204 N-ETFYRNCTGVSTPSCKTSCQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLV 262
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+F+VYEDF +YK G+Y++ G GGHAV+++GWG ++ + YW+
Sbjct: 263 ATFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVK-YWI 307
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 160/306 (52%), Gaps = 26/306 (8%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
L L T +L G++SS V + D D ++ V + WK N Q SN
Sbjct: 6 LILLTVVLANGLVSS-------VDRHGQDP--FNDDFLRRVLARART-WKPDTNFQ-SNV 54
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
F+ L G+ + G + + + + +P+SFDAR+ WP C ++ I +QG CGS
Sbjct: 55 HFHAFRSLKGIGESRTGFKVPIRRYEYVYDVDIPESFDARNHWPNCESLRAIRNQGTCGS 114
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHG 182
CWA A +SDR CIH +N++L+ DL+ CC CG+GC+GG+ ++++Y+V G
Sbjct: 115 CWAVAAASVMSDRVCIHSNGTINVALAAEDLMGCC-VDCGNGCNGGFLDGTSFQYWVDAG 173
Query: 183 VV-------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 233
+V T+ C PY C +P + +PKC C ++ + K + AY
Sbjct: 174 LVSGGAYNSTDGCKPY-PFKPCEYPFNDCHVEISPKCTHHCRDGVDRHYSKDKLFGKVAY 232
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ D I EI NGPVE F VYED YKSGVY+H+ G+ +G HAV++IGWG D
Sbjct: 233 SVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWG-RDG 291
Query: 294 GEDYWV 299
G YW+
Sbjct: 292 GIPYWL 297
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 122/225 (54%), Gaps = 20/225 (8%)
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+D+ +P+SFDAR+ WP CS+++ I DQ +CGSCWA ALSDR CI +++
Sbjct: 88 NDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWAVSTAAALSDRICISTNGTKQVNI 147
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCE 203
S D+L CC + CG GC GG+PI AW Y G VT + C C H G E
Sbjct: 148 SATDILTCC-YKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHHGNE 206
Query: 204 PAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
Y TPKC C KN + + K AY + + + I EI KNGPV
Sbjct: 207 TYYGECGGRARTPKCRTSCTPGYKNS-YSDDKIRGKDAYELPNSVKAIQREIMKNGPVVA 265
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTVY DF++YK G+YKH G G HAVK+IGWG D YW+
Sbjct: 266 AFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEEGD-VPYWI 309
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 115/313 (36%), Positives = 159/313 (50%), Gaps = 45/313 (14%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQF 70
+LI V+ S F E +H L I ++NE K WKA +N P+ N Q
Sbjct: 6 ILISVVLLSVYFTE--------QAHFLSKDYINKINEVAKT-WKAKQNFPE--NTPKEQI 54
Query: 71 KHLLGVKPTPKGLLLGV---PVKTHDK----SLKLPKSFDARSAWPQCSTISRILDQGHC 123
LLG K LLGV P+K +D+ + ++P+ FD+R W C TI + +QG+C
Sbjct: 55 VRLLGSK-----RLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCETIGHVRNQGNC 109
Query: 124 GSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWA G A +DR C+ N +S +L CC CG GC+GGYP+ AW+YF H
Sbjct: 110 GSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HRCGFGCNGGYPLKAWQYFKRH 168
Query: 182 GVV-------TEECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
GVV T+ C PY D G + +P KC +KC + + HY
Sbjct: 169 GVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHY 228
Query: 229 SI-SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLI 286
AY + + +Y GP+E SF VY+DF +Y+SGVY+ +GGHAVK+I
Sbjct: 229 KTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMI 286
Query: 287 GWGTSDDGEDYWV 299
GWG ++G YW+
Sbjct: 287 GWGV-EEGTPYWL 298
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 152/306 (49%), Gaps = 29/306 (9%)
Query: 17 VISSQTFA---EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 73
V SSQ F E + + ++ L + E N ++ +KA +P+ V + +
Sbjct: 13 VSSSQKFTRLEEFLAQPITKEAEQLTGEALVEYVNNRQSFFKAKYSPE----VVKKRRQF 68
Query: 74 LGVKPTPKGLLLG----VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
L +KP +PV + +P+SFD+R W C ++ I DQ +CGSCWA
Sbjct: 69 L-LKPQFIERSYNQENVLPVANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAV 127
Query: 130 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE- 186
A + +SDR CIH + LS D+LACCG CG GCDGGY AW++ GVVT
Sbjct: 128 SAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGG 187
Query: 187 ------ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAY 233
C PY +H G P++P TP C C + + N K + + Y
Sbjct: 188 AYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWY 247
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ +D I EI K GPV +F +YEDF HY GVY H G + GGH++K+IGWG D
Sbjct: 248 WLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DK 306
Query: 294 GEDYWV 299
G YW+
Sbjct: 307 GVKYWL 312
>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
Length = 301
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 114/273 (41%), Positives = 140/273 (51%), Gaps = 27/273 (9%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL + V++S + G V KL H L D I E+N + WKA RN N + +
Sbjct: 5 LLCIVVLASVALSYGGV---KL--HPLSDEFINEINSK-QTTWKAGRNFDV-NTPISHVR 57
Query: 72 HLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQC-STISRILDQGHCGSCWAF 129
LLGV P K +PVKTH +L +P+SFDAR AWP+C S I I DQ CGSCWAF
Sbjct: 58 RLLGVLPK-KANAPKLPVKTHAVNLDAIPESFDAREAWPECTSIIGEIRDQASCGSCWAF 116
Query: 130 GAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
GAVEA+SDR CIH + + +S DL CC + CGDGC+GG+P AW Y+ G+VT
Sbjct: 117 GAVEAMSDRICIHSDASVKVRISAEDLNDCC-YDCGDGCNGGWPDLAWSYWSSTGIVTGG 175
Query: 186 -----EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 234
E C Y C H C TP C + C + L S SAY
Sbjct: 176 LYGVDEGCKAY-SIKPCDHHVDGNLGPCGDIQRTPACKKSCDSTSDLEYKSDLRRGSAYS 234
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
I I EI NGPVE + VY DF YK+
Sbjct: 235 IPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 101/230 (43%), Positives = 135/230 (58%), Gaps = 32/230 (13%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
+P +FDAR+ WP+C++I + DQ +CGSCWAFGA E +SDR CIH +S D+L
Sbjct: 70 IPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISDRICIHSNGKEQPVISAEDIL 129
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTP 209
CCG CG+GC GG + A +++ +G VT + C PY CS+ C + TP
Sbjct: 130 TCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKGDGCKPY-SFAPCSN--CVESKTTP 186
Query: 210 KCVRKCVKKNQL--WRNSKHYS---------------ISAYRINSDPED---IMAEIYKN 249
C KC + ++ KHY SAYR+++ I EIY+N
Sbjct: 187 SCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPIIQNEIYQN 246
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GPVEV++TVY+DF HYKSGVY H+TG GGHAVK+IGWGT + G DYW+
Sbjct: 247 GPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGT-EKGVDYWL 295
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 155/320 (48%), Gaps = 37/320 (11%)
Query: 6 LFLTTCLLILGVISSQ---TFAEGVVSKLKLDSHILQ-DSIIKEVNENPKAGWKAARNPQ 61
LFL + + SSQ T E + + DS L +++++ VN ++
Sbjct: 2 LFLLIFSVFFAIASSQEVHTIEELLAQQTSDDSDTLTGEALVEYVN----------KHQS 51
Query: 62 FSNYTVG----QFKHLLGVKPTPKGLLLGVPVKTHDKSLK--LPKSFDARSAWPQCSTIS 115
FS + HL+ L K +++ +P+SFD+R W CS+I+
Sbjct: 52 FSRLNTSKAEERMAHLMKTDYIRNARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSIT 111
Query: 116 RILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPIS 173
+ DQ CGSCWA A +SDR C+ L LS D+L+CCG +CGDGC+GGY
Sbjct: 112 YVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMCGDGCEGGYDHL 171
Query: 174 AWRYFVHHGVVTEE-------CDPY-FDSTGCSHPG-----CEPAYPTPKCVRKC-VKKN 219
AW + GVVT C PY F G H + ++ TP C C
Sbjct: 172 AWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACKPYCQFGYG 231
Query: 220 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 279
+ + K + S Y +++D + I E+ KNGPV+ +F YEDF+ YK G+Y H+ G G
Sbjct: 232 KRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHVKGRERG 291
Query: 280 GHAVKLIGWGTSDDGEDYWV 299
HAVKLIGWG ++G YW
Sbjct: 292 AHAVKLIGWGV-ENGTKYWT 310
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 107/287 (37%), Positives = 145/287 (50%), Gaps = 28/287 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 93
++ L++S I+ +N+ WKA N S F +LG K + KTHD
Sbjct: 21 AYFLEESYIEMINDVATT-WKAGVNFDPSTPET-DFIKMLGSKGVEAAKNASAHMFKTHD 78
Query: 94 KSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 147
+ +P++FDAR W C TI + DQGHCGSCWAFG A +DR C+ N
Sbjct: 79 VAYNKFSYIPRTFDARKRWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNE 138
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDS 194
LS +L CC CG GC+GGYPI AW+YF HG+VT + C+PY +
Sbjct: 139 LLSAEELTFCC-HACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPRNE 197
Query: 195 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVE 253
G S +P +C R C L + H ++ Y + I ++ GP+E
Sbjct: 198 DGKSSCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGPIE 255
Query: 254 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW+
Sbjct: 256 ASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWL 301
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 143/286 (50%), Gaps = 26/286 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 93
++ L++S I+ +N+ W A N S K +LG K + KTHD
Sbjct: 21 AYFLEESYIEMINDVATT-WTAGVNFDPSTPEKDLIK-MLGSKGVEAAKNASAHMFKTHD 78
Query: 94 KSLK----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 147
+ +P++FDAR W C TI + DQG+CGSCWAFG A +DR C+ N
Sbjct: 79 VAYNNNGYIPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTSSAFADRLCVATDGDFNE 138
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDS 194
LS +L CC CG+GC+GGYPI AW+YF HG+VT E C+PY +
Sbjct: 139 LLSAEELTFCC-HTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNE 197
Query: 195 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
G S +P +C R C L N H Y + I ++ GP+E
Sbjct: 198 DGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPIEA 256
Query: 255 SFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW+
Sbjct: 257 SFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGIPYWL 301
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 106/288 (36%), Positives = 147/288 (51%), Gaps = 45/288 (15%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-------VKPTPKGLL---------- 84
+I E+N +P + WKA N + TV + K LLG V+ + + +
Sbjct: 7 MINEINSDPSSTWKAGVNRNLAGKTVAEMKRLLGFAKKEGQVRYSEEQMTTIKHYNEAKA 66
Query: 85 -----LGVPVKTHD-KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 138
+GV + K+L LP +FD+R W +C I I +Q CGSCWAF A E+LSDR
Sbjct: 67 SAVKSVGVEEASKQFKTLGLPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDR 124
Query: 139 FCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG 196
FCI + +++ LS D+++C GCDGG +AW + + G+V + C PY G
Sbjct: 125 FCIASNGKVDVILSPQDMVSCD--YNDMGCDGGNLDNAWWWMKNKGIVPDSCMPYVSGGG 182
Query: 197 CSHPGCEPAYPTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
P C C N QL+ IS + DI EIY NGP
Sbjct: 183 ----------NVPACPSNCNGTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGP 232
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+ F+VY+DF +YKSGVY H TG +GGHA+K+IGWG + G DYW+
Sbjct: 233 VQGGFSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGV-EGGVDYWL 279
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 106/293 (36%), Positives = 142/293 (48%), Gaps = 77/293 (26%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+ H + D ++ VN+ W+A N F N +
Sbjct: 10 CLLVLANARSRP-----------SFHPVSDELVNYVNKR-NTTWQAGHN--FYNVDMSYL 55
Query: 71 KHLLGV---KPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
K L G P P ++ + LKLP SFDAR WPQC TI I DQG CGSCW
Sbjct: 56 KRLCGTFLGGPKPPQRVM------FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCW 109
Query: 128 AFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
AFGAVEA+SDR C + VN
Sbjct: 110 AFGAVEAISDRIC--------IHVNG---------------------------------- 127
Query: 188 CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 246
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEI
Sbjct: 128 ----------SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 177
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 178 YKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 229
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 127/225 (56%), Gaps = 22/225 (9%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
DK +P+SFDAR+ WP C++I I DQ +CGSCWA LSDR CI + ++
Sbjct: 89 DKGDDIPESFDARTKWPNCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHIS 148
Query: 153 --DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 203
D ++CC CG GC+GG+PI A+ Y+ + GVVT C PY C H G E
Sbjct: 149 SIDFVSCCD-SCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPY-PFHPCGHHGNE 206
Query: 204 PAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
Y TP+CV++C K KN +R K + Y + + + I EI ++GPV
Sbjct: 207 TYYGECPKEESTPECVKQCQKGYKNS-YRRDKTWGEDYYEVENSVKAIQREIMRSGPVVS 265
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SFTVY+DF++Y G+YKH G G HA+K+IGWGT + YW+
Sbjct: 266 SFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGT-EKNVPYWI 309
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 163/316 (51%), Gaps = 43/316 (13%)
Query: 10 TCLLILGVISSQTFAEGVVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
L +LGV++S EG +L + +++ L D ++ +N WKA N +
Sbjct: 5 VALFLLGVLASVRAEEG---RLMVPAYLAPLSDKMVDYIN-FINTTWKAGHNEGHRDLET 60
Query: 68 GQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKLPKSFDARSAWPQC-------STISRILD 119
+ K LGV L P HD + +P FD+R W T R
Sbjct: 61 VRRK--LGVHRDNHKYRL--PELVHDTLEMDIPAQFDSRQQWQDWPHHPGDPGTKERADP 116
Query: 120 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
GH FGAVE++SDR CIH G + L+ +D+L+CC + CG GC+GG+P +AW Y
Sbjct: 117 VGH------FGAVESMSDRHCIHSGAKNIVHLAADDVLSCC-WGCGSGCNGGFPAAAWSY 169
Query: 178 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 223
+V G+VT E C PY C H C PTPKCVR C K N ++
Sbjct: 170 WVDKGIVTGGNYDTDEGCMPY-PVPSCDHHVNGTLGPCGQDPPTPKCVRLCRKGYNVDFK 228
Query: 224 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
+ KHY S+Y + S+ I EI KNGPVE +FTVY DF YKSGVYK + D +GGHA+
Sbjct: 229 DDKHYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAI 288
Query: 284 KLIGWGTSDDGEDYWV 299
+++GWG +D YW+
Sbjct: 289 RILGWGVEND-VPYWL 303
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 100/278 (35%), Positives = 148/278 (53%), Gaps = 33/278 (11%)
Query: 39 QDSIIKEVNENPKAGWKAARNPQFSN-YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
Q + ++ +N N WKA NPQ ++ Y G +L + L LG +K ++
Sbjct: 78 QAAFVEAIN-NRSTTWKAGVNPQRNDQYRTG----VLSDESMKFQLPLGFVLKKDEQ--P 130
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLL 155
LP SFDAR W C +++ + +QG C S +A AV ++DR+C+H + D+L
Sbjct: 131 LPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVL 190
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------- 207
+CC CG GCDGG P + W Y+V +G+ + SH GC+ +YP
Sbjct: 191 SCC-HRCGFGCDGGVPSAVWHYWVENGITS-------GGAFGSHEGCQ-SYPFDVCKKSG 241
Query: 208 ----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
TP+C+R C N + KHY AY + D E IM E++ GP + +FT+Y DF
Sbjct: 242 DSNDTPRCLRFCQPGYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDF 301
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
YKSGVY+H G +G H+VK++GWG +D + YW+C
Sbjct: 302 VQYKSGVYRHTFGVRVGTHSVKVMGWGVENDVK-YWLC 338
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/219 (41%), Positives = 124/219 (56%), Gaps = 21/219 (9%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
P SFDAR+ WP+C +I I DQ CGSCWA + EA+SD C+ + + +S D+L+
Sbjct: 89 PDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDILS 148
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--- 206
CCG CG GC GG+PI A+R+ GVVT + C PY C P Y
Sbjct: 149 CCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPY-SFYPCGQHKDVPYYGPC 207
Query: 207 -----PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
PTPKC + +K N+ ++ KH++ +Y + ++ I EIYKNGPV +F VYE
Sbjct: 208 PGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYE 267
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
D++ G+Y H G G HA K+IGWG ++G DYW+
Sbjct: 268 DYSS-TGGIYVHKWGIQTGAHADKVIGWG-RENGTDYWL 304
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 153/306 (50%), Gaps = 29/306 (9%)
Query: 17 VISSQTFA---EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHL 73
V SSQ F E + + ++ L + E N ++ +KA +P+ V + +
Sbjct: 13 VSSSQKFTRLEEFLAQPITKEAEQLTGEALVEYVNNRQSFFKAKYSPE----VVKKRRQF 68
Query: 74 LGVKPTPKGLLLG----VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF 129
L +KP +P+ + +P+SFD+R W C ++ I DQ +CGSCWA
Sbjct: 69 L-LKPQFIERSYNQENVLPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAV 127
Query: 130 GAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE- 186
A + +SDR CIH + LS D+LACCG CG GCDGGY AW++ GVVT
Sbjct: 128 SAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGG 187
Query: 187 ------ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAY 233
C PY +H G P++P TP C C + + N K + + Y
Sbjct: 188 AYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWY 247
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ +D I EI + GPV +F +YEDF HY+ GVY H G + GGH++K+IGWG D
Sbjct: 248 WLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGV-DK 306
Query: 294 GEDYWV 299
G YW+
Sbjct: 307 GVKYWL 312
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/218 (40%), Positives = 126/218 (57%), Gaps = 19/218 (8%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
P SFDAR+ WP+C +I I DQ CGSCWA + EA+SD+ C+ + +S D+L+
Sbjct: 88 PDSFDARAHWPECRSIGTIRDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILS 147
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGC 202
CCG CG GC+ PI A+R+ VVT + C PY +H P
Sbjct: 148 CCGISCGYGCEV-LPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCP 206
Query: 203 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+PTPKC + C +K N+ + K+++ +Y + S+ I EIYKNGPV +F VY+D
Sbjct: 207 RGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNERSIREEIYKNGPVVAAFKVYQD 266
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F++Y+ G+Y H G G HAVK++GWG ++G DYW+
Sbjct: 267 FSYYRGGIYVHKWGGQTGAHAVKVVGWG-RENGTDYWL 303
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 140/268 (52%), Gaps = 27/268 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L + +L +SI++ VN +P + W A P S T +F LG T +
Sbjct: 5 LFASVLAESIVETVNNDPSSTWVAVEYPA-SVITRAKFLARLGTYVTK------YEETSF 57
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
D LP++FD+R WP I + DQ CGSCWAF E + DR I +S
Sbjct: 58 DLDNALPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIKGCDFGDMSPQ 115
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
DL++C GC+GGY AW + HG+ TE+C PY +G P C
Sbjct: 116 DLVSC--DTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSG----------RVPACP 163
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
KCV + + RN S+S ++N+ + +M E+Y+NGP+ V+FTVY DF +YKSGVY H
Sbjct: 164 AKCVNGSAIVRNK---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSGVYVH 218
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
TG + GGHAV +GWG D+ YW+C
Sbjct: 219 KTGGIAGGHAVLCVGWGVEDN-TPYWLC 245
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/296 (36%), Positives = 141/296 (47%), Gaps = 51/296 (17%)
Query: 54 WKAARNPQFSNYTVGQF--------KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDAR 105
WK AR GQ ++ + P G PV +P +FDAR
Sbjct: 228 WKDARRIAGGTVMRGQVGFEELPRRRYTKEIAPAVPGRRRLTPVAQSSSDEDIPANFDAR 287
Query: 106 SAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCI---------------HFGMNLSL 149
A+P+C S I R+ DQ CGSCWAF + EA +DR CI L L
Sbjct: 288 EAFPECASIIGRVRDQSDCGSCWAFASTEAFNDRRCIAGIGKEDAAGAEGEATADQLLVL 347
Query: 150 SVNDLLACC-GFLCG--DGCDGGYPISAWRYFVHHGVVT----------EECDPY----- 191
S D ACC GF CG GC+GG P SAW++F GVVT C PY
Sbjct: 348 SAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPC 407
Query: 192 ---FDSTGCSHPGC-EPAYPTPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMA 244
D +P C + YPTP+C+ +C + N + K + AY + + E+I
Sbjct: 408 AHHVDPGASGYPACPDGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGIENIQR 466
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWV 299
++ K G V +F+V+ DF Y GVY H +G MGGHAVK+IGWGT + GEDYW+
Sbjct: 467 DMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWL 522
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 140/268 (52%), Gaps = 27/268 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L + +L +SI++ VN +P + W A P S T +F LG T +
Sbjct: 5 LFASVLAESIVETVNNDPSSTWVAVEYPA-SVITRAKFLARLGTYVTK------YEETSF 57
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
D LP++FD+R WP I + DQ CGSCWAF E + DR I ++
Sbjct: 58 DLDNALPENFDSREQWP--GKILPVRDQASCGSCWAFSVAETMGDRLSIKGCDYGDMAPQ 115
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
DL++C GC+GGY AW + HGV TE+C PY +G P C
Sbjct: 116 DLVSC--DTTDMGCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSG----------RVPACP 163
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
KCV + + RN S+S ++N+ + +M E+Y+NGP+ V+FTVY DF +YKSGVY H
Sbjct: 164 AKCVNGSAIVRNK---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSGVYVH 218
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
TG + GGHAV +GWG D+ YW+C
Sbjct: 219 KTGGIAGGHAVLCVGWGVEDN-TPYWLC 245
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 153/310 (49%), Gaps = 39/310 (12%)
Query: 6 LFLTT--CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFS 63
LFL + C+ +L V + A G VS D +L +I+++N + + W A F
Sbjct: 2 LFLRSLICICLLAVATGIPVA-GAVSHG--DDPVLDKDMIEQINSDKDSLWTAGETEIFK 58
Query: 64 NYTVGQFKH-LLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQ 120
T+ +F+ +LG++ VPVK H + LP+SF+ WP + + I DQ
Sbjct: 59 GMTMKEFRSSMLGLRLDRD--YSEVPVKVHSSTALKDLPESFNCYENWP--NYMHPIRDQ 114
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRY 177
CGSCWAF A E LSDRF I + +N LS DL++C GD GC GGY AW Y
Sbjct: 115 ARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLVSCDK---GDMGCQGGYLDKAWDY 171
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
+G+VTE C PY G + P C CV K Y S Y +
Sbjct: 172 LKTNGIVTESCFPYAAQKGVA----------PSCRISCVDGEPY----KKYKASDYYQLT 217
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTS----- 291
EDIM EIY NGPVE F VY F YKSGVY H D+M GGHA+K++GWG
Sbjct: 218 TEEDIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRF 277
Query: 292 -DDGEDYWVC 300
YW+C
Sbjct: 278 WQKPTKYWIC 287
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 150/300 (50%), Gaps = 44/300 (14%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH-LLGVKPTPKGLLLGVPVKTHDK 94
H L D I+ +N N W A RN F T ++ + L+G + L T +
Sbjct: 24 HPLSDEFIESINFNQNT-WIAGRN--FPKKTPLKYIYNLMGTLSDSRMDNLPQRNYTFSR 80
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
K P FDAR W C T+ I DQG CGSCWA AV A++DR CI + S+
Sbjct: 81 KTKYPNQFDAREHWKNCPTLKDIRDQGGCGSCWAVAAVSAMTDRMCILSKGKEHFYFSIK 140
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------ 199
D+L+CCG+ CG+GC+GG AW Y+ G+V+ + C PY C+H
Sbjct: 141 DVLSCCGY-CGNGCEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPY-TIPPCNHLVWGEI 198
Query: 200 ---------PGCE--PAYP--------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 239
P C+ P P TP+C +KC K ++ + KH S YR+
Sbjct: 199 EQCKNIPMTPKCKNIPVIPEQCKYIPITPECEKKCNKNYKVCYSKDKHRGKSVYRVKKS- 257
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+I EIY+ GPV FTVYEDF +YK G+Y + +G +G H+VK+IGWG + G YW+
Sbjct: 258 -EIFKEIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGWG-EERGIKYWL 315
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 100/264 (37%), Positives = 137/264 (51%), Gaps = 26/264 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKL-----PKSFDARSAW 108
WK RN F N ++G+ K LLG + PK + + + L L P FD+R W
Sbjct: 240 WKFGRNAYFKNKSIGEIKKLLGYRMLPKTVKERNEMPMPEDLLNLENFNYPVEFDSRKHW 299
Query: 109 PQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 165
PQC IS I DQ +CGSCWA + +SDR CI + LS +LL+CC CG G
Sbjct: 300 PQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELLSCCT-SCGYG 358
Query: 166 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE--PAYPTPKCVRKCV 216
C+GGYP ++Y+V+ G+ T + C PY P C TPKC + C+
Sbjct: 359 CNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPY------PIPPCSNCSETRTPKCSKSCI 412
Query: 217 KKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
L N +HY + Y+ + +M +I GP+ +VYEDF HYK GVY +G
Sbjct: 413 STYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESG 472
Query: 276 DVMGGHAVKLIGWGTSDDGEDYWV 299
+GGHAV++IGWG D+ YW+
Sbjct: 473 IFLGGHAVRIIGWGEQDN-IPYWL 495
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 126/220 (57%), Gaps = 20/220 (9%)
Query: 87 VPVKTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG- 144
+P++ H++ LP+SFDAR AW C +I I DQ CGSC AFGA EA+SDR CIH
Sbjct: 13 LPIRLHEEIPEDLPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKG 72
Query: 145 -MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 196
+ +++S DLL CC CG GC GGYP +AW Y+ G+VT + C PY+
Sbjct: 73 RVQVNISAQDLLTCC-HQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP- 130
Query: 197 CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
C H P C PTPKC++ C K + + K+++ + Y ++SD I EIYKN
Sbjct: 131 CEHHTKGPLPNCTDTKPTPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKN 190
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
GPVE F+VY DF YKSGVY+ + ++ L GW
Sbjct: 191 GPVEADFSVYTDFLAYKSGVYQRHSYELWEARHQNL-GWA 229
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 150/310 (48%), Gaps = 35/310 (11%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
F+ LLI G S+ + + L D I +N + W+A RN F+ T
Sbjct: 4 FILFSLLICGTFSAS-----------IPTDPLSDEFIDYIN-TLQTTWRAGRN--FAPNT 49
Query: 67 VGQF-KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
++ K L GV +P + + +P FDAR WP C +I+ I DQG CGS
Sbjct: 50 PKKYLKSLAGVHKNANNAFT-LPKRKVSLDVTIPDEFDARKQWPNCPSITDIRDQGSCGS 108
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWA + F H + + LS +L+ CCG CG GC GG P SAW Y+ G+
Sbjct: 109 CWALELLRLCLIVFVSHSNGKLQVHLSAENLVTCCG-SCGAGCFGGDPGSAWEYWRDVGI 167
Query: 184 VT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYS 229
V+ E C PY C H P C T C ++C K + + HY+
Sbjct: 168 VSGGNYGSKEGCQPY-SIAPCEHHIPGSRPPCRGEGHTADCRKQCEKGYSIPYDKDLHYA 226
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
Y D ++I EI KNGPVE +F VYED YK GVYKH+ G +GGHA+K++GWG
Sbjct: 227 EFVYSTERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWG 286
Query: 290 TSDDGEDYWV 299
++G YW+
Sbjct: 287 V-ENGTPYWL 295
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 158/323 (48%), Gaps = 43/323 (13%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M + ++ +L+LGV ++ ++ L++ I +NE K WKA N
Sbjct: 1 MGARMWISSSVILLLGVCVTE------------QAYFLEEDFIDSINEKAKT-WKAGIN- 46
Query: 61 QFSNYTVGQF-KHLLGVK--PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCS 112
F T ++ LLG K P L L + KT D++ ++PK FDAR W +C
Sbjct: 47 -FDPNTPKEYIVKLLGSKGVQVPHKLNLKM-YKTDDEAYVNLFGRIPKKFDARKEWRRCI 104
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGY 170
TI ++ DQG+CGSCWA A +DR CI ++ N LS +L CC LCG C GGY
Sbjct: 105 TIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELTFCC-HLCGFACHGGY 163
Query: 171 PISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVK 217
PI AW YF HG+VT E C PY + G + +P +C R C
Sbjct: 164 PIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYG 223
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGD 276
++ + H Y + I ++ GP+E S VY+DF YKSGVY K
Sbjct: 224 DQEIDYDDDHRFTRDYYYLT-YASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENAT 282
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
+GGHAVKLIGWG +DG YW+
Sbjct: 283 YLGGHAVKLIGWG-EEDGVPYWL 304
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 158/323 (48%), Gaps = 43/323 (13%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M + ++ +L+LGV ++ ++ L++ I +NE K WKA N
Sbjct: 1 MGARMWISSSVILLLGVCVTE------------QAYFLEEDFIDSINEKAKT-WKAGIN- 46
Query: 61 QFSNYTVGQF-KHLLGVK--PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCS 112
F T ++ LLG K P L L + KT D++ ++PK FDAR W +C
Sbjct: 47 -FDPNTPKEYIVKLLGSKGVQVPHKLNLKM-YKTDDEAYVNLFGRIPKKFDARKEWRRCI 104
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGY 170
TI ++ DQG+CGSCWA A +DR CI ++ N LS +L CC LCG C GGY
Sbjct: 105 TIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELTFCC-HLCGFACHGGY 163
Query: 171 PISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVK 217
PI AW YF HG+VT E C PY + G + +P +C R C
Sbjct: 164 PIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYG 223
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGD 276
++ + H Y + I ++ GP+E S VY+DF YKSGVY K
Sbjct: 224 DQEIDYDDDHRFTRDYYYLTYAS-IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENAT 282
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
+GGHAVKLIGWG +DG YW+
Sbjct: 283 YLGGHAVKLIGWG-EEDGVPYWL 304
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 106/287 (36%), Positives = 141/287 (49%), Gaps = 27/287 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 93
++ L++S I+ +N+ W A N S F +LG K + KTHD
Sbjct: 21 AYFLEESYIEMINDVATT-WTAGVNFDPST-PEKDFIKMLGSKGVEAAKNASAHMFKTHD 78
Query: 94 -----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
+ +P++FDAR W C TI + DQGHCGSCWA A +DR C+ + N
Sbjct: 79 VANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFN 138
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------D 193
LS ++ CC CG GC+GGYPI AW+YF HG+VT E C+PY D
Sbjct: 139 ELLSAEEITFCC-HTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQD 197
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
G S +P +C R C L N H Y + I ++ GP+E
Sbjct: 198 EEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPIE 256
Query: 254 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW+
Sbjct: 257 ASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWL 302
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 138/259 (53%), Gaps = 25/259 (9%)
Query: 54 WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
W+A NP+ + Y G L P G++ V + L LP +FDAR WP+C
Sbjct: 86 WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGY 170
++ I DQG CGSCWA A A++DR+C+ DLL+CC CG GC GG
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198
Query: 171 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 220
AW+++V G+ + + C PY C PG + TPKC KC
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDE--DTPKCSNKCRSGYNVTD 255
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+W++ +HY AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVKL+GWG ++G YW+
Sbjct: 315 HAVKLLGWGV-ENGVKYWL 332
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 105/281 (37%), Positives = 141/281 (50%), Gaps = 32/281 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
+ S++ E+N + +F N ++ K L G L+ G ++DK++K
Sbjct: 82 IMQSLVDEINSKQNTWTASTGQKRFKNLSLRDAKMLCGT------LMRG----SNDKAVK 131
Query: 98 ----------LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
LP FDAR+A+P CS I I DQ CGSCWAFG EA +DR CI
Sbjct: 132 KGYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGA 191
Query: 147 LS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHP--G 201
+ LS ++ AC F GC GG P SAW + G+ T E P S + P
Sbjct: 192 FTELLSAGEMNACTLFF---GCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIA 248
Query: 202 CEPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ YPTP CV +C K R+ +H+ + + + D I +GPV SFTVY
Sbjct: 249 YQDIYPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVY 308
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
EDF YKSGVYKH +G +GGHAVK+IGWG G+ YW+
Sbjct: 309 EDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EKSGQAYWLA 348
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 138/259 (53%), Gaps = 25/259 (9%)
Query: 54 WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
W+A NP+ + Y G L P G++ V + L LP +FDAR WP+C
Sbjct: 86 WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGY 170
++ I DQG CGSCWA A A++DR+C+ DLL+CC CG GC GG
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198
Query: 171 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 220
AW+++V G+ + + C PY C PG + TPKC KC
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTD 255
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+W++ +HY AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVKL+GWG ++G YW+
Sbjct: 315 HAVKLLGWGV-ENGVKYWL 332
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 113/313 (36%), Positives = 158/313 (50%), Gaps = 45/313 (14%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQF 70
+LI ++ S F E +H L I ++NE K WKA +N P+ N Q
Sbjct: 6 ILISVILLSVYFTE--------QAHFLSKDYINKINEVAKT-WKAKQNFPE--NTPKEQI 54
Query: 71 KHLLGVKPTPKGLLLGV---PVKTHDK----SLKLPKSFDARSAWPQCSTISRILDQGHC 123
LLG K LLGV P+K +D+ + ++P+ FD+R W C TI + +QG+C
Sbjct: 55 VRLLGSK-----RLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCETIGHVRNQGNC 109
Query: 124 GSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWA G A +DR C+ N +S +L CC C GC+GGYP+ AW+YF H
Sbjct: 110 GSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HRCVFGCNGGYPLKAWQYFKRH 168
Query: 182 GVV-------TEECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
GVV T+ C PY D G + +P KC +KC + + HY
Sbjct: 169 GVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHY 228
Query: 229 SI-SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLI 286
AY + + +Y GP+E SF VY+DF +Y+SGVY+ +GGHAVK+I
Sbjct: 229 KTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMI 286
Query: 287 GWGTSDDGEDYWV 299
GWG ++G YW+
Sbjct: 287 GWGV-EEGTPYWL 298
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 138/259 (53%), Gaps = 25/259 (9%)
Query: 54 WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
W+A NP+ + Y G L P G++ V + L LP +FDAR WP+C
Sbjct: 86 WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGY 170
++ I DQG CGSCWA A A++DR+C+ DLL+CC CG GC GG
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198
Query: 171 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 220
AW+++V G+ + + C PY C PG + TPKC KC
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTD 255
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+W++ +HY AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVKL+GWG ++G YW+
Sbjct: 315 HAVKLLGWGV-ENGVKYWL 332
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 102/290 (35%), Positives = 154/290 (53%), Gaps = 45/290 (15%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSN----------YTVGQFKHLLGVKPTPKGLLLGVPV 89
D I+ +N +P +G KA+++ +F+ Y QF+H + +P+
Sbjct: 27 DEQIRFLNNHPSSGLKASKHNRFTAISDVYSALEYYGEKQFRHHI------------LPI 74
Query: 90 KTHDK-SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 146
+HD ++ LP FD+R W C +I RI DQ C S WA +V A+SDR CI +
Sbjct: 75 ISHDDDNILLPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVK 134
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------- 197
+ LS +L++CC C GC+ GY SAW Y+V +G+VT E + +++GC
Sbjct: 135 VELSAIELVSCCS-KCAVGCNFGYSESAWYYWVENGLVTGESNG--NNSGCLPYPFPKCD 191
Query: 198 -----SHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 250
S+P C Y P C C + + + KH+ SAY++ + DI EI G
Sbjct: 192 HGSSDSYPMCGYVVYTPPVCNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYG 251
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
PVE S +Y+DF YKSGVYKH+TG ++ +V++IGWG ++G YW+C
Sbjct: 252 PVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRIIGWGI-ENGIPYWLC 300
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 23/241 (9%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+F+NYT Q K LLG + + G+ T + LP SFD+R+ W C + I DQ
Sbjct: 45 KFANYTEAQLKGLLGTVLSHQS---GISAFTQINAA-LPDSFDSRTQWKDC--VHPIRDQ 98
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
CGSCWAF A E+LSDRFCI +NL LS D+++C GC GGY AW+Y
Sbjct: 99 AQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSC--DTSNFGCFGGYLDQAWQYL 156
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
GV ++ C+PY S G +P+ PT + +KK + S + A
Sbjct: 157 EQQGVSSDSCEPYK-----SGNGDQPSCPTKCSNGQAIKKYKCKAGSTKQAKGA------ 205
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E + I ++GPVE FTVY+DF +Y SGVY H+TGD GGHAVK++GWG E+YW
Sbjct: 206 -EATKSLIQESGPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWGKQGL-ENYW 263
Query: 299 V 299
+
Sbjct: 264 I 264
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/222 (40%), Positives = 117/222 (52%), Gaps = 24/222 (10%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLL 155
+P FDAR WP C TI I +QG C SCWA + +SDR CIH G + LS +LL
Sbjct: 113 IPAEFDARLRWPNCPTIGEIFEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLL 172
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY-PTPK---- 210
+CC LCG GC GG+P AW ++ HG+VT Y GC P Y P K
Sbjct: 173 SCCK-LCGKGCKGGFPGGAWMHWSKHGIVTG--GSYSSDYGCQKYQFFPCYQPRTKGSIK 229
Query: 211 ------------CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
C C N+ ++ +Y S YRI +D I EI +NGPV+ +
Sbjct: 230 NKCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLR 289
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+YEDF HYK GVY+H+ G + HAVK+ GWGT + G YW+
Sbjct: 290 IYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGT-EGGTPYWL 330
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 124/220 (56%), Gaps = 21/220 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P+S +R+ WP+CS++ I DQ +CGSCWA ALSDR CI + + +S D+L
Sbjct: 2 IPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDIL 61
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPG------ 201
+CCG CG GC+GG+PI A+ YF G VT C PY F C H G
Sbjct: 62 SCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHP--CGHHGKDTYYG 119
Query: 202 -CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C TPKCVRKC + ++ + AY + + EI KNGPV +FTVY
Sbjct: 120 ECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAFTVY 179
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EDF++YK G+YKH G GGHA+K+IGWG + G YW+
Sbjct: 180 EDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWL 218
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 96/217 (44%), Positives = 120/217 (55%), Gaps = 13/217 (5%)
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
D P+SF R W CS+I I DQ CGSCWAF A E++SDR CIH + +++
Sbjct: 81 EDSRWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNI 140
Query: 150 SVNDLLACCGFLCGDGCDG-----GYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHPGCE 203
S DLLACC CG GCDG I R V V TE+ C PY S P C
Sbjct: 141 SAEDLLACC-HTCGHGCDGRCHCSSVAILQGRRLVPEPVRTEDGCQPY--SLPPCVPNCT 197
Query: 204 PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
PTPKC C K + + KH++ + YR+ + I +IYKNGPVE +F VY DF
Sbjct: 198 HPEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADF 257
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVY+ MG HA+K++GWGT +DG YW+
Sbjct: 258 PSYKSGVYQQHMIKFMGVHAIKILGWGT-EDGVPYWL 293
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/244 (40%), Positives = 130/244 (53%), Gaps = 29/244 (11%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+F+NYT Q K LLG + + G+ T + LP SFD+R+ W C + I DQ
Sbjct: 45 KFANYTEAQLKGLLGTVLSHQS---GISAFTQINA-ALPDSFDSRTQWKDC--VHPIRDQ 98
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC-CGFLCGDGCDGGYPISAWRY 177
CGSCWAF AVE+LSDRFCI +NL LS D+L+C C C GGY +AW+Y
Sbjct: 99 AKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSCDASNFC---CFGGYLDTAWQY 155
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA--YRI 235
GV ++ C+PY G P C KC + K Y A +
Sbjct: 156 LEQQGVGSDSCEPYKSGNG----------DQPSCPSKCSNGQAI----KKYKCKAGSTKQ 201
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
E + I ++GPVE FT+YEDF +Y SG+Y H+TG MGGHAVK++GWG E
Sbjct: 202 AKGAEATKSLIQQSGPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGL-E 260
Query: 296 DYWV 299
+YW+
Sbjct: 261 NYWI 264
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 140/288 (48%), Gaps = 29/288 (10%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV-PVKTHD 93
++ L+ S I +NE W A N S F +LG K KT+D
Sbjct: 21 AYFLEKSYIDMINEVATT-WTAGVNFDPS-IPEDHFIKMLGSKGVESAKQASAHEFKTND 78
Query: 94 KSLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMN 146
+ +P++FDAR W C TI + DQGHCGSCWAFG A +DR C+ N
Sbjct: 79 VAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFN 138
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 193
LS ++ CC CG GC GGYPI AW+YF HG+VT E C+PY D
Sbjct: 139 ELLSAEEITFCC-HTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRD 197
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPV 252
G + +P +C R C L N H ++ Y + I ++ GP+
Sbjct: 198 DKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYGS--IQKDVMTYGPI 255
Query: 253 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
E SF VY+DF YKSGVY K +GGHAVKLIGWG ++G YW+
Sbjct: 256 EASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGV-EEGTPYWL 302
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 150/284 (52%), Gaps = 44/284 (15%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVK---------PTPKGLLLGVPVKTHDKSLK-LPKSFD 103
W+ + +F ++ K L+G PTPKG+ L P K + + + +P FD
Sbjct: 222 WEPEVSLRFRYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPL--PAKEFENATEPVPAHFD 279
Query: 104 ARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGF 160
AR+A+P C + + DQG CGSCWAF + EA +DR CI + LS +CC
Sbjct: 280 ARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNA 339
Query: 161 L-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH------PGC 202
+ C GC+GG P AWR+F GVVT C PY + C+H P C
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDC 398
Query: 203 EPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ TPKC + C ++ + H + SAY + S +D+ ++ +GPV
Sbjct: 399 DATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSG 457
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+F VYEDF YKSGVYKH++G +GGHA+K+IGWGT ++GE+YW
Sbjct: 458 AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYW 500
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 150/284 (52%), Gaps = 44/284 (15%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVK---------PTPKGLLLGVPVKTHDKSLK-LPKSFD 103
W+ + +F ++ K L+G PTPKG+ L P K + + + +P FD
Sbjct: 225 WEPEVSLRFRYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPL--PAKEFENATEPVPAHFD 282
Query: 104 ARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGF 160
AR+A+P C + + DQG CGSCWAF + EA +DR CI + LS +CC
Sbjct: 283 ARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHTTSCCNA 342
Query: 161 L-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH------PGC 202
+ C GC+GG P AWR+F GVVT C PY + C+H P C
Sbjct: 343 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDC 401
Query: 203 EPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ TPKC + C ++ + H + SAY + S +D+ ++ +GPV
Sbjct: 402 DATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSG 460
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+F VYEDF YKSGVYKH++G +GGHA+K+IGWGT ++GE+YW
Sbjct: 461 AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYW 503
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/282 (38%), Positives = 143/282 (50%), Gaps = 26/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
L D IK +NE K WKA R +N + F LLG + K V +K +D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEVEIKKYDPL 79
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
PK FD+R W C I I DQG+CGSCW+F A +DR C+ G N LS
Sbjct: 80 YVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 198
+L CC CG GC GGYPI AW+YF GV T E C PY +D G +
Sbjct: 140 PEELAFCC-MDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKN 198
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
G +P +C + C K + ++ + + Y INS E I ++ GPVE SF V
Sbjct: 199 TCGGKPMERNHQCPKTCYGKTTV--QDRYKTKNEYVINS-IETIEQDLMTYGPVEASFDV 255
Query: 259 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+DF+ YKSG+Y+ GGH++K+IGWG ++G YW+
Sbjct: 256 YDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWG-EENGTPYWL 296
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 122/229 (53%), Gaps = 17/229 (7%)
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
+P+ + +P+SFD+R W C ++ I DQ +CGSCWA A + +SDR CIH
Sbjct: 85 LPIANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGR 144
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 197
+ LS D+LACCG CG GCDGGY AW++ GVVT C PY
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204
Query: 198 SHPGCE----PAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
+H G P++P RK + + + N K + + Y + +D I EI + G
Sbjct: 205 AHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKG 264
Query: 251 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
PV +F +YEDF HY GVY H G + GGH++K+IGWG D G YW+
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWL 312
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/295 (36%), Positives = 149/295 (50%), Gaps = 39/295 (13%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG--VKPTPKGLLLGVPVKTHDKS 95
+ S++ EVN + +F ++G K L G + T + L V ++
Sbjct: 41 IMQSLVDEVNSKQNLWTASTEQGRFYGRSLGDAKKLCGTFLNGTEE---LEEKVYPAEEL 97
Query: 96 LKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ +P SFDAR A+ +C I + DQ CGSCWAFG VEA + R CI G +N LS
Sbjct: 98 VDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAA 157
Query: 153 DLLACCG---FLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTG 196
D+LACC F GC GG PI++W + +G+V+ + C PY +
Sbjct: 158 DMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPY-NFPK 216
Query: 197 CSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINSDPEDIMA 244
C+H P + Y TP C C K + +HY+ S + R S I
Sbjct: 217 CAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS-TSSIKK 275
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI NGP +F+VYEDF YKSGVYKH +G +GGHAV++IGWGT + G DYW+
Sbjct: 276 EIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDYWL 329
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 150/284 (52%), Gaps = 44/284 (15%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVK---------PTPKGLLLGVPVKTHDKSLK-LPKSFD 103
W+ + +F ++ K L+G PTPKG+ L P K + + + +P FD
Sbjct: 222 WEPEVSLRFRYLSLKDAKKLMGTFLVNTKVEGFPTPKGMPL--PAKEFENATEPVPAHFD 279
Query: 104 ARSAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGF 160
AR+A+P C + + DQG CGSCWAF + EA +DR CI + LS +CC
Sbjct: 280 ARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNA 339
Query: 161 L-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH------PGC 202
+ C GC+GG P AWR+F GVVT C PY + C+H P C
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDC 398
Query: 203 EPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ TPKC + C ++ + H + SAY + S +D+ ++ +GPV
Sbjct: 399 DATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSG 457
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+F VYEDF YKSGVYKH++G +GGHA+K+IGWGT ++GE+YW
Sbjct: 458 AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYW 500
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 155/318 (48%), Gaps = 42/318 (13%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + + ++ LQ I +N N WKA N N
Sbjct: 1 MARALMLLSVIFVSVY-------VTEQAYFLQKDFIDNIN-NHATTWKAGVNFD-PNTPK 51
Query: 68 GQFKHLLGVK----PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRIL 118
F +LG K P + + KTHD + ++PK FDAR W +C TI ++
Sbjct: 52 EYFLKMLGSKGVQIPDKHNIHM---YKTHDAAYDNLFGRIPKHFDARKKWKRCHTIGKVR 108
Query: 119 DQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 176
DQG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 109 DQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCCS-SCGYGCNGGYPIKAWE 167
Query: 177 YFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR 223
F + G+VT E C+PY +D+ G + +P +C R C L
Sbjct: 168 SFNNRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDY 227
Query: 224 NSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGH 281
N H ++ +Y + I ++ + GP+E SF +Y+DF YKSGVY + +GGH
Sbjct: 228 NDDHRFTRDSYYLTY--SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGH 285
Query: 282 AVKLIGWGTSDDGEDYWV 299
AVKLIGWG + G YW+
Sbjct: 286 AVKLIGWG-EEHGVLYWL 302
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/273 (36%), Positives = 147/273 (53%), Gaps = 32/273 (11%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKG------LLLGVPVKTHD 93
+ ++K+VNE K W A P+ S+ ++ K L+G+K G LLG K+
Sbjct: 43 EDMVKKVNE-AKTTWTAEELPRISSMSLNAKKGLMGLKAFHDGGFQKHKQLLGARPKSAS 101
Query: 94 K--SLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 148
K + KLP+ FD+R + +C+ I I DQ +CGSCWA + + DR CI + +
Sbjct: 102 KLDATKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVH 161
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP---- 204
+S D+L+C GC+GGYP A+ ++ GVVT S ++ GC+P
Sbjct: 162 ISAQDILSCATDRS-QGCNGGYPDEAFEHYAQSGVVT-------GSGNSANQGCKPYPFL 213
Query: 205 -----AYPTPKCVRKC--VKKNQLWRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSF 256
Y TP+C +KC + + ++ KH+ +S Y + SDP DI EI NGPVE +
Sbjct: 214 PHTTVEYSTPECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPVEANM 273
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
VY DF YKSGVY+ + +GGHAV+++GWG
Sbjct: 274 IVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWG 306
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 115/205 (56%), Gaps = 15/205 (7%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 160
FDA AWP C TI+ I DQ CGSCWA A A+SDR+C G+ +L +S DLL+CC
Sbjct: 1 FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRGGVRDLRISAGDLLSCCN- 59
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------PGCEPAYPTPKCVR 213
CG GC+GG P AW Y+V G+V+E C PY C+H C Y TP C
Sbjct: 60 ACGLGCNGGDPDWAWLYYVETGIVSEFCQPY-PFPPCAHHVNSTHYTPCSVEYDTPFCNI 118
Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
C + S S S ED E++ GP EV+FTVYEDF Y GVYKH
Sbjct: 119 TCTNTIPPIKYKGRISYSL----SGEEDYKRELFLYGPFEVAFTVYEDFVAYSDGVYKHF 174
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYW 298
+G+ +GGHAV+L+GWG +G YW
Sbjct: 175 SGNALGGHAVRLVGWGNL-NGTPYW 198
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 149/278 (53%), Gaps = 18/278 (6%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYT-VGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
+IL D +I+ +N P AGWKA++ +F + + V G++ KG+L + D+
Sbjct: 23 NILSDELIQYINNYPSAGWKASKQNRFKSISDVYNTFGYYGIRHFRKGIL--STISHEDE 80
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+++LP FD+R W C +I+ I DQ C S WA + ++SDR CI M + LS
Sbjct: 81 NIQLPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAI 140
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPY-----FDSTGCSHPGC-E 203
+L++C G C G+ +W Y++ +G+VT + C PY + S+P C
Sbjct: 141 ELISCSKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGY 198
Query: 204 PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
Y P C + C + ++ KHY Y + + DI EI NGPVE V+ DF
Sbjct: 199 ITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDF 258
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+YKSGVY+HITG ++ H+V++IGWG +D YW+C
Sbjct: 259 LNYKSGVYRHITGQLVTIHSVRIIGWGIENDIP-YWLC 295
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/271 (36%), Positives = 139/271 (51%), Gaps = 27/271 (9%)
Query: 29 SKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP 88
++L L + +L +SI + +N NP + W A P S + + + LG + TP
Sbjct: 1 TRLLLIAAVLAESIPETINRNPNSTWVAIDYPA-SVISHEKLRSKLGARFTPHR------ 53
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 148
V+ + S K+P +FDAR WP I + DQG CGSCWAF E + DR +
Sbjct: 54 VRPYRDSNKVPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDRLGVLGCSRGD 111
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
++ DL++C F DGCDGG+ AW + +G+ TEEC PY G P
Sbjct: 112 IAPEDLVSCDIF--DDGCDGGFIDMAWDWCQENGLTTEECIPYKAGEGVPSP-------- 161
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C + ++R I +YR D +DI EIY+ GPV + F VY DF YKSG
Sbjct: 162 --CPETCEDGSAIYRTP----IESYRY-IDADDIQGEIYEYGPVSMGFIVYSDFMSYKSG 214
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY H G + GGHAV ++GWG D+ YW+
Sbjct: 215 VYVHQAGYIEGGHAVLIVGWGVEDE-VPYWL 244
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 98/259 (37%), Positives = 137/259 (52%), Gaps = 25/259 (9%)
Query: 54 WKAARNPQF-SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
W+A NP+ + Y G L P G++ V + L LP +FDAR WP+C
Sbjct: 86 WRAGSNPKPPAGYRSGVNMADLERTKLPLGIMADV------EDLDLPDTFDAREKWPECP 139
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGY 170
++ I DQG CGSCWA A A++DR+C+ DLL+CC CG GC GG
Sbjct: 140 SLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGT 198
Query: 171 PISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKKNQ 220
AW+++V G+ + + C PY C PG + TPKC KC
Sbjct: 199 LGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDE--DTPKCSNKCRSGYNVTD 255
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 280
+W++ +H AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GG
Sbjct: 256 VWQD-RHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGG 314
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVKL+GWG ++G YW+
Sbjct: 315 HAVKLLGWGV-ENGVKYWL 332
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 162/304 (53%), Gaps = 27/304 (8%)
Query: 10 TCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ 69
T L++LG+ A V + + + D+ ++ V ++ WK N + SN
Sbjct: 12 TVLILLGL------ACFVQATDRQGQNPFNDAFLRRVLARARS-WKPDTNFR-SNIHYHT 63
Query: 70 FKHLLGVKPTPKGLLLGVPVKTHDK--SLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
F+ L G+ + G VP+K +D + +P+SFD+R WP C ++ I +QG CGSCW
Sbjct: 64 FRSLKGIGESRTGFK--VPIKHYDYVYDIDIPESFDSRDRWPNCDSLREIRNQGTCGSCW 121
Query: 128 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV 184
A A +SDR CIH N++++ DL+ CC CG+GC+GG+ ++++Y+V G+V
Sbjct: 122 AVAAASVMSDRVCIHTNGTRNVAIAAEDLMGCCA-DCGNGCEGGFLDGTSFQYWVDAGLV 180
Query: 185 -------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 235
TE C PY C +P + +PKC C ++ + K + AY +
Sbjct: 181 SGGAYNSTEGCKPY-PFKPCLYPFTDCHREESPKCKHHCQHGVDKRYARDKVFGSVAYSV 239
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 295
D I EI NGPVE F VYED YKSGVY+H+ G+ +G HAV++IGWG + G
Sbjct: 240 PRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWG-REGGI 298
Query: 296 DYWV 299
YW+
Sbjct: 299 PYWL 302
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 101/268 (37%), Positives = 137/268 (51%), Gaps = 27/268 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L + +L +SI++ VN +P + W A P S T +F LG + +T+
Sbjct: 5 LFASVLAESIVETVNNDPSSTWVAVEYPA-SVITRAKFLARLGTH------VEEYEERTY 57
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ LP++FDAR WP+ I + DQ CGSCWAF E + DR I +S
Sbjct: 58 ESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGHMSPQ 115
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
DL++C GC+GGY AW + HGV EEC PY G P C
Sbjct: 116 DLVSC--DTTDMGCNGGYMDKAWAWTKSHGVTNEECMPYQSGGG----------RVPACP 163
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
KCV + + R +K S + + + + E+Y+NGP+ V+FTVY DF +YKSGVY H
Sbjct: 164 AKCVNGSTIVR-TKSQSFTHFTAS----QMQQELYENGPLSVAFTVYYDFMNYKSGVYVH 218
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
TG V GGHAV IGWG D+ YW+C
Sbjct: 219 KTGGVAGGHAVLCIGWGVEDN-TPYWLC 245
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/294 (36%), Positives = 148/294 (50%), Gaps = 37/294 (12%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGV-- 87
L +H L + ++NE K WKA +N P+ N LLG K LLG+
Sbjct: 17 LTEQAHFLSKEYVNKINEVAKT-WKAKQNFPE--NTPREDIVRLLGSK-----RLLGLNK 68
Query: 88 -PVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
P+K +D + ++P+ FD+R W C TI + +QG+CGSCWA G A +DR CI
Sbjct: 69 SPIKENDILYVDNGEVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIA 128
Query: 143 FG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPY-- 191
N +S +L CC CG GC+GG P+ AW+YF HGVV T+ C PY
Sbjct: 129 TDGEFNELISAEELTFCC-HTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRV 187
Query: 192 ----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEI 246
D G + +P KC +KC + HY AY +++ +
Sbjct: 188 PPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMV 247
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y GP+E SF VY+DF Y+SGVY+ +GGHAVK+IGWG ++G YW+
Sbjct: 248 Y--GPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGV-EEGTPYWL 298
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 109/308 (35%), Positives = 150/308 (48%), Gaps = 30/308 (9%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ--FSNY 65
+ + L + S TFA+ +LD L D I+++N + WKA RN + S Y
Sbjct: 1 MKLAFIALAAVVSCTFAQP-----ELD--FLSDEYIEQLN-SKNLPWKAGRNFERDTSLY 52
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P + + D LP+ FDAR W +C +I I DQ CGS
Sbjct: 53 NIQRLLSVGTINPPSEF----ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGS 108
Query: 126 CWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHH 181
CWA + +SDR CI L +S D++ CC DGC GG P + +
Sbjct: 109 CWAVSSASVMSDRICIQSDQKNQLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDS 168
Query: 182 GVVTEECDPYFDSTGCS-------HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAY 233
G V+ Y + GC +P C+ Y P C ++C K + L + KHY+ AY
Sbjct: 169 GFVSG--GEYNSTNGCMSYPLPRCNPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAY 226
Query: 234 RINSDPE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTS 291
RI S E I EI KNGPV SFTVY DF HY SGVYK ++GGHAV++IGWG
Sbjct: 227 RIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIE 286
Query: 292 DDGEDYWV 299
+ YW+
Sbjct: 287 NGTYPYWL 294
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 148/284 (52%), Gaps = 26/284 (9%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ + ++++K VNE + ++A +P+ + HL+ + L + +
Sbjct: 34 DNRLTGEALVKYVNER-QPFFEAKYSPEAEQ----RLNHLMDTEFVRNVRKLH-KIPRAE 87
Query: 94 KSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 150
K++ +P+SFD+R W CS+I+ I DQ +CGSCWA A E +SDR C+ +
Sbjct: 88 KAISNDDIPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKM 147
Query: 151 VND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG 201
++D +LACCG CG GC+GG AW Y GVVT +E C PY +H G
Sbjct: 148 ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGG 207
Query: 202 ------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ ++ TP C + C + + K Y S Y ++ D + I E+ KNGPV+
Sbjct: 208 KFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQA 267
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+F YEDF+ Y G+Y H G G HAVK++GWG ++G YW
Sbjct: 268 AFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYW 310
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 120/229 (52%), Gaps = 24/229 (10%)
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSL 149
+D LP+++D R W CS+ I DQ +CGSCWA A+SDR CI +
Sbjct: 83 NDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAISDRICIATKGKKQVYA 142
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HP----- 200
S D+L CCG CG GC GG+PI AW++F + GVV+ PY CS HP
Sbjct: 143 SDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSG--GPYLGKGCCSPYPLHPCGRHG 200
Query: 201 ------GCEPAYPTPKCVRKCVKKNQ-LWRNSKHYSI--SAYRINSDPEDIMAEIYKNGP 251
C PTP C RKC + ++R K Y Y + I +I + G
Sbjct: 201 NDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGS 260
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWV 299
V F VYEDF+HY+SG+YKH G GG HAVK+IGWG D+G DYW+
Sbjct: 261 VVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWG-KDNGTDYWL 308
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/288 (36%), Positives = 144/288 (50%), Gaps = 37/288 (12%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK---PTPKGLLLGVPVKTHDKSLK 97
S++ E+N + +F N ++ K L G + K + G + ++
Sbjct: 3 SLVDEINSKQTTWTASTGQKRFKNLSLRDAKMLCGTRMRGSNDKVIRKGYAI---EELQD 59
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP FDAR+A+P CS I I DQ CGSCWAFG EA +DR C+ + LS ++
Sbjct: 60 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEM 119
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCSH-- 199
AC GCDGGYP SAW + G+ T + C PY D C+H
Sbjct: 120 NACAPSY---GCDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPY-DFPPCAHHI 175
Query: 200 -----PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
P C + +Y TP CV +C K + +N +HY + + + I +GP
Sbjct: 176 NDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQYSVNNAKNAIRTDGP 235
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V S+ VYEDF YKSGVYKH +G +GGHAVK+IGWG ++GE YW+
Sbjct: 236 VSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EENGEAYWL 282
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/284 (38%), Positives = 140/284 (49%), Gaps = 27/284 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD- 93
S + D I+ +N+ K WKA R +N + LLG + K L V +K D
Sbjct: 22 SQFISDERIEYINKIAKT-WKAERYFP-ANMSKEYIMGLLGSRGY-KNYLNEVEIKKDDP 78
Query: 94 ---KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLS 148
K+ K FDAR W C I + DQG+CGSCWAFG A +DR C+ G N
Sbjct: 79 LYTKNNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQ 138
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTG 196
LS L CC + CG GC GG PI AW+YF HG+ T E C PY +D G
Sbjct: 139 LSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPCYDDQG 197
Query: 197 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+P KC R C + + Y + + + + I +I K GPVE SF
Sbjct: 198 EFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVKSIYVLDSSKTIEQDIRKYGPVEASF 254
Query: 257 TVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWV 299
VY+DF YKSG+Y+ +GGH+VKLIGWG +DG YW+
Sbjct: 255 DVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWG-EEDGIPYWL 297
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/282 (37%), Positives = 142/282 (50%), Gaps = 26/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
L D IK +NE K WKA R +N + F LLG + K +K +D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEAEIKKYDPL 79
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
P+ FD+R W C I I DQG+CGSCW+F A +DR C+ G N LS
Sbjct: 80 YVENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNELLS 139
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 198
+L CC CG+GC+GGYPI AWRYF GV T E C PY ++ G +
Sbjct: 140 PEELAFCCK-DCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKN 198
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
G +P +C + C K ++ + S Y INS + I +I GPVE SF V
Sbjct: 199 TCGGKPMERNHQCPKTCYGKTT--DQKRYKTKSEYVINS-IKTIEQDIKTYGPVEASFDV 255
Query: 259 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+DF+ YKSG+Y+ GH+VK+IGWG ++G YW+
Sbjct: 256 YDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWG-QENGTPYWL 296
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/286 (34%), Positives = 143/286 (50%), Gaps = 26/286 (9%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
+ S++ E+N +A +F ++ K L G + V ++
Sbjct: 80 IMQSLVDEINAKQNTWTASAEQEKFKTSSLRDAKMLCGTLTRDSNDKVVEKVYAIEELKD 139
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP FDAR+A+P+CS I + DQ CG CWAFG EA +DR CI + LS ++
Sbjct: 140 LPTDFDARTAFPKCSKVIGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKLLSAGEM 199
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCSHPG 201
AC L GC GG+P SAW + G+ T + C PY D C+H
Sbjct: 200 NACAPSLKDPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPY-DFPPCAHFF 258
Query: 202 CEPAYPT-PKCVR---KCVKKNQ----LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
+P YP PK R +CV K + ++ + +++ + + + +D I +GPV
Sbjct: 259 KDPKYPACPKFARVNLRCVSKLRHMMVVYFSDRYFMVESVPYHFSADDAKNAIRTDGPVS 318
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+F VYEDF YKSGVYKH +G ++G HAVK+IGWG D GE YW+
Sbjct: 319 ATFYVYEDFLAYKSGVYKHTSGSLLGAHAVKIIGWG-EDGGEAYWL 363
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/217 (39%), Positives = 119/217 (54%), Gaps = 17/217 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LL 155
+P+SFD+R W CS+I+ I DQ +CGSCWA A E +SDR C+ + ++D +L
Sbjct: 95 IPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG------C 202
ACCG CG GC+GG AW Y GVVT +E C PY +H G
Sbjct: 155 ACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPR 214
Query: 203 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+ ++ TP C + C + + K Y S Y ++ D + I E+ KNGPV+ +F YED
Sbjct: 215 DHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED 274
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
F+ Y G+Y H G G HAVK++GWG ++G YW
Sbjct: 275 FSFYTKGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYW 310
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/241 (39%), Positives = 131/241 (54%), Gaps = 23/241 (9%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+F+NYT Q K LLG + +P T + +P SFD+R+ W C + I DQ
Sbjct: 45 KFANYTEAQIKGLLGTVLSHSS---DIPAFTQINA-AVPDSFDSRTQWQGC--VHPIRDQ 98
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
CGSCWAF A E+LSDRFCI +N+ LS D+++C GCDGGY AW+Y
Sbjct: 99 AQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYL 156
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
GV ++ C+PY ++G + P C KC Q + K + S + N
Sbjct: 157 EKKGVASDSCEPYKSASGTA----------PSCPSKCAN-GQAIKKYKCQAGSTKQANGA 205
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I ++GPVE FTVY DF +YKSG+Y H++G GGHAVK++GWG E+YW
Sbjct: 206 AA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYW 263
Query: 299 V 299
+
Sbjct: 264 I 264
>gi|290977636|ref|XP_002671543.1| predicted protein [Naegleria gruberi]
gi|284085113|gb|EFC38799.1| predicted protein [Naegleria gruberi]
Length = 268
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/292 (38%), Positives = 153/292 (52%), Gaps = 30/292 (10%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M S F+ LLI + TF G S L H L S+I+++N + WKA
Sbjct: 1 MQQSIRFVLCFLLI-----ATTFVCGQFSALDKPVHEL--SLIQKINSDSSIRWKATTYK 53
Query: 61 QFSNYTVGQFKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILD 119
+F T+ + + LG V +P + +P K K+LK FDAR W C I I +
Sbjct: 54 KFEGMTLREARKYLGTVIISP---INNLPKKKMPKNLKAASHFDAREKWEDC--IHEIRN 108
Query: 120 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
Q CGSCWAF A EA SDR CI + +N+ LS +++C GCDGGY +AW +
Sbjct: 109 QEECGSCWAFSASEAFSDRLCIATNGSVNIVLSPQYMVSCDA--TDYGCDGGYLNNAWNF 166
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI-N 236
+ G+ ++EC PY +G H P C K KK Q + K Y +S I N
Sbjct: 167 LANTGIPSDECVPY--QSGSGH--------VPSC-SKLNKKCQDGSDIKLYKVSKKSIAN 215
Query: 237 SDP-EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
D EDI +I +NG ++ F+VY+DF YKSGVY H+TG + GGHA+K+IG
Sbjct: 216 LDSIEDIQKDIQENGSIQSGFSVYKDFFSYKSGVYHHVTGSLAGGHAIKVIG 267
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 149/289 (51%), Gaps = 37/289 (12%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ + ++++K VNE + ++A +P+ + HL+ + L + +
Sbjct: 34 DNRLTGEALVKYVNER-QPFFEAKYSPEAEQ----RLNHLMDTEFVRNVRKLH-KIPRAE 87
Query: 94 KSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 150
K++ +P+SFD+R W CS+I+ I DQ + GSCWA A E +SDR C+ +
Sbjct: 88 KAISNEDIPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKM 147
Query: 151 VND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG 201
++D +LACCG CG GC+GG AW Y GVVT +E C PY HP
Sbjct: 148 ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYH-----LHP- 201
Query: 202 CE-----------PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
CE ++ TP C + C + + K Y S Y ++ D + I E+ KN
Sbjct: 202 CEITGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKN 261
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
GPV+ +FT YEDF+ Y+ G+Y H G G HAVK++GWG ++G YW
Sbjct: 262 GPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVVGWGV-ENGTKYW 309
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 154/319 (48%), Gaps = 44/319 (13%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + L ++ LQ I +NE WKA N F T
Sbjct: 1 MARVLMLLSVIFVSFY-------LTEQAYFLQKDFIDNINERATT-WKAGVN--FDPDTP 50
Query: 68 GQ-FKHLLGVK----PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRI 117
+ F +LG K P + + KTHD + ++P+ FDAR W +C TI +
Sbjct: 51 KEHFLKMLGSKGVQIPNKHNIHM---YKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAV 107
Query: 118 LDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAW 175
DQG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 108 RDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAW 166
Query: 176 RYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
F G+VT E C+PY +D+ G + +P +C R C L
Sbjct: 167 ERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLD 226
Query: 223 RNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGG 280
+ H Y+ +Y + I ++ GP+E SF VY+DF YKSGVY K +GG
Sbjct: 227 FDEDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGG 284
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVKLIGWG + G YW+
Sbjct: 285 HAVKLIGWG-EEYGVPYWL 302
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/217 (39%), Positives = 119/217 (54%), Gaps = 17/217 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LL 155
+P+SFD+R W CS+I+ I DQ +CGSCWA A E +SDR C+ + ++D +L
Sbjct: 95 IPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDIL 154
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG------C 202
ACCG CG GC+GG AW Y GVVT +E C PY +H G
Sbjct: 155 ACCGSECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPR 214
Query: 203 EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+ ++ TP C + C + + K Y S Y ++ D + I E+ KNGPV+ +F YED
Sbjct: 215 DHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED 274
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
F+ Y G+Y H G G HAVK++GWG ++G YW
Sbjct: 275 FSFYTKGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYW 310
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 95/241 (39%), Positives = 131/241 (54%), Gaps = 23/241 (9%)
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+F+NYT Q K LLG + +P T + +P SFD+R+ W C + I DQ
Sbjct: 45 KFANYTEAQIKGLLGTVLSHSS---DIPAFTQINAA-VPDSFDSRTQWQGC--VHPIRDQ 98
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
CGSCWAF A E+LSDRFCI +N+ LS D+++C GCDGGY AW+Y
Sbjct: 99 AQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYL 156
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
GV ++ C+PY ++G + P C KC Q + K + S + N
Sbjct: 157 EKKGVASDSCEPYKSASGTA----------PSCPSKC-SNGQAIKKYKCKAGSTKQANGA 205
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I ++GPVE FTVY DF +YKSG+Y H++G GGHAVK++GWG E+YW
Sbjct: 206 AA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYW 263
Query: 299 V 299
+
Sbjct: 264 I 264
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 98/265 (36%), Positives = 144/265 (54%), Gaps = 31/265 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK--THDK 94
I + ++ +N NP A W A ++S + + + L + P G PV+ T +
Sbjct: 10 ISGEPLVNIINRNPAATWSAH---EYSRDIITRARLTL-LAPLAIG-----PVEKFTIED 60
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 154
S +P+SFDAR WP + I + DQ CGSCWAF E+L DRF I LS DL
Sbjct: 61 SFYVPESFDARDEWP--NAILPVRDQEKCGSCWAFSIAESLGDRFGILGCGKGHLSPQDL 118
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 214
++C G C+GGY ++W + + G+ TE C PY +G P C +
Sbjct: 119 ISCDSNDLG--CNGGYQENSWTWVLTTGITTESCWPYRSGSG----------RIPSCPHR 166
Query: 215 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
CV + L RN+ I+ YR D ++ E+Y NGP++V++ VYEDF +Y G+YKH++
Sbjct: 167 CVNGSVLQRNT----INNYR-RLDSSELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLS 221
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G+ +GGHAV L+GWG +DG YW+
Sbjct: 222 GNKVGGHAVVLMGWGI-EDGVKYWL 245
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 154/319 (48%), Gaps = 44/319 (13%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + + ++ LQ I +N N WKA N F T
Sbjct: 1 MARALMLLSVIFVSVY-------VTEQTYFLQKDFIDNIN-NQATTWKAGVN--FDPDTP 50
Query: 68 GQ-FKHLLGVK----PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRI 117
+ F +LG K P + + KTHD + ++P+ FDAR W +C TI +
Sbjct: 51 KEHFLKMLGSKGVQIPNKHNIHM---YKTHDAAYDKLFGRIPRHFDARRKWRRCHTIGAV 107
Query: 118 LDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAW 175
DQG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 108 RDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAW 166
Query: 176 RYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
F G+VT E C+PY +D+ G + +P +C R C L
Sbjct: 167 ERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLD 226
Query: 223 RNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGG 280
+ H Y+ +Y + I ++ GP+E SF VY+DF YKSGVY K +GG
Sbjct: 227 FDEDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGG 284
Query: 281 HAVKLIGWGTSDDGEDYWV 299
HAVKLIGWG + G YW+
Sbjct: 285 HAVKLIGWG-EEYGVPYWL 302
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 145/289 (50%), Gaps = 31/289 (10%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
++ L++ I ++NE WKA N P+ + + GV+ K L K+
Sbjct: 21 AYFLEEDYINKINEQATT-WKAGVNFDPKTPKEHILKLLGSKGVQIPSK--LNHKMYKSE 77
Query: 93 DKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 145
D++ ++P+ FDAR W C TI I DQG+CGSCWA A +DR C+ +
Sbjct: 78 DENYDNLFGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCVVSNEDF 137
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 192
N LS +L CC CG GC+GGYPI AW +F HG+VT E C+PY +
Sbjct: 138 NQLLSAEELTFCC-HKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY 196
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 251
D +G + +P +C R C L + H Y+ +Y + I ++ GP
Sbjct: 197 DESGNNTCAGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGP 254
Query: 252 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE SF VY+DF YKSGVY + +GGHA KLIGWG + G YW+
Sbjct: 255 VEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWG-EEYGVPYWL 302
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 106/312 (33%), Positives = 147/312 (47%), Gaps = 32/312 (10%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNY 65
+ L++L VI + + ++ L+ I +N WKA N P+ S
Sbjct: 1 MARVLMLLSVIFVSVY-------MTEQAYFLEKDFIDNINAQATT-WKAGVNFDPKTSKE 52
Query: 66 TVGQFKHLLGVK-PTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
+ + GV+ P + L + +P+ FDAR W CSTI R+ DQG+CG
Sbjct: 53 HIMKLLGSRGVQIPNKNNMNLYKSEDAEYDNTYIPRFFDARRKWRHCSTIGRVRDQGNCG 112
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW+ F G
Sbjct: 113 SCWAVATSSAFADRLCVATNADFNELLSAEEITFCC-HTCGFGCNGGYPIKAWKRFSKKG 171
Query: 183 VVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-Y 228
+VT E C+PY D G + +P +C R C L + H Y
Sbjct: 172 LVTGGDYKSGEGCEPYRVPPCPNDDQGNNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRY 231
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIG 287
+ Y + I ++ GP+E SF VY+DF YKSGVY K +GGHAVKLIG
Sbjct: 232 TRDYYYLTYGS--IQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIG 289
Query: 288 WGTSDDGEDYWV 299
WG + G YW+
Sbjct: 290 WG-EEYGVPYWL 300
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 148/284 (52%), Gaps = 26/284 (9%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ + ++++K VNE + ++A +P+ + HL+ + L + +
Sbjct: 34 DNRLTGEALVKYVNER-QPFFEAKYSPEAEQ----RLNHLMDTEFVRNVRKLH-KIPRAE 87
Query: 94 KSLK---LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLS 150
K++ +P+SFD+R W CS+I+ I DQ +CGSCWA A E +SDR C+ +
Sbjct: 88 KAISNDDIPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKM 147
Query: 151 VND--LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPG 201
++D +LACCG CG GC+GG AW Y GVVT +E C PY +H G
Sbjct: 148 ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGG 207
Query: 202 ------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ ++ TP C + C + + K Y S Y ++ D + I E+ KNGPV+
Sbjct: 208 KFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQA 267
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ YEDF+ Y+ G+Y H G G HAVK++GWG ++G YW
Sbjct: 268 ASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGV-ENGTKYW 310
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 122/224 (54%), Gaps = 22/224 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
L SFDAR WP+C +I +I D C + WAF A E++SDR CI+ G N LS +LL
Sbjct: 76 LSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILSAEELL 135
Query: 156 ACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF------DSTGCSHP 200
+CC F CG+GC+GG P AW+Y HG+ T C PY ++P
Sbjct: 136 SCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYP 195
Query: 201 GC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
C PTP C +KC + +HY +S ++ + +I +++ NGP++ +F
Sbjct: 196 ACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATF 255
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VY+DF Y +G+Y H+TG+ G +V++IGWG G YW+C
Sbjct: 256 EVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWGVW-QGVPYWLC 298
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 111/313 (35%), Positives = 151/313 (48%), Gaps = 35/313 (11%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNY 65
+ ++L VI +A ++ LQ+ I +NE WKA N P +
Sbjct: 1 MARVFMLLSVIFVSVYA-------TEQAYFLQEDFINNINEQATT-WKAGMNFDPNTPHD 52
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRILDQ 120
+ + GV+ K + KTHD++ ++P+ FDAR+ W C TI R+ DQ
Sbjct: 53 DIIKLLGSRGVQNPDK--VNHKLYKTHDEAYDNLFGRIPEHFDARNKWVYCDTIGRVRDQ 110
Query: 121 GHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
G+CGSCWA A +DR C+ N LS ++ CC CG GC GGYPI AW+ F
Sbjct: 111 GNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEITFCC-HTCGFGCHGGYPIKAWKRF 169
Query: 179 VHHGVVT-------EECDPYF---DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH- 227
HG+VT E C+PY + G S +P C R C + N H
Sbjct: 170 STHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHCYGNQSIDFNDDHR 229
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLI 286
Y+ Y + I ++ GP+E SF VY+DF YKSGVY K +GGHAVKLI
Sbjct: 230 YTRDYYYLTYGS--IQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLI 287
Query: 287 GWGTSDDGEDYWV 299
GWG +DG YW+
Sbjct: 288 GWG-EEDGTPYWL 299
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 101/282 (35%), Positives = 140/282 (49%), Gaps = 26/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
L D IK +NE K WKA R +N + LLG + V +KT+D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERFFP-ANTSKEYIMGLLGSRGYTN-YSSEVEIKTYDPL 79
Query: 96 LKLPKS---FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
+ S FD+R W C I RI DQG+CGSCWAFG A +DR C+ G N LS
Sbjct: 80 YEENASVEQFDSRENWKSCKQIGRIRDQGNCGSCWAFGTTGAFADRLCVSTGGKFNELLS 139
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 198
D+ CC CG GC+GGYPI AW+YF GV T E C PY FD G +
Sbjct: 140 PEDVAFCCQ-NCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKN 198
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
+P +C + C + K Y + + + P + ++ K GP+E SF +
Sbjct: 199 TCAGKPLERNHQCPKTCYGSTTV---QKRYKVKNEYVLNSPNTMEQDLIKYGPIEASFNL 255
Query: 259 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
++D + YKSG+Y+ + GH++K+IGWG ++G YW+
Sbjct: 256 FDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWG-KENGVPYWL 296
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 116/189 (61%), Gaps = 18/189 (9%)
Query: 128 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGA EA+SDR CIH +S LS DLL+CC CG GC+GGYP +AW ++ G+V+
Sbjct: 25 AFGASEAMSDRICIHSNAKISVELSAEDLLSCC-ESCGMGCNGGYPSAAWDFWTKDGLVS 83
Query: 186 EE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSIS 231
C PY S P C TP+CV +C ++ KHY +
Sbjct: 84 GGLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKT 143
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
+Y ++SD +DI EIYKNGPVE +FTVYEDF YK+GVY+H+TG +GGHA+K++GWG
Sbjct: 144 SYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWG-E 202
Query: 292 DDGEDYWVC 300
++G YW+C
Sbjct: 203 ENGIPYWLC 211
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 143/282 (50%), Gaps = 26/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
L D IK +NE K WKA R +N + F LLG + K +K +D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEFEIKKYDPL 79
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
P+ FD+R+ W C I I DQG+CGSCW+F A +DR C+ G N LS
Sbjct: 80 YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 198
+L CC CG GC GGYPI AW+YF GV T E C PY ++ G +
Sbjct: 140 PEELAFCCK-DCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKN 198
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
G +P +C + C K + +++ + S Y INS + I ++ GPVE SF V
Sbjct: 199 TCGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYSINS-IKTIEQDLKTYGPVEASFDV 255
Query: 259 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+DF+ YKSG+Y+ G H++K+IGWG ++G YW+
Sbjct: 256 YDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWG-QENGTTYWL 296
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/284 (37%), Positives = 139/284 (48%), Gaps = 27/284 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD- 93
S L D I+ +N+ K WKA R +N + LLG + K L V +K D
Sbjct: 22 SQFLSDERIEYINKIAKT-WKAERYFP-ANMSKEYITGLLGSRGY-KNYLNEVEIKKDDP 78
Query: 94 ---KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLS 148
K+ K FDAR W C I + DQG+CGSCWAFG A +DR C+ G N
Sbjct: 79 LYTKNNNKIKHFDARENWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQ 138
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTG 196
LS L CC + CG GC GG PI AW+YF G+ T E C PY +D G
Sbjct: 139 LSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPCYDDQG 197
Query: 197 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+P KC R C + + Y + + + + I +I GPVE SF
Sbjct: 198 EFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVESIYVLDSFKTIEQDIRTYGPVEASF 254
Query: 257 TVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWV 299
VY+DF YKSG+Y+ + +GGH+VKLIGWG +DG YW+
Sbjct: 255 DVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWG-EEDGIPYWL 297
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 100/268 (37%), Positives = 142/268 (52%), Gaps = 29/268 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L + ++ +SI++ VN +P + W A P+ T+ + + +LG + P + +
Sbjct: 5 LFASVIAESIVETVNNDPSSTWVAIEYPR-EVITLAKMRAMLGEEVLP------LEDVEY 57
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+ +P++FDAR WP I + DQ CGSCWA A EA+ +RF I LSV
Sbjct: 58 VEPNNVPENFDAREQWP--GKIYPVRDQASCGSCWAHAASEAIGNRFSIKGCGKGMLSVQ 115
Query: 153 DLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
DL++C GD GC+GG + ++ V +GV TEEC PY G P C
Sbjct: 116 DLVSCDK---GDSGCNGGSGPLSSKWLVSNGVTTEECLPYVSGNG----------RVPAC 162
Query: 212 VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
KC +Q+ R K+ Y + ++I E+ KNGPV FTVY DF +YKSGVY+
Sbjct: 163 AAKCSNGSQIIR-YKYEKAETYTV----QNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQ 217
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
H +G GGHAV LIGWG +DG YW+
Sbjct: 218 HKSGYQEGGHAVLLIGWGV-EDGVPYWL 244
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/279 (36%), Positives = 142/279 (50%), Gaps = 25/279 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF-KHLLGVKPTPKGLLLGVPVKTHDKSL 96
L D + E+ ++ + WKA RN F+ F K L V+ P + +P+K +
Sbjct: 20 LSDEFL-ELLQSKQMTWKAGRN--FAKDISKDFLKSLNCVRKNPD--IPKLPLKNVTPTK 74
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
++P FDAR WP C I I DQG+CGSCWA A ++DR CI ++ S ++
Sbjct: 75 EIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENV 134
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PG 201
ACC CG+ C GG +A+ ++V G V+ E C PY C H P
Sbjct: 135 AACCT-ECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPY-SVEECEHHIEGPRPP 192
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
CE P C C ++ + + Y + AY + D I EI NGPV +F VY+
Sbjct: 193 CEGDMPELVCSETCHEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYD 252
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF YKSGVY+H TG + G HAV++IGWG ++G YW+
Sbjct: 253 DFLSYKSGVYQHETGLLDGYHAVRVIGWG-EEEGTPYWL 290
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 144/285 (50%), Gaps = 29/285 (10%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKT 91
++ L I +N K WKA N F T K +LG+ + KG+ + P K+
Sbjct: 20 QTYFLNKDYISTINSVAKT-WKAGIN--FHPET--PLKFILGLLGS-KGVEVSSAGPFKS 73
Query: 92 HDK----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 145
HD + +P FDAR W C+TI I DQG+CGSCWAF A +DR CI +
Sbjct: 74 HDPLYSPTGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSF 133
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 198
N LS + +CC + CG GC GGYPI AWRY+ HG+VT E C PY
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192
Query: 199 HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 255
+ C + KC +KC + +R + Y S Y + D ++ +I GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250
Query: 256 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VY+DF YKSGVY K +GGH+VK IGWG + YW+
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-VSYWL 294
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/283 (38%), Positives = 153/283 (54%), Gaps = 31/283 (10%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPVKTHDKS 95
+ II+ VN PK WKA N F + HL+GV P + K +LL V +S
Sbjct: 28 NQIIQLVNNIPKHTWKAGIN--FHPSLLTNVSHLMGVVPWNKLSEKDILLTYDVSIDLES 85
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 153
L P+S+D W +C ++ I DQ +CGSCWA A SDR CI + G+N LS
Sbjct: 86 L--PESYDITQTWSECKSVVSIRDQSNCGSCWALSTASAFSDRLCITSNMGVNKVLSGEY 143
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 200
+ +CC CG+GC+GG+P AW+Y +G+ T E C PY ++ CS
Sbjct: 144 INSCCNGKCGNGCNGGHPEKAWKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKE 203
Query: 201 GCEPAYPTPKCVR-KCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ TP+C + +C N + +Y+ Y + PE IM+E++KNGPV +
Sbjct: 204 NED----TPQCYKDQCTNNNYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMK 259
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VY+DF YK G+Y++ TG + G HAVK++GWG DDG DYW+C
Sbjct: 260 VYDDFLCYKGGIYQYTTGGLKGDHAVKIMGWG-EDDGIDYWLC 301
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/267 (36%), Positives = 139/267 (52%), Gaps = 20/267 (7%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKS 101
+I ++N ++ W A NP F + + LG+ P P L ++ + +P +
Sbjct: 23 LINQINSQ-QSSWTARINP-FDD--IESRLGFLGIHPDPNFQL--EVLEWEEPRTVIPAT 76
Query: 102 FDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACC 158
FDAR WPQC I I +QG CGSCWAF A E +SDR C+ + + S DL+ CC
Sbjct: 77 FDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDLINCC 136
Query: 159 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKCVRKC 215
CG C GGY AW+Y+ G+V+ Y S GC P + + +P+C + C
Sbjct: 137 E-TCGKKCKGGYSYYAWKYYTSTGLVSG--GDYNTSRGC-QPYSKSNFNDGVSPECSKTC 192
Query: 216 --VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY-KNGPVEVSFTVYEDFAHYKSGVYKH 272
K + N +H+ Y I + I EI + GPV F VYEDF Y+ GVY H
Sbjct: 193 QNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRGGPVMAGFDVYEDFKLYREGVYVH 252
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+G ++G HAVK+IGWGT ++G YW+
Sbjct: 253 TSGALLGSHAVKIIGWGT-ENGWAYWL 278
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 95/267 (35%), Positives = 133/267 (49%), Gaps = 27/267 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L + + +SI++ VN +P A W A P T + + LG +G VP
Sbjct: 5 LIASVFAESIVETVNNHPGATWVAVEYPP-EVITTAKLRARLGAIDLNEGPSNYVP---- 59
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
LP +FDAR WP I + +Q CGSCWAF E +R I +S
Sbjct: 60 --DTSLPDNFDAREQWP--GKILPVRNQEQCGSCWAFAVAETTGNRLNILGCGRGDMSPQ 115
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
DL++C GC+GG P+ +W + H G+ TEEC PY G P C
Sbjct: 116 DLVSC--DKVDHGCNGGSPLFSWEWVKHSGITTEECIPYVSGGG----------RVPSCP 163
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
+KC + + R +K S+ + + + E+Y GP E +F+VYEDF YKSGVY H
Sbjct: 164 KKCTNGSAIVR-TKAKSVGLVK----GDKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHH 218
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
ITG ++GGHAV ++GWG +DG YW+
Sbjct: 219 ITGKMLGGHAVMVVGWGV-EDGTPYWL 244
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 144/285 (50%), Gaps = 29/285 (10%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKT 91
++ L I +N K WKA N F T K +LG+ + KG+ + P K+
Sbjct: 20 QTYFLNKDYISTINSVAKT-WKAGIN--FHPET--PLKFILGLLGS-KGVDVSSAGPFKS 73
Query: 92 HDK----SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGM 145
HD + +P FDAR W C+TI I DQG+CGSCWAF A +DR CI +
Sbjct: 74 HDPLYSPAGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSF 133
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 198
N LS + +CC + CG GC GGYPI AWRY+ HG+VT E C PY
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192
Query: 199 HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 255
+ C + KC +KC + +R + Y S Y + D ++ +I GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250
Query: 256 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VY+DF YKSGVY K +GGH+VK IGWG + YW+
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-VSYWL 294
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 100/277 (36%), Positives = 138/277 (49%), Gaps = 21/277 (7%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D+ +L + +N+ WKA + + N T + K L G L V
Sbjct: 27 DAPVLTQKFVDRINQLNGGMWKAVYDGKMQNLTFSEAKRLTGAFSRKTSTLPPVRFTEEQ 86
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVN 152
+LP+SFDA WP C TI I DQ C + WA A+SDR+C + G L +S
Sbjct: 87 LRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQLRISAA 146
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 207
DL+ACC CG GC+GGYP +AW Y+V +G+ + +C PY C H G + P
Sbjct: 147 DLMACCT-GCGGGCEGGYPDAAWEYYVSNGITSSQCQPY-PFPRCEHRGAQGKKPPCSKY 204
Query: 208 ---TPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
TP C C K+ +R + Y + ED E+Y NGP V F V+ D
Sbjct: 205 NFDTPTCNATCTDKSVPLIKYRGNHSYEVRG------EEDYKRELYFNGPFVVRFQVHSD 258
Query: 262 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
F YKSGVY+H+ G+ +GG AV+++GWG +G YW
Sbjct: 259 FLAYKSGVYQHVAGNFLGGKAVRIVGWGKM-NGTPYW 294
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/195 (45%), Positives = 115/195 (58%), Gaps = 23/195 (11%)
Query: 124 GSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWA AVEA+SDR CI ++LS +DLL+CC CG GC GG P++AW+Y+V
Sbjct: 15 GSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLR 73
Query: 182 GVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQLWRN 224
G+VT Y + +GC P CE YPTPKCV+KC K + ++
Sbjct: 74 GIVTG--SEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKA 131
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
K+Y S Y + S+ E I EI GPVE SF VY DF +Y G+YKH+ G + GGHAVK
Sbjct: 132 DKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVK 191
Query: 285 LIGWGTSDDGEDYWV 299
++GWG D G YW+
Sbjct: 192 VLGWGI-DQGVPYWL 205
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 119/209 (56%), Gaps = 22/209 (10%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWA-----FGAVEALSDRFCIHFG--MNLSLS 150
LP+SFD+R WP C I I +Q CGSCWA + E LSDRFCI G +N+ LS
Sbjct: 2 LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
DL++C + GCDGG +AW Y H G+VT++C PY G + P
Sbjct: 60 PQDLVSCNWY--NAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVA----------PS 107
Query: 211 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
C + C + + K+ + Y + S E IM EI NGPV+ F+VY+DF YKSGVY
Sbjct: 108 CPKYCNGTSTPIDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVY 167
Query: 271 KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
H TG +GGHA+K++GWG ++ + YW+
Sbjct: 168 THQTGSFLGGHAIKIVGWGVENNVK-YWL 195
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 148/314 (47%), Gaps = 27/314 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP---QFS 63
FL L+I V T AE + D+ L + ++ ++A +P +F
Sbjct: 4 FLIALLIIPPVEKPLTVAEYLARPKSEDAAKLDGKAFVDYINQQQSFFRAEYSPDAEEFV 63
Query: 64 NYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 123
+ K + + T +L + + +P +FDAR WP C+++ I DQ C
Sbjct: 64 RNRIMDVKFAVDPEKTEPNYVLA----NTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSC 119
Query: 124 GSCWAFGAVEALSDRFC--IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWA A A+SDR C + +N LS ++L+CC CG GC GGYP A+ Y +
Sbjct: 120 GSCWAVAAASAMSDRVCALTNGRINRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRY 179
Query: 182 GVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKNQL-WRNS 225
G+ T + C PY C + EP Y PTP C R C + +
Sbjct: 180 GLSTGGPYGEKDACQPY-AFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKD 238
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
K ++ Y I + +I EI GPV ++ VY DF +YK GVY H G+V G HAVK+
Sbjct: 239 KIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKI 298
Query: 286 IGWGTSDDGEDYWV 299
IGWG +D YW+
Sbjct: 299 IGWGKGND-VPYWL 311
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 136/275 (49%), Gaps = 28/275 (10%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD- 93
+H +D +I +N++P W+AA QF+ + + + LLG K + T D
Sbjct: 24 THFTKD-MIDSLNQDPSVKWEAANYDQFAGKSFAELRKLLGGKRGEESSSEEARYNTRDV 82
Query: 94 -KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
++ +P +FD+R+ WPQC I I +QG CGSCWAF SDR CI N+ +S
Sbjct: 83 KSTVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVFSDRLCITTNNVSNVVIS 140
Query: 151 VNDLLAC--CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY-- 206
L+ C F C GGY +W++F++ G+ E C PY + Y
Sbjct: 141 PEFLIECDKTSFAC----QGGYGYYSWKFFMNTGIPLESCVPYTKDS--------LVYGN 188
Query: 207 -PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+C C + L + + SAY I S + EI NGPVE F VY DF Y
Sbjct: 189 TTNAQCRSTCTDGSPL---KLYKAASAYYIYSPITNYQTEIMTNGPVEADFDVYSDFYSY 245
Query: 266 KSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWV 299
KSG+Y+ G +GGHAVK++GW + +G YW+
Sbjct: 246 KSGIYQKTAGSTYVGGHAVKVLGWASDSNGTPYWI 280
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 155/318 (48%), Gaps = 42/318 (13%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + + ++ L+ I ++NE W A N S
Sbjct: 1 MARVLILLSVILFSVY-------MTEQAYFLEKDYINKINEKAST-WTAGFNFDPSTPKE 52
Query: 68 GQFKHLLGVK--PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRILDQ 120
+ LLG K TP + + K+ DK ++PK FDAR W C+TI + DQ
Sbjct: 53 DILR-LLGSKGVQTPSKINHKM-YKSEDKEYDNLFGRIPKKFDARKKWRHCTTIGAVRDQ 110
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
G+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW F
Sbjct: 111 GNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEITFCC-HKCGYGCNGGYPIKAWERF 169
Query: 179 VHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
HG+VT E C+PY +D +G + +P +C R C L +
Sbjct: 170 KKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDD 229
Query: 226 KH-YSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGH 281
H ++ +Y I S +D+M GP+E SF VY+DF YKSGVY + +GGH
Sbjct: 230 DHRHTRDSYYLTIGSIQKDVMTY----GPIEASFDVYDDFLSYKSGVYVRSENASYLGGH 285
Query: 282 AVKLIGWGTSDDGEDYWV 299
AVKLIGWG + G YW+
Sbjct: 286 AVKLIGWG-EEYGTPYWL 302
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 150/284 (52%), Gaps = 30/284 (10%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V H+ +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHNLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 153
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G + LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALD 147
Query: 154 LLACCGFLCGDGCD---------GGYPISAWRYFV--HHGVVTEECDPY-FDS----TGC 197
L++CC G G S WR+ H G C PY F T
Sbjct: 148 LISCCEDCGGGCKGGFPGQAWDMGKTRDSHWRFRKKNHTG-----CQPYPFPKCEHLTKG 202
Query: 198 SHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
+P C Y TP+C + C K + + K + + + ++ + +I GPVE +
Sbjct: 203 KYPACGTKIYKTPQCKQTCQKGYKTPFEQDKPFGEGSSNVQNNEKVFQRDIMMYGPVEAA 262
Query: 256 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
F VYEDF + KSG+ +H+TG ++GGH +++IGWG + G YW+
Sbjct: 263 FDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGV-EKGNPYWL 305
>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
Length = 247
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 91/222 (40%), Positives = 120/222 (54%), Gaps = 23/222 (10%)
Query: 81 KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFC 140
+G + G+P + +PK+FD+R W C + I DQ CGSCWAFGA E LSDR C
Sbjct: 13 QGPVEGIPEPAQHNDI-VPKTFDSREQWGNC--VHPIRDQAQCGSCWAFGASETLSDRIC 69
Query: 141 IHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS 198
I ++ LS DL+AC G+ GC+GG AW Y + G V + C PY G
Sbjct: 70 IASDKKTDVILSPEDLVACDGW--NMGCNGGILPWAWSYLTNTGAVEDSCFPYSSDKG-- 125
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P C +KC + K S + S + I AEI KNGP+E FTV
Sbjct: 126 --------AVPTCAKKCQNDKDSFTKYKCKKNSVVQA-SGVDKIKAEISKNGPMETGFTV 176
Query: 259 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
YEDF +Y+SGVY H TG+ +GGHAVK++G+ G+ YW+C
Sbjct: 177 YEDFMNYESGVYHHTTGNQLGGHAVKIVGY-----GDGYWIC 213
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 99/289 (34%), Positives = 135/289 (46%), Gaps = 49/289 (16%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK------PTPKGLLLGVP 88
+ L D+ +++V K W RN S + + L+GV P P +
Sbjct: 22 ADFLSDAFMEKVRRKAKT-WNLGRNFHES-ISEKYLRGLMGVHEESYKYPLPDKQEVLGE 79
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 146
LP FDAR W C TIS I +QG CGSCWA +SDR CI MN
Sbjct: 80 SDDEISLADLPVDFDARLRWTSCPTISEIREQGSCGSCWAIATTSVMSDRLCIGSNGVMN 139
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 199
LS D+L+CC +CG C GGYP +AW Y+ G+V+ + C PY C H
Sbjct: 140 FRLSGLDMLSCCA-ICGFACQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEP-CDH 197
Query: 200 PG------------------CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED 241
G CEP+Y ++ K+++ Y I++D +
Sbjct: 198 SGNGSRPVCTVGGGVRCQHLCEPSYKVD------------FQRDKNFASKVYSISNDVLE 245
Query: 242 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
I EI NGPV+ TVYEDF YK+GVY H+ G+ +G HAV+++GWG
Sbjct: 246 IQKEIMTNGPVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGV 294
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 94/228 (41%), Positives = 121/228 (53%), Gaps = 31/228 (13%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
LP FDAR+A+P CS I I DQ CGSCWAFG EA +DR CI H LS ++
Sbjct: 21 LPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGEM 80
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCSH-- 199
AC GC+GG+P SAW + G+ T + C PY D C+H
Sbjct: 81 NACAP---SHGCNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPY-DFPPCAHHV 136
Query: 200 -----PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
P C + +Y TP C +C K R+ +H+ + + D I +GP
Sbjct: 137 NDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTDGP 196
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V SFTVYEDF YKSGVYKH +G+ +GGHAVK+IGWG + G+ YW+
Sbjct: 197 VSASFTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWG-EESGQAYWL 243
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/234 (37%), Positives = 128/234 (54%), Gaps = 26/234 (11%)
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
VP + D L + FDAR WP+C +I +I D C S WAF A E++SDR CI+ G
Sbjct: 21 VPTENSD----LSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSGGT 76
Query: 145 MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
+N LS +LL+CC G L CG+GC GG AW+Y+ HG+ T C PY +
Sbjct: 77 INTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAP 136
Query: 196 ------GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAE 245
++P C PTP C +KC KN +HY S ++ + +I ++
Sbjct: 137 CGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSD 196
Query: 246 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ NGP+E +F VY+DF Y +G+Y H+TG+ G +V+++GWG +G YW+
Sbjct: 197 VMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWL 249
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 147/302 (48%), Gaps = 17/302 (5%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL +++ GV + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLLSTALVTL-----GVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + K L G L V +LP+SFD+ WP C TI I DQ C
Sbjct: 57 ITFAEAKRLTGAWIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACR 116
Query: 125 SCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
+ WA +SDR+C G+ L +S LL+CC CG GC GG+P AWRY+V +G+
Sbjct: 117 ASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCK-QCGGGCKGGFPGFAWRYYVEYGI 175
Query: 184 VTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
+ C PY + G P + + TPKC C K+ K+ + Y +
Sbjct: 176 ASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPL--VKYRGNATYLLL 233
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
ED E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG +G
Sbjct: 234 HGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTP 292
Query: 297 YW 298
YW
Sbjct: 293 YW 294
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 112/194 (57%), Gaps = 23/194 (11%)
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA AVEA+SDR CI + LS +DLL+CC CG GC GG P++AW+Y+V G
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSG 221
Query: 183 VVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQLWRNS 225
+VT Y + +GC P CE YPTPKC R+C K + ++
Sbjct: 222 IVTG--SDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKAD 279
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
K+Y AY + +D E I EI GPVE SF VY DF HY G+YKH+ G V GGHAVK+
Sbjct: 280 KYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKI 339
Query: 286 IGWGTSDDGEDYWV 299
+GWG D G YW+
Sbjct: 340 LGWGI-DQGVSYWL 352
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/290 (34%), Positives = 140/290 (48%), Gaps = 27/290 (9%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLG 86
L ++ L+ I +N+ WKA N N LLG + P +
Sbjct: 17 LTEQAYFLEKDFIDNINKQATT-WKAGVNSA-PNTPKEHILRLLGSRGVQIPDKVNYNMY 74
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFG 144
D ++P FDAR W +C TI + DQG+CGS WA A +DR C+ +
Sbjct: 75 KNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGD 134
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 191
N LS ++ CC CG+GC+GGYPI AW+ F +HG+VT E C+PY
Sbjct: 135 FNQLLSAEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCP 193
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNG 250
+D G + +P KC +KC + N H Y+ Y + I ++ G
Sbjct: 194 YDKDGKNTCSGQPMESNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYG 251
Query: 251 PVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
P+E SF VY+DF +YKSG+Y K +GGH+VKLIGWG + G YW+
Sbjct: 252 PIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWL 300
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 148/302 (49%), Gaps = 17/302 (5%)
Query: 5 HLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSN 64
+++ CLL +++ GV + L D+ +L + + +N+ WKA N + N
Sbjct: 2 RVYVALCLLSTALVTL-----GVSALLVKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQN 56
Query: 65 YTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCG 124
T + K L G L V +LP+SFD+ WP C TI I DQ C
Sbjct: 57 ITFAEAKRLTGAWIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACR 116
Query: 125 SCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
+ WA +SDR+C G+ L +S LL+CC CG GC GG+P AWRY+V +G+
Sbjct: 117 ASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCK-QCGGGCKGGFPGFAWRYYVEYGI 175
Query: 184 VTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
+ C PY + G P + + TPKC C K+ K+ + Y +
Sbjct: 176 ASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPL--VKYRGNATYLLL 233
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
ED E+Y NGP F VY D YKSGVY+++ GD++GG AV+++GWG +G
Sbjct: 234 HGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQAVRIVGWGKL-NGTP 292
Query: 297 YW 298
YW
Sbjct: 293 YW 294
>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
Length = 314
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/276 (35%), Positives = 134/276 (48%), Gaps = 30/276 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
LD +L D++I +N N K+ W A RN F T G ++G K T L +
Sbjct: 25 LDKPVLDDNLINSINNNKKSSWTAHRNKNFEGKTFGDIIGMMGTKKTAAPFKL----TEN 80
Query: 93 DKSLK--LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--- 147
+ LK +P SFD+R WP C I IL+Q CGSCWAF + E LSDR CI
Sbjct: 81 GEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPG 138
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
+LS L+A C DGC GG P AW Y G+ T+ C PY G +
Sbjct: 139 ALSPQTLVA-CDVYGNDGCSGGIPQLAWEYMELKGLPTDSCVPYTAGNGTVY-------- 189
Query: 208 TPKCVRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
C R C L+R +K +++ + S + I I GP+ + VYEDF Y
Sbjct: 190 --SCQRSCSDSEDYSLYR-AKPFTL---KTCSSVQCIQENILAYGPIVGTMEVYEDFMSY 243
Query: 266 KSGVYKHITG-DVMGGHAVKLIGWGTSDDGE-DYWV 299
SGVY G ++GGHA+K++GWG + +YW+
Sbjct: 244 SSGVYVMTPGSSLLGGHAIKIVGWGFDQTSQLNYWI 279
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 81/183 (44%), Positives = 104/183 (56%), Gaps = 19/183 (10%)
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAFGA EA+SDR CI +++S +D+L+CCG CG+GC+GGYPI AW+Y+V G
Sbjct: 1 SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60
Query: 183 VVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKKNQL-WRNSK 226
+ T C PY C H P Y TP C KC+ + + + K
Sbjct: 61 ICTGGSYESQSGCKPY-PIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDK 119
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
HY SAY + I EI NGPVE ++TVYEDF Y GVY H G +GGHAV+++
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRIL 179
Query: 287 GWG 289
GWG
Sbjct: 180 GWG 182
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 121/224 (54%), Gaps = 21/224 (9%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D ++P FDAR W +C TI + DQG+CGS WA A +DR C+ + N LS
Sbjct: 23 DNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADRLCVATNGDFNQLLS 82
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGC 197
++ CC CG+GC+GGYPI AW+ F +HG+VT E C+PY +D G
Sbjct: 83 AEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGK 141
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+ +P P KC +KC + N H Y+ Y + I ++ GP+E SF
Sbjct: 142 NTCSGQPMEPNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGPIEASF 199
Query: 257 TVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY+DF +YKSG+Y K +GGH+VKLIGWG + G YW+
Sbjct: 200 DVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWL 242
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 66/87 (75%), Positives = 79/87 (90%)
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
+KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKH
Sbjct: 1 KKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKH 60
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
ITG +MGGHAVKLIGWGT+D GEDYW+
Sbjct: 61 ITGGMMGGHAVKLIGWGTTDAGEDYWL 87
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 110/192 (57%), Gaps = 16/192 (8%)
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWA A SDR CI G + +LS L CC + CG+GCDGG P +AW +F+
Sbjct: 1 CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPEAAWYFFMR 59
Query: 181 HGVVT-------EECDPY-FDSTGCSHPGC-EPAYPTPKC-VRKCVKKN--QLWRNSKHY 228
HG+VT + C PY G C + TP C +R C N + +R HY
Sbjct: 60 HGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHY 119
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 288
+ Y ++ EDIM +IYKNGPV+ +F VY DF +YKSGVY + G + GGHA+K++GW
Sbjct: 120 VDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGW 179
Query: 289 GTSDDGEDYWVC 300
G DD YW+C
Sbjct: 180 GV-DDNTKYWLC 190
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 130/235 (55%), Gaps = 27/235 (11%)
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
VP + D L + FDAR WP+C++I +I D C S WAF A E++SDR CI+ G
Sbjct: 65 VPTENSD----LSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGGM 120
Query: 145 MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 195
+N LS +LL+CC G L CG+GC GG AW+Y+ HG+ T C PY +
Sbjct: 121 INTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAP 180
Query: 196 ------GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAY-RINSDPEDIMA 244
++P C PTP C +KC KN +HY S+ ++ + +I +
Sbjct: 181 CGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQS 240
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
++ NGP+E +F VY+DF Y +G+Y H+TG+ G +V+++GWG +G YW+
Sbjct: 241 DVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWL 294
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 105/294 (35%), Positives = 145/294 (49%), Gaps = 42/294 (14%)
Query: 12 LLILGVISSQTFA-----EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
L+I+G I + A E +V+ +K + + Q E NP F+N T
Sbjct: 4 LVIIGTIVAVAVATHPINEEMVAHIKAKTSLWQP---HETTTNP-----------FNNMT 49
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
Q G P K + +P++FDAR W S I I DQ CGSC
Sbjct: 50 KEQLLAKCGTYIVPANKEY-----PGSKIMTVPENFDARQQWG--SKIHAIRDQQQCGSC 102
Query: 127 WAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
WAFGA EA SDRF I+ G ++ LS DL++C GC+GGY AW Y HG T+
Sbjct: 103 WAFGATEAFSDRFAIN-GKDVILSPEDLVSC--DTNDYGCNGGYMDVAWEYLADHGAATD 159
Query: 187 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 246
C PY +G + P C KC + + R + ++ R + I +EI
Sbjct: 160 SCFPYSAGSGFA----------PACSDKCADGSAMQR--FKCAPNSVRQSKGVAQIQSEI 207
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+GPVE +FTVY DF +Y+SGVY T DV GGHA+K++G+G ++G YW+C
Sbjct: 208 VSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGV-ENGTPYWLC 260
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/190 (46%), Positives = 112/190 (58%), Gaps = 18/190 (9%)
Query: 128 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVE++SDR CIH +S LS +LL+CC CG GC GG P AW Y+ + G+VT
Sbjct: 45 AFGAVESMSDRICIHSKNKISVELSAINLLSCCT-RCGFGCRGGIPGMAWDYWKYEGIVT 103
Query: 186 -------EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSI 230
C PY S+ S+P CE Y PTP+C C + ++ K Y
Sbjct: 104 GGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGK 163
Query: 231 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
S+Y + S+ IM EI NGPVE F VYEDF +YKSGVYKHITG +GGHA+++IGWG
Sbjct: 164 SSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGI 223
Query: 291 SDDGEDYWVC 300
+ YW+C
Sbjct: 224 QQNHIPYWLC 233
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/192 (45%), Positives = 113/192 (58%), Gaps = 20/192 (10%)
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA + A+SDR CI + LS D+LACC + CG GC+GG+P+ AW+YF G
Sbjct: 1 SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSW-CGYGCEGGWPMKAWQYFXLEG 59
Query: 183 VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN-QLWRNSKH 227
VVT C PY + C G EP Y TPKC + C + + ++ KH
Sbjct: 60 VVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKH 118
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G + GGHAVK+IG
Sbjct: 119 FGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIG 178
Query: 288 WGTSDDGEDYWV 299
WG + G YW+
Sbjct: 179 WG-KEXGTPYWL 189
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 100/294 (34%), Positives = 148/294 (50%), Gaps = 27/294 (9%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPK-AGWKAARNPQFSNYTVGQF 70
+ I + + V + + + +L D I+ N N K A W A RN +F +T+GQ
Sbjct: 13 MRIFAITITLAILLNVAFAINMGAPVLNDKFIQ--NHNSKNAPWVAKRNARFEGHTIGQV 70
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
++G K +K D S+ P +FDAR WP C + +L+Q CGSCWAF
Sbjct: 71 MAMMGTKKVINNNA-APSIKIVDASI--PSTFDAREQWPGC--VHAVLNQEQCGSCWAFS 125
Query: 131 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 188
+ EALSDR CI +N++LS L+A C + GC+GG P AW Y G+ T EC
Sbjct: 126 SSEALSDRLCIASKGQVNVTLSPQALVA-CDDIGNQGCNGGVPQLAWEYMEWKGLPTFEC 184
Query: 189 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 247
PY G C R+C + + + +K +S++ + I EI
Sbjct: 185 YPYTAGNGTDG----------TCQRQCADGSAMTYYRAKPFSMTTC---NSVACIQNEII 231
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGE-DYWV 299
GPV + VY+DF Y SGVY + T +++GGHA++++GWGT + DYW+
Sbjct: 232 TYGPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGHAIEIVGWGTDATSKLDYWI 285
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 85/212 (40%), Positives = 117/212 (55%), Gaps = 22/212 (10%)
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSL 149
D + +P+SFD+R WP C I I DQ CGSCWAF + LSDRFCIH +N L
Sbjct: 119 QDLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDL 176
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPT 208
S DL++C GC GG + + ++ G+V+E+C PY + T C P
Sbjct: 177 SPQDLVSCS--YENFGCSGGQLTESVDFLIYEGIVSEKCKPYMNQDTYCKFKCQNDKQPY 234
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
K C +K+ L I SD E+I E+ NGP+ V +VYED +YK G
Sbjct: 235 TKYF--CEQKSML-------------ILSDIEEIQLELMTNGPMMVGLSVYEDLMNYKEG 279
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VY++ TG+ +GGHA+K+IGWG ++ GE +W C
Sbjct: 280 VYEYTTGNQVGGHAIKIIGWGHTEKGELFWKC 311
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 98/281 (34%), Positives = 146/281 (51%), Gaps = 36/281 (12%)
Query: 39 QDSIIKEVNENPKAGWKAARNP-QFSNYTVGQFKHLLGVKPT-PKGLLLGVPVKTHDKSL 96
Q + + +N N GWKA NP + Y G + + P+G++L + +
Sbjct: 81 QAAFVAAIN-NRTRGWKAGVNPLRHDQYRTGALLYEEAARAKLPQGIVLKL------QEE 133
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDL 154
P+SFDAR W C ++ I +QG C S +A AV ++DR+CIH S D+
Sbjct: 134 PFPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDV 193
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----TPK 210
L+CC CG GCDGG P + W Y+V +G+ + SH GC+ +YP P+
Sbjct: 194 LSCC-HRCGFGCDGGVPSAVWHYWVENGITS-------GGAYESHEGCQ-SYPFGVCKPQ 244
Query: 211 ----------CVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C+R+C N + KH+ AY + D + I+ E++ GPV+ SFTVY
Sbjct: 245 EIFAPHVDLICLRQCQPGYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVY 304
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
DF YKSGVY+H G +G H+VK++GWG ++G +W+C
Sbjct: 305 TDFIQYKSGVYRHTYGVRVGDHSVKIVGWGV-ENGTKFWLC 344
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 108/187 (57%), Gaps = 12/187 (6%)
Query: 122 HCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
CGSCWAF E +SDR CI ++S D+LACCG CGDGC+GGYPI A+R++
Sbjct: 60 QCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWN 119
Query: 180 HHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISA 232
GVVT C PY + C+ C P TP C C + + K + +SA
Sbjct: 120 SRGVVTGGDFRGSGCRPYPFAP-CNSYKC-PEEKTPTCSLSCQFGYSTAYAKDKRFGVSA 177
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y + + I EI NGPV +FT+YED YKSGVY+H G ++GGHA+K+IGWGT
Sbjct: 178 YAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-Q 236
Query: 293 DGEDYWV 299
+G YW+
Sbjct: 237 NGIPYWL 243
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 32/51 (62%), Positives = 40/51 (78%), Gaps = 1/51 (1%)
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPVE SFTVYEDF YK GVY++ G V+G HA+K++GWGT + G DYW+
Sbjct: 3 NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGT-EHGTDYWL 52
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 96/294 (32%), Positives = 139/294 (47%), Gaps = 12/294 (4%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+ LG++S+ G + D+ +L + + +N+ WKA N + N T + K
Sbjct: 5 VALGLLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFSEAKR 64
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L G L V +LP+SFD+ WP C TI I DQ C + WA
Sbjct: 65 LTGAWIQKNSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTA 124
Query: 133 EALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR+C + G L +S LL+CC G G +P AW Y+V +G+ + C PY
Sbjct: 125 SAISDRYCTVGGGKQLRISAAHLLSCCKQCGGGCKGG-FPGFAWLYYVEYGIASSGCQPY 183
Query: 192 -------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ G P + + TPKC C K+ K+ + Y + ED
Sbjct: 184 PFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKR 241
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP F VY D YKSGVY+++ GD +GG AV+++GWG +G YW
Sbjct: 242 ELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYW 294
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 118/223 (52%), Gaps = 17/223 (7%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D ++ LP+SFDAR WP+C +I I DQ G CWA + E ++DR CI + +S
Sbjct: 89 DLAVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAVSSAEVMTDRICIQSNGTKQVYVS 148
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-FDSTGC-SH-- 199
D+L+CCG CG GC G P A+ Y + GV + C PY F G +H
Sbjct: 149 ETDILSCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPYPFYPCGYHAHLP 208
Query: 200 ---PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
P + +PTP C + C + N S + + E I EI+ NGP+ ++
Sbjct: 209 YYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNNGPLVATY 268
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
TVYEDFA+YK+G+Y G G HAVK+IGWG ++G YW+
Sbjct: 269 TVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWG-EENGVKYWL 310
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/220 (40%), Positives = 117/220 (53%), Gaps = 21/220 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
+P+ +D R + +CST I DQ +CGSCWA A+SDR CI +++S D+L
Sbjct: 86 IPEEYDPREKF-KCSTFY-IRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDIL 143
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPG------- 201
CC CG GC GG+ I AW YFV+ GVV+ C PY C H G
Sbjct: 144 TCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPY-PIHPCGHHGNDTYYGE 202
Query: 202 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C TP C +KC +++R K AY + E I EI ++GPV SF VYE
Sbjct: 203 CPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYE 262
Query: 261 DFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
DF+ YK+GVYKH G + G HAVK++GWG S YW+
Sbjct: 263 DFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWL 302
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 121/225 (53%), Gaps = 23/225 (10%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
D+ +P+SFDAR+ W C+++ I DQ +CGSCWA ALSDR CI L +S
Sbjct: 89 DEDDDIPESFDARTHWANCTSLRHIRDQANCGSCWAVSTASALSDRICIASKGETQLHIS 148
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------CDPY---------FDST 195
D+++CC LCG GCDGG+PI A+ YF G VT E C PY D+
Sbjct: 149 SIDIVSCCK-LCGYGCDGGWPIEAFDYFSRQGAVTGETTSKDGCRPYPFHPLWTYGNDTV 207
Query: 196 GCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
G G C+ + + V++ V +N R + RI + + NGPV
Sbjct: 208 GRRMSGRCKHSKTVGEGVKR-VTRNHTRRTG--LTARRLRITEFCQSHSEGDHGNGPVVA 264
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
FTVYEDF++YK G+Y HI G G HA+K+IGWG ++G YW+
Sbjct: 265 VFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGV-ENGLPYWL 308
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 155/302 (51%), Gaps = 42/302 (13%)
Query: 36 HILQDSIIKEVNE-NPKAGWKAARNPQFSNYTVGQFKHLLG---VKPTPKGLLL--GVPV 89
++++ + K + K W+ + +F ++ K L+G V +GL L GVP+
Sbjct: 96 QLIKEKMAKRAETGDAKHMWEPEVSLRFKFLSLKDAKKLMGTFLVNTRVEGLRLPSGVPL 155
Query: 90 KT----HDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
+ + +P +FDAR+A+P C + + DQG CGSCWAF + EA +DR CI
Sbjct: 156 PAKTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQ 215
Query: 145 MN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDP 190
+ LS +CC + C GC+GG P AWR+F GVVT C P
Sbjct: 216 GKGVMPLSTQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWP 275
Query: 191 YFDSTGCSH------PGCEP---AYPTPKCVRKCVKKNQL-----WRNSKHYSISAYRIN 236
Y + C+H P C+ TPKC + C + + H + S+Y +
Sbjct: 276 Y-EIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSLR 334
Query: 237 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 296
S + + ++ +G V +F VYEDF +YKSGVYKH+ G +GGHA+K+IGWGT +DGE+
Sbjct: 335 SR-DAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGT-EDGEE 392
Query: 297 YW 298
YW
Sbjct: 393 YW 394
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/294 (36%), Positives = 150/294 (51%), Gaps = 39/294 (13%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
L+I+G I++ A V++ ++ +HI + + + +E NP FS+ T Q
Sbjct: 4 LVIVGTIAAMVAATHPVNE-EMVAHIKAKTSLWQPHET-------TTNP-FSDLTKEQLL 54
Query: 72 HLLGVKPTPKGLLL-GVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
G P G P+ + P +FDAR W S I I DQ CG+CWAFG
Sbjct: 55 AKCGTYIVPSNKQYPGSPL------ISTPDNFDARQQWG--SKIHAIRDQQQCGACWAFG 106
Query: 131 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 188
A EALSDRF I + +++ S DL++C GC+GGY AW + HGVV + C
Sbjct: 107 ATEALSDRFTIASNGSVDVVFSPEDLVSC--DTNDYGCNGGYMDMAWEFLDQHGVVADSC 164
Query: 189 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEI 246
PY +G + P C KC + K YS + R + E I +EI
Sbjct: 165 FPYSAGSGFA----------PACASKCADGSA----EKKYSCVHGSIRQSQGVEQIKSEI 210
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+GPVE +FTVY DF +Y+SGVY T DV GGHA+K++G+G ++G YW+C
Sbjct: 211 VAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGV-ENGTPYWLC 263
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 148/314 (47%), Gaps = 34/314 (10%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARN--PQFSNY 65
+ L++L VI + + ++ L+ I +N WKA N P
Sbjct: 1 MARVLILLSVILFSVY-------MTEQAYFLEKDYIDSINAQATT-WKAGVNFPPSTPKE 52
Query: 66 TVGQFKHLLGVK-PTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGH 122
+ + GV+ P + ++ +L ++PK FDAR W +C TI + DQG+
Sbjct: 53 AILRLLGSRGVQIPNKANYKMYKSRDSNYDNLFGRIPKKFDARKKWRKCKTIGAVRDQGN 112
Query: 123 CGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWA A +DR C+ + + LS +L CC CG GC+GGYPI AW F
Sbjct: 113 CGSCWALATSSAFADRLCVATDADFNEFLSPEELTFCC-HTCGYGCNGGYPIKAWERFKS 171
Query: 181 HGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 227
HG+VT E C+PY + G + +P +C R C L + H
Sbjct: 172 HGLVTGGDYKSGEGCEPYRVPPCRHHAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFDDDH 231
Query: 228 -YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKL 285
Y+ +Y + I ++ GP+E SF VY+DF YKSGVY + +GGHAVKL
Sbjct: 232 RYTRDSYYLTYG--SIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKL 289
Query: 286 IGWGTSDDGEDYWV 299
IGWG + G YW+
Sbjct: 290 IGWG-EESGVPYWL 302
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 115/207 (55%), Gaps = 26/207 (12%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLL 155
+P FD R+ WPQC + +I DQ +CG+CWAF L+DR CI + +N LS D++
Sbjct: 120 IPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSPQDMV 177
Query: 156 ACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
C F GC+GGY ++A Y ++ GV E C PY D T KC
Sbjct: 178 DCSHDNF----GCEGGYLMNALDYLMNEGVTKESCTPYKDKTN-------------KCQY 220
Query: 214 KCVKKNQLWRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C K + + KHY R+ ++ E I ++ +NGP+ V TVYEDF +Y +G YK
Sbjct: 221 TCQNKTEEFH--KHYCKPGTLRVLTNEEQIKRDLMQNGPLMVGLTVYEDFINYATGDYKF 278
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ G+++GGHAVKL+GW T+ G+ W+
Sbjct: 279 VAGEIVGGHAVKLMGWRTTQKGQTSWL 305
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 115/224 (51%), Gaps = 21/224 (9%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D ++P FDAR W +C TI + DQGHCGS WA A SDR C+ + N LS
Sbjct: 20 DNYQEIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLCVATNGDFNQLLS 79
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------DSTGC 197
++ CC CGDGC GGYPI AW+ + HG+VT E C+PY D G
Sbjct: 80 AEEITFCC-HTCGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGN 138
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+ +P +C R C L + H Y+ Y + I ++ GP+E SF
Sbjct: 139 NTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY--RGIQKDVINYGPIEASF 196
Query: 257 TVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY+DF YKSG+Y K +GGH+VKLIGWG + G YW+
Sbjct: 197 DVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWL 239
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 99/285 (34%), Positives = 141/285 (49%), Gaps = 32/285 (11%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG-VKPTPKGLLLGV--PVKTHDKSLK 97
S++ E+N + +F ++G K L G + +GL V P + D
Sbjct: 3 SLVDEINSKQNLWTASTDQERFYGRSLGDAKKLCGTLLEETEGLEKRVYPPGELAD---- 58
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
+P SFDAR A+ +C I + DQ C SCWA VEA + R CI G N LS ++
Sbjct: 59 IPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 118
Query: 155 LACCGFLCG---DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH----- 199
+ACC GC GG ++AW + HG+ TE C PY + C+H
Sbjct: 119 IACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPY-NFPKCAHHQKKS 177
Query: 200 ---PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
P + Y TP C+ +C K +H++ + + ++I EI NGP
Sbjct: 178 KYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTSA 237
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+F+VYEDF YKSGVYKH G +MG H+V++IGWGT + G DYW+
Sbjct: 238 TFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGT-EKGVDYWL 281
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 81/217 (37%), Positives = 122/217 (56%), Gaps = 20/217 (9%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC- 158
FDAR WP+CS+I I D C S WAF A E++SDR CI+ G ++ LS +LL+CC
Sbjct: 89 FDARERWPECSSIPLINDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELLSCCT 148
Query: 159 GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST------GCSHPGC-E 203
G L CG+GC GG P+ AW+Y+ HG+ T C PY + ++P C
Sbjct: 149 GVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTN 208
Query: 204 PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
PTP C +KC + +HY +S ++ + +I +++ NGPVE + +Y+DF
Sbjct: 209 TTLPTPTCEKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDF 268
Query: 263 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y +G+Y H+ G+ G +V+++GWG +G YW+
Sbjct: 269 LQYTTGIYVHLAGNKQGHLSVRILGWGMF-EGVPYWL 304
>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 515
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 98/262 (37%), Positives = 125/262 (47%), Gaps = 34/262 (12%)
Query: 33 LDSHILQDSIIKEVNENPK----AGWKAA---RNPQFSNYTVGQFKHLLGVKPTPKGLLL 85
LD H+ S+ + NP A WK++ + P N + G K
Sbjct: 16 LDKHV---SLFSPIGFNPHKQTGAKWKSSAVSKGPYMEN-----VRWRFGAKRETTEQKA 67
Query: 86 GVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG 144
P V ++ +P FDAR W +C +I I Q CGSCWAFGAVEA+SDR CIH G
Sbjct: 68 RRPTVNNRFSNVDIPMQFDARKYWLKCPSIREIRGQSSCGSCWAFGAVEAMSDRLCIHSG 127
Query: 145 MNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 191
LS DLL+CC + CG GCDGG+P AW Y+ G+VT C Y
Sbjct: 128 AKYQKGLSAVDLLSCC-WKCGYGCDGGFPAQAWNYWSTDGIVTGGSKENPSGCRSYPFPS 186
Query: 192 --FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
D G HP C Y TP+C +KC + + S+Y + +IM EI
Sbjct: 187 CSHDERG-RHPLCPSEIYHTPRCTKKCDTDKLHYSAELTKANSSYNVLDSDREIMMEIMN 245
Query: 249 NGPVEVSFTVYEDFAHYKSGVY 270
NGPVE F VYEDF Y+ G+Y
Sbjct: 246 NGPVEAVFDVYEDFLQYEKGIY 267
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 118/225 (52%), Gaps = 30/225 (13%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVND 153
L LPKSFDAR+ W C +I + DQG+C S +A A+SDR CIH + LS
Sbjct: 51 LNLPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQ 110
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHP 200
+L+CC +LCGDGC GG +W ++ HG+V+ E C PY T +
Sbjct: 111 ILSCC-YLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETAVENA 169
Query: 201 GCEPAYPTPKCVRKCVKKNQLWRNSK------HYSISAYRINSDPEDIMAEIYKNGPVEV 254
TP+C +C + R K HY + AY M EIY+NGP+
Sbjct: 170 CSNKTLFTPECKVQCYNPDYGTRYVKDNHQGTHYRVPAYT-------AMKEIYENGPITA 222
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SF +Y+DF +Y+SGVY + +G + AVK++GWG ++G YW+
Sbjct: 223 SFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWG-EENGTPYWL 266
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 100/273 (36%), Positives = 137/273 (50%), Gaps = 25/273 (9%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK-GLLLGVPVKT- 91
DS ++ D + WKA N +F+ T K LLG +P LG
Sbjct: 273 DSALINDEQHVNYLNQEEMSWKAGVNERFAGMTYADVKGLLGADTSPHIAEYLGETRSQD 332
Query: 92 -HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-HFGMNLSL 149
+D +P F+A + W + I DQ CGSCWAF A E LSDR I H L
Sbjct: 333 FYDNITDVPSEFNAVTQWK--GLVQPIRDQQQCGSCWAFSAAEVLSDRNAIQHNKAEPVL 390
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
S DL++C GC+GG +AW Y + G+VT+ C PY G + P
Sbjct: 391 SPEDLVSCD--RVDQGCNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDA----------P 438
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
KC C K W +K+ + SAY +N E++ EI +GP++V+F VY+ F YKSGV
Sbjct: 439 KCETSC-KDGSSW--TKYKAASAYAVNG-VENMQKEIMTHGPIQVAFNVYKSFMSYKSGV 494
Query: 270 YKHITGDVM--GGHAVKLIGWGTSDDGEDYWVC 300
Y ++M GGHAVK++GWGT + G+DYW+
Sbjct: 495 YAKKWYELMPEGGHAVKIVGWGT-EGGKDYWLV 526
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/295 (34%), Positives = 141/295 (47%), Gaps = 14/295 (4%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+ LG++S+ A G + D+ +L + + +N+ WKA N + N T + K
Sbjct: 5 VALGLLSTALVALGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFSEAKR 64
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L G L V +LP+SFD+ WP C TI I DQ C + WA
Sbjct: 65 LTGAWIQKNSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTA 124
Query: 133 EALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR+C + G L +S LL+CC CG GC GG+P AWRY+V +G+ + C PY
Sbjct: 125 SAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKGGFPGFAWRYYVEYGIASSYCQPY 183
Query: 192 FDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 243
C H G + + TP+C C K K+ AY + E+
Sbjct: 184 -PFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL--IKYRGKDAYMLLPGEEEFK 240
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP VY D YKSGVY+++ G MG AVK++GWG +G YW
Sbjct: 241 RELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVTAVKVVGWGKL-NGTPYW 294
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 106/168 (63%), Gaps = 14/168 (8%)
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF----- 192
+++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 1 VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE 60
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGP
Sbjct: 61 HHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGP 120
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 121 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 167
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/282 (36%), Positives = 138/282 (48%), Gaps = 26/282 (9%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
L D IK +NE K WKA R +N + F LLG + K +K +D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEFEIKKYDPL 79
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
P+ FD+R+ W C I I DQG+CGSCW+F A +DR C+ G N LS
Sbjct: 80 YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCS 198
+L CC CG GC GG P+ AW YF GV T E C PY + G +
Sbjct: 140 PEELTFCCK-DCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQGEN 198
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
+P +C + C K + +++ + S Y INS + I +I GPVE SF
Sbjct: 199 ICDEQPMERNHQCPKTCYGKTTV--QNRYKTKSEYYINS-IKTIEQDIKTYGPVEASFDC 255
Query: 259 YEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
Y+D + YKSG+Y K GGH++K+IGWG +DG YW+
Sbjct: 256 YDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWG-QEDGTPYWL 296
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/166 (43%), Positives = 104/166 (62%), Gaps = 14/166 (8%)
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DS 194
+ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 1 VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHH 60
Query: 195 TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE
Sbjct: 61 VNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVE 120
Query: 254 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 121 GAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 165
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 86/193 (44%), Positives = 111/193 (57%), Gaps = 22/193 (11%)
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWAFGAVEA+SDR CI ++LS DLL+CC CG GC+GG P+SAW+++V G
Sbjct: 1 SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCC-RSCGFGCNGGDPLSAWKFWVKEG 59
Query: 183 VVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKK--NQLWRNS 225
+VT C PY C H P +PTPKC + C + ++
Sbjct: 60 IVTGSNHSTNAGCKPY-PFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKED 118
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 285
K++ SAY + + E I EI GPVEV+F VYEDF +Y G+Y H G + GGHAVK+
Sbjct: 119 KYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKM 178
Query: 286 IGWGTSDDGEDYW 298
IGWG D+G YW
Sbjct: 179 IGWGI-DNGVPYW 190
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 141/306 (46%), Gaps = 52/306 (16%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTHDK 94
+ S++ E+N + +F N ++ K L G K + G + ++
Sbjct: 82 IMQSLVDEINSKQNTWTASTGQKRFKNLSLRDAKMLCGTLKRGSNDKVIRKGYAI---EE 138
Query: 95 SLKLPKSFDARSAWPQCSTISR-ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
LP FDAR+A+P CS + R I DQ CGSCWAFG EA +DR CI + LS
Sbjct: 139 LQDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSA 198
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCS 198
++ AC GCDGG P AW + + G+ T + C PY D C+
Sbjct: 199 GEMNACAPSF---GCDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPY-DFPPCA 254
Query: 199 H-------PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 248
H P C + +Y TP C +C K R+ +H+ + + D I
Sbjct: 255 HHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRT 314
Query: 249 NGPV---------------EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+GPV SF VYEDF Y+SGVYKH +G +GGHAVK+IGWG +
Sbjct: 315 DGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWG-EET 373
Query: 294 GEDYWV 299
G+ YW+
Sbjct: 374 GQAYWL 379
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 103/268 (38%), Positives = 140/268 (52%), Gaps = 21/268 (7%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK-GLLLGVPVKTHDKSLKLP 99
S+I ++N A W A NP F + + LG+ P P + P T + +P
Sbjct: 21 SLINQINSQQSA-WTAGINP-FDD--IESRLGFLGIHPDPNFKPEIKEPQATQNV---IP 73
Query: 100 KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 156
++FDAR WP+C+ I I +QG C S WAF A E +SDR CI + + + LS DL+
Sbjct: 74 ETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLID 133
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRK 214
CC + CG+ C GGY AW YF+ G+V+ Y STGC P E Y TP C
Sbjct: 134 CCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTT 189
Query: 215 CV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYKSGVYK 271
C K + + KH+ S Y I + I EI G PV +F VY DF Y+ GVY
Sbjct: 190 CQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYI 249
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ +G + G AVK+IGWGT ++G YW+
Sbjct: 250 YTSGALFGRTAVKIIGWGT-ENGWAYWL 276
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 112/187 (59%), Gaps = 17/187 (9%)
Query: 128 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV- 184
AFGAVEA+SDR CIH + + +S DL+ CC CG GC GG +AW+Y+ G+V
Sbjct: 1 AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCD-KCGSGCSGGVSAAAWQYWKDAGLVS 59
Query: 185 ------TEECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 232
T+ C PY S+ S P C PTPKC R+C + + + + K+++ +
Sbjct: 60 GGLYNTTDGCKPYSLAPCEHSSQGSLPECVGTLPTPKCKRQCREGYERSYDDDKYFAKNV 119
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y IN + I EI++NGPVE FT Y DF YKSGVY+H + D++G HA++++GWG S+
Sbjct: 120 YSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWG-SE 178
Query: 293 DGEDYWV 299
D YW+
Sbjct: 179 DNNPYWL 185
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 120/247 (48%), Gaps = 39/247 (15%)
Query: 89 VKTHDKSLK---------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
+++H++S + +P SFDAR WP CS I + DQ CGS A E SDR
Sbjct: 76 IRSHEQSTENDNSQVFEEIPNSFDARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIASDRT 135
Query: 140 CIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT------- 185
CI N LS D L+CC L CGDG CDG +P +++ HG+ T
Sbjct: 136 CIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQ 195
Query: 186 --------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAY 233
CD + + S P P Y TP C +C N W + KH+ + Y
Sbjct: 196 FGCKPYTIYPCDKKYPNGTTSVPC--PGYHTPVCEERCTS-NITWPISYKQDKHFGKAHY 252
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ DI EI +NGPV SF +Y+DF YKSG+Y H GD GG K+IGWG D+
Sbjct: 253 NVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DN 311
Query: 294 GEDYWVC 300
G YW+C
Sbjct: 312 GVPYWLC 318
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 99/294 (33%), Positives = 139/294 (47%), Gaps = 12/294 (4%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+ LG++S+ G + D+ +L + + +N+ WKA N + N T + K
Sbjct: 5 VALGLLSTALVTLGASALRAKDAPVLTKTFVDRINQLNGGMWKAVYNGKMQNITFSEAKR 64
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L G L V +LP+SFD+ WP C TI I DQ C + WA
Sbjct: 65 LTGAWIQKTSSLPPVRFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTA 124
Query: 133 EALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY 191
A+SDR+C + G L +S LL+CC CG GC GG+P AWRY+V +G+ + C PY
Sbjct: 125 SAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKGGFPGFAWRYYVEYGIASSYCQPY 183
Query: 192 -------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 244
+ G P + TP+C C K K+ AY + E+
Sbjct: 184 PFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIPL--IKYRGKDAYMLLPGEEEFKR 241
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+Y NGP VY D YKSGVY+++ G MG AVK++GWG +G YW
Sbjct: 242 ELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVTAVKVVGWGKL-NGTPYW 294
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 114/231 (49%), Gaps = 30/231 (12%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+ +P SFD+R WP+C+ I + DQ CGS AVE SDR CI N LS D
Sbjct: 89 INIPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTCISSNGTFNWPLSAQD 148
Query: 154 LLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT---------------EECDPYFD 193
L+CC L CGDG CDG +P +++ HG+ T CD +
Sbjct: 149 PLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYP 208
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDIMAEIYKN 249
+ S P C P Y TP C C N W + KH+ + Y + DI EI N
Sbjct: 209 NGTTSVP-C-PGYHTPPCEDHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTN 265
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
GPV SF +YEDF YKSG+Y H GD GG K+IGWG D+G YW+C
Sbjct: 266 GPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLC 315
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 149/322 (46%), Gaps = 41/322 (12%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSK-LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
FL + + SS T+ + S+ +KL L + K+ + WKA +
Sbjct: 8 FLLPLFICTPLHSSVTYQSSISSEAIKLSGSDLTSYVNKK-----QKLWKAETSRMTFQE 62
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISRILDQGHC 123
+ + K + +K + V KT + ++ +P SFD+R WP CS I + DQ C
Sbjct: 63 KMARAKSIKFIKSNDE-----VSEKTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDC 117
Query: 124 GSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFL---CGDG--CDGGYPISAWR 176
GS AVE SDR CI + N LS D L+CC L CGDG CDG +P +
Sbjct: 118 GSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILK 177
Query: 177 YFVHHGVVTEE-------CDPYFD-------STGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
++ HG+ T C PY + G + C P Y TP C C N W
Sbjct: 178 WWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPC-PGYHTPTCEEHCTS-NITW 235
Query: 223 ----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278
+ KH+ + Y + DI EI NGPV SF +Y+DF YK+G+Y H GD
Sbjct: 236 PIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQE 295
Query: 279 GGHAVKLIGWGTSDDGEDYWVC 300
GG K+IGWG D+G YW+C
Sbjct: 296 GGMDTKIIGWGV-DNGVPYWLC 316
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/334 (31%), Positives = 147/334 (44%), Gaps = 87/334 (26%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
+ S++ E+N + +F N ++ K L G L+ G ++DK++K
Sbjct: 477 IMQSLVDEINSKQNTWTASTGQKRFKNLSLRDAKMLCGT------LMRG----SNDKAIK 526
Query: 98 ----------LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
LP FDAR+A+P CS I I DQ CGSCWAFG EA +DR CI
Sbjct: 527 KGYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGT 586
Query: 147 LS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPY 191
+ LS ++ AC GC+GG+P SAW + G+ T + C PY
Sbjct: 587 FTELLSAGEMNACAP---SHGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPY 643
Query: 192 FDSTGCSH-------PGC----------------------EPAYPTPKCVRKC--VKKNQ 220
D C+H P C + +Y TP C +C K
Sbjct: 644 -DFPPCAHHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTT 702
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV---------------EVSFTVYEDFAHY 265
R+ +H+ + + D I +GPV SF+VYEDF Y
Sbjct: 703 TLRDDRHFMLESSPYQYSVNDAKNAIRTDGPVGPIYFCDPNVNFDQVSASFSVYEDFLAY 762
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSGVYKH +G+ +GGHAVK+IGWG + G+ YW+
Sbjct: 763 KSGVYKHTSGEYLGGHAVKIIGWG-EESGQAYWI 795
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/214 (38%), Positives = 111/214 (51%), Gaps = 20/214 (9%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLS 150
D ++P+ FDAR W +C TI + DQG+C S WA A +DR C+ + N LS
Sbjct: 1 DNYQEIPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLS 60
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGC 197
++ CC CG+GC GGYPI AW+ F HG+VT E C+PY +D G
Sbjct: 61 AEEITFCC-HTCGNGCYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGN 119
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+ +P +C R C L + H Y+ Y + I ++ GP+E SF
Sbjct: 120 NTCSGQPMESNHRCTRMCYGNQDLDFDQDHRYTRDHYYLTY--RGIQKDVINYGPIEASF 177
Query: 257 TVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWG 289
VY+DF YKSG+Y K +GGH+VKLIGWG
Sbjct: 178 DVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWG 211
>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 233
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 107/178 (60%), Gaps = 15/178 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +NE+P AGWKA ++ +F +++ + L+G + + V HD +
Sbjct: 30 LSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 153
+++P FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G + LS D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
L++CC CGDGC GG+P AW Y+V G+VT + +H GC+P YP PKC
Sbjct: 148 LISCCKD-CGDGCKGGFPGQAWDYWVKRGIVTGGSEE-------NHTGCQP-YPFPKC 196
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/194 (42%), Positives = 109/194 (56%), Gaps = 22/194 (11%)
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA + A+SDR CI + +S D+++CC + CG GC GG+ I AW YF G
Sbjct: 1 SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTW-CGYGCQGGWSIRAWYYFAEQG 59
Query: 183 VVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKC-VKKNQLWRNSKH 227
VVT C PY + C + EP Y TP+C R+C + + + + KH
Sbjct: 60 VVTGGNYNTKGSCRPY-EIHPCGYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKH 118
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y +AY++ E I EI +NGPV FTVYEDFAHYK G+YKH +G GGHAVK+IG
Sbjct: 119 YGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIG 178
Query: 288 WGTSDDGED---YW 298
WG+ G + YW
Sbjct: 179 WGSEQKGSEKIPYW 192
>gi|290973645|ref|XP_002669558.1| predicted protein [Naegleria gruberi]
gi|284083107|gb|EFC36814.1| predicted protein [Naegleria gruberi]
Length = 343
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/298 (32%), Positives = 144/298 (48%), Gaps = 49/298 (16%)
Query: 26 GVVSKLKLDSH---ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-------HLLG 75
+V+ ++ SH I +I +N NPK+ WKA +F+N TVG+FK H
Sbjct: 4 AIVAMGEMASHHEPIHDHHVIHSINNNPKSSWKAKVYEKFANMTVGEFKQKYLGAIHEEA 63
Query: 76 VKPTPKG---LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAF--- 129
+ P+ K ++ G P + P +FD+R WPQC + + +Q CGSCWAF
Sbjct: 64 ITPSSKSRFSIVTGPPT-----AYTPPTNFDSRQKWPQC--VHTVRNQLDCGSCWAFWIE 116
Query: 130 -----GAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
A + LSDRFCI + +N+ +S + C + GC GG W + + G
Sbjct: 117 FNDLVSATKVLSDRFCIASNGSVNVIMSPQYQIDCN--MDNLGCSGGSLPKTWNFLTNVG 174
Query: 183 VVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
V+E+C PY ++ C KCV + Y +Y + I
Sbjct: 175 SVSEQCRPYKNND------------DDDCPSKCVDG----KAPSFYKAKSYASIKGLDSI 218
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
M EI GPV S TVY+D Y+SGVY H+TG+ +GGHA+ +IG+G S + YW+
Sbjct: 219 MYEIQNYGPVHASLTVYKDLMSYQSGVYSHLTGNEIGGHAIVIIGFGMDSLSKKPYWI 276
>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
Length = 239
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 97/238 (40%), Positives = 120/238 (50%), Gaps = 37/238 (15%)
Query: 71 KHLLGVK----PTPKGLLLGVPVKTHD----KSLKLPKSFDARSAWPQCSTISRILDQGH 122
K LLG K P + + KT+D S K+PK+FDAR W QC TI R+ DQG
Sbjct: 15 KRLLGSKGVQIPNKNNMHM---YKTNDVAYISSGKIPKTFDARKKWVQCDTIGRVRDQGQ 71
Query: 123 CGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
CGSCWA A +DR CI N LS +++ CC + CG GCDGGYPI AW+ F
Sbjct: 72 CGSCWAVSTSSAFADRLCIATDGDFNELLSADEITFCC-YTCGFGCDGGYPIKAWKQFSR 130
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPAYPTPK-----------CVRKCVKKNQ--LWRNSKH 227
HG+VT FDS GCEP P C KC NQ +
Sbjct: 131 HGLVT---GGDFDSG----EGCEPYRVPPSGSNSSNSYNHFCRGKCYGDNQNISYSEDHR 183
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVK 284
Y+ Y ++ + I ++ GP+E SF VY+DF YKSGVY K +GGHAVK
Sbjct: 184 YTRDYYYLSYNA--IQKDVLLYGPIEASFEVYDDFMIYKSGVYVKSENATHLGGHAVK 239
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 134/267 (50%), Gaps = 27/267 (10%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
L + ++ +SI++ +N +P + W AA P+ S V +F+ +LG + P +P
Sbjct: 5 LFASVVAESIVETINNDPTSTWVAAEYPR-SVINVAKFRAMLGAELGPH-----MPY-VQ 57
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
SL P FDAR WP I + DQ CGSCWA EA+ D I ++SV
Sbjct: 58 PLSLSEPTEFDAREQWP--GKILPVRDQASCGSCWAHSVAEAMGDAQNIAGCPRGAMSVQ 115
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
DL++C C+GG A Y V G+ TE C Y +G P C
Sbjct: 116 DLVSC--DKTDSACNGGDMKKAQEYLVKTGITTEACVKYVSGSG----------RVPACP 163
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
KC +Q+ R Y + +++ + +P +IM + + GP+ F VY DF +Y+SGVY+H
Sbjct: 164 SKCDNGSQIIR----YKLQSWK-SVEPSEIMQALMEYGPLSCGFMVYSDFMNYRSGVYQH 218
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+G GGHAV L GWG ++G YW+
Sbjct: 219 KSGYFEGGHAVLLCGWGV-ENGLPYWL 244
>gi|290991959|ref|XP_002678602.1| predicted protein [Naegleria gruberi]
gi|284092215|gb|EFC45858.1| predicted protein [Naegleria gruberi]
Length = 286
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/295 (33%), Positives = 139/295 (47%), Gaps = 26/295 (8%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
FL CLL+L V + FAE K + I +++++VN GW+A P F N
Sbjct: 9 FLVICLLLLAV--TFLFAE---EKDFWNKPIQTRALVEQVNSQVGVGWRATSYPHFDNMK 63
Query: 67 VGQFKHLLGVKPTPKGLLLGVPVKTH-DKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ F+ LGV + V+ K LP+ FDAR WP C I+ I +Q CGS
Sbjct: 64 LSDFRKYLGVHNFTEPTRSKFNVRAELTKVRNLPEQFDARKEWPHC--ITPIRNQEQCGS 121
Query: 126 CWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
CWAF A LSDRFC++ + + LS +L C + C+GG +AW++ V G+
Sbjct: 122 CWAFSASAVLSDRFCVYSNGSVQVMLSPEYMLECSA--QNNACNGGTLHAAWQFLVSVGI 179
Query: 184 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 243
T+ C PY G C KC Q SK Y +A + + +IM
Sbjct: 180 PTDSCVPYSSGNG----------TVGHCPSKCTVPGQ---TSKFYKAAAAKKLENMVEIM 226
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
EI +G V+V+ VY D YKSGVY H+T + G+ L G G+D W
Sbjct: 227 TEIKTHGSVQVAIAVYRDLFSYKSGVYHHVTWG-LDGYFWILRGHNECGFGKDVW 280
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 117/213 (54%), Gaps = 21/213 (9%)
Query: 90 KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNL 147
KT ++ +FD+R+ WP C + I +Q CGSCWAF A E LSDRFCI G +++
Sbjct: 5 KTATGAVAAVPAFDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDV 62
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 207
LS +++C GCDGGY +AW + G+ +++C PY G
Sbjct: 63 VLSPQYMVSCDS--TDYGCDGGYLNNAWAFLAGTGIPSDKCAPYTSQNGD---------- 110
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
V C K Q + K Y + +D IM ++ +NGPV+ +F+VY DF YKS
Sbjct: 111 ----VAACPSKCQDGSSVKLYKAKNPQQLNDIPSIMEDMQQNGPVQAAFSVYRDFMSYKS 166
Query: 268 GVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
GVY H++G ++GGHA+K++GWG S + YW+
Sbjct: 167 GVYHHVSGSLLGGHAIKMVGWGVDSATNKPYWI 199
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 128/242 (52%), Gaps = 17/242 (7%)
Query: 68 GQFKHLLGVKPTPKGLLLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSC 126
G K LG+ + L +P + +S++ LP SFDAR WP C ++++I QG CGSC
Sbjct: 19 GVMKMSLGLNESE---LNNLPRLQNQRSVRALPASFDARQKWPYCPSLNQIRSQGSCGSC 75
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
+A ++DR+CIH G L+CC CDGGY + Y+V +G+
Sbjct: 76 YAVSTAAVITDRYCIHSGGERQFYFGSTGYLSCCTDCYK--CDGGYVHKTFDYWVKYGLT 133
Query: 185 TEECDPYFDSTGCS-HP---GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 239
+ PY GC +P + KC R+C L + + S+Y +
Sbjct: 134 SG--GPYHSGQGCKPYPFGGATQDVNIVLKCDRQCQAGYPLTYSQDLKHGASSYILPWGD 191
Query: 240 EDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E+ M AEIY+NGP+ SF VY DF Y+SGVY+H+TG G HAV++IGWG ++G YW
Sbjct: 192 ENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGV-ENGVKYW 250
Query: 299 VC 300
+C
Sbjct: 251 LC 252
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 126/280 (45%), Gaps = 31/280 (11%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK------HLLGVKPTPK 81
V++ + + +I ++N N GWKA P+F+N ++ + + LL P
Sbjct: 63 VNETSASTPVNDKELIDKINANETLGWKATEYPRFANLSISEARDSLFGLSLLSTDPDTP 122
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
L + + + LP +FDAR+ W C I + DQ CG+CWAF A L+ R CI
Sbjct: 123 RLDI-------EPRVDLPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLAHRLCI 173
Query: 142 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
N+ LS + C C GGY AW + G + C PY
Sbjct: 174 ATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYAWSFLERTGTTVDSCIPYASGRATFS 231
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
G PA KC Q + Y R S +I A I G V+ FT+Y
Sbjct: 232 SGTCPA--------KCKVSTQ---SMTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTIY 280
Query: 260 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
DF Y+SGVYKH++ +GGHAV LIGWG + G +YW+
Sbjct: 281 RDFMSYRSGVYKHVSTTTLGGHAVALIGWGV-ESGTNYWL 319
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 104/192 (54%), Gaps = 18/192 (9%)
Query: 125 SCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A E +SDR C+ LS D+LACCG CG GC+GGY AW Y + G
Sbjct: 1 SCWAVSAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGYGCNGGYSARAWLYARNSG 60
Query: 183 VVT----EE---CDPY------FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSKH 227
V + +E C PY + + C + Y TP C + C + + K
Sbjct: 61 VCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKI 120
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y+ AYR++SD I AEI+ GPV+ SF YEDFAHYKSG+Y H G GGHAVK+IG
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIG 180
Query: 288 WGTSDDGEDYWV 299
WG ++G W+
Sbjct: 181 WGV-ENGTKXWI 191
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 94/284 (33%), Positives = 138/284 (48%), Gaps = 22/284 (7%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKG 82
+G + D + D++++ VN + GW A + ++ Y G K L +PT +
Sbjct: 70 DGGIVDCDRDLCLTDDNLVRNVNSIHRLGWSARKYDEWWGHKYAEGLTKRLGTKEPTYR- 128
Query: 83 LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
+ + H+ LP+SF++ W S IS +LDQG CGS W SDRF I
Sbjct: 129 --VKAMSRLHNIVDHLPRSFNSIDKWA--SYISDVLDQGWCGSSWVISTASVASDRFAIQ 184
Query: 143 FGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSH 199
+ LS ++L+C GC+GG+ +AWRY GVV E C PY C
Sbjct: 185 SRGKEVIQLSPQNILSCTRRQ--QGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDACKI 242
Query: 200 PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
P + C V +++L+ YS++ + DIMAEI+ +GPV+ + TV
Sbjct: 243 PHNSRSLRNNGCRSYSGVDRDELYTVGPAYSLN------NETDIMAEIFMSGPVQATLTV 296
Query: 259 YEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
Y DF Y G+Y+H G +G H+VKLIGWG DG YW+
Sbjct: 297 YRDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWI 340
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 116/207 (56%), Gaps = 22/207 (10%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG 159
FD+R WP C + I DQG+CGSC++F + E +SDRFCI + +N+ LS DL+ C
Sbjct: 6 FDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSW 63
Query: 160 FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN 219
+ GC+GG P + Y G+V++ C PY G +H C P + C K
Sbjct: 64 Y--SFGCNGGIPGLVFDYIHKDGLVSDACFPYLSYDGNTHVKC-PDF----CYNN---KT 113
Query: 220 QLWRNSKHYSISAYRINSDPED-------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
+ +++ KH++ Y + ED I EI +GPV F VY DF YKSGVY+H
Sbjct: 114 KSFKSDKHFADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMVYSDFTVYKSGVYRH 173
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
TG G HAVK+IGWGT ++G DYW+
Sbjct: 174 QTGSFEGIHAVKIIGWGT-ENGVDYWL 199
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 102/309 (33%), Positives = 141/309 (45%), Gaps = 53/309 (17%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTP-KGLLLGVPVKTHDKSLKLP 99
S++ EVN + +F ++G K L G P KGL V ++ +P
Sbjct: 3 SLVDEVNSKQNLWTASTDQERFYGRSLGDAKKLCGTLPEETKGLE--KKVYPTEELADIP 60
Query: 100 KSFDARSAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
SFDAR A+ +C I + DQ CGSCWA VEA + R CI G N LS ++LA
Sbjct: 61 SSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEMLA 120
Query: 157 CCGFL--CGD-GCDGGYPISAWRYFVHHGVVT-------------EECDPY------FDS 194
CC + C GC GG +AW + HG+VT + C PY D
Sbjct: 121 CCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQ 180
Query: 195 TGCSHPGC---------------------EPAYPTPKCVRKC--VKKNQLWRNSKHYSIS 231
+ C + Y TP C+ +C K +H++
Sbjct: 181 EDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTAR 240
Query: 232 AY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
A + ++I EI NGP SF+ YEDF+ YKSGVYKH +G +G H+V++IGWGT
Sbjct: 241 ALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGT 300
Query: 291 SDDGEDYWV 299
+ G DYW+
Sbjct: 301 -EKGVDYWL 308
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 95/255 (37%), Positives = 128/255 (50%), Gaps = 24/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K VP T + + P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V ++ DR C G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHL 189
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
K Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV
Sbjct: 190 YKATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVD 247
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT DDG DYW+
Sbjct: 248 MVGYGTDDDGVDYWI 262
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 89/231 (38%), Positives = 111/231 (48%), Gaps = 30/231 (12%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
L +P FD+R WP+C+ I + DQ CGS AVE SDR CI N LS D
Sbjct: 92 LDIPTYFDSRQKWPECTQIGAVRDQSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQD 151
Query: 154 LLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT---------------EECDPYFD 193
L+CC L CGDG CDG +P +++ HG+ T CD +
Sbjct: 152 PLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYP 211
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDIMAEIYKN 249
+ S P P Y TP C C N W + KH+ + Y + DI EI N
Sbjct: 212 NGTTSVPC--PGYHTPTCEEHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTN 268
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
GPV SF +Y+DF YKSG+Y H GD GG K+IGWG D G YW+C
Sbjct: 269 GPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DSGVPYWLC 318
>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
Length = 274
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/259 (35%), Positives = 123/259 (47%), Gaps = 25/259 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 93
++ L++S I+ +N+ W A N S F +LG K + KTHD
Sbjct: 17 AYFLEESYIEMINDVATT-WTAGVNFDPST-PEKDFIKMLGSKGVEAAKNASAHMFKTHD 74
Query: 94 -----KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
+ +P++FDAR W C TI + DQGHCGSCWA A +DR C+ + N
Sbjct: 75 VANDNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFN 134
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------D 193
LS ++ CC CG GC+GGYPI AW+YF HG+VT E C+PY D
Sbjct: 135 ELLSAEEITFCC-HTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQD 193
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
G S +P +C R C L N H Y + I ++ GP+E
Sbjct: 194 EEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNEDHRFTRDYYYLT-YGSIQKDVMNYGPIE 252
Query: 254 VSFTVYEDFAHYKSGVYKH 272
SF VY+DF YKSGVY+
Sbjct: 253 ASFDVYDDFPSYKSGVYQR 271
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 146/296 (49%), Gaps = 32/296 (10%)
Query: 23 FAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKG 82
FA V+ + + + + ++ +N+N +KA NP Y G+ P K
Sbjct: 4 FATLVLFLIPVAASLSGQELVDYINKN--GLFKAVYNPSAGAYHFGRIN-----DPLRKS 56
Query: 83 LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTI-SRILDQGHCGSCWAFGAVEALSDRFCI 141
L +D S ++P+SFDA WP+C+ + + I DQ +CGSCWA + +SDR C+
Sbjct: 57 TLKKRTEADYDLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGSCWAVSSAGVMSDRICV 116
Query: 142 HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY--- 191
+ +S++ + A + GDGC+GG A+ F+ +G T + C PY
Sbjct: 117 ATNGKVKVSISGI-ATASCVGGDGCNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFK 175
Query: 192 -----FDSTGCSHPGCE--PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIM 243
+ST +P C+ P Y C +C K ++ + +Y Y SD I
Sbjct: 176 HCAHHVNST--EYPPCDSVPEYKADTCSHECQKDYDRKYEEDLYYGKEQYGF-SDEAPIQ 232
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGTSDDGEDYW 298
EI NGPV VSFTVYE F +Y G+Y+ G+ + G HAV+++GWG ++G YW
Sbjct: 233 REIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYHAVRVVGWGV-ENGTKYW 287
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 164/361 (45%), Gaps = 73/361 (20%)
Query: 8 LTTCLLILGVISSQTF-----AEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP-- 60
L L +LGVI F K + D ++ + ++++VN++P+ WKA N
Sbjct: 34 LLLILAVLGVIYGSYFLYRRYVTDANDKRESDEYLRK--LVRQVNDSPETTWKAKFNKFG 91
Query: 61 --------QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQ 110
+++ +++ ++ + + ++ D KS LPK+FDAR WP
Sbjct: 92 VKNRSYGFKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELDNYKSSDLPKAFDARQKWPN 151
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDG 168
C +IS + +QG CGSC+A A SDR CIH LS D++ CC +CG+ C G
Sbjct: 152 CPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCS-VCGN-CYG 209
Query: 169 GYPISAWRYFVHHGVVT---EECDPYFDSTGCSHPGCEPAY-----PTPKCVRKC--VKK 218
G P+ A Y+V+ G+VT + C PY C P C PA C+R+C +
Sbjct: 210 GDPLKALTYWVNQGLVTGGRDGCRPYSFDLSCGVP-CSPATFFEAEEKRTCMRRCQNIYY 268
Query: 219 NQLWRNSKHYSISAYRI-------------------------NSDPEDIMAEIYKN---- 249
Q + KH++ AY + + + E + Y+N
Sbjct: 269 QQRYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNVIKK 328
Query: 250 -----GPVEVSFTVYEDFAHYKSGVYKHITGD-----VMGGHAVKLIGWGTSDDGEDYWV 299
GP ++F V E+F HY SGV++ D ++ H V+LIGWG S+DG YW+
Sbjct: 329 EILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGQSEDGTHYWL 388
Query: 300 C 300
Sbjct: 389 A 389
>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
Length = 345
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 160/326 (49%), Gaps = 64/326 (19%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIK------------EVNENPKAGWKAARN 59
+L+LGV + +V+ L + H ++ +I E+ ENP K+ ++
Sbjct: 10 ILLLGVTT-------LVNGLNFNKHPVRQEVIDRIKNSNVSWTPFEIEENPFKN-KSLQS 61
Query: 60 PQFSNYTVGQFKHLLGVKPTPKGL--------------LLGVPVKTHDKSLK------LP 99
+ +G K G++ K L L G + D+ L LP
Sbjct: 62 MRNMGGNLGYIKEESGIQGNIKHLKSKFFQELKKMGHKLKGEHIHVQDEGLNPKLGASLP 121
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLAC 157
+++ ++A+P C ILDQ +CGSCWA AV L +RFCI G +N+ S D+++C
Sbjct: 122 TAYNTKTAFPSCP--HTILDQANCGSCWAHAAVTMLQNRFCIKSGGSINMQFSRQDMVSC 179
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
L C+GGY S+ +Y GVV+E+C Y + G S P+C +C
Sbjct: 180 D--LGNAACNGGYLSSSVQYLQTEGVVSEQCLAYASADGNS---------VPRCNYRCDD 228
Query: 218 KNQLWRNSKHY--SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
K+ + K Y ++ +I + EDI EIY NGPV V F VY+DF+ Y +G+Y+ +T
Sbjct: 229 KSLEY---KKYGCKYNSMKILTTYEDIKEEIYTNGPVMVGFVVYDDFSSYSTGIYE-VTP 284
Query: 276 DVM--GGHAVKLIGWGTSDDGEDYWV 299
D + GGHAV L GWG D+G YW+
Sbjct: 285 DSVEEGGHAVTLNGWGY-DNGRLYWI 309
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 94/250 (37%), Positives = 124/250 (49%), Gaps = 18/250 (7%)
Query: 66 TVGQFKHLLGVKPTP----KGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 121
T F +L + P K L + + L LPKSFDAR WPQCS+++ I QG
Sbjct: 26 TTSPFAWILDLPGVPLEKLKETRLHPAINVFAEDLVLPKSFDARQQWPQCSSLNEIRTQG 85
Query: 122 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
CGSC A++DR+CIH + DLL+CC G GG P W Y+V
Sbjct: 86 CCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDLLSCCYECGGGCTGGGIPGPIWSYWV 145
Query: 180 HHGVVT-------EECDPYFDSTGCSHPGCEPAYP-TPKCVRKCVKKNQLWRN--SKHYS 229
GV + + C PY C P E YP P C +C + + + +
Sbjct: 146 KQGVSSGGPYGSNQGCHPYPMPPSCPKPS-EGDYPDEPNCSTRCNAGYNVTEDLRDRRFG 204
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
AY I +D IM +I+ NGPV+ F YED +Y GVY+H +G + GGHAVKLIGWG
Sbjct: 205 RVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVKLIGWG 264
Query: 290 TSDDGEDYWV 299
+DG YW+
Sbjct: 265 V-EDGTKYWL 273
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 151/333 (45%), Gaps = 68/333 (20%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNP-QFSNYTVGQFKHLLGVKPTP------KGLL 84
K +S ++++VN++P+ WKA N N + G FK+ +
Sbjct: 65 KRESDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYG-FKYTRNQTAVEEYMEHIRKFF 123
Query: 85 LGVPVKTHDKSLK------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 138
+K H + L+ LPK FDAR WP C +IS + +QG CGSC+A A SDR
Sbjct: 124 ESDAMKRHLEELENYKSSSLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDR 183
Query: 139 FCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EECDPYFD 193
CIH LS D++ CC +CG+ C GG P+ A Y+V+ G+VT + C PY
Sbjct: 184 ACIHSNGTFKSLLSEEDIIGCCS-VCGN-CYGGDPLKALTYWVNQGLVTGGRDGCRPYSF 241
Query: 194 STGCSHPGCEPAY-----PTPKCVRKC--VKKNQLWRNSKHYSISAYRI----------- 235
C P C PA C+R+C + Q + KH++ AY +
Sbjct: 242 DLSCGVP-CSPATFFEAEEKRTCMRRCQNIYYQQKYEEDKHFATFAYSLYPRSMTVSPDG 300
Query: 236 --------------NSDPEDIMAEIYKN---------GPVEVSFTVYEDFAHYKSGVYKH 272
+ + E + Y+N GP ++F V E+F HY SGV++
Sbjct: 301 KERVKVPTIIGHFNDKNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRP 360
Query: 273 ITGD-----VMGGHAVKLIGWGTSDDGEDYWVC 300
D ++ H V+LIGWG SDDG+ YW+
Sbjct: 361 FPLDGFDDRIVYWHVVRLIGWGESDDGQHYWLA 393
>gi|21697|emb|CAA46813.1| cathepsin B [Triticum aestivum]
Length = 130
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 63/96 (65%), Positives = 72/96 (75%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL 96
I+Q II+ VN +P AGW A NP +NYT+ QFKH+LGVKPTP GL V KTH +S
Sbjct: 35 IIQKDIIQTVNNHPNAGWTAGHNPYLANYTIEQFKHMLGVKPTPPGLRAAVRTKTHSRSE 94
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
+LPK FDARS W CSTI +ILDQGHCGSCWAFGAV
Sbjct: 95 QLPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAV 130
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 102/295 (34%), Positives = 139/295 (47%), Gaps = 49/295 (16%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV---KPTPKGLLLGVPVKTHDK 94
+ S++ E+N A + +F ++ K L G KP + + T D+
Sbjct: 80 IMQSLVDEINSKQNAWMASIEQERFKGASMSDAKRLCGTWLEKPEN----IREKLYTADE 135
Query: 95 SLKLPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
LP SF+A + +CS+ I I DQ CGSCWAF EA +DR CI N + LS
Sbjct: 136 LKDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSP 195
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCS 198
++ AC GC GG + AW++ GVVT + C PY D C+
Sbjct: 196 GNVAACSK---TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPY-DIPPCA 251
Query: 199 H-------PGC-EPAYPTPKCVRKCVKK--NQLWRNSKHY----SISAYRINSDPEDIMA 244
H P C + Y P C C K + +H+ S+SA R + I
Sbjct: 252 HYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALR---SIDAIKK 308
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI NGPV S+ VY+DF YKSGVYK + + +GGHAVK+IGW GEDYW+
Sbjct: 309 EIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGW-----GEDYWL 358
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 134/270 (49%), Gaps = 30/270 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLK 97
LQ +I+E+N + WKA N +G LG+ P P + K H +
Sbjct: 24 LQPQLIQEINSR-QTSWKAGTNSLDIKSRLG----FLGLHPDPD---YKIQTKHHKIAKS 75
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
+P+SFDAR WP+C I +I DQG CGSCWAF + E ++DR CI S +L
Sbjct: 76 IPESFDAREKWPECKDVIGKIRDQGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENL 135
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKC 211
L CC C C GGY AW Y+++ G+V+ Y S GC P + ++ KC
Sbjct: 136 LTCCED-CRLECVGGYTAKAWDYYINEGIVSG--GDYNSSEGC-QPYSKASFQYAVASKC 191
Query: 212 VRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
V+ C K + + + KHY S Y + ++ I EI NGPV +F V+ED +YKSG+
Sbjct: 192 VKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGI 251
Query: 270 YKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V ++ WGT ++G YW+
Sbjct: 252 QL---------SNVSILRWGT-EEGVPYWL 271
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/180 (41%), Positives = 105/180 (58%), Gaps = 19/180 (10%)
Query: 136 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 186
SDR CIH + +++S DLL CC CG GC+GGYP +AW+++ G+VT +
Sbjct: 1 SDRICIHTKGKVQVNISAEDLLTCCD-SCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTED 59
Query: 187 ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDP 239
C PY+ C H P C PTP+C + C + + + KH+ Y I+SD
Sbjct: 60 GCQPYYFPP-CEHHTVGPLPNCTGIKPTPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
I EI KNGPVE F VY DF YKSGVY+ + +++GGHA++++GWGT +DG YW+
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGT-EDGVPYWL 177
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 107/188 (56%), Gaps = 19/188 (10%)
Query: 128 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
AFGAVEA+SDR CIH + +S DL++CCG+ CG GC GG+P +AW ++ G+VT
Sbjct: 1 AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVT 59
Query: 186 -------EECDPYFDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRNSKHYSIS 231
C Y CSH G + Y TP CV+KC + + K +
Sbjct: 60 GGSKENPTGCRSY-PFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANI 118
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 291
Y + + IM EI NGPVE +F VYEDF YKSGVY H G ++GGHA++++GWG
Sbjct: 119 TYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-E 177
Query: 292 DDGEDYWV 299
++G YW+
Sbjct: 178 ENGVAYWL 185
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 103/277 (37%), Positives = 140/277 (50%), Gaps = 30/277 (10%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK-GLLLGVPVKTHDKSLKLP 99
S+I ++N A W A NP F + + LG+ P P + P T + +P
Sbjct: 21 SLINQINSQQSA-WTAGINP-FDD--IESRLGFLGIHPDPNFKPEIKEPQATQNV---IP 73
Query: 100 KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLA 156
++FDAR WP+C+ I I +QG C S WAF A E +SDR CI + + + LS DL+
Sbjct: 74 ETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLID 133
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRK 214
CC + CG+ C GGY AW YF+ G+V+ Y STGC P E Y TP C
Sbjct: 134 CCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTT 189
Query: 215 CV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYK-NGPVEVSFTVYEDFAHYK----- 266
C K + + KH+ S Y I + I EI GPV +F VY DF Y+
Sbjct: 190 CQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQH 249
Query: 267 ----SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY + +G + G AVK+IGWGT ++G YW+
Sbjct: 250 DTILEGVYIYTSGALFGRTAVKIIGWGT-ENGWAYWL 285
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 141/307 (45%), Gaps = 41/307 (13%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQF 70
CLL+L S+T+ H L ++ +N+ P +A N F +
Sbjct: 23 CLLVLASAGSRTYL-----------HPLSKXLVNYINK-PNTMQQAGHN--FHKMXISYL 68
Query: 71 KHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFG 130
+ G P L V + LP+SFD WP I DQG G CWA G
Sbjct: 69 RRPCGTFPGRSKLPQRVKFAX---DINLPESFDPXEQWPD-XPXREIRDQGSYGFCWALG 124
Query: 131 AVEALSDRFCIH-------FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 183
A+EA+SD CIH G ++ +S D L C LCGDGC+GG P W ++ G+
Sbjct: 125 ALEAISDWICIHPNVGGAQGGNHVEVSAEDKLTC---LCGDGCNGGXPNEGWNFWTGKGL 181
Query: 184 VTEE-------CDPYFDSTGCSHPGCEPAY----PTPKCVRKCVKKNQLWRNSKHYSISA 232
V+ C + C H Y +PKC C + Q ++ KHY S+
Sbjct: 182 VSGGLYDSHVGCRLFPSLLPCKHHIHGXPYVXTGDSPKCSMTC-EPGQTYKXDKHYGCSS 240
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
Y I+ +DIM IYKN VE +F+VY DF YK Y+ +TG++ GGHA+ ++G +
Sbjct: 241 YSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKV-E 299
Query: 293 DGEDYWV 299
+ YW+
Sbjct: 300 NSTSYWL 306
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 102/172 (59%), Gaps = 17/172 (9%)
Query: 124 GSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHH 181
GSCWAFGA EA+SDR CIH +S+ ++ DLLACC CG GC+GGYP +AW ++
Sbjct: 1 GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCD-SCGMGCNGGYPSAAWDFWTDV 59
Query: 182 GVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 227
G+V+ C PY G P TP+C+ +C ++ KH
Sbjct: 60 GLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKH 119
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 279
Y S+Y + SD E I +EIYKNGPVE +FTVYEDF YK+GVY+H+TG +G
Sbjct: 120 YGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 104/281 (37%), Positives = 142/281 (50%), Gaps = 24/281 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
L D IK +NE K WKA R +N + F LLG + K V +K +D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEVEIKKYDPL 79
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-LSV 151
P+ FD+R+ W C I I DQG+CGSCW+F A +DR C+ G + L
Sbjct: 80 YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSH 199
+ LA C CG GC GGYPI AW+YF GV T E C PY ++ G +
Sbjct: 140 PEELAFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNT 199
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
G +P +C + C K + +++ + S Y INS + I +I GPVE SF VY
Sbjct: 200 CGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYVINS-IKTIERDIMTYGPVEASFDVY 256
Query: 260 EDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
+D + YKSG+Y+ GGH++K+IGWG +G YW+
Sbjct: 257 DDLSAYKSGIYRKTPKAKYQGGHSIKIIGWG-QQNGTPYWL 296
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 137 bits (344), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 97/286 (33%), Positives = 131/286 (45%), Gaps = 34/286 (11%)
Query: 38 LQDSI-IKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
L DSI I +VNE+ GW+A+ T + + LG P + L V +
Sbjct: 135 LVDSITISDVNEDYYLGWRASNYSFLWGLTQAEGVLYRLGTFPPGRALSEMAEVNIDTEG 194
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVND 153
+LP++FDAR WP I ++DQG CGS WA SDR I +N LS
Sbjct: 195 ARLPETFDARENWP--GLIDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTP 209
LL+C GC GGY AW + G V+ C PY + T C AY +
Sbjct: 253 LLSC-NIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSGLDEDTIMQKLRCRVAYGSS 311
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
+C + V + + S YRI + DIM EIY+NGPV+ +F V DF Y GV
Sbjct: 312 QCPERGVTSD------LYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGV 365
Query: 270 YKHIT---------GDVMGGHAVKLIGWGTSDDGED------YWVC 300
Y+++ D G H+VK++GWG D D YW+C
Sbjct: 366 YRNVKQEFTASQSDSDQAGWHSVKIVGWGI--DRSDWYNPIKYWLC 409
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/178 (44%), Positives = 103/178 (57%), Gaps = 16/178 (8%)
Query: 130 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
GAVEA+SDR CIH N SLS DLL+CC CG GCDGG+P AW ++ HG+VT
Sbjct: 1 GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCK-DCGYGCDGGFPPMAWDFWKTHGIVTGG 59
Query: 186 --EE---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 234
EE C PY S G P YPTPKCV+ C ++ K + ++Y
Sbjct: 60 SKEEPAGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYN 119
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
++ IM EI NGPVE +F V+EDF YKSG+Y H G +GGHA++++GWG +
Sbjct: 120 VHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEEN 177
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/255 (36%), Positives = 130/255 (50%), Gaps = 25/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL VP T + K+P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPISFLNRDRAAVPRGTIADT-KVPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V +L DR C G++ ++ S +++C GD
Sbjct: 85 PHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFA-GLDKKAVTYSPQYVVSCDH---GDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ G T EC PY T + C PT KC +L
Sbjct: 139 ACDGGWLQSVWRFLTKTGTTTNECVPYQSGTTGARGTC----PT-----KCADGGEL--- 186
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
S + A D + IM + GP++ +FTVY DF +Y+ GVY+H++G V GGHAV+
Sbjct: 187 STVKAKKAVDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVE 246
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT + DYW+
Sbjct: 247 MVGYGTDEYDVDYWI 261
>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
Length = 410
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 165/365 (45%), Gaps = 75/365 (20%)
Query: 4 SHLFLTTCLLILGVISS-----QTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAAR 58
+++ L + ++ V+ + + V K + D ++ + ++++VN++P+ WKA
Sbjct: 18 NYILLNLIITVIAVVYGSYYLYRRYVTDVNDKRENDEYLRK--LVRQVNDSPETTWKAKF 75
Query: 59 NP-QFSNYTVGQFKHLLGVKPTP------KGLLLGVPVKTHDKSLK------LPKSFDAR 105
N N + G FK+ + +K H + L+ LPK FDAR
Sbjct: 76 NKFGVKNRSYG-FKYTRNQTAVEEYMEHIRKFFESDAMKRHLEELENYKSSDLPKHFDAR 134
Query: 106 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG 163
WP C +IS + +QG CGSC+A A SDR CIH LS D++ CC +CG
Sbjct: 135 QKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKALLSEEDIIGCCS-VCG 193
Query: 164 DGCDGGYPISAWRYFVHHGVVT---EECDPYFDSTGCSHPGCEPAY-----PTPKCVRKC 215
+ C GG P+ A Y+V+ G+VT + C PY C P C PA C+R+C
Sbjct: 194 N-CYGGDPLKALTYWVNQGLVTGGRDGCRPYSFDLSCGVP-CSPATFFEAEEKRTCMRRC 251
Query: 216 --VKKNQLWRNSKHYSISAYRI-------------------------NSDPEDIMAEIYK 248
+ Q + KH++ AY + + + E + Y+
Sbjct: 252 QNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYR 311
Query: 249 N---------GPVEVSFTVYEDFAHYKSGVYKHITGD-----VMGGHAVKLIGWGTSDDG 294
N GP ++F V E+F HY SGV++ D ++ H V+LIGWG S DG
Sbjct: 312 NVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGESGDG 371
Query: 295 EDYWV 299
+ YW+
Sbjct: 372 QHYWL 376
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 103/281 (36%), Positives = 143/281 (50%), Gaps = 24/281 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK- 94
L D IK +NE K WKA R +N + F LLG + K V +K +D
Sbjct: 23 QFLSDERIKYINEVAKT-WKAERYFP-ANTSEEYFIGLLGSRGY-KNYTNEVEIKKYDPL 79
Query: 95 --SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-LSV 151
P+ FD+R+ W C I I DQG+CGSCW+F A +DR C+ G + L
Sbjct: 80 YVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLS 139
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSH 199
+ LA C CG GC GGYPI AW+YF GV T E C PY ++ G +
Sbjct: 140 PEELAFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNT 199
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
G +P +C + C K + +++ + S Y +NS + I ++ GPVE SF VY
Sbjct: 200 CGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYVMNS-IKTIEQDLKTYGPVEASFDVY 256
Query: 260 EDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
+DF+ YKSG+Y+ GGH++K+IGWG +G YW+
Sbjct: 257 DDFSVYKSGIYRKTPKAKYQGGHSIKIIGWG-QQNGTPYWL 296
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 132/273 (48%), Gaps = 26/273 (9%)
Query: 31 LKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV--KPTPKGLLLGVP 88
L+ I ++KE+ W A N +F T + G K P + L P
Sbjct: 2 FNLEEKIQGSKLLKELKGEKDLPWVAGENERFKGMTFKDASVISGNAHKLRPDTIPLARP 61
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS 148
K + + +P S++ +PQC +LDQG CGSCW+F ++ S R+C + +
Sbjct: 62 PKIN---ISIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSKSFSHRYCRKYNKPVL 116
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
S + L+AC GC GG ++AWRY G+ + C PY +
Sbjct: 117 FSQSHLVACD--RRNSGCGGGIEVNAWRYIDLRGLPLDSCQPY-----------DGNITK 163
Query: 209 PKCVRKCVKKNQLWRN--SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
C +KC +++ + ++++S++ Y + E++ I GPV S VY D +YK
Sbjct: 164 YNCSKKCTNESETYEAQFTEYWSVARY---ASIEEMQIGIMTEGPVTTSLKVYSDLMYYK 220
Query: 267 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
SG+Y H G+ +G HAV++IGWGT +G DYW+
Sbjct: 221 SGIYTHTKGEFLGHHAVEIIGWGTK-NGIDYWI 252
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/274 (35%), Positives = 140/274 (51%), Gaps = 22/274 (8%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH 92
D ++ +S+++ VN + W+A P+F N + + + LG P P++ +
Sbjct: 127 DPCLMSNSVVEGVNRG-GSSWRAYNYPEFRNKKLKEGLIYKLGTFPLNAETRRMGPLR-Y 184
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLS 150
DK + P FDAR+ WP IS I+DQG CGS WA SDRF I N+ LS
Sbjct: 185 DKDVPYPTQFDARTRWP--GFISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVLS 242
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTP 209
LL+C GC GG+ AW + HG+V E+C PY S T C P P
Sbjct: 243 PQTLLSC-NVRAQQGCHGGHIDVAWNFARGHGLVDEKCFPYKASVTRC------PFRPRG 295
Query: 210 KCVRK-CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
++ C+ + R + Y + S +DIM +I ++GPV+ TVY+DF HY+ G
Sbjct: 296 NLIQDGCMP--LVKRRTSRYKLGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDFFHYRDG 353
Query: 269 VYK---HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY+ H ++ G H+V++IGWG D G+ YWV
Sbjct: 354 VYRRSYHGNNELKGFHSVRIIGWG-EDRGDRYWV 386
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 86/211 (40%), Positives = 111/211 (52%), Gaps = 33/211 (15%)
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG---FLCGDGCDGGYPIS 173
DQ CGSCWAFG VEA + R CI G +N LS ++LACC F GC GG PI+
Sbjct: 1 DQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPIT 60
Query: 174 AWRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCV 212
+W + +G+V+ + C PY C+H P + Y TP C
Sbjct: 61 SWTFLHTNGIVSGGGFVPEKNMKAADGCWPY-SFPKCAHHQDGSDYKPCAKEIYDTPSCS 119
Query: 213 RKC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C K + +HY+ S + R S I EI NGP +F+VYEDF YKSG
Sbjct: 120 SSCPNAKYGTAFDKDRHYTESLFPSRFGS-TSSIKKEIMTNGPTSAAFSVYEDFLSYKSG 178
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYKH +G +GGHAV++IGWGT + G DYW+
Sbjct: 179 VYKHTSGGFLGGHAVEIIGWGT-EKGVDYWL 208
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/255 (35%), Positives = 131/255 (51%), Gaps = 25/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K VP T + ++P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQVPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V ++ DR C+ G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVA-GLDKKAVRYSPQYVVSCDR---GDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ V G T+EC PY G A T C KC ++L
Sbjct: 139 ACDGGWLPSVWRFLVKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSEL--- 186
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ + A D + IM + GP++ +FTVY DF +Y+ GVY+H+ G GGHAV+
Sbjct: 187 PIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVE 246
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT + DYW+
Sbjct: 247 MVGYGTDEYDVDYWI 261
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 104/181 (57%), Gaps = 18/181 (9%)
Query: 135 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 185
++DR CI G S LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 1 MTDRICIQSGGQQSAELSALDLISCC-EDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENH 59
Query: 186 EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 238
C PY T +P C Y TP+C + C K + + KHY +Y + S+
Sbjct: 60 TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISN 119
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYW 178
Query: 299 V 299
+
Sbjct: 179 L 179
>gi|403331769|gb|EJY64852.1| hypothetical protein OXYTRI_15000 [Oxytricha trifallax]
Length = 259
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 21/228 (9%)
Query: 76 VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 135
+KP P L + + + LP SFD+ WP C +R +QG CGSC+AF A +
Sbjct: 11 IKPQPSSYSLNLNITQKLLASNLPLSFDSTVEWPDCIHATR--NQGSCGSCYAFAASGMM 68
Query: 136 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-- 191
SDR CI +NL LS +L++C GC GG+ + Y + +G+ +E C PY
Sbjct: 69 SDRLCIKSNGQINLVLSPQELVSC--DYQNYGCSGGWMTNTLYYLMSYGIPSETCLPYDM 126
Query: 192 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
F+S T C +C N + K ++ +I SDPE IM +I +NGP
Sbjct: 127 FNSE------------TKACSGRCDSPNYEYTRHKCKKGTS-KIMSDPETIMRDIMENGP 173
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+F +EDF ++ G+YK+ +G + GHA KL GWG G YW+
Sbjct: 174 SIVAFQAFEDFLNFGGGIYKYTSGKFLVGHATKLTGWGLDSAGRLYWI 221
>gi|3859607|gb|AAC72873.1| contains similarity to cysteine proteases (Pfam: PF00112, E=.21,
N=1) [Arabidopsis thaliana]
gi|7268204|emb|CAB77731.1| putative cysteine protease [Arabidopsis thaliana]
Length = 129
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/96 (66%), Positives = 76/96 (79%)
Query: 28 VSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV 87
++K KLDS ILQD I+K+VNENP AGWKAA N +FSN TV +FK LLGVKPTPK LGV
Sbjct: 33 LTKQKLDSKILQDEIVKKVNENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV 92
Query: 88 PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 123
P+ +HD SLKLPK+FDAR+AWPQC++I IL C
Sbjct: 93 PIVSHDPSLKLPKAFDARTAWPQCTSIGNILGLVLC 128
>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
Length = 237
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/195 (41%), Positives = 102/195 (52%), Gaps = 30/195 (15%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
H L D +I +N+ W+A RN F N + K L G +LG P S
Sbjct: 24 HPLSDDLINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGP--KLPGS 71
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+ LP++FDAR W C TI +I DQG CGSCWAFGAVEA+SDR CIH +N+ +S D
Sbjct: 72 IDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAED 131
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SH 199
LL CCG CGDGC+GGYP AW ++ G+V+ Y GC S
Sbjct: 132 LLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGV--YNSHVGCLPYTIPPCEHHVNGSR 189
Query: 200 PGCEPAYPTPKCVRK 214
P C TP+C +K
Sbjct: 190 PPCTGEGDTPRCNKK 204
>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
Length = 210
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 81/196 (41%), Positives = 102/196 (52%), Gaps = 28/196 (14%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH--- 92
H L D +I +N+ W+A RN F N + K L G +LG P
Sbjct: 24 HPLSDDMINYINKQ-NTTWQAGRN--FYNVDISYLKKLCGT-------VLGGPKLPERVG 73
Query: 93 -DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSL 149
+ + LP+SFDAR W C TI++I DQG CGSCWAFGAVEA+SDR CIH +N+ +
Sbjct: 74 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 133
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGC 197
S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 134 SAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNG 193
Query: 198 SHPGCEPAYPTPKCVR 213
S P C TPKC +
Sbjct: 194 SRPPCTGEGDTPKCNK 209
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 104/181 (57%), Gaps = 18/181 (9%)
Query: 135 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 185
++DR CI G S LS DL++CC CG GC GG+P AW Y+V G+VT
Sbjct: 1 MTDRICIQSGGGQSAELSALDLISCC-EDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENH 59
Query: 186 EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 238
C PY T +P C Y TP+C +KC K + ++ KHY +Y + S+
Sbjct: 60 TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISN 119
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKR-TPYW 178
Query: 299 V 299
+
Sbjct: 179 L 179
>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 261
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/180 (43%), Positives = 100/180 (55%), Gaps = 24/180 (13%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L D ++ +N+ WKA N F N + K L G LG P
Sbjct: 26 LSDDLVNHINKL-NTTWKAGHN--FHNADMSYVKKLCGT-------FLGGPKLPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
++LP +FD+R+ WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 ADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
DLL+CCGF CG GC+GGYP AWRY+ G+V+ +D SH GC P Y P C
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGG---LYD----SHVGCRP-YSIPPC 187
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 85/252 (33%), Positives = 123/252 (48%), Gaps = 25/252 (9%)
Query: 59 NPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL--KLPKSFDARSAWPQCSTISR 116
NP FS + + +G K + ++++L KLPK FD+R WP+C I
Sbjct: 239 NPYFSGMSKEEILIRMGTKLMNSSTEFDSKLSNNNEALIKKLPKHFDSREKWPECEWIRF 298
Query: 117 ILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISA 174
I DQ +CGSCWA A ++DR CI + ++D +LAC G S
Sbjct: 299 IRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQILAC-----------GMIPSP 347
Query: 175 WRYFVHHGVVTEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKH 227
+ Y+ G+ T PY D + C C TP C C + + K
Sbjct: 348 FNYWKKMGIATG--GPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYDIPISDDKF 405
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y+ Y ++S+ +IM EIY +GPV F VYEDF +Y SG+Y+ T MGGHA+++IG
Sbjct: 406 YASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRIIG 465
Query: 288 WGTSDDGEDYWV 299
WG ++G YW+
Sbjct: 466 WG-EENGIPYWL 476
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 62/132 (46%), Gaps = 13/132 (9%)
Query: 165 GCDGGYPISAWRYFVHHGVVT-----EE--CDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
GC G +A+ Y+ G+VT E+ C PY S C+ C P PKC R C
Sbjct: 69 GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISP-CTM--CRPYMLAPKCQRTCQA 125
Query: 218 KNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
L + K+Y S Y +N D DIM EIY+ GPV F VY DF +Y SG + I G+
Sbjct: 126 SYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQF--ICGN 183
Query: 277 VMGGHAVKLIGW 288
L W
Sbjct: 184 KRCEEEENLTSW 195
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 88/255 (34%), Positives = 128/255 (50%), Gaps = 23/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPRRFEGLTKDEISSLLMPVSFLKSAKGAAPRGTFADKDDVPESFDFREEY 85
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACC-GFLCGD 164
P C I ++DQG CGSCWAF +V DR CI G++ + S +++C G +
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIA-GLDKKPVKYSPQYVVSCDHGNMA-- 140
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C+GG+ +AW++ G T+EC PY + C PT KC +
Sbjct: 141 -CNGGWLPNAWKFLTKTGTTTDECVPYQSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ S Y + D +M + GP++V+F VY DF +Y+SGVY+H G + GGHAV+
Sbjct: 191 TTATSYKDYGL--DIPAMMKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT DDG DYW+
Sbjct: 249 MVGYGTDDDGVDYWI 263
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 80/195 (41%), Positives = 107/195 (54%), Gaps = 22/195 (11%)
Query: 125 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A+SDR CI + +S D+++CC + CG GC+GG+PI AW+Y V G
Sbjct: 1 SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTW-CGAGCEGGWPIEAWKYGVTEG 59
Query: 183 VVT------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKH 227
VVT +EC ++ C + G EP Y TP C ++C KN + K
Sbjct: 60 VVTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMD-KR 118
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y SAY + + I +I +NGPV F VYEDF +YKSG+Y+H G GGHAVK+IG
Sbjct: 119 YGTSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIG 178
Query: 288 WG---TSDDGEDYWV 299
WG T + YW+
Sbjct: 179 WGEEXTENGTIPYWI 193
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 155/334 (46%), Gaps = 56/334 (16%)
Query: 6 LFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNY 65
+ TT ++L ++SS L D++ +I+ VN N W+A N +N
Sbjct: 1 MLRTTMKIVLLLVSS--------FWLTCDANDKLHNIVTHVN-NANVTWQAGINSFHTN- 50
Query: 66 TVGQFKHLLGVKPTPKGLLL------GVPVKTHDKSL----------KLPKSFDARSAWP 109
K L+G P+ + L GV VK D + P+SFDAR W
Sbjct: 51 ---DHKKLVGTFYHPEWIGLEHETFDGVLVKGGDCDNDDEDDGGDANETPESFDARYHWF 107
Query: 110 QCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 167
C++IS I +QG+C + WA A++DR CI N++ S L++CC CG+GC
Sbjct: 108 NCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQKLVSCCE-DCGNGCS 166
Query: 168 GGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHP----------GCEPA 205
GGY +AWRY + G+VT E C P+ ST + P G +PA
Sbjct: 167 GGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPA 226
Query: 206 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
TPKC C + + D + K+GP V+ VYEDF Y
Sbjct: 227 -TTPKCDLSCYNARHEGKYLDDIIKAKKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAY 285
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSGVY H+TGD +G +V++IGWG + G+ +W+
Sbjct: 286 KSGVYHHVTGDYLGLLSVRMIGWGL-EGGQAFWL 318
>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
Length = 421
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 153/333 (45%), Gaps = 68/333 (20%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNP-QFSNYTVGQFKHLLGVKPTP------KGLL 84
K D+ ++++VN++P+ WKA N N + G FK+ +
Sbjct: 60 KRDNDEYLRKLVRQVNDSPETTWKAKFNKFGVKNRSYG-FKYTRNQTAVEEYVEQIRKFF 118
Query: 85 LGVPVKTHDKSLK------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 138
+K H L+ +PK+FDAR WP C +IS + +QG CGSC+A A SDR
Sbjct: 119 ESDAMKRHLDELENFNSSDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDR 178
Query: 139 FCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EECDPYFD 193
CIH LS D++ CC +CG+ C GG P+ A Y+V+ G+VT + C PY
Sbjct: 179 ACIHSNGTFKSLLSEEDIIGCCS-VCGN-CYGGDPLKALTYWVNQGLVTGGRDGCRPYSF 236
Query: 194 STGCSHPGCEPA-YPTPKCVRKCVKK------NQLWRNSKHYSISAY------------- 233
C P C PA + + R C+K+ Q + KH++ AY
Sbjct: 237 DLSCGVP-CSPATFFEAEEKRTCMKRCQNIYYQQKYEEDKHFATFAYSMYPRSMTVSPDG 295
Query: 234 -------------------RIN-SDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
++N ++ DI+ EI GP ++F V E+F HY SGV++
Sbjct: 296 KERVKVPTIIGHFNDKKTEKLNVTEYRDIIKKEILLYGPTTMAFPVPEEFLHYSSGVFRP 355
Query: 273 ITGD-----VMGGHAVKLIGWGTSDDGEDYWVC 300
D ++ H V+LIGWG SDDG YW+
Sbjct: 356 YPTDGFDDRIVYWHVVRLIGWGESDDGTHYWLA 388
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 101/278 (36%), Positives = 136/278 (48%), Gaps = 31/278 (11%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSL 96
++ S+I+ +N GW+AA F + KH LG + + + K
Sbjct: 120 VRPSLIQAINHG-GFGWRAANYTTFWGMKLTDAVKHKLGTLKVERDVHTMTEIDIKMKK- 177
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
K+PKSFDAR W S I+ ILDQG+C S WAF V SDR I ++LS L
Sbjct: 178 KIPKSFDARDKWG--SMITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQHL 235
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTG-CSHPGCEPAYPTP 209
L+C GC GG+ AW + GVV+ +C PY D G C PG P+
Sbjct: 236 LSC-NTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYPYTSGDQDKKGVCMMPGKLPS---- 290
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
C + N+L H+S YRI ++ +I EI +NGPV+ SF V EDF Y SGV
Sbjct: 291 DCPTGRERNNEL-----HHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGV 345
Query: 270 YKHI---TGDVMGGHA-----VKLIGWGTSDDGEDYWV 299
Y+H + D HA VKL+GWG ++G YW+
Sbjct: 346 YRHTPIASNDAEQYHASEWHSVKLLGWGV-ENGIKYWL 382
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 87/255 (34%), Positives = 128/255 (50%), Gaps = 23/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V DR C+ G++ + S +++C GD
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDM 139
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C+GG+ + W++ G T+EC PY + C PT KC +
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ S Y + D +M + +GP++V+F VY DF +Y+SGVY+H G + GGHAV+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT DDG DYW+
Sbjct: 249 MVGYGTDDDGVDYWI 263
>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 236
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/180 (43%), Positives = 100/180 (55%), Gaps = 24/180 (13%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L D ++ +N+ WKA N F N + K L G LG P
Sbjct: 26 LSDDLVNHINKL-NTTWKAGHN--FHNADMSYVKKLCGT-------FLGGPKLPERVDFA 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
++LP +FD+R+ WP C TIS I DQG CGSCWAFGAVEA+SDR C+H +S+ V+
Sbjct: 76 ADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
DLL+CCGF CG GC+GGYP AWRY+ G+V+ +D SH GC P Y P C
Sbjct: 136 EDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGG---LYD----SHVGCRP-YSIPPC 187
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/283 (33%), Positives = 135/283 (47%), Gaps = 21/283 (7%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 81
EG + D + D+I+ VN + GW A + Q+ Y+ G K LG K PT +
Sbjct: 115 EGGSVQCDEDLCLTDDAIVHSVNSIHRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
+ + + + LP SF+A W S IS + DQG CG+ W SDRF I
Sbjct: 174 ---VKAMTRLKNPTDGLPNSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228
Query: 142 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
N+ LS ++L+C GC+GG+ +AWRY GVV E C PY
Sbjct: 229 QSKGKENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYTQ----HR 282
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
C+ + C K + R+S + AY +N + DIMAEI+ +GPV+ + V
Sbjct: 283 DTCKIRHSRSLKANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVN 341
Query: 260 EDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWV 299
DF Y GVY+ + G H+VKL+GWG +GE YW+
Sbjct: 342 RDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWI 384
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/271 (38%), Positives = 133/271 (49%), Gaps = 32/271 (11%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGV-----KP--TPKGLLLGVPVKTHDKSLKLPKSFDARS 106
WKA N +Y +F ++G+ KP TP L P S LP FD+R
Sbjct: 5 WKADYN--IDSYIDNRFLGMMGINYSELKPNVTPD---LEPPFVVSKISENLPDEFDSRV 59
Query: 107 AWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGD 164
WP C TI I DQG CG+CWAF A EA+SDR CIH + S +LL+CC C
Sbjct: 60 RWPNCPTIREIRDQGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSALNLLSCCD-SCEK 118
Query: 165 GCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKC 211
GC G AW ++V HG+V+ E C PY C H C PTP C
Sbjct: 119 GCLGCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYH-LPPCEHHRAGPRRNCTKYGPTPSC 177
Query: 212 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGV 269
R C ++ + + H+ Y + E I+ EI+ NGPVE + YEDF Y+SG+
Sbjct: 178 ARVCQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGI 237
Query: 270 YKHITGDVMGGHAVKLIGWGTSDD-GEDYWV 299
Y HI G + HAVK+IGWGT YW+
Sbjct: 238 YHHIEGTFVCDHAVKIIGWGTDKKTNTPYWL 268
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/165 (44%), Positives = 100/165 (60%), Gaps = 18/165 (10%)
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 199
+S N+LLACC CGDGC+GGYP +AW F H GVVT + C PY + C H
Sbjct: 12 VSANELLACC-ESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA-CDHHV 69
Query: 200 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
C+ TP+C +KC N +++ KHY +Y ++S DIM E+ GPVE
Sbjct: 70 VGKLKPCKGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-VNDIMEELVTRGPVEA 128
Query: 255 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+FTVY DF Y SGVY+H TG +GGHAVK++G+G ++G+ YW+
Sbjct: 129 AFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGV-ENGDKYWL 172
>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
Length = 230
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 116/226 (51%), Gaps = 31/226 (13%)
Query: 69 QFKHLLGVKPTPKGLLLGV---PVKTHDK----SLKLPKSFDARSAWPQCSTISRILDQG 121
Q LLG K LLGV P+K +D+ + ++P+ FD+R W C TI + +QG
Sbjct: 13 QIVRLLGSK-----RLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCETIGHVRNQG 67
Query: 122 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
+CGSCWA G A +DR C+ N +S +L CC CG GC+GGYP+ AW+YF
Sbjct: 68 NCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HTCGFGCNGGYPLKAWQYFK 126
Query: 180 HHGVV-------TEECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK 226
HGVV T+ C PY D G + +P KC +KC + +
Sbjct: 127 RHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKN 186
Query: 227 HYSI-SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 271
HY AY + + +Y GP+E SF VY+DF +Y+SGVY+
Sbjct: 187 HYKTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQ 230
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 153/335 (45%), Gaps = 51/335 (15%)
Query: 6 LFLTTCLLILGV-ISSQTFAEGVVSKLKLDS------------HI--LQDSIIKEVNENP 50
+F+ C+++ G Q + EG V K +S H+ +Q +I+ VNE
Sbjct: 1 IFICVCVILTGCHRDGQHYEEGSVIKENCNSCTCSGQQWNCSQHVCLVQPGLIEHVNEG- 59
Query: 51 KAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD--KSLKLPKSFDARSA 107
GW A QF T+ + FK+ LG P P LLL + T ++ LP+ F A
Sbjct: 60 DFGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLPETTDLPEFFVASYK 118
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 165
WP LDQ +C + WAF +DR I +LS +L++CC G
Sbjct: 119 WP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCA-KNRHG 175
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCV 216
C+ G AW Y G+V+ C P F ++ GC A + T C
Sbjct: 176 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNFE 235
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
K N++++ S YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+T
Sbjct: 236 KSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTST 290
Query: 277 --------VMGGHAVKLIGWGT----SDDGEDYWV 299
+ HAVKL GWGT E +W+
Sbjct: 291 NEESDKYRKLRTHAVKLTGWGTLKGAQGRKEKFWI 325
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 119/227 (52%), Gaps = 24/227 (10%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LS 150
D S +P++FDAR+ W +C +I+ I +QG+C + WA A++DR CI N++ S
Sbjct: 82 DGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVTSAINDRICIKSKKNITAFYS 141
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE 203
+L+CC CGDGC+GGY +AW+Y++ G+VT E C P+ C+H +
Sbjct: 142 PQKMLSCCD-DCGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPP-CNHTVMD 199
Query: 204 PAYP----------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED-IMAEIYKNGPV 252
P TP+C C N K S RI+ I E+ K+GP
Sbjct: 200 ERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDIS-KGIRIDWHCSGMIRNELKKHGPA 258
Query: 253 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYEDF YKSG+Y+H+TG ++G VK+IGWG G YW+
Sbjct: 259 TAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVY-RGVQYWL 304
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/181 (42%), Positives = 101/181 (55%), Gaps = 21/181 (11%)
Query: 136 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 193
+DR C + + S DLL+CC +CG GC+GG P AW Y+ H G+V+ Y
Sbjct: 1 TDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHMGLVSG--GNYNS 57
Query: 194 STGCSH---PGCEPAYP-----------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 238
S GCS P CE P TPKC + C N L++ K Y Y +
Sbjct: 58 SQGCSPYVIPPCEHHVPGNRLPCNGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGG 117
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I AE++KNGPVE +FTVY D YKSGVYKH+ GD +GGHA+K+IGWG ++G YW
Sbjct: 118 EDHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGV-ENGNKYW 176
Query: 299 V 299
+
Sbjct: 177 L 177
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 130/281 (46%), Gaps = 39/281 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
+ D++I VN + GW A + ++ Y+ G L +PT + + +
Sbjct: 127 LTDDALIHSVNSIHQLGWSARKYDEWWSHKYSEGLRLRLGTKEPT---FRVKSMTRLTNP 183
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 152
S LP+SF+A W + IS + DQG CG+ W SDRF I + LS
Sbjct: 184 SNDLPRSFNAVEKWS--TFISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQ 241
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDST-----------GCSHPG 201
++L+C GCDGG+ +AWRY +GV+ C PY G
Sbjct: 242 NILSCTRRQ--QGCDGGHLDAAWRYMHKNGVLDANCYPYIQQRDTCKVQRHRGRSLKAYG 299
Query: 202 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
C+PA+ V ++ + YS+S DIMAEIY +GPV+ + TVY D
Sbjct: 300 CQPAHG--------VNRDNFYTVGPAYSLSR------EADIMAEIYHSGPVQATMTVYRD 345
Query: 262 FAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
F Y SGVY+H G G H+VKL+GWG +G YW+
Sbjct: 346 FFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWI 386
>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
Length = 150
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 95/184 (51%), Gaps = 38/184 (20%)
Query: 117 ILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISA 174
I +Q +CGSCWAFGA E +SDR CI +S D+L CCG CG GCDG
Sbjct: 2 IRNQTNCGSCWAFGAAEVISDRICIVTKGARQPIISPTDMLDCCGEYCGYGCDGC----- 56
Query: 175 WRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAY 233
P TPKC C K N + K++ SAY
Sbjct: 57 -----------------------------PKAVTPKCALSCQSKYNTEYAKDKNFGSSAY 87
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
+ + I EI NGPVE SFTVYEDF YK GVY++ G+V+GGHA+K+IGWGT ++
Sbjct: 88 YVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYKKGVYQYTAGEVLGGHAIKIIGWGT-EN 146
Query: 294 GEDY 297
G DY
Sbjct: 147 GTDY 150
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 134/293 (45%), Gaps = 39/293 (13%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKG 82
+G + D + D ++ VN + GW A + ++ Y+ G L +PT +
Sbjct: 115 DGGRVQCDTDLCLTDDELVHSVNSIHRLGWSARKYDEWWGHKYSEGLRLRLGTKEPTYR- 173
Query: 83 LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
+ + + S LP+ F+A W S IS + DQG CGS W SDRF I
Sbjct: 174 --VKAMTRLTNPSDDLPRKFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQ 229
Query: 143 FGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY--------- 191
+ LS ++L+C GC+GG+ +AWRY GV+ E+C PY
Sbjct: 230 SQGKEVVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKI 287
Query: 192 --FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
+S GC+PAY V ++ L+ YS+S DIMAEIY +
Sbjct: 288 QRHNSRSLKANGCQPAYG--------VNRDSLYTVGPAYSLSR------EADIMAEIYHS 333
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
GPV+ + +Y DF Y G+Y+ G G H+VKL+GWG DG YW+
Sbjct: 334 GPVQATMRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWI 386
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 132/284 (46%), Gaps = 25/284 (8%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGV-KPTPKGLLLGVPVKTHDK 94
++ +I VN GW+AA QF T+ ++ LG +P P + + D
Sbjct: 140 LMDGDLIDAVNRG-NYGWRAANYSQFWGMTLEDGMRYRLGTFRPPPTVMNMNEMHMAMDS 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
+ LP+ FDA + WP I LDQG+C WAF SDR IH M SLS
Sbjct: 199 NEVLPRHFDAATKWP--GMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPK 210
+LL+C GC GG AW Y GVVT+EC P+ DS + P + T +
Sbjct: 257 NLLSC-DTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGR 315
Query: 211 CVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
R+ + Q N + S AYR+ ++IM E+ +NGPV+ V+EDF YKS
Sbjct: 316 GKRQATARCPNPQTHANDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKS 375
Query: 268 GVYKHIT--------GDVMGGHAVKLIGWGTSD--DG--EDYWV 299
G+Y+H G H+VK+ GWG DG + YW
Sbjct: 376 GIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWT 419
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 90/255 (35%), Positives = 127/255 (49%), Gaps = 25/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K VP T + + P SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKRDRAAVPRGTV-SATQAPDSFDFREEY 84
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V ++ DR C G++ + S +++C GD
Sbjct: 85 PHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDM 138
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 139 ACDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDL--- 186
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ + A D + IM + GP++ +FTVY DF +Y+ GVY+H G V GGHAV+
Sbjct: 187 PIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVE 246
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT + DYW+
Sbjct: 247 MVGYGTDEYDVDYWI 261
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 128/255 (50%), Gaps = 23/255 (9%)
Query: 49 NPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAW 108
NP+ WKA +F T + LL K P T +P+SFD R +
Sbjct: 28 NPR--WKAGIPKRFEGLTKDEISSLLMPVSFLKNAKGAAPRGTFTDKDDVPESFDFREEY 85
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD- 164
P C I ++DQG CGSCWAF +V DR C+ G++ + S +++C GD
Sbjct: 86 PHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDM 139
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C+GG+ + W++ G T+EC PY + C PT KC +
Sbjct: 140 ACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHL 190
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
+ S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV+
Sbjct: 191 ATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVE 248
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT DDG DYW+
Sbjct: 249 MVGYGTDDDGVDYWI 263
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/177 (45%), Positives = 97/177 (54%), Gaps = 20/177 (11%)
Query: 130 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
GAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y+ HG+VT
Sbjct: 1 GAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCEN-CGFGCRGGYPAVAWDYWKTHGIVTGG 59
Query: 188 CDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLWRNSKHYSISA 232
D +GC P CE YPTP+CV++C + + K + +
Sbjct: 60 SKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMS 117
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
Y I + IM EI GPVE FT+YEDF Y SGVY H G M GHAV+++GWG
Sbjct: 118 YNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWG 174
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 99/293 (33%), Positives = 137/293 (46%), Gaps = 40/293 (13%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 81
EG + D + D+II VN + GW A + Q+ Y+ G K LG K PT +
Sbjct: 115 EGGRVQCDQDLCLTDDAIIHSVNSISRLGWSAHKYDQWWGRKYSEG-LKLRLGTKEPTYR 173
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
+ + + + LP+SF+A W S IS + DQG CG+ W SDRF I
Sbjct: 174 ---VKAMTRLRNPTDGLPRSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228
Query: 142 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-------- 191
+ LS ++L+C GCDGG+ +AWRY GVV E C PY
Sbjct: 229 QSKGKETVQLSAQNILSCTRRQ--QGCDGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCK 286
Query: 192 --FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 249
+S GCE TP V R++ + AY +N + DIMAEI+ +
Sbjct: 287 IRHNSRSLRANGCE----TPVNVD---------RDTFYTVGPAYSLNREA-DIMAEIFNS 332
Query: 250 GPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWV 299
GPV+ + V DF Y GVY+ + G H+VKL+GWG +GE YW+
Sbjct: 333 GPVQATMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWI 385
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 147/296 (49%), Gaps = 47/296 (15%)
Query: 31 LKLDSH----ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK-PTPK---- 81
L LDS + ++ I+ +N+ K W+A ++ F + + L+G+ PTP+
Sbjct: 37 LNLDSSSDPLVHDEAFIQLINKYAKT-WQAGKSKFFEGKRLSHARRLIGLGLPTPEQRAS 95
Query: 82 -----GLLLGVPVKTHDKSL----KLPKSFDAR--SAWPQCSTISRILDQGHCGSCWAFG 130
L++G + +K L LP S++A S + C + RI +Q CGSCWAF
Sbjct: 96 YPKKNSLMMGEEANSLEKYLVKMDALPDSYNAANDSNYYMCQQLHRIRNQEQCGSCWAFS 155
Query: 131 AVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 188
E ++DRFCI +N +S +++C +GC+GG +A+++ G+V++ C
Sbjct: 156 ISEMVADRFCIGTRGKINTIMSPQWMVSCD--TADNGCNGGEFPTAFQFVETTGLVSDGC 213
Query: 189 DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIM 243
PY G P C C + +NS+++ ++ D + +
Sbjct: 214 VPYQSGNGF----------VPPCPNSCANGEDINVRYRTKNSRNFDVN------DMKSVQ 257
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
A I NGPV F VY DF +Y+SG YKH+ G ++GGHA+K++GWG + YW+
Sbjct: 258 ASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQSNVPYWI 312
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 96/285 (33%), Positives = 136/285 (47%), Gaps = 24/285 (8%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 81
EG + D + D+I+ VN + GW A + Q+ Y+ G K LG K PT +
Sbjct: 115 EGGSVQCDEDLCLTDDAIVHSVNSIHRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
+ + + + LP SF+A W S IS + DQG CG+ W SDRF I
Sbjct: 174 ---VKAMTRLKNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228
Query: 142 HFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
N+ LS ++L+C GC+GG+ +AWRY GVV E C PY H
Sbjct: 229 QSKGKENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPY-----TQH 281
Query: 200 PGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ +R C K + R+S + AY +N + DIMAEI+ +GPV+ +
Sbjct: 282 RDTCKIRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMR 340
Query: 258 VYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWV 299
V DF Y GVY+ + G H+VKL+GWG +GE YW+
Sbjct: 341 VNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWI 385
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 99/177 (55%), Gaps = 19/177 (10%)
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA + A+SDR CI + +S D+++CC + CG GCDGG+PI AW++F G
Sbjct: 1 SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSY-CGYGCDGGWPIKAWQFFAREG 59
Query: 183 VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQL-WRNSKH 227
VVT C PY + T C H G EP Y TP+C RKC + ++ K
Sbjct: 60 VVTGGNYGRQGCCRPY-EITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKR 118
Query: 228 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
Y AY++ + + I EI +GPV +TVYEDF++Y G+YKH G GGHAVK
Sbjct: 119 YGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 106/193 (54%), Gaps = 19/193 (9%)
Query: 125 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY----- 177
SCWA + EA+SD C+ + + +S +D+L+CCG CG GC GG+ I A+++
Sbjct: 1 SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60
Query: 178 --FVHHGVVTEECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKN-QLWRNSK 226
+ C P S + +P Y PTPKC + C +K + ++ K
Sbjct: 61 CCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDK 120
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
H++ AY + ++ I EIYKNGPV +F VY+DF++YK G+Y H G G HAVK++
Sbjct: 121 HFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVV 180
Query: 287 GWGTSDDGEDYWV 299
GWG ++ DYW+
Sbjct: 181 GWG-RENATDYWL 192
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 114/232 (49%), Gaps = 32/232 (13%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP SFDAR + C+ I + +QG C +CWA AV +DR CI G ++ LS+ L
Sbjct: 145 LPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYL 204
Query: 155 LACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------EE------CDPYFDSTGC 197
+CC G +GC G + +HG+VT EE C PY C
Sbjct: 205 TSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPY-PFPKC 263
Query: 198 SH-PGCEPAYPT-------PKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIY 247
+H PG E YP P C C K + H + S R+ PE I EI+
Sbjct: 264 NHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIF 323
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPV T+YEDF YKSGVY H TG ++ H +KLIGWG + G++YW+
Sbjct: 324 DNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGV-ESGQEYWL 374
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 94/292 (32%), Positives = 131/292 (44%), Gaps = 38/292 (13%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKG 82
+G + D + D +I VN + GW A + ++ Y+ G L +PT +
Sbjct: 115 DGGRVQCDTDLCLTDDELINSVNSIHQLGWSARKYDEWWSHKYSEGLRLRLGTKEPTYR- 173
Query: 83 LLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
+ + + S LP+ F+A W S IS + DQG CGS W SDRF I
Sbjct: 174 --VKAMTRLSNPSSGLPRKFNAVERWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQ 229
Query: 143 FGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY--------- 191
+ LS ++L+C GC+GG+ +AWRY GVV E C PY
Sbjct: 230 SQGKEVVQLSPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDETCYPYTQRRDSCKI 287
Query: 192 -FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 250
+S GC PAY V ++ L+ YS+ DIMAEIY +G
Sbjct: 288 RHNSRSLKANGCRPAYG--------VNRDSLYTVGPAYSLKG------ETDIMAEIYHSG 333
Query: 251 PVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
PV+ + VY DF Y GVY+ G G H+VK++GWG DG YW+
Sbjct: 334 PVQATMRVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWI 385
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 129/283 (45%), Gaps = 32/283 (11%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 79
+ + ++ + S+I +N N GWKA +F N T+ Q + +L G+ + T
Sbjct: 32 IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSTDEDT 91
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
P+ ++ + + +P +FDAR+ W C + I DQ CG+CWAF A L+ R
Sbjct: 92 PR-------MENIETRMDIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRL 142
Query: 140 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 197
CI N+ LS + C C GGY +W + + G + C PY G
Sbjct: 143 CIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDTCIPYASGRGT 200
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
G P +C + ++ Y R + +I I G V+ FT
Sbjct: 201 FSSGTCPT----QCKIASMSMSK-------YKAKNTRYITGINNIKTAIMTYGSVQAGFT 249
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VY D YKSGVYKH+ V+GGHAV LIG+G + G +YW+
Sbjct: 250 VYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNYWLA 291
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 117/237 (49%), Gaps = 33/237 (13%)
Query: 90 KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D S K +P+ FDAR + C+ I + DQG+C S WA +DR CI
Sbjct: 54 KTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQ 113
Query: 147 LS--LSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+ LS +L++C GDG CDGG AW ++ G+VT E C PY
Sbjct: 114 FTDNLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-K 167
Query: 194 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 242
+ C H G C T C +KCV KN + + H + Y + ++ + I
Sbjct: 168 NRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 227
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI +GPV VYE+F YK G+YK TG+++G H VKLIGWG DG +YW+
Sbjct: 228 QQEIMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWL 284
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/177 (42%), Positives = 103/177 (58%), Gaps = 20/177 (11%)
Query: 130 GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
GAVEA++DR CIH + +S DLL+CC CG GC GG+P AW +++ +G+VT
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISATDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGG 59
Query: 186 -----EECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSISA 232
C Y CSH G + YP TP CV C K + + K ++ S+
Sbjct: 60 SKENPSGCRSY-PFPRCSHHG-KGKYPPCPKTIFDTPNCVDHCDKPDIDYAADKTHAKSS 117
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
Y + S+ IM EI +NGPVE +F VYEDF YKSG+Y H G ++GGHA++++GWG
Sbjct: 118 YNVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWG 174
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/275 (32%), Positives = 129/275 (46%), Gaps = 32/275 (11%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPTPKGLLLG 86
++ + S+I +N N GWKA +F N T+ Q + +L G+ + TP+
Sbjct: 76 NTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSSDEDTPR----- 130
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-- 144
+ + + +P +FDAR+ W C + I DQ CG+CWAF A L+ R CI
Sbjct: 131 --MANIETRIDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQ 186
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
N+ LS + C C GGY +W + + G + C PY G
Sbjct: 187 TNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDSCIPYASGRG-------- 236
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
+ + C +C K SK+ + + I S +I I G V+ FTVY D
Sbjct: 237 TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTG 293
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
YKSGVYKHI V+GGHAV LIG+G + G +YW+
Sbjct: 294 YKSGVYKHIENTVLGGHAVALIGFGV-EGGSNYWL 327
>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 476
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 159/357 (44%), Gaps = 72/357 (20%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDS------IIKEVNENPKAGWKAARNPQFSNY 65
+LIL IS A G KL+ D + ++ ++++VN+ P+ WKA NP +
Sbjct: 89 ILILLGISFIAAAIGFYLKLQKDVEEVHETKAYLMGLVQQVNQAPELKWKAKYNPFGTRK 148
Query: 66 TVGQF---KHLLGVKPTPKGL---LLGVPVKTHDKSL------KLPKSFDARSAWPQCST 113
F K+ ++ L +K H + L LP FDAR W CS+
Sbjct: 149 KDHNFPFDKNSTAIREYLNRLSEFFNSEKMKQHLRELTEFPADSLPSEFDARRKWSYCSS 208
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 171
+ + +QG CG+C+A AV SDR CI L S D+L CC +CG+ C GG P
Sbjct: 209 LHNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSEEDVLGCCA-VCGN-CYGGDP 266
Query: 172 ISAWRYFVHHGVVT---EECDPYFDSTGCSHPGCEPA-YPTPKCVRKCVKKNQ------L 221
+ A Y+V G+VT + C PY C P C PA YP + RKC ++ Q
Sbjct: 267 LKALVYWVDEGLVTGGRDGCRPYSVDLSCGVP-CSPAVYPLAEYRRKCYRQCQDIYFQYN 325
Query: 222 WRNSKHYSISAYRI---------------------------NSDP-------EDIMAEIY 247
+ + KHY AY + + +P + IM E+Y
Sbjct: 326 YESDKHYGSMAYSMFPRTMSLDNKGSERVKLPTVIGYLNETSDEPLTDKEIRQIIMKELY 385
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYK-----HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GP+ ++F V E+F HY SGV+ + + ++ H +LIGWG D YW+
Sbjct: 386 LWGPMTMAFPVTEEFLHYSSGVFSPFPAANFSDRIVYWHVARLIGWGKYDGDNHYWL 442
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 135/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++Q +I+ VN N GW A QF T+ + FK+ LG + P+P+ L + +
Sbjct: 155 LIQPELIERVN-NGDYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPRLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW + G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNDGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKGAQGQKEKFWI 433
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/263 (34%), Positives = 135/263 (51%), Gaps = 33/263 (12%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS--LKLP 99
+I+E+N + + WKA N +G LG+ P P + K H S + +P
Sbjct: 26 VIQEIN-SEQISWKAETNCLDIKSRLG----FLGLHPDPN---YKIQTKQHKISRIISIP 77
Query: 100 KSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
+SFDAR WP+C I +I +QG+CGSCWAF + E ++DR CI + S +LL
Sbjct: 78 ESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLT 137
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
CC G Y +AW Y+++ G+ + Y S GC P E ++ + +CV
Sbjct: 138 CCKDCGCGCKGG-YIKNAWDYYINEGIAS--GGDYNSSEGC-QPYSESSFQYAE-ASECV 192
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
K Y + ++ I EI NGPV + V+EDFA +KSGVY + +G
Sbjct: 193 K--------------FYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGK 238
Query: 277 VMGGHAVKLIGWGTSDDGEDYWV 299
+G H+VK+IGWGT ++G YW+
Sbjct: 239 FVGRHSVKVIGWGT-EEGIPYWL 260
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 114/233 (48%), Gaps = 34/233 (14%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
+P SFDAR A+ +C I + DQ C SCWA V+A S R CI G N LS +L
Sbjct: 83 IPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAGEL 142
Query: 155 LACCGFL--C-GDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGCS 198
LACC C GC GG AW + HG+ T + C PY + C+
Sbjct: 143 LACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPY-NFPRCA 201
Query: 199 H--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISA--YRINSDPEDIMAEI 246
H P + +Y TP C+ +C K +H++ A Y N I EI
Sbjct: 202 HYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNG-IRSIKKEI 260
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
K+GP SF YEDF YKSGVYK+ +G + H V+LIGWGT + G DYW+
Sbjct: 261 MKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGWGT-EKGVDYWL 312
>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
Length = 183
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/185 (41%), Positives = 106/185 (57%), Gaps = 24/185 (12%)
Query: 131 AVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
AV ++SDR CIH N + LS DLL+CC CG GC GG+ AW Y+ +G+VT
Sbjct: 1 AVTSMSDRVCIHSNQNKTNVQLSARDLLSCC-TSCGFGCVGGWIGDAWDYWRDNGIVTGG 59
Query: 186 -----EECDPY-------FDSTGCS---HPGCEPAYPTPKCVRKCVKKNQ-LWRNSKHYS 229
C PY S G +P + YPTP CV KC + + K ++
Sbjct: 60 DYQDKSTCLPYPFPPSHHLVSKGTPFEIYP--QTLYPTPPCVSKCQEGYPGEYEKDKIFA 117
Query: 230 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
+S+Y+I+ + +I EI NGPVE VY DF +YK+GVY+H TG+++GGHA++L+GWG
Sbjct: 118 LSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGGHAIRLLGWG 177
Query: 290 TSDDG 294
+ DG
Sbjct: 178 KTKDG 182
>gi|56758470|gb|AAW27375.1| unknown [Schistosoma japonicum]
Length = 217
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 104/178 (58%), Gaps = 15/178 (8%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLL--LGVPVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD +
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLREKRRPTVDHHDLN 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ CGS WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVD 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
L++CC + CG GCDGG+ +W Y+V G+VT + +H GC P YP PKC
Sbjct: 148 LISCCKY-CGSGCDGGFLGPSWDYWVLRGIVT-------GGSKENHTGCRP-YPFPKC 196
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 116/237 (48%), Gaps = 33/237 (13%)
Query: 90 KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D S K +P+ FDAR + C+ I + DQG+C S WA +DR CI
Sbjct: 54 KTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQ 113
Query: 147 LS--LSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+ LS +L++C GDG CDGG AW ++ G+VT E C PY
Sbjct: 114 FTDNLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-K 167
Query: 194 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 242
+ C H G C T C +KCV KN + + H + Y + ++ + I
Sbjct: 168 NRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 227
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI GPV VYE+F YK G+YK TG+++G H VKLIGWG DG +YW+
Sbjct: 228 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWL 284
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 137/289 (47%), Gaps = 35/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++Q +I+ VN+ GW+A QF T+ + FK+ LG + P+P L + +
Sbjct: 152 LVQPELIERVNKG-DYGWRAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEVTASLPA 210
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 211 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 268
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW + G+V+ C P F + ++ GC A
Sbjct: 269 NLISCCP-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKNQNATNHGCAMASRSDGRG 327
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 328 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 382
Query: 264 HYKSGVYKHITGDV---------MGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+HIT + HAVKL GWGT E +W+
Sbjct: 383 HYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKEKFWI 431
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 115/215 (53%), Gaps = 31/215 (14%)
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD-GCD 167
C ++ I DQ +CGSCWAFG+ EA++DR CI ++ LS D+ +C GD GC+
Sbjct: 1 CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL--GDMGCN 58
Query: 168 GGYPISAWRYFVHHGVVTEECDPYFDSTGC---------------SHPGCEPAYPTPKCV 212
GG P S + Y+ G+V + Y D +GC +P C PKC
Sbjct: 59 GGIPSSVYSYWALSGIV--DGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCA 116
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPE-------DIMAEIYKNGPVEVSFTVYEDFAHY 265
RKC +++ W +K Y + E + A+IY+NGP+ F V +DF Y
Sbjct: 117 RKCESEDKDWTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAY 176
Query: 266 KSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSGVY+ + +GGHA+K++G+GT +DG+DYW+
Sbjct: 177 KSGVYEPKLLSPPLGGHAIKIMGFGT-EDGKDYWL 210
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/285 (34%), Positives = 135/285 (47%), Gaps = 34/285 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLL-LGVPVKTHDK 94
+++ +++ +N GWKA QF TV + FK LG P LL +
Sbjct: 160 LVRQDLLQRINSG-DYGWKADNYSQFWGMTVEEAFKKRLGTFPPSHSLLNMRESPGNSLP 218
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
K P F A AWP+ I LDQ +CG+ WAF +DR IH ++ LSV
Sbjct: 219 EEKFPVFFAATYAWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQ 276
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-----DSTGCSHPGCEPAY- 206
+L++C GC+GG SAWRY HGVV+ C P F + +G +H Y
Sbjct: 277 NLISC-DTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPSFWKKHLEPSGENHCYVSSEYG 335
Query: 207 ------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
P P + K N+L+R + HY R++S +IM EI GPV+ VYE
Sbjct: 336 KNYTNGPCPNALEK---SNRLYRCASHY-----RVSSKETNIMKEIMDKGPVQAIMKVYE 387
Query: 261 DFAHYKSGVYKHI--TGDVMGGHAVKLIGWGTSDD----GEDYWV 299
DF YK G+Y+H G H+VKL+GWG D + +W+
Sbjct: 388 DFFLYKEGIYRHSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWI 432
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 103/205 (50%), Gaps = 16/205 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN-DLLA 156
+P+SFDAR+ WP C I IL+Q CGSCWAF A E LSDR CI + ++ L
Sbjct: 31 IPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALV 88
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 216
C GC+GG P AW Y HG+ T C PY G CV+
Sbjct: 89 SCDIFGNQGCNGGIPQLAWEYMELHGIPTYGCFPYTSGNGTDG----------SCVKNSC 138
Query: 217 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG- 275
N+ + + ++ + + E I +I K GP++ + VY DF Y SGVY G
Sbjct: 139 VDNEQYTLYRAKPLT-LKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGS 197
Query: 276 DVMGGHAVKLIGWGTSD-DGEDYWV 299
++GGHA+K++GWG ++YW+
Sbjct: 198 SLLGGHAIKIVGWGFDQASNQNYWI 222
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 132/283 (46%), Gaps = 32/283 (11%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 79
+ + ++ + S+I +N N GWKA +F N T+ Q + +L G+ + T
Sbjct: 17 IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSSDEDT 76
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
P+ + + + + +P +FDAR+ W C + I DQ CG+CWAF A L+ R
Sbjct: 77 PR-------MASIETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRL 127
Query: 140 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 197
CI N+ LS + C C GGY +W + + G + C PY G
Sbjct: 128 CIATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDTCIPYASGRG- 184
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ + C +C K SK+ + + I S +I I G V+ FT
Sbjct: 185 -------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFT 234
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VY D YKSGVYKH+ V+GGHAV LIG+G + G +YW+
Sbjct: 235 VYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNYWLA 276
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 135/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++Q +I+ VN+ GW A QF T+ + FK+ LG + P+P L + +
Sbjct: 155 LVQPELIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW + G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 385
Query: 264 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+HIT + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWI 433
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 82/256 (32%), Positives = 128/256 (50%), Gaps = 29/256 (11%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLG-VP----VKTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G G +P + + + +P FD R +
Sbjct: 31 WKAGMPKRFENITEDEFRGML-IRPDILGAGSGSLPPSSVTEIQEPADPIPSQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC ++ ++DQG CG CWAF A+ DR C+ G++ + S L++C G
Sbjct: 90 PQC--VTPVMDQGSCGGCWAFSAIGVFGDRRCVA-GIDKEGVPYSQQYLISCS--TENHG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D P C C +Q+
Sbjct: 145 CDGGDFWPTWSFLTLTGATTAECVKYIDY---------PNIVASPCPAVCDDGSQI---- 191
Query: 226 KHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAV 283
+ Y Y +++ + + IM + GPV+ VY D ++Y+SGVYKH G + +G HA+
Sbjct: 192 QLYKAHGYGQVSKNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHAL 251
Query: 284 KLIGWGTSDDGEDYWV 299
+++G+GT+DDG DYW+
Sbjct: 252 EMVGYGTTDDGTDYWI 267
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 131/281 (46%), Gaps = 40/281 (14%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPKGLLLGVPVKTHD 93
+ D++I VN + GW A + Q+ Y+ G K LG K PT + + + +
Sbjct: 127 LTDDALIHSVNSIQRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR---VKAMTRLKN 182
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSV 151
+ LP SF+A W S IS + DQG CG+ W SDRF I + LS
Sbjct: 183 PTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSA 240
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPG 201
++L+C GC+GG+ +AWRY GVV E C PY +S G
Sbjct: 241 QNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYPYTQQRDTCKIRHNSRSLRANG 298
Query: 202 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
C+ Y R++ + AY +N + DIMAEI+ +GPV+ + V D
Sbjct: 299 CQTPYNVD-------------RDTFYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRD 344
Query: 262 FAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGEDYWV 299
F Y GVY+ + M G H+VKL+GWG +GE YW+
Sbjct: 345 FFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYWI 385
>gi|325303156|tpg|DAA34330.1| TPA_inf: cysteine proteinase cathepsin L [Amblyomma variegatum]
Length = 207
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/139 (48%), Positives = 90/139 (64%), Gaps = 13/139 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS---LKLPKSFDARSAWPQ 110
WKA NP + + +LLGV+P + L +P +T D + LP++FDAR WP
Sbjct: 62 WKAGHNPGYDD--PDYVANLLGVRP--ENSLYRLPERTLDVNALPTALPENFDAREQWPD 117
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG-----MNLSLSVNDLLACCGFLCGDG 165
C TI I DQG CGSCWAFGAVEA+SDR CIH +N+ L+ +D+L+CC CG G
Sbjct: 118 CPTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPARKPRVNVHLAADDVLSCCK-DCGAG 176
Query: 166 CDGGYPISAWRYFVHHGVV 184
C+GG+P +AW Y+VHHG+V
Sbjct: 177 CNGGFPGAAWSYWVHHGIV 195
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 135/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
+YK+G+Y+HIT HAVKL GWGT E +W+
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 94/152 (61%), Gaps = 14/152 (9%)
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPT 208
+CGDGC+GGYP AW ++ G+V+ C PY S P C T
Sbjct: 1 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDT 60
Query: 209 PKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
PKC + C + ++ KHY ++Y +++ + IMAEIYKNGPVE +F+VY DF YKS
Sbjct: 61 PKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKS 120
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 121 GVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 151
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 135/285 (47%), Gaps = 24/285 (8%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 81
EG + D + D+I+ VN + GW A + Q+ Y+ G K LG K PT +
Sbjct: 115 EGGSVQCDQDLCLTDDAIVHSVNSINRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
+ + + + LP SF+A W S IS + DQG CG+ W SDRF I
Sbjct: 174 ---VKAMTRLRNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228
Query: 142 HFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
+ LS ++L+C GC+GG+ +AWRY GVV E C PY H
Sbjct: 229 QSKGKEAVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYT-----QH 281
Query: 200 PGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ +R C + R++ + AY +N + DIMAEI+ +GPV+ +
Sbjct: 282 RDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFHSGPVQATMR 340
Query: 258 VYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGEDYWV 299
V DF Y GVY+ + + G H+VKL+GWG +GE YW+
Sbjct: 341 VNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWI 385
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 71/178 (39%), Positives = 100/178 (56%), Gaps = 16/178 (8%)
Query: 130 GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 185
GAVEA++DR CIH + +S DLL+CC CG GC GG+P AW +++ +G+VT
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISSTDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGG 59
Query: 186 -----EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 234
C Y G P E +PTP C + C + K + S+Y
Sbjct: 60 SKENPSGCRSYPFPKCNHHGKGPDAPCPEKIFPTPACNKTCDTPEVNYILDKTKAKSSYN 119
Query: 235 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
+ + + IM EI +NGPVE +F VYEDF HY+SGVY H G ++GGHA++++GWG +
Sbjct: 120 VPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEEN 177
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 86/237 (36%), Positives = 116/237 (48%), Gaps = 33/237 (13%)
Query: 90 KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D S K +P+ FDAR + C+ I + DQG+C S WA +DR CI
Sbjct: 16 KTVDNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQ 75
Query: 147 LS--LSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+ LS +L++C GDG CDGG AW ++ G+VT E C PY +
Sbjct: 76 FTDNLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKN 130
Query: 194 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 242
C H G C T C +KCV KN + + H + Y + ++ + I
Sbjct: 131 RP-CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI GPV VYE+F YK G+YK TG+++G H VKLIGWG DG +YW+
Sbjct: 190 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWL 246
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 131/283 (46%), Gaps = 32/283 (11%)
Query: 27 VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK-HLLGV------KPT 79
+ + ++ + S+I +N N GWKA +F N T+ Q + +L G+ + T
Sbjct: 17 IANHTHANTPVNDKSLIDRINSNHTHGWKATEYSRFDNMTISQLRDNLFGLSLMSSDEDT 76
Query: 80 PKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRF 139
P+ + + + +P +FDAR+ W C + I DQ CG+CWAF A L+ R
Sbjct: 77 PR-------MANIETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANYVLAHRL 127
Query: 140 CIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC 197
CI N+ LS + C C GGY +W + + G + C PY G
Sbjct: 128 CIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTFLENTGTPLDTCIPYASGGG- 184
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ + C +C K SK+ + + I S +I I G V+ FT
Sbjct: 185 -------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFT 234
Query: 258 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWVC 300
VY D YKSGVYKH+ V+GGHAV LIG+G + G +YW+
Sbjct: 235 VYRDLTGYKSGVYKHLVSTVLGGHAVALIGFGV-EGGSNYWLA 276
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 134/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
+++ +I+ VN+ GW A QF T+ FK LG P P LLL + T
Sbjct: 143 LVRPELIENVNKG-DYGWIAQNYSQFWGMTLEDGFKFRLGTLP-PSPLLLSMNEMTASLP 200
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 201 KTTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 258
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 259 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGCAMASRSDGR 317
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YRI+S+ +IM EI +NGPV+ V+EDF
Sbjct: 318 GKRHATKPCPNNIEKSNRIYQCS-----PPYRISSNETEIMKEIMQNGPVQAIMQVHEDF 372
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYKSG+Y+H+ + HAVKL+GWGT E +W+
Sbjct: 373 FHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQGRKEKFWI 421
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 129/280 (46%), Gaps = 38/280 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
+ +SII +N GW A + ++ Y+ G L +PT + + + +
Sbjct: 129 LTDESIIHSINTIYHLGWSARKYDEWWGHKYSEGLRLRLGTKEPTYR---VKAMSRLTNP 185
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 152
+ LP +F+A W S IS + DQG CGS W SDRF I + LS
Sbjct: 186 TAGLPAAFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGC 202
++L+C GC+GG+ +AWRY GVV E C PY +S GC
Sbjct: 244 NILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGC 301
Query: 203 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
P+ R+S + AY +N + DIMAEIY +GPV+ + VY DF
Sbjct: 302 RPSANVD-------------RDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDF 347
Query: 263 AHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
Y SGVY+ G G H+VKL+GWG +G+ YW+
Sbjct: 348 FSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWI 387
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 98/290 (33%), Positives = 134/290 (46%), Gaps = 30/290 (10%)
Query: 30 KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLL--LG 86
K D +++ +I+ +N GWKA QF TV + FK LG P LL
Sbjct: 153 KCSTDVCLVRQDLIQHINSG-DFGWKADNYSQFWGMTVEEGFKKRLGTFPPSHSLLNMRE 211
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
VP K+ + K P F A WP+ I LDQ +CG+ WAF +DR IH
Sbjct: 212 VPGKSLPEE-KFPAIFSAIYEWPE--WIHDPLDQRNCGASWAFSTASVAADRIAIHSKGQ 268
Query: 147 LS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
++ LS +L++C GC+GG AWRY HGVV+ C P F +
Sbjct: 269 ITDNLSAQNLISC-DTRNQHGCNGGSIDGAWRYLKTHGVVSYACYPSFWNKHLGPSAENQ 327
Query: 205 AYPTPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 255
Y + + C K N+L+R + HY R++S DIM EI GPV+
Sbjct: 328 CYVSNEYGKNHTNGPCPNAFEKSNRLYRCASHY-----RVSSKETDIMKEIKDRGPVQAI 382
Query: 256 FTVYEDFAHYKSGVYKHI--TGDVMGGHAVKLIGWGTSDD----GEDYWV 299
VYEDF YK G+Y+H G H+VKL+GWG D + +W+
Sbjct: 383 MKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWI 432
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 129/280 (46%), Gaps = 38/280 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
+ +SII +N GW A + ++ Y+ G L +PT + + + +
Sbjct: 129 LTDESIIHSINTIYHLGWSARKYDEWWGHKYSEGLRLRLGTKEPTYR---VKAMSRLTNP 185
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 152
+ LP +F+A W S IS + DQG CGS W SDRF I + LS
Sbjct: 186 TAGLPAAFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQ 243
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGC 202
++L+C GC+GG+ +AWRY GVV E C PY +S GC
Sbjct: 244 NILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGC 301
Query: 203 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
P+ R+S + AY +N + DIMAEIY +GPV+ + VY DF
Sbjct: 302 RPSANVD-------------RDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDF 347
Query: 263 AHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
Y SGVY+ G G H+VKL+GWG +G+ YW+
Sbjct: 348 FSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWI 387
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 135/287 (47%), Gaps = 36/287 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPELIERVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEMTASLP 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 153
+ LP+ F A WP LDQ +C + WAF +DR + NLS +
Sbjct: 213 ATTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIXGRYTANLS--PQN 268
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-------- 205
L++CC GC+ G AW + G+V+ C P F ++ GC A
Sbjct: 269 LISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGK 327
Query: 206 -YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF H
Sbjct: 328 RHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFH 382
Query: 265 YKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
YK+G+Y+H+T + HA+KL GWGT E +W+
Sbjct: 383 YKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEKFWI 429
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 114/239 (47%), Gaps = 41/239 (17%)
Query: 94 KSLKLPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLS 150
+ P++FD+ + WP+C+ I I DQ +CG CWAF EA SDR CI G + + LS
Sbjct: 20 RGGAAPEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIATGGAVAVPLS 79
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG-------------- 196
D+ C DGCDGG I+ W Y G VT ++ TG
Sbjct: 80 AQDV---CFNANVDGCDGGQIITPWTYVAKAGAVT---GGQYNGTGPFGAGLCADWFAPH 133
Query: 197 CSHPGCE-------------PAYPTPKCVRKC----VKKNQLWRNSKHYSISAYRINSDP 239
C H G P+ +P+ + C + + KH + S
Sbjct: 134 CHHHGPRGDDPYPAEGDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGE 193
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
IMA I + GPVE +FTVYEDF +Y G+Y H+TG+ GGHAVK +GWG ++G YW
Sbjct: 194 AAIMAMIAEGGPVETAFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGV-ENGTKYW 251
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 134/285 (47%), Gaps = 24/285 (8%)
Query: 25 EGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVK-PTPK 81
EG + D + D+I+ VN + GW A + Q+ Y+ G K LG K PT +
Sbjct: 115 EGGSVQCDQDLCLTDDAIVHSVNSINRLGWSARKYDQWWGRKYSEG-LKLRLGTKEPTYR 173
Query: 82 GLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI 141
+ + + + LP SF+A W S IS + DQG CG+ W SDRF I
Sbjct: 174 ---VKAMTRLRNPTDGLPSSFNALDKWS--SYISEVPDQGWCGASWVLSTTSVASDRFAI 228
Query: 142 HFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH 199
+ LS ++L+C GC+GG+ +AWRY GVV E C PY H
Sbjct: 229 QSKGKEAVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPY-----TQH 281
Query: 200 PGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
+ +R C + R++ + AY +N + DIMAEI+ +GPV+ +
Sbjct: 282 RDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFHSGPVQATMR 340
Query: 258 VYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWV 299
V DF Y GVY+ + G H+VKL+GWG +GE YW+
Sbjct: 341 VNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWI 385
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 130/280 (46%), Gaps = 38/280 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK-PTPKGLLLGVPVKTHDK 94
+ D +I VN GW A + ++ + + + LG K PT + + + +
Sbjct: 126 LTDDELIYSVNSIHNLGWSARKYNEWWGHKYAEGLRLRLGTKEPTYR---VKAMTRLTNP 182
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 152
+ LP SF+A WP S IS + DQG CGS W SDRF I + LS
Sbjct: 183 TDGLPSSFNAVERWP--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQ 240
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGC 202
++L+C GCDGG+ +AWR+ GVV + C PY +S GC
Sbjct: 241 NILSCTRRQ--QGCDGGHLDAAWRFLHKKGVVDDSCYPYTQQRDTCKIRHNSRSLKANGC 298
Query: 203 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
P+ + R+S + AY +N + DIMAEIY +GPV+ + VY DF
Sbjct: 299 RPS-------------PNVDRDSFYTVGPAYTLNREG-DIMAEIYHSGPVQATMRVYRDF 344
Query: 263 AHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
Y G+Y+ G G H+VKL+GWG +G+ YW+
Sbjct: 345 FSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWI 384
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 81/253 (32%), Positives = 118/253 (46%), Gaps = 56/253 (22%)
Query: 102 FDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG 159
FDAR WP+CS+I I D C S WAF A E++SDR CI+ G +N LS +LL+CC
Sbjct: 85 FDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCT 144
Query: 160 --FLCGDG------------------------------------CDGGYPISAWRYFVHH 181
F CG+G C GG AW+Y+ H
Sbjct: 145 GVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKH 204
Query: 182 GVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSK 226
G+ T C PY S + PGC TP C +KC + +
Sbjct: 205 GLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDR 264
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
HY +S ++ + +I +++ NGP+ + VY+DF Y +G+Y H+TG+ G +V+++
Sbjct: 265 HYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRIL 324
Query: 287 GWGTSDDGEDYWV 299
GWG +G YW+
Sbjct: 325 GWGMY-EGVPYWL 336
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 134/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
+++ +I++VN+ GW A QF T+ FK LG P P +LL + T
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSLMLLSMNEMTASLP 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 385 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 133/283 (46%), Gaps = 30/283 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGV----KPTPKGLLLGVPVKT 91
I + +I+++NE GW+A F + ++ LG +PT + L +
Sbjct: 140 INRPELIRQINEG-NFGWQATNYSIFYGKLLEDGIRYRLGTHQPERPTAEMNELHL---- 194
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FGMN-LSL 149
K +LP+ FDAR W + + DQG C + WAF SDR I G++ + L
Sbjct: 195 -KKREQLPEEFDARIRW--SGLVHGVRDQGDCANSWAFSTAAVASDRLSIQSRGVDKVEL 251
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE-PAYPT 208
S DL++C C GG+P WR+ +++G V+EEC PY ++ C P
Sbjct: 252 SPQDLMSCLNGGRRVVCQGGHPDRGWRFLLNYGGVSEECYPYEGVHSSANATCRIPRRRD 311
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
P +C KH+S YR+ ++ EDIM EIY NGPV+ V EDF Y+SG
Sbjct: 312 PIEDARCPTGRT---EQKHFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSG 368
Query: 269 VYKHI--------TGDVMGGHAVKLIGWGTSDDGE---DYWVC 300
VY+H G H+V+++GWG YW+C
Sbjct: 369 VYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYRPIKYWLC 411
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRSELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 134/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
+++ +I++VN+ GW A QF T+ FK LG P P +LL + T
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSLMLLSMNEMTASLP 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 385 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 124/266 (46%), Gaps = 32/266 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK---THD 93
+L +S++ VN +P + W A P+ + L K T +G + T
Sbjct: 2 VLAESVVDIVNNDPSSTWVATEYPR---------EILTLAKMTAMISQIGNGFEGEWTFA 52
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND 153
++ P SFD R WP + +Q CGSCWA A E + R I +S D
Sbjct: 53 ENENAPASFDCRQKWP--GKAEPVRNQASCGSCWAHAASETMGFRMGIRGCYKGVMSPQD 110
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
L++C GC+GGY W + G+ TE+C PY +G P C
Sbjct: 111 LVSCESN--NMGCEGGYADRVWNWIQKKGITTEQCLPYVSGSG----------RVPTCPS 158
Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
KC + + R+ + S NS + +M E+ NGPV F V+EDF +YKSG+Y+H
Sbjct: 159 KCKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFLNYKSGIYQHK 213
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
TG G H V L+GWGT ++G YW+
Sbjct: 214 TGKSKGWHHVMLMGWGT-ENGVPYWL 238
>gi|149436731|ref|XP_001513125.1| PREDICTED: cathepsin B-like [Ornithorhynchus anatinus]
Length = 211
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/134 (48%), Positives = 81/134 (60%), Gaps = 6/134 (4%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
W+AA N F + + K L G G L V + +KLP++FDAR WP C T
Sbjct: 41 WRAAHN--FPHADMSYVKRLCGT--FLNGPKLPARVGLANSDMKLPENFDARQQWPNCPT 96
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 171
I I DQG CGSCWAFGAVEA+SDR C+H +S+ V+ DLL CCG CG GC+GGYP
Sbjct: 97 IKEIRDQGSCGSCWAFGAVEAISDRVCVHTNGQVSVEVSAEDLLTCCGLECGMGCNGGYP 156
Query: 172 ISAWRYFVHHGVVT 185
AW Y+ G+V+
Sbjct: 157 TGAWTYWTKKGLVS 170
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S +IM EI +NGPV+ V+EDF
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFF 385
Query: 264 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRGAQGRKEKFWI 433
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 129/276 (46%), Gaps = 31/276 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
+ D +I VN + GW A + ++ Y+ G L +PT + + + +
Sbjct: 129 LTDDELIHSVNSIHRLGWSARKYEEWWGRKYSEGLRLRLGTKEPTYR---VKTMTRLTNP 185
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 152
+ LP SF+A W + IS + DQG CGS W SDRF I + LS
Sbjct: 186 TDGLPASFNAVDKWSR--YISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQ 243
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG---CSHPGCEPAY--- 206
++L+C GC+GG+ +AWRY GV+ E C PY S G H G A+
Sbjct: 244 NILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDESCYPYTQSRGTCKVRHSGSLKAHGCR 301
Query: 207 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
P P V ++ L+ YS+S DI AEI+ +GPV+ + VY DF Y
Sbjct: 302 PAPG-----VDRDSLYTVGPAYSLSR------EADIKAEIFHSGPVQATMRVYRDFFSYS 350
Query: 267 SGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWV 299
G+Y+ G G H+VKL+GWG +G+ YW+
Sbjct: 351 GGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWI 386
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/289 (32%), Positives = 135/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLA 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
++ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 ETTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
+YK+G+Y+HIT HAVKL GWGT E +W+
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAHGQKEKFWI 433
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 127/272 (46%), Gaps = 18/272 (6%)
Query: 37 ILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDK 94
+++ SI + +N N GW A+ +F + + + K LG + ++ PV+
Sbjct: 16 LIEPSITEAINSNYANYGWSASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVRRIYD 75
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
LP+ FD+ WP +S I DQG CGS WA SDRF I ++LS
Sbjct: 76 PNSLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQ 133
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C C+GGY AW Y G+V E+C PY ++ C
Sbjct: 134 HLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRIPRRGDLVT 188
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C + R SK+ AYR+ ++ DIM EI +GPV+ + VY DF YK G+Y+H
Sbjct: 189 ANCQLPTNVDRRSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGIYRH 247
Query: 273 ---ITGDVMGGHAVKLIGWGTSDDGE---DYW 298
T D G H+V+++GWG E YW
Sbjct: 248 SPISTNDRTGYHSVRIVGWGEEYSPEGLKKYW 279
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S +IM EI +NGPV+ V+EDF
Sbjct: 331 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRGAQGRKEKFWI 433
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 108/213 (50%), Gaps = 24/213 (11%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV- 151
+ S +P SFD R +PQC I+ + DQGHCGSCWAF A A DR C+ G++ S V
Sbjct: 73 EPSGSIPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQ-GLD-SAGVP 128
Query: 152 --NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDST-GCSHPGCEPAYPT 208
C +L GC GG S W + HG T EC PY D+ S P
Sbjct: 129 YSQQYTISCDYL-DLGCAGGLSFSVWTFLTEHGTTTLECVPYTDANKDISSP-------- 179
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C C +++ R K Y N IM + +GPV+ S VY DF +Y+SG
Sbjct: 180 --CPDACADGSEI-RLVKADGCLDYSGNVTA--IMQALANDGPVQASMAVYRDFLYYRSG 234
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGED--YWV 299
VY+H+ G + HAV++IG+G +DD + YW+
Sbjct: 235 VYRHVYGSQISSHAVEIIGYGAADDEDSTPYWI 267
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRPQLIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/255 (34%), Positives = 126/255 (49%), Gaps = 27/255 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC + LDQG CGSCWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 284
+ +S S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT+DDG DYW+
Sbjct: 253 IVGYGTTDDGTDYWI 267
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 126/270 (46%), Gaps = 26/270 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLL--LGVPVKTHD 93
+++ +I +N GWKA QF T+ + F+ LG P LL +P +
Sbjct: 160 LVRPDLIHHINSG-DYGWKADNYTQFWGMTLEEGFRKRLGTLPPSHSLLNMKAIPGSSVP 218
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
+ K P+ F A AWP I LDQ +CG+ WAF +DR IH ++ LSV
Sbjct: 219 EE-KFPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRITIHSDGQITDNLSV 275
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK- 210
+L++C GC+GG AWRY HGVV+ C P F P Y + +
Sbjct: 276 QNLISC-DTGNQRGCNGGSIDGAWRYLTTHGVVSYACYPSFWKHHLDSPSENQCYVSSEY 334
Query: 211 --------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
C N+L+R HY R++S DIM EI GPV+ VYEDF
Sbjct: 335 GKNHTNGPCPNALEDSNRLYRCGSHY-----RVSSKETDIMEEIMAKGPVQAIMKVYEDF 389
Query: 263 AHYKSGVYKHI--TGDVMGGHAVKLIGWGT 290
YK G+Y+H G H+VKL+GWG+
Sbjct: 390 FLYKEGIYRHSYKAGSKWKTHSVKLLGWGS 419
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 137/281 (48%), Gaps = 31/281 (11%)
Query: 42 IIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK-SLKL 98
+I+ VN NPK GWKA N +F S F+ + ++ + + +H+ ++++
Sbjct: 33 LIEYVNRNPKFGWKAGTNHRFRSSKDIEKMFRKYIEIENIQTKHIKTI---SHNSINMEI 89
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
P+SFDAR W CSTI +I D+ C + WA V+++SDR CI +++ LS D ++
Sbjct: 90 PRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSARDAIS 149
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---------HPGCE---- 203
CGF GC G + Y++ +G+VT Y D +GC HP
Sbjct: 150 -CGF--SPGCFHGSEVEVLVYWITYGIVTG--GSYEDQSGCQPYPLPKCSYHPESRFLDC 204
Query: 204 --PAYPTPKCVRKCVK-KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
+ P+C +C N+ + + K Y Y + EDI EI NGPV S +V
Sbjct: 205 NNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASISVNT 264
Query: 261 DFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWVC 300
DF YKSGVY +G +++IGWG + YW+C
Sbjct: 265 DFLVYKSGVYLPTPRSRNLGWITLRIIGWGY-EGKIPYWLC 304
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 128/273 (46%), Gaps = 17/273 (6%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVK 90
+ D+ I+ D +I VN W+A QF + + LG P P++
Sbjct: 122 ERDACIISDDVIYGVNRG--NSWRAYNYTQFYGKKLRDGIIYKLGTMPLSHETRRMGPIR 179
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-GMNLSL 149
+DK + P+ FDAR WP + IS +LDQG CGS WA SDRF I G +
Sbjct: 180 -YDKDIPYPRDFDARRRWP--NFISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMV 236
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
+L C GC GG+ AW + HG+V EEC PY +T P P
Sbjct: 237 LSPQVLLSCNIRRQQGCRGGHIDVAWNFARGHGLVDEECFPYKAATTSC-----PFRPKA 291
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
+ + R S+ Y + + DIM +I ++GPV TV++DF HY G+
Sbjct: 292 NLIEDGCRPPVRQRTSR-YKVGPPGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGI 350
Query: 270 YKHIT-GD--VMGGHAVKLIGWGTSDDGEDYWV 299
Y+ GD + G H+V+++GWG D G+ YWV
Sbjct: 351 YRRSPYGDNTLQGLHSVRIVGWG-EDRGDKYWV 382
>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
Length = 273
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 91/233 (39%), Positives = 118/233 (50%), Gaps = 29/233 (12%)
Query: 90 KTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 146
K + K+L LP+SFDAR+ WP C+ I DQG+CGSCWA E +SDR CI G ++
Sbjct: 10 KFNPKALGLPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEID 69
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 199
LS LLAC GC+GG A+ + +GVVT C PY + C H
Sbjct: 70 AELSPFQLLACA--QGSFGCEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAP-CHH 126
Query: 200 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED----IMAEIYKNGPV-EV 254
P CE +PTP C CV + + S I P + EIY NGPV
Sbjct: 127 P-CE-VFPTPACPATCVGGSNDGVQNGKASFKVKAIVDCPSFDYGCVANEIYHNGPVSSY 184
Query: 255 SFTVYEDFAHYKSGVYKHI-----TGDVMGGHAVKLIGWGTSD----DGEDYW 298
+ +YE+F YKSGV++ G GGH VK+IGWG +D +GE Y+
Sbjct: 185 AGDIYEEFYAYKSGVFRESPSVAQRGANHGGHVVKVIGWGKADPAKGEGEGYY 237
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/260 (33%), Positives = 130/260 (50%), Gaps = 15/260 (5%)
Query: 37 ILQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 94
+++ +++EVN + P GW+A +F T+ L LG + + PV+
Sbjct: 140 LIEPELMEEVNLQGPTLGWQAGNYSEFWGRTLRDGVELRLGTLNPSQSMYKMNPVRRIYD 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
LP+ FDAR+ WP+ IS I DQG CG+ WA + SDRF I ++ LS
Sbjct: 200 PDALPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C GC GGY AW + G+V +EC P+ TG + C + V
Sbjct: 258 HLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPW---TG-RNDQCRLRKRSNLNV 312
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C K R + AYR+ ++ DIM EI +GPV+ + VY+DF YK+GVY+H
Sbjct: 313 AGCRKPPNPLRQELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFVYKNGVYRH 371
Query: 273 ITGDVM---GGHAVKLIGWG 289
+ G H++++IGWG
Sbjct: 372 SRSAELHDSGYHSMRIIGWG 391
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 100/183 (54%), Gaps = 19/183 (10%)
Query: 128 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
A AV A+SDR CI G ++ LS DL++CC CG GCDGG+P AW Y+V HG+VT
Sbjct: 42 AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVT 100
Query: 186 -------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSI 230
C PY C H P C + Y TP+C RKC K + + KHY
Sbjct: 101 GGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGG 159
Query: 231 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 290
+ + + I EI GPVE ++EDF +YKSG+Y++ TG +G H V++IGWG
Sbjct: 160 ISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI 219
Query: 291 SDD 293
++
Sbjct: 220 ENE 222
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/289 (32%), Positives = 134/289 (46%), Gaps = 36/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q +I+ VN+ GW A QF T+ + FK+ LG P P LLL + T
Sbjct: 155 LVQPGLIEHVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLP-PSPLLLSMNEVTASLT 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 KTTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCAKK-RRGCNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGR 329
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 GKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 263 AHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
+YK+G+Y+HIT HAVKL GWGT E +W+
Sbjct: 385 FNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 125/270 (46%), Gaps = 26/270 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
+++ +I +N GWKA QF T+ + F+ LG P P LL +
Sbjct: 160 LVRPDLIHHINSG-DYGWKADNYTQFWGMTLEEGFRKRLGTLP-PSHSLLNMEAIPGSSL 217
Query: 96 L--KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSV 151
L K P+ F A AWP I LDQ +CG+ WAF +DR IH ++ LSV
Sbjct: 218 LEEKFPEFFAATYAWPD--WIHDPLDQRNCGASWAFSTASVAADRIAIHSDGQITDNLSV 275
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK- 210
+L++C GC GG AWRY HGVV+ C P F P Y + +
Sbjct: 276 QNLISC-DTKNQHGCGGGNIEGAWRYLKTHGVVSYACYPSFWKHSLDSPSENHCYVSSEY 334
Query: 211 --------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
C N+L+R + HY RI+S DIM EI GPV+ VYEDF
Sbjct: 335 GKNHTNGPCPNALEDSNRLYRCASHY-----RISSKETDIMEEIMAKGPVQAIMKVYEDF 389
Query: 263 AHYKSGVYKHI--TGDVMGGHAVKLIGWGT 290
YK G+Y+H G H+VKL+GWG+
Sbjct: 390 FLYKEGIYRHSYKAGSKWKTHSVKLLGWGS 419
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 77/196 (39%), Positives = 102/196 (52%), Gaps = 24/196 (12%)
Query: 125 SCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A E +SDR CI LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60
Query: 183 VVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKN--QLWRN 224
VT Y D TGC +P CE YPT + K + +
Sbjct: 61 YVTG--GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHK 118
Query: 225 SKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
H+ +I + + I I +G + TV+EDF HY GVY H G +GGHAV
Sbjct: 119 DLHFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAV 178
Query: 284 KLIGWGTSDDGEDYWV 299
K++GWG D+G YW+
Sbjct: 179 KMLGWGV-DNGTPYWL 193
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 128/238 (53%), Gaps = 33/238 (13%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-----VK 90
H L D ++ +N+ W+A N F N V K L G LG P V+
Sbjct: 3 HPLSDELVNFINKQ-NTTWQAGHN--FFNVEVSYLKKLCGT-------FLGGPKLPRRVE 52
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLS 148
D +KLP+SFDAR WP C TI I DQG CGSCWAFGAVEA+SDR CIH +N+
Sbjct: 53 FAD-DIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVE 111
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCSH-- 199
+S D+L CCG CGDGC+GGYP AW ++ G+V+ C PY C H
Sbjct: 112 VSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPY-SIPPCEHHV 170
Query: 200 ----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 252
P C TP+C + C + ++ KHY S+Y ++SD +I AEIYKNGPV
Sbjct: 171 NGSRPACTGEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPV 228
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 133/288 (46%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++Q +I+ VN+ GW A QF T+ + FK+ LG + P+P L + +
Sbjct: 159 LIQPELIERVNKG-DYGWTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEVTPSLPA 217
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 218 TTDLPEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRFTANLSPQ 275
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ C A
Sbjct: 276 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNDCAMASRSDGRG 334
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V++DF
Sbjct: 335 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHDDFF 389
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK G+Y+H+T + HA+KL GWGT E +W+
Sbjct: 390 HYKKGIYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRGAQGRKEKFWI 437
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 135/283 (47%), Gaps = 24/283 (8%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+L++CC GC+ G AW Y G+V+ C P F ++ GC A +
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRG 330
Query: 213 RKCVKK---NQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
++ K N + ++++ Y S YR++S+ +IM EI +NGPV+ V EDF HYK+G
Sbjct: 331 KRDATKPCPNNVEKSNRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390
Query: 269 VYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
+Y+H+T + HAVKL GWGT E +W+
Sbjct: 391 IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 125/265 (47%), Gaps = 32/265 (12%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVK---THDK 94
L +S++ VN +P + W A P+ T + + ++ +G + T +
Sbjct: 1 LAESVVDIVNNDPSSTWVATEYPR-EILTPAKMRAMIS--------QIGNGFEGEWTFAE 51
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL 154
+ P SFD R WP + +QG CGSCWA A E + R I +S DL
Sbjct: 52 NENAPASFDCRQKWP--GKAEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGVMSPQDL 109
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 214
++C GC+GGY W + G+ TE+C PY +G P C K
Sbjct: 110 VSC--ESNNMGCNGGYADRVWNWIQKKGITTEQCIPYVSGSG----------RVPTCPSK 157
Query: 215 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 274
C + + R+ + S NS + +M E+ NGPV F V+EDF +Y+SGVY+H T
Sbjct: 158 CKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFYNYRSGVYQHKT 212
Query: 275 GDVMGGHAVKLIGWGTSDDGEDYWV 299
G G H V L+GWGT ++G YW+
Sbjct: 213 GRSQGWHHVMLMGWGT-ENGVPYWL 236
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 127/272 (46%), Gaps = 18/272 (6%)
Query: 37 ILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDK 94
+++ SI + +N N GW A+ +F + + + K LG + ++ PV+
Sbjct: 142 LIEPSITEAINSNYANYGWSASNYSKFWGHKLEEGIKLRLGTLQPQRFVMHMNPVRRIYD 201
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
LP+ FD+ WP +S I DQG CGS WA SDRF I ++LS
Sbjct: 202 PNSLPREFDSEFKWP--GWMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQ 259
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C C+GGY AW Y G+V E+C PY ++ C
Sbjct: 260 HLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRIPRRGDLVT 314
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C + R SK+ AYR+ ++ DIM EI +GPV+ + VY DF YK G+Y+H
Sbjct: 315 ANCQLPTNVDRRSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGIYRH 373
Query: 273 ---ITGDVMGGHAVKLIGWGTSDDGE---DYW 298
T D G H+V+++GWG E YW
Sbjct: 374 SPISTNDRTGYHSVRIVGWGEEYSPEGLKKYW 405
>gi|60600065|gb|AAX26576.1| unknown [Schistosoma japonicum]
Length = 190
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 94/172 (54%), Gaps = 11/172 (6%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSL- 96
L +I +N WKA +F TV + +LG P P G L ++ +L
Sbjct: 5 LSKELIHFINYEANTTWKAGPTRRFK--TVSDIRRMLGALPDPNGEQLETLCTGYELTLN 62
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
+LPKSFDAR W C +IS I DQ CGSCWAFGAVEA+SDR CI LS +L
Sbjct: 63 ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENL 122
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCE 203
++CC CG GC+GG+P SAW Y+ + G+VT D Y + GC P CE
Sbjct: 123 VSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTG--DLYNTTNGCQPYEFPPCE 171
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 132/288 (45%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVHPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 109/223 (48%), Gaps = 23/223 (10%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
+P++FDAR + CS I + DQG+C S WA +DR CI + LS +L
Sbjct: 26 IPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNL 85
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG------ 201
++C G GCDGG AW + G+VT E C PY + C H G
Sbjct: 86 MSC-GNEEKMGCDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRP-CDHYGDSSLTN 143
Query: 202 CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSF 256
C T C KCV KN + + H + Y + ++ + I EI GPV
Sbjct: 144 CSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTALM 203
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VYE+F YK G+YK G+++G H VKLIGWG +DG +YW+
Sbjct: 204 YVYENFMGYKKGIYKSTAGELIGYHHVKLIGWGVDEDGTEYWL 246
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 125/255 (49%), Gaps = 27/255 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC + LDQG CG CWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGECWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 284
+ +S S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT+DDG DYW+
Sbjct: 253 IVGYGTTDDGTDYWI 267
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 125/255 (49%), Gaps = 27/255 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAW 108
WKA +F N T +F+ +L ++P G L + + + + +P FD R +
Sbjct: 31 WKAGMPKRFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEY 89
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDG 165
PQC + LDQG CG CWAF A+ DR C G++ +S S L++C L G
Sbjct: 90 PQC--VKPALDQGSCGGCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFG 144
Query: 166 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 225
CDGG W + G T EC Y D G A P P QL++
Sbjct: 145 CDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAH 197
Query: 226 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 284
+ +S S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++
Sbjct: 198 GYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALE 252
Query: 285 LIGWGTSDDGEDYWV 299
++G+GT+DDG DYW+
Sbjct: 253 IVGYGTTDDGTDYWI 267
>gi|403365594|gb|EJY82586.1| Cathepsin B [Oxytricha trifallax]
Length = 333
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 136/282 (48%), Gaps = 29/282 (10%)
Query: 17 VISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAA---RNPQFSNYTVGQFKHL 73
VISS T S+ + I+ + K+ + W A NP F Y F+ L
Sbjct: 26 VISSVTQHTNAGSRATVGKEIVDEIASKQQD------WDAMPPDENP-FKGYAKEDFQSL 78
Query: 74 LGVKPTPKGLLLGVP--VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
LG+ L L K + +PK++D+R + C I +LDQ C +CWAF
Sbjct: 79 LGISKRAPSLFLADSSFYKPKANGVTIPKTYDSRKIYKNC--IHGVLDQVKCSACWAFAI 136
Query: 132 VEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 189
+ +SDRFCI + ++ LS +L++C GC G A++Y G+++++C
Sbjct: 137 AQVVSDRFCIVSNSTTDVVLSYQNLISCVNPKIF-GCKIGVIDVAFQYMEKTGIMSDQCM 195
Query: 190 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS--AYRINSDPEDIMAEIY 247
PY G P C KC N +++ Y ++++ +DI A +
Sbjct: 196 PYTAQEG-------PNATIEACRTKC---NNASDSNRKYQCKKGSFKVAQGADDIKAMLV 245
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 289
G + V+F V+EDF +Y+ G+Y++ TG+++G HA KLIGWG
Sbjct: 246 DKGSIFVTFDVFEDFFNYRRGIYRYTTGELVGYHACKLIGWG 287
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK LG + P+P L + +
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F + GC A
Sbjct: 272 NLISCCS-KNRPGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATSNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 386 HYKTGIYRHVTSANKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWI 433
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 88/152 (57%), Gaps = 16/152 (10%)
Query: 162 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPT 208
C C+GG+P SAW Y+ G+VT + C PY G P C+ PT
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEGPT 227
Query: 209 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
P+C KC + + KHY++S I+++PE EI NGPVE FTVYEDF YKS
Sbjct: 228 PECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKS 287
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY+H TG V+GGHA+K++GWG ++G YW+
Sbjct: 288 GVYQHTTGGVLGGHAIKILGWGV-EEGTKYWL 318
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 86/151 (56%), Gaps = 18/151 (11%)
Query: 166 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKC 211
C+GGYPI AW+++V HG+VT C PY + G + P C E PTPKC
Sbjct: 14 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73
Query: 212 VRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
V C N + KH+ +AY + E I EI +GP+EV+FTVYEDF Y +G
Sbjct: 74 VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY H G +GGHAVK++GWG D+G YW+
Sbjct: 134 VYVHTAGKSLGGHAVKILGWGV-DNGTPYWL 163
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 113/237 (47%), Gaps = 33/237 (13%)
Query: 90 KTHDKSLKL--PKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D S K+ P+ FDAR + C+ I + DQG+C S WA SDR CI
Sbjct: 16 KTVDISYKIDIPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNGQ 75
Query: 147 LS--LSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+ LS +LL+C GD GCDGG AW + G+VT E C PY
Sbjct: 76 FTDNLSAQNLLSC-----GDEEKMGCDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPY-K 129
Query: 194 STGCSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDI 242
C+H G C T C KCV KN + + H + Y + ++ + I
Sbjct: 130 IRPCNHYGNGNLKNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQI 189
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI GPV VYE+F YK G+YK G+++G H VKLIGWG DG +YW+
Sbjct: 190 QQEIMTYGPVTAFMYVYENFMGYKEGIYKSTAGELIGYHHVKLIGWGVDGDGTEYWL 246
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 116/233 (49%), Gaps = 21/233 (9%)
Query: 84 LLGVPVKTHDKSLKLPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIH 142
LLG P K K L P +FDAR + C+ I + DQ C +CW + L+DR CI
Sbjct: 26 LLG-PTKPELKDL--PSNFDARQKFASCAGVIGHVRDQSACHNCWTVSSTGMLNDRVCIK 82
Query: 143 FGMNLS--LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-EECDP---YF 192
G LSV +CC G GC GG + + +HG+VT +E P
Sbjct: 83 SGGTFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLS 142
Query: 193 DSTGC---SHPGCEPA-YPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEI 246
+ GC P C+ A Y +P C KC K + H + S R+ + P++I EI
Sbjct: 143 SADGCWPYPFPKCKHAGYSSPACQTKCTNKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEI 202
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ NGPV ++YED YK+GVY H TG G H +K+IGWG + G+DYW+
Sbjct: 203 FTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV-ESGQDYWL 254
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 135/297 (45%), Gaps = 49/297 (16%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLL--GVPVKTHD 93
+++ II+ VN GWKAA + T+ + ++ LG + + ++ + +
Sbjct: 164 LIEPDIIQAVNRG-NYGWKAANYSELYGMTLNEGIRYRLGTQRPSRTVMNMNEIQMNMDP 222
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 151
++ LP F++ WP I LDQG+C + WAF SDR I M LS
Sbjct: 223 QTDNLPPYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPRLSP 280
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
+L++C G GC GG AW Y GVVTE+C PY +P + TP
Sbjct: 281 QNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTEDCYPY-----------QPPHQTPAE 328
Query: 212 VRKCVKKN-----------------QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
V +C+ ++ Q + N + S YR++S+ ++IM EI NGPV+
Sbjct: 329 VGRCMMQSRSVGRGKRQATQRCPNTQNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQA 388
Query: 255 SFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWGTSDD----GEDYWV 299
V+EDF YK+G+YKH G H+V++ GWG + YW+
Sbjct: 389 IMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVRITGWGEDRNVDGTSRKYWI 445
>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
yakuba]
Length = 174
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/180 (38%), Positives = 95/180 (52%), Gaps = 17/180 (9%)
Query: 12 LLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFK 71
LL+ S T + G + +L D I+ V K W RN ++ T G +
Sbjct: 5 LLVATAASVATLSAG-------EPSLLSDEFIELVRSKAKT-WTVGRNFD-ASVTEGHIR 55
Query: 72 HLLGVKPTPKGLLLGVPVKT-----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSC 126
L+GV P L + + ++P+ FD+R WP C TI I DQG CGSC
Sbjct: 56 RLMGVHPDAHKFALADKREVLGDLYMNSVDEIPEEFDSRKQWPNCPTIGEIRDQGSCGSC 115
Query: 127 WAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
WAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y+ G+V
Sbjct: 116 WAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIV 174
>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
Length = 246
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/249 (34%), Positives = 113/249 (45%), Gaps = 25/249 (10%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV-PVKTHD 93
++ L++S I +NE W A N S F +LG K KT+D
Sbjct: 1 AYFLEESYIDMINEVATT-WTAGVNFDPST-PEEHFVKMLGSKGVESAKQASAHEFKTND 58
Query: 94 KSLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMN 146
+ +P++FDAR W C TI + DQG+CGSCWAFG A +DR C+ N
Sbjct: 59 VAYDNYYGYIPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFN 118
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FD 193
LS ++ CC CG GC GGYPI AW+YF HG+VT E C+PY
Sbjct: 119 ELLSPEEIAFCC-HTCGFGCHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHH 177
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
G + +P +C R C L N H Y + I ++ GP+E
Sbjct: 178 HQGNNSCSDKPMEKNHRCTRMCYGDQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPIE 236
Query: 254 VSFTVYEDF 262
SF VY+DF
Sbjct: 237 ASFDVYDDF 245
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 111/231 (48%), Gaps = 24/231 (10%)
Query: 90 KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D + + +PK FDAR + C+ I + DQG+C S WA +DR CI G
Sbjct: 15 KTVDANYRTDVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIATGGK 74
Query: 147 LS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 197
+ LS +L++C GC GG AW + + +G+VT E C PY + C
Sbjct: 75 FTDNLSAQNLMSCGDSEKFVGCHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRP-C 133
Query: 198 SHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAEI 246
H G C T C KCV KN + + H + Y + ++ I EI
Sbjct: 134 DHYGDSSMTNCSSFRRTQMSICREKCVNKNYKVKYEDDLHKTSVVYMTSWTNVTQIQQEI 193
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 297
GPV VYE+F YK G+YK GD++G H VKLIGWG DDG +Y
Sbjct: 194 MTYGPVTALMYVYENFMGYKEGIYKSTVGDLVGYHHVKLIGWGVDDDGNEY 244
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/260 (33%), Positives = 127/260 (48%), Gaps = 17/260 (6%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTV--GQFKHLLGVKPTPKGLLLGVPVKTHDK 94
+ + +I EVN P W+A +F+ T+ G L + P+ + + +D
Sbjct: 139 LQEPDLIDEVNAMP-LNWRARNYSEFNGRTLKDGMRLRLGTLNPSRSVYRMNAVRRIYDP 197
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVN 152
LP+ FD+R+ WP+ IS+I DQG CG+ WA + + SDRF I + LS
Sbjct: 198 E-SLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSAQ 254
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C GC GG+ AW + G+V E C P+ ST C T
Sbjct: 255 HLLSC-NNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKAST----ETCRLRKRTDLRS 309
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+SGVYKH
Sbjct: 310 AGCAPPPNPLRTELYKVGPAYRL-ANETDIMQEILTSGPVQATMRVYQDFFSYESGVYKH 368
Query: 273 -ITGDVMGG--HAVKLIGWG 289
+T ++ H+V++IGWG
Sbjct: 369 SVTAELYESDYHSVRIIGWG 388
>gi|219565128|dbj|BAH04068.1| cathepsin B [Equus caballus]
Length = 162
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/173 (41%), Positives = 94/173 (54%), Gaps = 23/173 (13%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD---- 93
L + ++ VN+ WKA N F N + K L G LG P
Sbjct: 2 LSNELVNYVNKR-NTTWKAGHN--FHNVDLSYVKRLCGT-------FLGGPKLPQRVWFA 51
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN- 152
+ + LP++FDAR WP C TI I DQG CGSCWAFGAVEA+SDR CI ++S+ V+
Sbjct: 52 EDVVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSA 111
Query: 153 -DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
D+L CCG CGDGC+GG+P AW ++ G+V+ +D SH GC P
Sbjct: 112 EDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVS---GGLYD----SHVGCRP 157
>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
Length = 301
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 131/267 (49%), Gaps = 33/267 (12%)
Query: 43 IKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH------DKSL 96
+KE+ + + W A + +F N TV +F+ L P L + +TH K+
Sbjct: 21 LKELQQLATS-WTPAIHDRFRNMTVDEFRARL----IPVENLRSLRTETHVSQLNLGKTK 75
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN---D 153
+LPK +D R C + + DQ CGSCWAF AV +DR C +G++ S V+
Sbjct: 76 ELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATFADRRCA-YGLD-SKQVHYSEQ 131
Query: 154 LLACCGFLCGDG-CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+ C F GDG C+GG+ + W++ GV +C YF C+
Sbjct: 132 YVVSCDF--GDGACNGGWLSNVWKFLTKTGVPKLDCLKYFSGMTGDRE---------SCI 180
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C + + + I+ D + +M + +GP++V+F VY DF +Y SGVY+H
Sbjct: 181 THCTDGSPVELYQASHVIN---YGMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQH 237
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ G + GGHAV+++G+G + G YW+
Sbjct: 238 VNGMMEGGHAVEMVGYGIDESGLKYWI 264
>gi|1763661|gb|AAB58259.1| cysteine protease [Giardia intestinalis]
Length = 198
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 113/206 (54%), Gaps = 21/206 (10%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDL 154
+P+SFD R +P C I ++DQG CGSCWAF +V DR C+ G++ + S +
Sbjct: 5 VPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYV 61
Query: 155 LACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
++C GD C+GG+ + W++ G T+EC PY + C PT
Sbjct: 62 VSCDH---GDMACNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT----- 109
Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
KC + + S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H
Sbjct: 110 KCADGSSKVHLATATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHT 167
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
G + GGHAV+++G+GT DDG DYW+
Sbjct: 168 YGYMEGGHAVEMVGYGTDDDGVDYWI 193
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/291 (34%), Positives = 130/291 (44%), Gaps = 31/291 (10%)
Query: 27 VVSKLKLDSHI--LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGL 83
V S K S I ++ S+IK++N+ GWKA QF + + + LG P P L
Sbjct: 150 VNSHWKCSSEICLVRPSLIKQINDG-NYGWKAHNYSQFWGMNLKEGYNSRLGTFPPPAAL 208
Query: 84 LLGVPVKTHDKSLK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH 142
L PV + + P+ F A WP I LDQ +C + WAF +DR IH
Sbjct: 209 LDMKPVTENIIAEDDFPEFFVAWHEWP--GWIHDPLDQRNCAASWAFSTASVAADRIAIH 266
Query: 143 ----FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DS 194
F NLS L C GC GG AW Y +G+V+ C P F
Sbjct: 267 SKGRFTDNLS---PQHLISCDTRNQYGCKGGSITGAWSYLKKYGLVSHACYPLFWNNLHQ 323
Query: 195 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA--YRINSDPEDIMAEIYKNGPV 252
T C A + ++ C + W S H YRI+S DIM EI +NGPV
Sbjct: 324 TSCEMSSVFDAEGKRQAIQPCPNR---WEPSNHIYQCGLPYRISSQDADIMKEIKENGPV 380
Query: 253 EVSFTVYEDFAHYKSGVYKHI---TGDVMGG-----HAVKLIGWGTSDDGE 295
+ VY+DF YKSG+YKHI G H++K++GWGT D E
Sbjct: 381 QAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSIKIVGWGTLRDAE 431
>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 309
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/253 (32%), Positives = 124/253 (49%), Gaps = 26/253 (10%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCS 112
WKA + N T FK L+ K P+G + + + T++ +P FD R +PQC
Sbjct: 31 WKAGIPERLKNLTETDFKRLVSAK-DPRGQIPTLHLIHTYESEDPIPDHFDFREEYPQC- 88
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDG- 168
I+ ++D G C S WA VEA R C++ G++ S +L+C +GC
Sbjct: 89 -ITEVIDMGTCSSSWAHSPVEAFGHRRCMN-GVDQEATRYSAQYILSCA---TTNGCLAF 143
Query: 169 -GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 227
G + +W + G+ E C Y D + E +YP P C + L
Sbjct: 144 PGQGVVSWDFIATTGIPLESCVKYTD-----YDKTESSYPCPSL---CNDNSSL----VL 191
Query: 228 YSISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
Y Y + +PE + I GP++ FTVYEDFA+Y G+Y H+ G G +V+++
Sbjct: 192 YKSDGYEGVGFNPEKLRRAIALRGPMQAMFTVYEDFAYYLEGIYSHVYGGTAGYLSVEIV 251
Query: 287 GWGTSDDGEDYWV 299
G+GTSD+G+DYW+
Sbjct: 252 GYGTSDEGQDYWI 264
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 132/288 (45%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FK-HLLGVKPTPKGLLLGVPVKTHDK 94
+++ +I++VN+ GW A QF T+ FK HL + P+P L + +
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFHLGTLPPSPMLLSMNEMTASLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P ++ GC A
Sbjct: 272 NLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLSKDQNATNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V EDF
Sbjct: 331 KRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFF 385
Query: 264 HYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +WV
Sbjct: 386 HYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWV 433
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 139/299 (46%), Gaps = 53/299 (17%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLL--GVPVKTHD 93
+++ II VN GWKAA QF ++ + ++ LG + + ++ + +K
Sbjct: 139 LIEADIIHAVNRG-NYGWKAANYSQFFGMSLDEGIRYRLGTQRPSRTVMNMNEIQMKMDP 197
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 151
++ LP+ F++ WP + I LDQG+C + WAF SDR I M LS
Sbjct: 198 QNDHLPRYFNSSEKWP--NKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSP 255
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
+L++C G GC GG AW Y GVVTE C PY +P P
Sbjct: 256 QNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTENCYPY-----------QPPQQAPAE 303
Query: 212 VRKCVKKNQL-----------------WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
V +C+ +++ + N + S Y+++S+ ++IM EI +NGPV+
Sbjct: 304 VGRCMMQSRAVGRGKRQATQRCPNTYNYHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQA 363
Query: 255 SFTVYEDFAHYKSGVYKHITGDVM----------GGHAVKLIGWGTSDDGE----DYWV 299
V+EDF YK+G+YKH DV G H+V++ GWG D + YW+
Sbjct: 364 IMEVHEDFFVYKNGIYKHT--DVSSTKPPQYRKHGTHSVRITGWGEDKDYDGTPRKYWI 420
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 90/147 (61%), Gaps = 14/147 (9%)
Query: 166 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 213
C+GGYP AW ++ G+V+ C PY S P C TPKC +
Sbjct: 87 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSK 146
Query: 214 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H
Sbjct: 147 ICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH 206
Query: 273 ITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+TG++MGGHA++++GWG ++G YW+
Sbjct: 207 VTGEMMGGHAIRILGWGV-ENGTPYWL 232
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 90/152 (59%), Gaps = 16/152 (10%)
Query: 162 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 208
CG GC+GGYP +AW+++ +VT + C PY+ C H P C PT
Sbjct: 3 CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPP-CEHHTVGPLPNCTGIKPT 61
Query: 209 PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
P+C + C + Q + KH+ Y I+SD I EIYKNGPVE F+VY DF YKS
Sbjct: 62 PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
GVY+ + +++GGHA++++GWGT +DG YW+
Sbjct: 122 GVYQRHSEEMLGGHAIRILGWGT-EDGVPYWL 152
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/272 (33%), Positives = 132/272 (48%), Gaps = 31/272 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
++Q+ ++ ++ ++ + W QF T+ +H LG L V+ +
Sbjct: 21 LIQEDLLMKI-QSGRYTWTGRNYSQFWGRTLKDGIRHRLGT------LFPERSVQNMNEM 73
Query: 94 --KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 149
K +LP SFDAR WP I I DQG C S WA +DR + N++L
Sbjct: 74 IVKPRELPTSFDARQKWP--DFIHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVAL 131
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
S L+C GC+GGY AW Y GVV+EEC PY T C
Sbjct: 132 SAQQFLSCNQHR-QKGCEGGYLDRAWWYIRKFGVVSEECYPYISGTTRKPEICYMQKSKH 190
Query: 210 KCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
R+C + NS+ Y + +YR++S +DIM+EI NGPV+ +F V+ DF + +G
Sbjct: 191 ANGRQCPSGHP---NSRVYRTTPSYRVSSREQDIMSEILTNGPVQATFRVHGDF--FIAG 245
Query: 269 VYKH---ITGDVMGGHAVKLIGWGTSDDGEDY 297
VYKH + ++ G H+V+L+GW GEDY
Sbjct: 246 VYKHLPTVGEEIEGYHSVRLLGW-----GEDY 272
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 114/241 (47%), Gaps = 41/241 (17%)
Query: 90 KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D + K +PK FDAR + C+ I + DQG+C S WA +DR CI
Sbjct: 54 KTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGK 113
Query: 147 LS--LSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+ LS +L++C GD GCDGG AW + + G+VT E C PY
Sbjct: 114 FTDNLSAQNLMSC-----GDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPY-K 167
Query: 194 STGCSHPG------CEPAYPTPK--CVRKCVKKN-------QLWRNSKHYSISAYRINSD 238
+ C H G C T C KCV KN L++ S Y S ++
Sbjct: 168 NRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW----TN 223
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I EI GPV VYE+F YK GVYK G+++G H VKLIGWG + G +YW
Sbjct: 224 VKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYW 283
Query: 299 V 299
+
Sbjct: 284 L 284
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 137/287 (47%), Gaps = 29/287 (10%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ ++ VN GW+A+ QF T+ + ++ LG +KP + + D+
Sbjct: 192 LINGDMMDAVNRG-NYGWRASNYSQFWGMTLDEGIQYRLGTIKPPTSVMNMNELQMNMDE 250
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
+ LP F+A W I LDQG+C WAF SDR IH M +LS
Sbjct: 251 NDVLPSYFNAADKW--SGMIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQ 308
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-----YP 207
+LL+C GC+GG AW + GVVT+EC P F + +H PA
Sbjct: 309 NLLSC-NTRHQQGCNGGRIDGAWWFLRRRGVVTDECYP-FSNQETNHSPNAPACMMHSRS 366
Query: 208 TPKCVRKCVKKNQLWR---NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
T + R+ + + R N + S AYR++S+ ++IM E+ +NGPV+ V+EDF
Sbjct: 367 TGRGKRQAIARCPNPRSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFM 426
Query: 265 YKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DG--EDYWV 299
Y++G+Y+H G H+VK+ GWG DG + YW+
Sbjct: 427 YRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDGSNQKYWI 473
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 111/233 (47%), Gaps = 32/233 (13%)
Query: 98 LPKSFDARSAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP FDAR + C I + DQG CG+CWA E L+DR CI + LS +
Sbjct: 33 LPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGYV 92
Query: 155 LACC----GFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPY------ 191
+CC G L GC+GG + A + HGVVT + C PY
Sbjct: 93 TSCCNPAHGCLHAKGCNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCN 152
Query: 192 -FDSTGCSHPGCEPA--YPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEI 246
+ G +P C+ P P C C K + H + S ++ +D + I EI
Sbjct: 153 HVPTEGTGYPKCKDVVQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEI 212
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+ NGPV +F +Y+DF +YKSGVY T +V H +K+IGWG +D +YW+
Sbjct: 213 FDNGPVFSAFEMYKDFRYYKSGVYVPTTKEVDCLHVIKIIGWG-ADSVREYWL 264
>gi|403355865|gb|EJY77523.1| Cathepsin B [Oxytricha trifallax]
Length = 299
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 113/229 (49%), Gaps = 29/229 (12%)
Query: 82 GLLLGVPVKTHDKSLK------LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 135
G LG+ +++ K LP S+D R+A P C+ +L+Q CGSCW+F A L
Sbjct: 54 GTALGIESSPDNQNTKKKLTTTLPSSYDYRTAHPGCT--HAVLNQQSCGSCWSFAATSML 111
Query: 136 SDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 193
DR C+H +N+ LS D+++C GC GG+ Y V HGVVT +C Y
Sbjct: 112 QDRLCLHSNGAVNVQLSQQDMVSC--DFDNAGCSGGWLSHTINYLVVHGVVTSQCLAYAS 169
Query: 194 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS--ISAYRINSDPEDIMAEIYKNGP 251
G +C +C N + K Y ++ ++ + E++M EIY NGP
Sbjct: 170 VDGAGR----------ECSFRCDDANTEY---KKYGCKFNSLKMTTSKEEMMEEIYLNGP 216
Query: 252 VEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V V F VY DF Y G Y+ + + GGHAV + GWG + G YW+
Sbjct: 217 VMVGFIVYSDFMSYGGGYYEVSPSASISGGHAVIVHGWGY-NGGRLYWI 264
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I+ VN GW A QF T+ + +K LG + P+P L + T
Sbjct: 147 LVRPELIENVNTR-DYGWTAHNYSQFWGMTLEEGYKFRLGTLPPSPTLLSMNEMTVTLPS 205
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP+ F + WP LDQ +C + WAF +DR I + LS
Sbjct: 206 QTDLPEFFISSYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSKGRYTDNLSPQ 263
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC GG AW Y G+V+ C P F ++ GC+ A
Sbjct: 264 NLISCC-VKNRHGCKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMASRSDGRG 322
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 323 KRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 377
Query: 264 HYKSGVYKHITG--------DVMGGHAVKLIGW----GTSDDGEDYWV 299
HYKSG+Y+HI + HAVKL GW G E +W+
Sbjct: 378 HYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQGKKEKFWI 425
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/285 (32%), Positives = 128/285 (44%), Gaps = 48/285 (16%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
++Q+ I+K VN + W A F T+ ++ LG K + + K
Sbjct: 125 LIQEDILKRVNAG-RYTWSARNYSNFWGRTLEDGMRYRLGTLFPDKSVQNMNEILM--KP 181
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVND 153
+LP SFDAR WP I + DQG C S W+ +DR I +N+ LS
Sbjct: 182 RELPSSFDAREKWPL--YIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQ 239
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
LL+C GC+GGY AW Y GVV+E C PY +S PG
Sbjct: 240 LLSCNQHR-QRGCEGGYLDRAWWYIRKLGVVSELCYPY-ESGATQQPG------------ 285
Query: 214 KCVKKNQLWRNSKH------------YSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYE 260
+C +R H Y ++ YR++S +DIM EI NGPV+ +F VYE
Sbjct: 286 ECRIPKSAYRTGAHIDCPSGAADPSVYRMTPPYRVSSREQDIMTEIITNGPVQATFLVYE 345
Query: 261 DFAHYKSGVYKHI--------TGDVMGGHAVKLIGWGTSDDGEDY 297
DF Y GVY+H+ V G H+V++IGW GEDY
Sbjct: 346 DFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGW-----GEDY 385
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 135/297 (45%), Gaps = 49/297 (16%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLL--GVPVKTHD 93
+++ +I VN GW+AA QF T+ + ++ LG + K ++ + +
Sbjct: 142 LIEPDVISAVNRG-NYGWRAANYSQFYGMTLDEGIRYRLGTQRPAKTIMNMNEIQMNMDP 200
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSV 151
+ +LP F++ WP I LDQG+C + WAF SDR I M LS
Sbjct: 201 ERDQLPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSP 258
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
+L++C G GC GG AW + GVVTE+C PY P TP
Sbjct: 259 QNLISCDTRNQG-GCTGGRIDGAWWFLRRRGVVTEDCYPY-----------RPPQQTPAE 306
Query: 212 VRKCVKKNQL-----------------WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 254
+ +C+ +++ ++N + S YR++++ ++IM EI NGPV+
Sbjct: 307 LGRCMMQSRSVGRGKRQATQRCPNTNNYQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQA 366
Query: 255 SFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWGTSD--DG--EDYWV 299
V+EDF YKSG+YKH G H+VK+ GWG DG YW+
Sbjct: 367 IMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDGAKRKYWI 423
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 132/287 (45%), Gaps = 33/287 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I+ +N+ GW A QF T+ + F LG + P+P L +
Sbjct: 155 LVRPELIEHINKG-DYGWTAENYSQFWGMTLEEGFTFRLGTLAPSPMLLSMNEVTAALPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
LP+ F A WP LDQ +C + WAF +DR I ++LS
Sbjct: 214 KTDLPEFFIASYKWP--GWTHDPLDQKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHP----GCEP 204
+L++CC GC GG AW Y G+V+ C P F + GC+ G
Sbjct: 272 NLISCC-LKHRYGCSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGCAMASRSDGRGK 330
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
+ T C K N++++ S YR++S+ IM EI KNGPV+ V+EDF +
Sbjct: 331 RHATTPCPNNIEKSNRIYQCS-----PPYRVSSNETQIMKEIMKNGPVQAIMQVHEDFFY 385
Query: 265 YKSGVYKHITGDV--------MGGHAVKLIGWGT----SDDGEDYWV 299
YK+G+Y+H+T + + HAVKL GWGT E +W+
Sbjct: 386 YKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKEKFWI 432
>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 260
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 82/245 (33%), Positives = 111/245 (45%), Gaps = 29/245 (11%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
++ LQ S I +NE + WKA N P S + + GV+ K K
Sbjct: 21 AYFLQKSYIDTINE-VASTWKAGVNFDPNTSQEDIVKLLGSTGVESAMKAS--ANEFKMD 77
Query: 93 DKSLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GM 145
D + P++FDAR W C TI + DQGHCGSCWAFG A +DR C+
Sbjct: 78 DVAYNKLYGYTPRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDF 137
Query: 146 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 192
N LS ++ CC CG GC+GG PI AW+YF HG+VT E C+PY
Sbjct: 138 NELLSAEEITFCC-HTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPR 196
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
D G + +P +C R C L +R Y+ Y + I ++ GP
Sbjct: 197 DDKGKNTCAGKPREKNHRCTRMCYGNQDLDYREDHRYTRDFYYLTYGS--IQKDVMTYGP 254
Query: 252 VEVSF 256
+E +F
Sbjct: 255 IEATF 259
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 54/67 (80%), Positives = 61/67 (91%)
Query: 234 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
R +SDP IM E+YKNGPVEV+FTVYEDFAHYKSGVYKH+TGD +GGHAVKLIGWGTS+D
Sbjct: 2 RGSSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSED 61
Query: 294 GEDYWVC 300
GEDYW+
Sbjct: 62 GEDYWLL 68
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 131/288 (45%), Gaps = 23/288 (7%)
Query: 32 KLDSHILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPV 89
+ D+ +++ I+ +N N + GW A + F + + LG K +L P+
Sbjct: 117 EADACLVEPEAIQAINGNSAQFGWTAGNHSDFWGRKLEDGLVYRLGTLEPEKFVLAMHPI 176
Query: 90 KTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--L 147
K LP SFD R W T+ + DQG CG+ WAF +DR I +
Sbjct: 177 KQKYDRNTLPMSFDGRIEWR--DTLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVY 234
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 205
LS+ +LLAC GC+GG+ AW Y GVV EEC PY C+
Sbjct: 235 PLSMQNLLAC-NNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRR 293
Query: 206 --YPTPKCV------RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
T KC RK + ++ R S AYRI +DIM EI ++GPV+ +
Sbjct: 294 GNLATMKCQLVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMR 353
Query: 258 VYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGED---YWV 299
V+ DF Y+ GVY++ + G H+V+++GWG + YW+
Sbjct: 354 VHPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWL 401
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 136/297 (45%), Gaps = 40/297 (13%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+IL ++ + A+ +VS+ +L I+ +N + W AA +F N T +F+
Sbjct: 1 MILALLLAVVCAKPLVSRAELRR-------IQALNPS----WVAAMPKRFENVTEDEFRG 49
Query: 73 LLGVKP----TPKGLLLGVPVK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
+L + P G + P+K +D + LP FD R +P C +S + DQG CG CW
Sbjct: 50 ML-INPDRLKARSGSMPSAPLKEINDPTDPLPAQFDFRDEYPHC--VSPVFDQGSCGGCW 106
Query: 128 AFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
AF A+ R C G++ + S L++C GC GG W + G
Sbjct: 107 AFSAIGMFGSRRCA-VGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWSFLTQTGAT 163
Query: 185 TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RINSDPEDIM 243
T EC Y D C PT C +Q+ + Y Y +++ IM
Sbjct: 164 TAECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQVSKSVPAIM 210
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWV 299
+ GPV+ VY D +Y GVY+H G + G HA++++G+GT+DDG DYW
Sbjct: 211 QMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWT 267
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 119/270 (44%), Gaps = 32/270 (11%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
W A QF T+ + FK+ LG P LL V + LP+ F A WP
Sbjct: 121 WTAQNYSQFWGMTLEEGFKYRLGTLPPSPMLLSMNEVTAVPAIIDLPEFFVAYYKWP--G 178
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 170
LDQ +C + WAF +DR I +LS +L++CC GC G
Sbjct: 179 WTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCSSGS 237
Query: 171 PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQL 221
AW Y G+V+ C P+ ++ C A + T C K N++
Sbjct: 238 IDRAWWYLRKRGLVSHACYPFLKDQNTTNNACAMASRSDGRGKRHATKPCPNNIEKSNRI 297
Query: 222 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG------ 275
++ S YR++S+ +IM EI NGPV+ V+EDF HYKSG+Y+H+T
Sbjct: 298 YQCS-----PPYRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSE 352
Query: 276 --DVMGGHAVKLIGWGT----SDDGEDYWV 299
+ HAVKL GWGT E +W+
Sbjct: 353 KYQKLQTHAVKLTGWGTLRGAQGRKEKFWI 382
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 114/241 (47%), Gaps = 41/241 (17%)
Query: 90 KTHDKSLK--LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
KT D + K +PK FDAR + C+ I + DQG+C S WA +DR CI
Sbjct: 18 KTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGK 77
Query: 147 LS--LSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFD 193
+ LS +L++C GD GCDGG AW + + G+VT E C PY +
Sbjct: 78 FTDNLSAQNLMSC-----GDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKN 132
Query: 194 STGCSHPG------CEPAYPTPK--CVRKCVKKN-------QLWRNSKHYSISAYRINSD 238
C H G C T C KCV KN L++ S Y S ++
Sbjct: 133 RP-CDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW----TN 187
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ I EI GPV VYE+F YK GVYK G+++G H VKLIGWG + G +YW
Sbjct: 188 VKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYW 247
Query: 299 V 299
+
Sbjct: 248 L 248
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 124/267 (46%), Gaps = 23/267 (8%)
Query: 53 GWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
GW A QF T+ + ++ LG ++ + + + LP F+A WP
Sbjct: 175 GWTAGNYSQFWGMTLDEGIQYRLGTAKPSSSVMNMNEIHVNMNNDILPSHFNAAEKWP-- 232
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 169
+ LDQG+C WAF SDR I M SLS +LL+C GC GG
Sbjct: 233 GLVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGG 291
Query: 170 YPISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNS 225
AW Y GVV+E C P+ ++ G S P + + R+ NQ + ++
Sbjct: 292 RVDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSN 351
Query: 226 KHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GD 276
+ Y S AYR+ S +DIM E+Y+NGPV+ V+EDF YKSG+Y+
Sbjct: 352 EIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHR 411
Query: 277 VMGGHAVKLIGWGTS--DDGE--DYWV 299
G H+VK+ GWG DG+ YW+
Sbjct: 412 RHGTHSVKITGWGEERGRDGQTHKYWL 438
>gi|414886871|tpg|DAA62885.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 129
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/89 (61%), Positives = 69/89 (77%), Gaps = 2/89 (2%)
Query: 34 DSH--ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKT 91
D+H I+Q+ II+ VN +P AGW A+RNP FSNYT+ QFKH+LGVKP P+ L VPVKT
Sbjct: 27 DNHMRIIQEDIIETVNNHPSAGWTASRNPYFSNYTIAQFKHILGVKPAPQNALSNVPVKT 86
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQ 120
+ +SL+LPK FDARSAW +CSTI IL +
Sbjct: 87 YSRSLELPKEFDARSAWSRCSTIGNILGR 115
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 132/289 (45%), Gaps = 37/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
+++ +I++VN+ GW A QF T+ FK LG P P +LL + T
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSPMLLSMNEMTXPLP 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASRSDGR 328
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S +IM EI +NGPV+ V EDF
Sbjct: 329 GKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDF 383
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 384 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWI 432
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 132/289 (45%), Gaps = 37/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
+++ +I++VN+ GW A QF T+ FK LG P P +LL + T
Sbjct: 155 LVRPELIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSPMLLSMNEMTAPLP 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASRSDGR 328
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S +IM EI +NGPV+ V EDF
Sbjct: 329 GKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDF 383
Query: 263 AHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 384 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWI 432
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 127/260 (48%), Gaps = 15/260 (5%)
Query: 37 ILQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 94
+++ +++E++ + P GW+A +F T+ L LG + + PV+
Sbjct: 140 LIEPELMEEIHLQGPTLGWQAGNYSEFWGRTLKDGVQLRLGTLNPSQSVYKMNPVRRIYD 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP+ F++R+ WP+ IS I DQG CG+ WA + SDRF I + LS
Sbjct: 200 PDALPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSAQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C GC GGY AW + G+V EEC P+ TG + C +
Sbjct: 258 HLLSC-NNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPW---TG-RNDQCRLRKRSNLKT 312
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+SGVY+H
Sbjct: 313 AGCQNPPNSLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFVYQSGVYRH 371
Query: 273 ITGDVM---GGHAVKLIGWG 289
+ G H+V++IGWG
Sbjct: 372 SRSAELHDSGYHSVRIIGWG 391
>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 261
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/277 (32%), Positives = 126/277 (45%), Gaps = 42/277 (15%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI V L ++ LQ I +NE WKA N F T
Sbjct: 1 MARVLMLLSVIF-------VSFYLTEQAYFLQKDFIDNINERATT-WKAGVN--FDPDTP 50
Query: 68 GQ-FKHLLGVK----PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRI 117
+ F +LG K P + + KTHD + ++P+ FDAR W +C TI +
Sbjct: 51 KEHFLKMLGSKGVQIPNKHNIHM---YKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAV 107
Query: 118 LDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAW 175
DQG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 108 RDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAW 166
Query: 176 RYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 222
F G+VT E C+PY +D+ G + +P +C R C L
Sbjct: 167 ERFKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLD 226
Query: 223 RNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
+ H Y+ +Y + I ++ GP+E SF V
Sbjct: 227 FDEDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDV 261
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 127/260 (48%), Gaps = 15/260 (5%)
Query: 37 ILQDSIIKEVN-ENPKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 94
+++ +++E+N + P GW+A+ +F T+ + L LG + + PV+
Sbjct: 198 LIESELMEELNLQGPTLGWQASNYSEFWGRTLLEGVELRLGTLNPSQSVYKMNPVRRIYD 257
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP+ FD+R+ W + IS + DQG CG+ WA + +DRF I + LS
Sbjct: 258 PDALPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGAEDAELSAQ 315
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C GC GGY AW + G+V ++C P+ G C+
Sbjct: 316 HLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNG----QCKLRKRNNLQA 370
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C K R + AYR+ ++ DIM EI +GPV+ + VY+DF YK+G+Y+H
Sbjct: 371 AGCRKPPNPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFVYKNGIYRH 429
Query: 273 ITGDVM---GGHAVKLIGWG 289
+ G H+V++IGWG
Sbjct: 430 SQSAELHDSGYHSVRIIGWG 449
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 136/307 (44%), Gaps = 39/307 (12%)
Query: 21 QTFAEGVVSKLKLDSHILQDSI--IKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK 77
Q + EG V K +S I++VN+ GW A QF T+ FK LG
Sbjct: 125 QHYEEGSVIKENCNSXXXXXXXXXIEQVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTL 183
Query: 78 PTPKGLLLGVPVKTHD--KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEAL 135
P P +LL + T + LP+ F A WP LDQ +C + WAF
Sbjct: 184 P-PSPMLLSMNEMTAPLPATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVA 240
Query: 136 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 193
+DR I +LS +L++CC GC+ G AW Y G+V+ C P F
Sbjct: 241 ADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFK 299
Query: 194 STGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA 244
++ GC A + T C K N++++ S YR++S +IM
Sbjct: 300 DQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMK 353
Query: 245 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SD 292
EI +NGPV+ V EDF HYK+G+Y+H+T + HAVKL GWGT
Sbjct: 354 EIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQG 413
Query: 293 DGEDYWV 299
E +W+
Sbjct: 414 RKEKFWI 420
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 135/297 (45%), Gaps = 40/297 (13%)
Query: 13 LILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKH 72
+IL ++ + A+ +VS+ +L I+ +N W AA +F N T +F+
Sbjct: 1 MILALLLAVVCAKPLVSRAELRR-------IQALNPP----WVAAMPKRFENVTEDEFRG 49
Query: 73 LLGVKP----TPKGLLLGVPVK-THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW 127
+L + P G + P+K +D + LP FD R +P C +S + DQG CG CW
Sbjct: 50 ML-INPDRLKARSGSMPSAPLKEINDPTDPLPAQFDFRDEYPHC--VSPVFDQGSCGGCW 106
Query: 128 AFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV 184
AF A+ R C G++ + S L++C GC GG W + G
Sbjct: 107 AFSAIGMFGSRRCA-VGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWSFLTQTGAT 163
Query: 185 TEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RINSDPEDIM 243
T EC Y D C PT C +Q+ + Y Y +++ IM
Sbjct: 164 TAECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQLSKSVPAIM 210
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWV 299
+ GPV+ VY D +Y GVY+H G + G HA++++G+GT+DDG DYW
Sbjct: 211 QMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDGTDYWT 267
>gi|437323|gb|AAB00354.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 133
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 88/178 (49%), Gaps = 53/178 (29%)
Query: 125 SCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A E +SDR CI LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60
Query: 183 VVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI 242
VT Y D TGC YP P
Sbjct: 61 YVTG--GSYQDKTGCK------PYPYP--------------------------------- 79
Query: 243 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTSDDGEDYWV 299
P EV+FTVYEDF HY GVY H G + GGHAVK++GWG D+G YW+
Sbjct: 80 --------PFEVAFTVYEDFEHYSGGVYVHTAGASLGGGHAVKMLGWGV-DNGTPYWL 128
>gi|403359042|gb|EJY79178.1| Cysteine protease [Oxytricha trifallax]
Length = 366
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 120/267 (44%), Gaps = 22/267 (8%)
Query: 36 HILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKS 95
++ +S I N P AG++ N ++N+T+ K L +G D+
Sbjct: 45 QVIDESQILVHNGQPNAGFQQGANSFYTNWTLSNAKSLFQ-NSLSDTQNIGPCKSKDDEE 103
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLL 155
+P+ +D R +P C + +++QG+C S + A+ ++DR C + LS +LL
Sbjct: 104 TIIPEKYDWREVYPDC--VQPVVNQGNCSSSYITAALSTVADRICQTTKKPIQLSAQELL 161
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 215
C CDGGY + + G + E+C PY G +C
Sbjct: 162 DCDK--SSYQCDGGYVSRTFNWGKRKGFIPEQCYPYTGVVG-------------ECEDDH 206
Query: 216 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
++ N+ N+ Y + Y + SD + EI KNGPV +Y DF YK GVY H T
Sbjct: 207 LETNECRVNNMFYRVIDYCLASDELGLKKEILKNGPVVAQMVIYTDFLTYKEGVY-HRTE 265
Query: 276 DVM---GGHAVKLIGWGTSDDGEDYWV 299
D G H VK++GW DG D+W+
Sbjct: 266 DAFKFNGQHVVKIVGWDRQGDGNDFWI 292
>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
Length = 238
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/243 (33%), Positives = 109/243 (44%), Gaps = 29/243 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
LQ S I +NE + WKA N P S + + GV+ K K D
Sbjct: 1 FLQKSYIDTINE-VASTWKAGVNFDPNTSQEDIVKLLGSTGVESAMKAS--ANEFKMDDV 57
Query: 95 SLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNL 147
+ P++FDAR W C TI + DQGHCGSCWAFG A +DR C+ N
Sbjct: 58 AYNKLYGYTPRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNE 117
Query: 148 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDS 194
LS ++ CC CG GC+GG PI AW+YF HG+VT E C+PY D
Sbjct: 118 LLSAEEITFCC-HTCGFGCNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDD 176
Query: 195 TGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 253
G + +P +C R C L +R Y+ Y + I ++ GP+E
Sbjct: 177 KGKNTCAGKPREKNHRCTRMCYGNQDLDYREDHRYTRDFYYLTYGS--IQKDVMTYGPIE 234
Query: 254 VSF 256
+F
Sbjct: 235 ATF 237
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/279 (32%), Positives = 138/279 (49%), Gaps = 22/279 (7%)
Query: 33 LDSHILQDSIIKEVNENPKAGWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVK 90
+D++IL D++ + N + GW A +F Y G + LG + + +L P+K
Sbjct: 133 VDTYIL-DTLRHQAN---RFGWSAGNYSEFWGRRYDEG-LQLRLGTLHSKRKILQMKPLK 187
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-- 148
+ KL +S+DAR W + IS +DQG CG+ WA V+ +DRF I +S
Sbjct: 188 AAFQRGKLRRSYDAREVWG--NYISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDV 245
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYP 207
LS LL+C L GC GG+ AW + G++TEEC P+ + C+ P +
Sbjct: 246 LSPQHLLSC-NNLNQQGCQGGHLTRAWNWIRKFGLITEECYPWQGRMSTCAVPK-KKKET 303
Query: 208 TPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
+C + N ++ + + YR+ ++ E IM EI +GPV+ V DF YK
Sbjct: 304 MAQCPSRVRSNNDRTTKTRLHRVGPVYRVATE-EGIMHEILTSGPVQAVMKVSRDFFMYK 362
Query: 267 SGVYK---HITGDVMGGHAVKLIGWGTSDDGE---DYWV 299
SGVYK +G G H+V+++GWG G YW+
Sbjct: 363 SGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWI 401
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 103/223 (46%), Gaps = 30/223 (13%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP +FDAR WP C +I I +QG+C S +A A++DR CIH N +S ++
Sbjct: 63 LPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSAQQII 122
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----------DSTGC 197
+CC +LCG GCDGG +W ++ HG V+ + C PY C
Sbjct: 123 SCC-YLCGYGCDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRHSC 181
Query: 198 SHPGCEPAYPTPKCVRKCVKKNQLWR-NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+ E TP C KC N + Y Y++ P M EI+ NGP+ F
Sbjct: 182 TTYNRE---ETPACEIKCNNPNYYSSFKTDIYKGKYYQVY--PFMAMKEIFDNGPITTQF 236
Query: 257 TVYEDFAHYKSGVYKH---ITGDVMGGHAVKLIGWGTSDDGED 296
+Y D YKSGVY++ GD K+IGWG + D
Sbjct: 237 YMYRDLIDYKSGVYQYDEGFYGDFFTVQGXKIIGWGEENGDPD 279
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 131/289 (45%), Gaps = 37/289 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD-- 93
+++ +I+ VN+ GW A QF T+ FK LG P P +LL + T
Sbjct: 155 LVRPELIEHVNKG-DYGWTAQNYSQFWGMTLEDGFKFRLGTLP-PSPMLLSMNEMTAPLP 212
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 ATTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSP 270
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------ 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 271 QNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASRSDGR 328
Query: 206 ---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 262
+ T C K N++++ S YR++S +IM EI +NGPV+ V EDF
Sbjct: 329 GKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQVREDF 383
Query: 263 AHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
HYK+G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 384 FHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWI 432
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 119/260 (45%), Gaps = 14/260 (5%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
++ II E+N GW A +F T K LG + +PV H
Sbjct: 172 LMDQEIINEINYLESPGWIARNYSKFWGRTFDDGLKLRLGTINPSQSTRQMLPVTRHYNP 231
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
LP+ FD+R W + I+ + DQG CG+ WA V+ SDRF I + LS
Sbjct: 232 NDLPREFDSRIQWG--NDITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQH 289
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
L++C GC GGY AW + GVV E+C P+ C
Sbjct: 290 LISC-NNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWLSG---RSDKCRIPRRGKLSDA 345
Query: 214 KCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C ++N ++ Y + AYR+ ++ DIM EI +GPV+ + V+ DF HY+SG+Y H
Sbjct: 346 GCQRRNSYNLRNEMYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVHRDFFHYESGIYVH 404
Query: 273 ---ITGDVMGGHAVKLIGWG 289
G H+V+++GWG
Sbjct: 405 SRPFDTRQSGYHSVRIVGWG 424
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 76/196 (38%), Positives = 99/196 (50%), Gaps = 27/196 (13%)
Query: 119 DQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWR 176
DQ CGSCWAFG EA +DR CI + LS ++ AC F GC GG P SAW
Sbjct: 1 DQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLFF---GCGGGDPYSAWS 57
Query: 177 YFVHHGVVT-------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR 223
+ G+ T + C PY D C+H + YP KC + +
Sbjct: 58 WVHDKGIATGGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYP--KCPKVSCSGDD--- 111
Query: 224 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 283
+H+ + + + D I +GPV SFTVYEDF Y+SGVYKH +G +GGHAV
Sbjct: 112 --RHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAV 169
Query: 284 KLIGWGTSDDGEDYWV 299
K+IGWG G+ YW+
Sbjct: 170 KIIGWGEK-SGQAYWL 184
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/248 (33%), Positives = 122/248 (49%), Gaps = 27/248 (10%)
Query: 61 QFSNYTVGQFKHLLGVKP----TPKGLLLGVPV-KTHDKSLKLPKSFDARSAWPQCSTIS 115
+F N T +F+ +L ++P G L + + + + +P FD R +PQC +
Sbjct: 4 RFENVTEDEFRSML-IRPDRLRARSGSLPPISITEVQELVDPIPPQFDFRDEYPQC--VK 60
Query: 116 RILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPI 172
LDQG CG CWAF A+ DR C G++ +S S L++C L GCDGG
Sbjct: 61 PALDQGSCGECWAFSAIGVFGDRRCA-MGIDKEAVSYSQQHLISCS--LENFGCDGGDFQ 117
Query: 173 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 232
W + G T EC Y D G A P P QL++ + +S
Sbjct: 118 PTWSFLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS- 169
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTS 291
S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++++G+GT+
Sbjct: 170 ---KSVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTT 225
Query: 292 DDGEDYWV 299
DDG DYW+
Sbjct: 226 DDGTDYWI 233
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 129/284 (45%), Gaps = 37/284 (13%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH--DKSLKL 98
+I+ +N+ GW A QF T+ + FK LG P P LLG+ T + L
Sbjct: 160 LIEHINKG-DYGWTAQNYSQFWGMTLEEGFKFRLGTLP-PSPALLGMNEVTAALPAKIDL 217
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLA 156
P+ F A WP LDQ +C + WAF +DR I +LS +L++
Sbjct: 218 PEFFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNLIS 275
Query: 157 CCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YP 207
CC GC GG AW Y G+V+ C P F ++ GC A +
Sbjct: 276 CCARK-RHGCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATN-GCAMASRSDGRGKRHA 333
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
T C K N++++ S YR++S+ IM EI +NGPV+ V+EDF YK+
Sbjct: 334 TTPCPNHIEKSNRIYQCS-----PPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKT 388
Query: 268 GVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
G+Y+H+T + HAVKL GWGT E +W+
Sbjct: 389 GIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKFWI 432
>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
Length = 311
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 119/274 (43%), Gaps = 48/274 (17%)
Query: 58 RNPQFSNYTVGQFKHLLGVKPTPKGLL-----LGVPVKT--------------------- 91
+NP N+T Q K +LGVK TP G P KT
Sbjct: 19 KNP-MKNFTTEQLKKILGVK-TPAGYFDANYGQQSPSKTTSAYTFSAPKSPVSARGTSGT 76
Query: 92 ----HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL 147
+ ++P S+D R+ +P C +RI DQ CGSCWAF L R+C+
Sbjct: 77 DYLNRQVAKQMPSSYDVRTVYPMCE--NRIKDQAQCGSCWAFATTNVLEYRYCMATKGKK 134
Query: 148 --SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 205
LS +L++C GCDGGY + Y GV TE+C PY G
Sbjct: 135 YPELSPQNLISCFNSASW-GCDGGYIDQTFLYLEMMGVNTEQCMPYKSGDG--------- 184
Query: 206 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
C KC L+ N + + + + ++ GP+ F V+EDF +Y
Sbjct: 185 -NMTACPSKCANGENLYMNKYYCRPGSTQYMRGEQQFKNYLFNKGPMVAVFDVFEDFINY 243
Query: 266 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
G+Y ++GD +G HAVKL+G+G ++ +Y++
Sbjct: 244 GGGIYNKVSGDKLGKHAVKLLGYGV-ENSTNYYI 276
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 100/209 (47%), Gaps = 20/209 (9%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP++FDA WP I LDQG+C WAF SDR IH M SLS +LL
Sbjct: 57 LPRNFDAAQKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQNLL 114
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 215
+C GC+GG AW + G+V+++C P + P + P + R+
Sbjct: 115 SC-NTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYPLASQNSIAEPCRMYSRPMGRGKRQA 173
Query: 216 V-------KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
+ + N + S YR++S+ +DIM EI +NGPV+ V+EDF YK G
Sbjct: 174 TGPCPNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFLYKDG 233
Query: 269 VYKHITGD--------VMGGHAVKLIGWG 289
+Y+H G H+VK+ GWG
Sbjct: 234 IYRHTPASNGKPPQFRRQGTHSVKITGWG 262
>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
Length = 207
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 77/187 (41%), Positives = 97/187 (51%), Gaps = 27/187 (14%)
Query: 30 KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV 89
+L SH + D I K WKA P F N K L G LL G +
Sbjct: 21 RLPPLSHQMVDYINKA-----NTTWKAG--PNFHNVDYSYVKRLCGT------LLKGPKL 67
Query: 90 KT---HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
T + ++LP +FD R WP C T+ I DQG CGSCWAFGA EA+SDR CIH
Sbjct: 68 PTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAK 127
Query: 147 LSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
+S+ ++ DLL+CC CG GC+GGYP +AW ++ G+VT +D SH GC P
Sbjct: 128 VSVEISSEDLLSCCDS-CGMGCNGGYPSAAWDFWTTEGLVTGG---LYD----SHVGCRP 179
Query: 205 AYPTPKC 211
Y P C
Sbjct: 180 -YSIPPC 185
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 130/288 (45%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+++ +I+ VN+ GW A QF T+ + K LG + P+P L + +
Sbjct: 155 LVRPELIEYVNKG-DYGWTAKNYSQFWGMTLEEGLKFRLGTLPPSPMLLSMNEVTPSLPA 213
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
+ LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 214 TTDLPEFFVASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQ 271
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 272 NLISCCT-KNRHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNANNNGCAMASRSDGRG 330
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N +++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 331 KRHATKPCPNNIEKSNVIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFF 385
Query: 264 HYKSGVYKHITG--------DVMGGHAVKLIGWGTSDDG----EDYWV 299
HYK+G+Y+H+ + HAVKL GWG E +WV
Sbjct: 386 HYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKEKFWV 433
>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
Length = 205
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 77/187 (41%), Positives = 97/187 (51%), Gaps = 27/187 (14%)
Query: 30 KLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV 89
+L SH + D I K WKA P F N K L G LL G +
Sbjct: 21 RLPPLSHQMVDYINKA-----NTTWKAG--PNFHNVDYSYVKRLCGT------LLKGPKL 67
Query: 90 KT---HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
T + ++LP +FD R WP C T+ I DQG CGSCWAFGA EA+SDR CIH
Sbjct: 68 PTMVQYAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAK 127
Query: 147 LSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
+S+ ++ DLL+CC CG GC+GGYP +AW ++ G+VT +D SH GC P
Sbjct: 128 VSVEISSEDLLSCCDS-CGMGCNGGYPSAAWDFWTTEGLVTGG---LYD----SHVGCRP 179
Query: 205 AYPTPKC 211
Y P C
Sbjct: 180 -YSIPPC 185
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/184 (35%), Positives = 96/184 (52%), Gaps = 18/184 (9%)
Query: 125 SCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 182
SCWA A ++DR C+ + +S D+L+CCG CG GC GG I AW++ + +G
Sbjct: 1 SCWAVSAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGYGCRGGANIRAWKHVMRNG 60
Query: 183 VVTEE-------CDPY-FDSTGCSHPGC------EPAYPTPKCVRKCVKK--NQLWRNSK 226
V T C PY F G +Y TP+C + C + + +
Sbjct: 61 VCTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDR 120
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 286
+Y+ SAY + +D + IM EI + GPV ++ Y DF YK GVY+H G+ GGH++K++
Sbjct: 121 YYAASAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIM 180
Query: 287 GWGT 290
GWG
Sbjct: 181 GWGN 184
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 125/260 (48%), Gaps = 15/260 (5%)
Query: 37 ILQDSIIKEVNEN-PKAGWKAARNPQFSNYTVGQFKHL-LGVKPTPKGLLLGVPVKTHDK 94
+++ +++EVN+ P GW+ +F T+ L LG + + PVK
Sbjct: 140 LIEPELLEEVNQQEPILGWQVGNYSEFWGRTLRDGVELRLGTLNPSQSVYKMNPVKRIYD 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
LP+ FD+R+ W + IS I DQG CG+ WA + SDR+ I LS
Sbjct: 200 PDALPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGAEAPELSAQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C GC GGY AW + G+V +EC P+ + C+ +
Sbjct: 258 QLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGK----NDQCKLRKRSTLKA 312
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C K + R + AYR+ ++ DIM EI +GPV+ + VY+DF YKSG+Y+H
Sbjct: 313 AGCRKPSHPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFIYKSGIYRH 371
Query: 273 ITGDVM---GGHAVKLIGWG 289
+ G H+V++IGWG
Sbjct: 372 SRSAELHDSGYHSVRIIGWG 391
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 141/320 (44%), Gaps = 48/320 (15%)
Query: 19 SSQTFAEGVVSKLKLDS------------HI--LQDSIIKEVNENPKAGWKAARNPQFSN 64
+SQ + EG V K +S H+ + +I+ +N+ GW A QF
Sbjct: 122 NSQHYEEGSVVKENCNSCTCSGRQWNCSQHVCLVHPELIEHINKG-DYGWTAQNYSQFWG 180
Query: 65 YTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGH 122
T+ + FK LG + P+P L + T LP+ F + WP LDQ +
Sbjct: 181 MTLEEGFKFRLGTLPPSPTLLSMNEMTATFPARADLPEVFISSYKWP--GWTHGPLDQKN 238
Query: 123 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 180
C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 239 CAASWAFSTASVAADRIAIQSRGRYTANLSPQNLISCCAKK-RHGCNSGSIDRAWWFLRK 297
Query: 181 HGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSIS 231
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 298 RGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS-----P 352
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAV 283
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+ + HAV
Sbjct: 353 PYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRSHAV 412
Query: 284 KLIGWGT----SDDGEDYWV 299
KL GWGT E +W+
Sbjct: 413 KLTGWGTLRGAGGKKEKFWI 432
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 133/290 (45%), Gaps = 43/290 (14%)
Query: 37 ILQDSIIKEVNE-NPKAGWKAARNPQF---------SNYTVGQFKHLLGVKPTPKGLLLG 86
+++ +I VN NP GW+A RN F Y +G FK P+G++
Sbjct: 153 LIRKEVIDHVNSHNP--GWQA-RNYTFLWGMTLKDGIKYRLGTFK--------PQGMIEE 201
Query: 87 VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN 146
+ D +P FDAR WP S I + DQG+CG+ +AF +DR IH G
Sbjct: 202 MSSLKVDADEVMPDEFDAREEWP--SFIHPVQDQGNCGASYAFSTSTVAADRLSIHSGGE 259
Query: 147 LS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG--C 202
L LS L++C GC+GG+ AW G V+++C PY S + PG
Sbjct: 260 LKDMLSAQYLISCTTDHHQKGCEGGHVDRAWWQLRRVGTVSKDCYPY-TSGDTNDPGKCL 318
Query: 203 EPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYED 261
Y PK +C + SK Y S YRI + +IM EI NGPV+ V +D
Sbjct: 319 MSKYKLPKKNIECPVGQGI--TSKLYQASPPYRIAAKEREIMNEIILNGPVQAVMHVKDD 376
Query: 262 FAHYKSGVYKHITGDVMGG---------HAVKLIGWGTSDDGED---YWV 299
F Y+ GVYKH H+V++IGWGT G+D YW+
Sbjct: 377 FYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYTGDDPIKYWL 426
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/217 (33%), Positives = 105/217 (48%), Gaps = 32/217 (14%)
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH--------FG 144
+ S +P +FD R +PQC I+ + DQG+CG+CWAF A A DR C+ +
Sbjct: 99 EPSGPIPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATGAFGDRRCMQWLDPVGVPYS 156
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 204
++S +DL GC GG + W + HG T EC Y D+ C P
Sbjct: 157 QQYTVSCDDLDL--------GCAGGTSFNVWTFLTEHGTTTLECVRYTDADKDLSSPC-P 207
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 264
A + VK + S + + IM + +GPV+ +VY DF +
Sbjct: 208 ALCDDGSEIQLVKADGCLDYSGNVTA-----------IMQTLANDGPVQAVMSVYRDFLY 256
Query: 265 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGED--YWV 299
Y+ GVYKH+ G + HAV++IG+GT+DD E YW+
Sbjct: 257 YRGGVYKHVYGIQISSHAVEIIGYGTTDDEERIPYWI 293
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/262 (32%), Positives = 120/262 (45%), Gaps = 20/262 (7%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
W A QF T+ + ++ LG ++ + + + LP F+A WP
Sbjct: 191 WTAGNYSQFWGMTLDEGIQYRLGTAKPSSSVMNMNEIHVNMNNDILPSHFNAAEKWP--G 248
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY 170
+ LDQG+C WAF SDR I M SLS +LL+C GC GG
Sbjct: 249 LVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGR 307
Query: 171 PISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSK 226
AW Y GVV+E C P+ ++ G S P + + R+ NQ + +++
Sbjct: 308 VDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNE 367
Query: 227 HY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDV 277
Y S AYR+ S +DIM E+Y+NGPV+ V+EDF YKSG+Y+H
Sbjct: 368 IYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRR 427
Query: 278 MGGHAVKLIGWGTSDDGEDYWV 299
G H+VK+ G G YW+
Sbjct: 428 HGTHSVKITG-GRDGQTHKYWL 448
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 129/294 (43%), Gaps = 43/294 (14%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N GW A + F T+ + ++ LG V+PT + +
Sbjct: 140 LVNPDLIDAINRG-NYGWTAGNHSVFWGMTLDEGIRYRLGTVRPTSSVMNMNEIQMVMSP 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F A + WP I LDQG+C WAF SDR IH M+ +LS
Sbjct: 199 DETLPSAFSASNKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 207
+LL+C GC GG AW + G+V+ C P+ + H G PA P
Sbjct: 257 NLLSC-NTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEG---DHNGAAPAAPCMMHS 312
Query: 208 ----------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 257
T C N +++ + YR++S +DIM E+ +NGPV+
Sbjct: 313 RHMGRGKRQATAHCPNSRTHANHIYQ-----ATPPYRLSSHEKDIMKELMENGPVQALLE 367
Query: 258 VYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGT--SDDGE--DYWV 299
V+EDF YKSG+YKH + G H+VK+ GWG DG+ YW
Sbjct: 368 VHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDGQKVKYWT 421
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/274 (32%), Positives = 133/274 (48%), Gaps = 42/274 (15%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTH 92
D+ ++ + ++ +VN+ W+A P+F+ + + LG P L V V ++
Sbjct: 127 DTCMMSEDLVNDVNQQGTT-WRATTYPEFNEKKLKDGLIYKLGTFP------LNVTVISY 179
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FGM-NLSLS 150
K + P FDAR W IS I DQ CGS WA + DRF I FG N+ +S
Sbjct: 180 SKDGQYPDEFDARREWY--GYISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMS 237
Query: 151 VNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
LL+C L G GC+GG A+ + HG+V+E+C PY
Sbjct: 238 SQTLLSC--HLKGQRGCNGGNLDIAFDFVKTHGLVSEQCFPY------------------ 277
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
V + ++ + + Y + S EDIM +I +GP TVY+DF HY+ G+
Sbjct: 278 ---EGAVTQCRIGNDCRRYRVGVPFSISKEEDIMYDIMTSGPALGIMTVYQDFFHYREGI 334
Query: 270 YKHIT-GDVM--GGHAVKLIGWGTSDDGED-YWV 299
Y+H GD + G H+V+++GWG +D ED YW+
Sbjct: 335 YRHTRHGDQLMRGLHSVRIVGWG--EDAEDKYWI 366
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 138/292 (47%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ + +IK +N+ GW+A + F T+ + ++ LG V+P+ +
Sbjct: 36 LVDEDMIKAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTVRPSSSVTNMNEIHTVLGP 94
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP++F+A WP + I LDQG+C WAF SDR IH ++S LS
Sbjct: 95 GEVLPRTFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQ 152
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ S G + A P P C+
Sbjct: 153 NLLSC-DTHNQQGCHGGRLDGAWWFLRRRGVVSDHCYPF------SGHGRDEAVPAPPCM 205
Query: 213 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + N + Y ++ AYR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 206 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 265
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+SG+Y H + G H+VK+ GWG T DG YW
Sbjct: 266 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTVKYWT 317
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 126/265 (47%), Gaps = 16/265 (6%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
+++ +I VN N + GW A F T+ + + G + + +PVK K
Sbjct: 129 LVEPGVISAVNSNRELGWSATNYSMFWGKTLDEGITYKTGTLLPHRTVKRMMPVKVKSKG 188
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
KLP SFDAR+ WP IS DQG CG+ WA SDR+ I + LS
Sbjct: 189 -KLPNSFDARNKWP--GWISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQH 245
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCV 212
LL+C GC GG+ AW + G+V + C P+ + T C P P + +
Sbjct: 246 LLSCNKGQ--RGCQGGHLSRAWTFIRKFGLVDDYCYPWTGTPTKCKIPK-RPNFDALSSI 302
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
+ L R+ + AY+I D +DIM EI ++GPV+ + VY+DF YKSGVY
Sbjct: 303 CPPSLGSNL-RSELYRVGPAYKIQ-DEKDIMEEIMQSGPVQATMKVYQDFFSYKSGVYTK 360
Query: 273 ITGDV----MGGHAVKLIGWGTSDD 293
+ G H+VK++GWG +
Sbjct: 361 SNTERESSNFGYHSVKILGWGEETN 385
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 119/271 (43%), Gaps = 32/271 (11%)
Query: 53 GWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
GW A QF T+ + FK LG P LL + LP+ F A WP
Sbjct: 170 GWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASYPRADLPEVFIASYKWP-- 227
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGG 169
LDQ +C + WAF +DR I +LS +L++CC GC+ G
Sbjct: 228 GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSG 286
Query: 170 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQ 220
AW + G+V+ C P F ++ C A + T C K N+
Sbjct: 287 SIDRAWWFLRKRGLVSHACYPLFKEQSTNNNSCAMASRSDGRGKRHATRPCPNSFEKSNR 346
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---- 276
+++ S YRI+S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+
Sbjct: 347 IYQCS-----PPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP 401
Query: 277 ----VMGGHAVKLIGWGT----SDDGEDYWV 299
+ HAVKL GWGT E +W+
Sbjct: 402 EKYRKLRTHAVKLTGWGTLRGAQGKKEKFWI 432
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 81/250 (32%), Positives = 110/250 (44%), Gaps = 32/250 (12%)
Query: 73 LLGVKPTPKGLLLGVP-VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGA 131
L G GL P V + S+ +P S+++ A+ +C IL QG CGSCWAF
Sbjct: 68 LSGSSEENIGLCASTPSVANLNTSMPIPDSYNSHEAYSKCK--PDILQQGSCGSCWAFAT 125
Query: 132 VEALSDRFCI---HFGMNLSLSVNDLLACCGFLC----GD-------------GCDGGYP 171
L+ R CI G L+ L++C +C GD GCDGGYP
Sbjct: 126 TGVLAQRMCIKSEQIGQGYELAPQALVSCTDQICYTKAGDRCSSPSSTCYCSLGCDGGYP 185
Query: 172 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 231
A+R+ G+ E C Y G C V +C + N
Sbjct: 186 DGAFRFMQDEGITPELCVKYVSKDGTDPLECSDVQTM---VSECTATSNATVNGDR---C 239
Query: 232 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK--HITGDVMGGHAVKLIGWG 289
Y +SD E I +I ++GPV S+ V+EDF Y SGVY D +G HAV ++GWG
Sbjct: 240 YYHSSSDIETIQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIGWHAVIIVGWG 299
Query: 290 TSDDGEDYWV 299
+D YW+
Sbjct: 300 V-EDNTPYWL 308
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 131/292 (44%), Gaps = 40/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK-PTPKGLLLGVPVKTHDK 94
+++D +I+E+N GW+AA QF T+ + + LG K PT + + +
Sbjct: 138 LIEDDMIQEINRR-DYGWRAANYSQFWGMTLDEGLRFRLGTKRPTRTIMNMNEMQMNMNG 196
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
+ LP F+A WP I LDQG+C + WAF SDR I M LS
Sbjct: 197 NDHLPSYFNAVDKWP--GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQ 254
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+L++C DGC GG AW + GVVT++C P+ P + A +C+
Sbjct: 255 NLISC-DTRHQDGCAGGRIDGAWWFMRRRGVVTQDCYPF-------SPPEQSAVEVARCM 306
Query: 213 RKC-------------VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ + + N + S YR++++ +IM EI NGPV+ V+
Sbjct: 307 MQSRAVGRGKRQATAHCPNSHSYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVH 366
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSDD----GEDYWV 299
EDF YKSG+++H + H+V++ GWG D YW+
Sbjct: 367 EDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYWI 418
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ + +
Sbjct: 141 LVDPDMINTINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLSP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + + P
Sbjct: 258 NLLSCDTHH-QQGCHGGRLDGAWWFLRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPM 316
Query: 209 PKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ + NQ+ N + AYR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATARCPNNQVQANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWV 299
+SG+Y H + G H+VK+ GWG T DG YW
Sbjct: 377 QSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/276 (30%), Positives = 127/276 (46%), Gaps = 31/276 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ ++ LG ++P+ + +
Sbjct: 141 LVDPDMIAAINQG-NYGWQAGNHSAFWGMTLDSGIRYRLGTIRPSSSVMNMNEIYTVLAP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LPK+F+A WP + I LDQG+C WAF SDR IH +++ LS
Sbjct: 200 GEVLPKAFEASKKWP--NMIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 207
+LL+C GC GG AW + GVV++ C P+ +G PA P
Sbjct: 258 NLLSCDTHH-QQGCQGGRLDGAWWFLRRRGVVSDHCYPF---SGHEQAEAGPATPCMMHS 313
Query: 208 ------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+ R+C + N + AYR+ SD ++IM E+ +NGPV+ VYED
Sbjct: 314 RAMGRGKRQATRRCPNSHDD-ANEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYED 372
Query: 262 FAHYKSGVYKHITGDV--------MGGHAVKLIGWG 289
F YKSG+Y H + G H+VK+ GWG
Sbjct: 373 FFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWG 408
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 83/255 (32%), Positives = 118/255 (46%), Gaps = 15/255 (5%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLP 99
+I E+N + W+A +F T+ + K LG + + V+ LP
Sbjct: 145 ELIDEIN-SLDLSWRARNYSEFWGRTLDEGVKLRLGTLNPSRSVYRMNSVRRIYDPESLP 203
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 157
+ FDAR WP+ IS I DQG CG+ WA A SDRF + ++ LS LL+C
Sbjct: 204 REFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSAQHLLSC 261
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
C GGY AW Y G+V E+C P+ + C+ T C
Sbjct: 262 -NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNA----QCKLRKRTDLKTAGCRP 316
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD- 276
R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+SG+YKH
Sbjct: 317 PVNPLRTELYKVGPAYRLGNE-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTE 375
Query: 277 --VMGGHAVKLIGWG 289
G H+V++IGWG
Sbjct: 376 HYAFGYHSVRIIGWG 390
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 129/288 (44%), Gaps = 34/288 (11%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW A QF T+ + FK LG + P+P L + +
Sbjct: 154 LVHPELIDHINKG-DYGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASFPP 212
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
LP+ F A WP LDQ +C + WAF +DR I +LS
Sbjct: 213 RADLPEIFIASYKWP--GWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQ 270
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA------- 205
+L++CC GC+ G AW + G+V+ C P F ++ C A
Sbjct: 271 NLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRG 329
Query: 206 --YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF
Sbjct: 330 KRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFF 384
Query: 264 HYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWV 299
+YK+G+Y+H+ + HAVKL GWGT E +W+
Sbjct: 385 YYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWI 432
>gi|56757237|gb|AAW26790.1| unknown [Schistosoma japonicum]
Length = 170
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 85/144 (59%), Gaps = 7/144 (4%)
Query: 38 LQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKS 95
L D +I +N++P AGWKA ++ +F ++V + LLG + L V HD
Sbjct: 30 LSDEMISFINKHPNAGWKADKSDRF--HSVDDARILLGGRREDPNLRQKRRPTVDHHDLK 87
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+++P FD+R WP+C +IS+I DQ C S WA AV A+SDR CI G ++ LS D
Sbjct: 88 VEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAID 147
Query: 154 LLACCGFLCGDGCDGGYPISAWRY 177
L++CC CG GCDGG+P AW Y
Sbjct: 148 LISCCEN-CGSGCDGGFPGPAWDY 170
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 139/319 (43%), Gaps = 48/319 (15%)
Query: 20 SQTFAEGVVSKLKLDS------------HI--LQDSIIKEVNENPKAGWKAARNPQFSNY 65
SQ + EG V K +S H+ + +I +N+ GW A QF
Sbjct: 123 SQHYEEGSVVKENCNSCTCSGQQWKCSQHVCLVHPELIDHINKG-DYGWTAQNYSQFWGM 181
Query: 66 TVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHC 123
T+ + FK LG + P+P L + + LP+ F A WP LDQ +C
Sbjct: 182 TLEEGFKFRLGTLPPSPMLLSMNEMTASFPPRADLPEIFIASYKWP--GWTHGPLDQKNC 239
Query: 124 GSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHH 181
+ WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 240 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKR 298
Query: 182 GVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSISA 232
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 299 GLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS-----PP 353
Query: 233 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGGHAVK 284
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+ + HAVK
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVK 413
Query: 285 LIGWGT----SDDGEDYWV 299
L GWGT E +W+
Sbjct: 414 LTGWGTLRGARGKKEKFWI 432
>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 245
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 112/227 (49%), Gaps = 29/227 (12%)
Query: 98 LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LPK FD R WP+C+ +S LDQG CGSCWA + ++DR CI ++ LS L
Sbjct: 2 LPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQL 61
Query: 155 LAC---------CGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 198
L+C G CDGG+P A+ G+V+ + C PY + C
Sbjct: 62 LSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFGDDKTCMPYAFAP-CQ 120
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA-EIYKNGPVEVSF- 256
HP C P + +C C KN + ++ S ++ + MA E++ +GPV
Sbjct: 121 HP-CNPNH-VAQCPTTCRNKNVNLSSQRYEVTSLVTCGTNDFNCMALELFYHGPVSSYVG 178
Query: 257 TVYEDFAHYKSGVYK-----HITGDVMGGHAVKLIGWGTSDDGEDYW 298
V+++F YKSGVY G+ GGH +++IGWGT++ G YW
Sbjct: 179 DVFDEFYKYKSGVYSLSKDVAARGENHGGHVMEVIGWGTTESGTRYW 225
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 137/284 (48%), Gaps = 38/284 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
++Q+ I++ + + + W +A F T+ + F + LG LL VK ++
Sbjct: 251 LIQEDILERM-LHERNSWTSANYSTFWGKTLDEGFSYRLGT------LLPEKSVKNMNEI 303
Query: 96 LK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNL--S 148
L LP+SFDAR WP S I + DQG C S WAF +DR I G
Sbjct: 304 LIEMSNFLPESFDARERWP--SFIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNP 361
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
LSV LL+C GC+GGY AW VV++EC Y S + PG E P
Sbjct: 362 LSVQQLLSC-NQARQRGCNGGYLDRAW------CVVSDECYTY-TSGQTNQPG-ECHIPR 412
Query: 209 PKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
+ ++ +++ Y ++ YRI+++ +IM EI NGPV+ +F V+EDF YKS
Sbjct: 413 TAYLDGEIRCPSGSADNRVYKMTPPYRISTNEREIMTEIMANGPVQATFLVHEDFFMYKS 472
Query: 268 GVYKHI--------TGDVMGGHAVKLIGWGTSDDGE---DYWVC 300
GVY+H+ G H+V+++GWG YW+C
Sbjct: 473 GVYQHLPYANDKGPAYARSGYHSVRILGWGVDHSTGVPIKYWLC 516
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 133/293 (45%), Gaps = 39/293 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ +
Sbjct: 20 LVDRDMIDAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVASMNEIHTVLGP 78
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 79 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 136
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG+ SAW + GVV++ C P F G + G P P+C+
Sbjct: 137 NLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCYP-FSGQGRTETG-----PAPRCM 189
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + +Q+ N + AYR+ S ++IM E+ +NGPV+ V+
Sbjct: 190 MHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 249
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DGE--DYWVC 300
EDF Y++G+Y H + G H+VK+ GWG DG YW
Sbjct: 250 EDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDGRTLKYWTA 302
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 133/290 (45%), Gaps = 35/290 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ + +
Sbjct: 142 LVDPDMINAINQG-DYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLAP 200
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH +++ LS
Sbjct: 201 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQ 258
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP----- 207
+LL+C L GC GG+ AW + GVV++ C P+ +G PA P
Sbjct: 259 NLLSC-DTLHQQGCRGGHLDGAWWFLRRRGVVSDHCYPF---SGREQAEAGPAPPCMMHS 314
Query: 208 ------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
+ R+C + N + AYR+ SD ++IM E+ +NGPV+ V+ED
Sbjct: 315 RAMGRGKRQATRRCPNSHTD-ANDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHED 373
Query: 262 FAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWV 299
F YK G+Y H + G H+VK+ GWG T DG YW
Sbjct: 374 FFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDGRTLKYWT 423
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 142/312 (45%), Gaps = 31/312 (9%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ- 69
C +ILG +T E + + ++ IIK +N+ GW+A + F T+ +
Sbjct: 114 CCVILG----RTCQENRQWQCDQEPCLVDPDIIKAINQG-NYGWQAGNHSAFWGMTLDEG 168
Query: 70 FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
++ LG ++P+ + + + LP +F+A WP + I LDQG+C WA
Sbjct: 169 IRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWA 226
Query: 129 FGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 186
F SDR IH M LS +LL+C GC GG AW + GVV++
Sbjct: 227 FSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSD 285
Query: 187 ECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS-AYRINSDP 239
C P+ D G + P + + R+ N N+ Y ++ YR+ S+
Sbjct: 286 HCYPFSGRERDEAGPAPPCMMHSQAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSND 345
Query: 240 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG-- 289
++IM E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GWG
Sbjct: 346 KEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEE 405
Query: 290 TSDDGE--DYWV 299
T DG YW
Sbjct: 406 TLPDGRTLKYWT 417
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 131/292 (44%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 89 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 147
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 148 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 205
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ A PTP+C+
Sbjct: 206 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 258
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + Q+ N + AYR+ SD ++IM E+ +NGPV+ V+
Sbjct: 259 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 318
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H G H+VK+ GWG T DG YW
Sbjct: 319 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 370
>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
Length = 156
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 83/151 (54%), Gaps = 16/151 (10%)
Query: 108 WPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 165
WP C TIS I DQG CGSCWAFG+VE +SDR C+H +S+ V+ DLL+CCGF CG G
Sbjct: 3 WPNCPTISEIRDQGSCGSCWAFGSVEVISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 62
Query: 166 CDGGYPISAWRYFVHHGVVTEEC-DPYFDSTGCSHPGCE------------PAYPTPKCV 212
C+GGYP AWRY+ G+V+ D + G + P CE TP+C
Sbjct: 63 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCAGYTIPPCEHHVNGSRPPCTGEGGETPRCS 122
Query: 213 RKCVKK-NQLWRNSKHYSISAYRINSDPEDI 242
R C + ++ KHY Y + ++I
Sbjct: 123 RHCEPGYSPSYKEDKHYGSHIYGVPRSEKEI 153
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 131/292 (44%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPDMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSTVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ A PTP+C+
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQ------NEASPTPRCM 309
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + Q+ N + AYR+ SD ++IM E+ +NGPV+ V+
Sbjct: 310 MHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVH 369
Query: 260 EDFAHYKSGVYKHI--------TGDVMGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H G H+VK+ GWG T DG YW
Sbjct: 370 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 421
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 130/275 (47%), Gaps = 21/275 (7%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
+ + S+I EVN W+A +F + + K LG + P+ + + +D
Sbjct: 135 LQEQSLIDEVNSISSLNWRARNYSEFWGKRLSEGVKLRLGTLNPSNSVYRMNSVRRVYDP 194
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP+ FDAR+ W + IS + DQG CG+ WA + SDRF + S LS
Sbjct: 195 E-SLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGTDSVLLSAQ 251
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
LL+C GCDGGY AW + G+V E+C P+ + C+ T
Sbjct: 252 HLLSC-NKKGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGV----YEQCKLQKRTNLEA 306
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 272
C R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+SG+Y H
Sbjct: 307 AGCRAPANPLRKELYKVGPAYRLGNE-TDIMREILTSGPVQATMKVYQDFFSYESGIYMH 365
Query: 273 ITGDVM---GGHAVKLIGWG---TSDDGE--DYWV 299
+ G H+V++IGWG ++D G YW+
Sbjct: 366 TPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWL 400
>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
Length = 218
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/214 (35%), Positives = 107/214 (50%), Gaps = 31/214 (14%)
Query: 69 QFKHLLGVKPTPKGLLLGVP---VKTHDK----SLKLPKSFDARSAWPQCSTISRILDQG 121
Q LLG K LLGVP +K +D+ + ++P+ FD+R W C TI + +QG
Sbjct: 13 QIVRLLGSK-----RLLGVPKSPIKENDEFYMDNSEVPEFFDSRLEWKYCKTIGHVRNQG 67
Query: 122 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
+CGSCWA G A +DR C+ +N +S ++ CC CG GC+GG P+ AW+YF
Sbjct: 68 NCGSCWAHGTTGAFADRLCVATNGEVNQLISAEEVTFCC-HRCGFGCNGGNPLRAWQYFK 126
Query: 180 HHGVV-------TEECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK 226
HGVV T+ C PY D G + +P KC +KC + + S
Sbjct: 127 RHGVVTGGDYNTTDGCQPYRVPPCVKDDKGHNSCSGQPTERNHKCSKKCYGDDTVDYKSD 186
Query: 227 HYSI-SAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
HY AY +++ +Y GP+E SF VY
Sbjct: 187 HYKTKDAYYLSNTTMQKDTMVY--GPIEASFDVY 218
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 83/263 (31%), Positives = 118/263 (44%), Gaps = 15/263 (5%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSLKLP 99
+I E+N W+A +F T+ + K LG + + V+ LP
Sbjct: 145 ELIDEINSQ-DLSWRARNYSEFWGRTLDEGVKLRLGTLNPSRSVYRMNSVQRIYDPESLP 203
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 157
+ FDAR WP+ IS I DQG CG+ WA SDRF + ++ LS LL+C
Sbjct: 204 REFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSAQHLLSC 261
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
C GGY AW Y G+V E+C P+ + + C+ T C
Sbjct: 262 -NNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGT----NVQCKLRKRTDLKTAGCRP 316
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD- 276
R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+SG+YKH
Sbjct: 317 PVNPLRTELYKVGPAYRLGNE-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTE 375
Query: 277 --VMGGHAVKLIGWGTSDDGEDY 297
G H+V++IGWG Y
Sbjct: 376 HYAFGYHSVRIIGWGEDTSAHRY 398
>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
Length = 219
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 110/226 (48%), Gaps = 28/226 (12%)
Query: 54 WKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD----KSLKLPKSFDARSAW 108
WKA +N P++ T Q LLG K KG L P+K +D +++P FDAR W
Sbjct: 1 WKAKQNFPEY--MTKEQIVRLLGSKSV-KGALKS-PIKEYDSKYTNDVEVPDFFDARIEW 56
Query: 109 PQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGC 166
C TI + +QG+CGSCWA G A +DR C+ + N +S +L CC CG GC
Sbjct: 57 KYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNGDFNELISAEELTFCC-HTCGFGC 115
Query: 167 DGGYPISAWRYFVHHGVV-------TEECDPY------FDSTGCSHPGCEPAYPTPKCVR 213
+GG PI AW YF HGVV T+ C PY D G + + +C +
Sbjct: 116 NGGNPIRAWLYFKRHGVVTGGNYNTTDGCQPYKVPPCIRDEEGHNSCSGQRTERNHRCSK 175
Query: 214 KCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
C ++N + + AY + ++ I IY GP+E SF V
Sbjct: 176 SCYGNTTSDYKNGHYKTKDAYYLTNNTMQIDTMIY--GPIESSFDV 219
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 126/279 (45%), Gaps = 25/279 (8%)
Query: 34 DSHILQDSIIKEVNENPKA-GWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVK 90
D + + ++K++N ++ GWKA ++ Y G+ L P K +
Sbjct: 121 DVCLTDNELLKQLNHLERSIGWKATNYSEWWGHKYDEGKVMRLGTFYPKIKVKSMSRLTN 180
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLS 148
D LP FDA + WP I ++ DQG CGS WA SDRF I +
Sbjct: 181 GLDH---LPTHFDATNYWP--GFIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQ 235
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
L+ +++C GC GG+ +AW Y G V EEC PY + H C+
Sbjct: 236 LAPQQIVSCVRR--SQGCSGGHLDTAWSYLRKVGTVNEECYPYISA----HNVCKIRPSD 289
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C ++ R + + A+ +N++ DIM EI K+GPV+ V+ DF YKSG
Sbjct: 290 TLITANCELPMKVDRTNMYKMGPAFSLNNE-TDIMLEIKKHGPVQAIMRVHRDFFSYKSG 348
Query: 269 VYKHITGDV-----MGGHAVKLIGWGTSDDGED---YWV 299
+Y+H G H+V+LIGWG G + YW+
Sbjct: 349 IYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWI 387
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/232 (34%), Positives = 114/232 (49%), Gaps = 34/232 (14%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP +F+A+ + C+ I I DQ C +CWA +V +DR CI G ++ LS+ L
Sbjct: 39 LPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLAYL 98
Query: 155 LACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDSTGC 197
+CC G DGC G + +HG+VT + C PY C
Sbjct: 99 TSCCNHANGCPKSDGCRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPY-PFPKC 157
Query: 198 SH-PGCEPAYPTPKCVRK---------CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 247
+H PG + YP +C K C + H + S R+ PE I EI+
Sbjct: 158 NHVPGMKVKYP--RCGSKVGRLAAPSHCDGLHCRRAGDVHRAKSWGRLPISPEKIKQEIF 215
Query: 248 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
NGPV T++EDF YKSGVY++ TG ++G H +KLIGWG + G++YW+
Sbjct: 216 DNGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGV-EAGQEYWL 266
>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
gigas]
Length = 464
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/257 (34%), Positives = 117/257 (45%), Gaps = 24/257 (9%)
Query: 53 GWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQ 110
GW+ A +F N T Q +G++ + + H S +LP FDAR W
Sbjct: 149 GWQTANYTRFWNLTFTQGISEHVGIETESRAKNMS---SLHSYSRDQLPIHFDARINWT- 204
Query: 111 CSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDG 168
S I + DQ +C S WAF V+ +DR I L+ LS L++C GC G
Sbjct: 205 -SWIHPVRDQKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQHLVSCNTGRGQRGCRG 263
Query: 169 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
G AW + G++TEEC PY S G C T C N Y
Sbjct: 264 GSTEKAWWFVKRRGIITEECYPYTASDG----ECLDGETT------CPNANSSTAKIVLY 313
Query: 229 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-AVKLIG 287
YR+ D EDI AEIY+NGPV+ +F V DF Y+SGVY+H D+ +V++IG
Sbjct: 314 VTPPYRVRQDEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIG 373
Query: 288 WG----TSDDGEDYWVC 300
WG YW+C
Sbjct: 374 WGEKTNKKGKKRKYWIC 390
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 133/292 (45%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ +
Sbjct: 140 LVDRDMIDAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVASMNEIHTVLGP 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG+ SAW + GVV++ C P F G + G P P+C+
Sbjct: 257 NLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCYP-FSGQGRTETG-----PAPRCM 309
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + +Q+ N + AYR+ S ++IM E+ +NGPV+ V+
Sbjct: 310 MHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 369
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DGE--DYWV 299
EDF Y++G+Y H + G H+VK+ GWG DG YW
Sbjct: 370 EDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGWGEESLPDGRTLKYWT 421
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 133/292 (45%), Gaps = 38/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ ++IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRRRGVVSDNCYPF-----SGREQNDEASPTPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + +Q+ N + YR+ SD ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 131/282 (46%), Gaps = 27/282 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 99
+IK +N+ GW+A + F T+ + ++ LG ++P+ + + + LP
Sbjct: 1 MIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVLNPGEVLP 59
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVNDLLAC 157
+F+A WP I LDQG+C WAF SDR IH G M LS +LLAC
Sbjct: 60 TAFEASEKWPNL--IHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLAC 117
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVR 213
GC GG AW + GVV++ C P+ D G + P + + R
Sbjct: 118 DTHHQ-QGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKR 176
Query: 214 KCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
+ + N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y
Sbjct: 177 QATARCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY 236
Query: 271 KHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWVC 300
H + G H+VK+ GWG T DG YW
Sbjct: 237 SHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 278
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 125/297 (42%), Gaps = 78/297 (26%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQ--FSNY 65
+ + L + S TFA+ +LD L D I+++N + WKA RN + S Y
Sbjct: 1 MKLAFIALAAVVSCTFAQP-----ELD--FLSDEYIEQLN-SKNLPWKAGRNFERDTSLY 52
Query: 66 TVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGS 125
+ + + + P + + D LP+ FDAR W +C +I I DQ CGS
Sbjct: 53 NIQRLLSVGTINPPSEF----ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGS 108
Query: 126 CWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 185
CW GC YP+
Sbjct: 109 CW-------------------------------------GC-MSYPLP------------ 118
Query: 186 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIM 243
C+P C+ Y P C ++C K + L + KHY+ AYRI S E I
Sbjct: 119 -RCNP----------SCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQ 167
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI KNGPV SFTVY DF HY SGVYK ++GGHAV++IGWG + YW+
Sbjct: 168 LEIIKNGPVVASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWL 224
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 105/215 (48%), Gaps = 26/215 (12%)
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGH-CGSCWAFGAVEALSDRFCIHFGMNLS- 148
T D S LP SFD+R W C S + DQG C SCWA A L+DR C+ G +
Sbjct: 27 TFDAS-NLPASFDSRQKWSDC--FSPVRDQGQKCSSCWAMTATGVLADRLCVASGGKVKK 83
Query: 149 -LSVNDLLAC--CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 205
LS +L+ C G L GC GG + YF +GVVTE+C+ Y A
Sbjct: 84 VLSPQELIDCDRNGNL---GCGGGRLDTPLAYFRDNGVVTEKCESY------------KA 128
Query: 206 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
C C +K++S YR++S E A+IY NGP+ F +Y D +Y
Sbjct: 129 TQASSCSNTCDDGTSFSNTTKYHSKDCYRLSS-IEQAKADIYLNGPIIAVFDLYTDIYNY 187
Query: 266 KSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
KSGVY K + HA ++IGWG +DG YW+
Sbjct: 188 KSGVYIKSDSATYKETHAGRVIGWGV-EDGVQYWL 221
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 126/279 (45%), Gaps = 25/279 (8%)
Query: 34 DSHILQDSIIKEVNENPKA-GWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVK 90
D + D ++++++ ++ GWKA ++ Y G+ L +P + +
Sbjct: 123 DVCLADDDLLRQLHHLERSIGWKATNYSEWWGHKYDEGKVLRLGTFQPR---FRVKAMKR 179
Query: 91 THDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI-HFGMNL-S 148
+K LP FDA W ++ DQG CGS WAF SDRF I G +
Sbjct: 180 LSNKGGHLPTRFDASEHWT--GLVAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQ 237
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 208
L+ +LAC GC GG+ +AW+Y GVV EEC PY + + T
Sbjct: 238 LAPQQMLACVRRQ--QGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCKISNDDTLIT 295
Query: 209 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 268
C VK N R + A+ +N++ DIMAEI G V+ VY DF Y+SG
Sbjct: 296 ANCELP-VKVN---RTLMYKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFSYRSG 350
Query: 269 VYKHITG-----DVMGGHAVKLIGWGTSDDGED---YWV 299
+Y+H + H+V+LIGWG G D YW+
Sbjct: 351 IYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWI 389
>gi|303289014|ref|XP_003063795.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
gi|226454863|gb|EEH52168.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
Length = 390
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/226 (36%), Positives = 108/226 (47%), Gaps = 42/226 (18%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS-------- 148
LP+ FDAR WP+C+ + LDQG CGSCWA L+DR CI L
Sbjct: 116 LPELFDARERWPRCARVVGTALDQGKCGSCWAVATAAVLTDRACIATNGALGGGGGGGEF 175
Query: 149 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHP 200
LS + LL+C DGC+GG A+ Y HGVVT C PY FD+ C HP
Sbjct: 176 LSASQLLSCG---AADGCEGGDERDAFEYAKTHGVVTGGAYGDESTCAPYLFDA--CQHP 230
Query: 201 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN---SDPED----IMAEIYKNGPVE 253
CE + PTP+C CV+ + ++ AYR+ S PE + EI GPV
Sbjct: 231 -CEKS-PTPECPLSCVRP----KGTRVEDAPAYRVKEIVSCPERDYSCVAKEIATRGPVT 284
Query: 254 V-------SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 292
F +Y+ + S + + G+ GGH VKLIGWG +
Sbjct: 285 SYAGTIWGEFYLYDGRGVFASSGDERVRGENHGGHVVKLIGWGRDE 330
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 133/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMINAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPT 208
+LL+C GC GG+ AW + GVV++ C P+ D G P + T
Sbjct: 258 NLLSCNTHH-QQGCRGGHLDGAWWFLRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSRAT 316
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ AYR+ S+ +IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H ++ G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWPDGRKLKYWT 422
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 127/281 (45%), Gaps = 27/281 (9%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 99
+I +N GW A + F T+ + ++ LG V+P + + LP
Sbjct: 144 LINAINHG-NYGWTAGNHSAFWGMTLEEGIQYRLGTVRPASSVMNMNEIQMVMAPQETLP 202
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLAC 157
+F+A WP I LDQG+C WAF SDR IH M +LS +LL+C
Sbjct: 203 LAFNASDKWP--GLIHEPLDQGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSC 260
Query: 158 CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVR 213
GC GG AW + G+V+ C P+ D+T + P + + R
Sbjct: 261 -DTHNQKGCRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKR 319
Query: 214 KCVK---KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
+ ++ N + + YR++SD +DIM E+ +NGPV+ V+EDF YKSG+Y
Sbjct: 320 QATAHCPNSRAHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGIY 379
Query: 271 KHITGDV--------MGGHAVKLIGWGTSD--DGE--DYWV 299
KH + G H+VK+ GWG DG+ YW
Sbjct: 380 KHTPASLGKPARYRQHGTHSVKITGWGEERQPDGQRLKYWT 420
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 101/212 (47%), Gaps = 26/212 (12%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLL 155
+P SFD R+ W C +S + +Q CGSCWA L+DR CI N+ LS L+
Sbjct: 46 IPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLM 103
Query: 156 ACCGFL-------CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG-CEPAYP 207
C G C +GC GG+ A ++ G+V++EC Y S S P C+ P
Sbjct: 104 DCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSP 163
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
N+ Y ++ R +D EI NGPV +F +Y DF +K
Sbjct: 164 I--------------SNTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKW 209
Query: 268 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VY + + HAV+++GWGT+ DG DYW+
Sbjct: 210 DVYIKSSNTQVESHAVRVVGWGTTSDGVDYWI 241
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 134/292 (45%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 36 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 94
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 95 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ S + A PTP C+
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPF------SGRERDEAGPTPPCM 205
Query: 213 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 206 MHSRAMGRGKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVH 265
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF YK G+Y H + G H+VK+ GWG T DG YW
Sbjct: 266 EDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 317
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 36 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 94
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 95 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 211
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 212 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 271
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 272 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 317
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 133/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 36 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVLNP 94
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 95 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 211
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ + N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 212 GRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 271
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 272 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 317
>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
Length = 218
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 99/212 (46%), Gaps = 28/212 (13%)
Query: 69 QFKHLLGVKPTPKGLLLGVP---VKTHDKSL----KLPKSFDARSAWPQCSTISRILDQG 121
Q LLG K L GVP VK +D S +PK+FDAR W C TI ++ DQG
Sbjct: 13 QMVRLLGSK-----RLTGVPKTPVKENDISYVEDGGIPKAFDARLEWKYCKTIGQVRDQG 67
Query: 122 HCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 179
+CGSCWA G A +DR CI N +S +L CC LCG GC+GG P+ AW+YF
Sbjct: 68 NCGSCWAHGTSGAFADRLCIATKGDFNELISAEELTFCC-HLCGIGCNGGNPLRAWQYFK 126
Query: 180 HHGVV-------TEECDPYF----DSTGCSHPGC--EPAYPTPKCVRKCVKKNQLWRNSK 226
HGVV T C PY + H C + KC++ C +
Sbjct: 127 RHGVVTGGNYNTTNGCQPYRVPPCTNGDKGHYSCSGQQKERNHKCLKTCYGDKTVDYKRD 186
Query: 227 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 258
HY S+ + ++ GP+E SF V
Sbjct: 187 HYKTKDAYYLSNTTTMQKDVILYGPIEASFDV 218
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 134/293 (45%), Gaps = 39/293 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ +
Sbjct: 36 LVDPDMIKAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVANMNEIHTVLGP 94
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP++F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 95 GEVLPRAFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ H E A P P+C+
Sbjct: 153 NLLSC-DTHNQQGCQGGRLDGAWWFLRRRGVVSDHCYPF-----SGHERNE-AGPAPRCM 205
Query: 213 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + N + Y ++ AYR+ S+ +DIM E+ +NGPV+ V+
Sbjct: 206 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVH 265
Query: 260 EDFAHYKSGVYKHITGD--------VMGGHAVKLIGWG--TSDDGE--DYWVC 300
EDF Y+SG+Y H G H+VK+ GWG T DG YW
Sbjct: 266 EDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRMLKYWTA 318
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 105/168 (62%), Gaps = 14/168 (8%)
Query: 145 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF----- 192
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY
Sbjct: 2 VNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCE 61
Query: 193 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 251
S P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 62 HHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 121
Query: 252 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW+
Sbjct: 122 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWL 168
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/293 (30%), Positives = 137/293 (46%), Gaps = 39/293 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ + +I+ +N GW+A + F T+ + ++ LG V+P+ +
Sbjct: 102 LVDEDMIEAINHG-DYGWRAGNHSAFWGMTLDEGIRYRLGTVRPSSFVANMNEIHTVLGP 160
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP++F+A WP + I LDQG+C WAF SDR IH ++S LS
Sbjct: 161 GEVLPRTFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQ 218
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ S G + A P P C+
Sbjct: 219 NLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPF------SGHGRDEAVPAPPCM 271
Query: 213 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + N + Y ++ AYR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 272 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 331
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWVC 300
EDF Y+SG+Y H + G H+VK+ GWG T DG YW
Sbjct: 332 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWTA 384
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 129/285 (45%), Gaps = 38/285 (13%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLP 99
+I+ +N+ GW A QF T+ + F+ LG + P+P L + T ++ LP
Sbjct: 158 LIEHINKG-DYGWTAQNYSQFWGMTLEEGFRFRLGTLPPSPVLLSMNEMRATLPETTDLP 216
Query: 100 KSFDA--RSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
+ F A + AW S + +C + WAF +DR I +LS +L+
Sbjct: 217 EFFIAFLQMAWMD----SWAIGSKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLI 272
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE---------PAY 206
+CC GC+ G AW Y G+V+ C P F S+ C +
Sbjct: 273 SCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISNNTCAMTSKADGRGKRH 331
Query: 207 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
T C K N++++ S YR++S+ +IM EI +NGPV+ V+EDF HYK
Sbjct: 332 ATRPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYK 386
Query: 267 SGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWV 299
+G+Y+H+ + HAVKL GWGT E +W+
Sbjct: 387 TGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGARGQKEKFWI 431
>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
Length = 466
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 134/285 (47%), Gaps = 35/285 (12%)
Query: 32 KLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPK---GLLLGVP 88
K +I I ++N + ++ W A P++ ++T+ + G PK G L +
Sbjct: 163 KHRKYIPNKDYINQIN-SAQSLWTATEYPEYEDFTLAELNMRSGRPTVPKSFAGPRLRMK 221
Query: 89 ----VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--H 142
+ D+ + PK FD R+ + +S + +QG CGSC+AF ++ R +
Sbjct: 222 RDRLSRNSDEFIYFPKQFDWRNV-SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSK 280
Query: 143 FGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPG 201
+ +S D+++C + GC GG+P + A +Y G+V E C PY G P
Sbjct: 281 NSVKRVMSPQDVVSCSEY--AQGCAGGFPYLIAGKYGEDFGLVEESCFPY---NGKDEPC 335
Query: 202 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 261
E KC R + +Y + + + +M E+ KNGP+ +SF VY D
Sbjct: 336 KETK---SKCRRHST--------TNYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGD 384
Query: 262 FAHYKSGVYKHI-TGDV-----MGGHAVKLIGWGTSD-DGEDYWV 299
F HYK G+Y+H GD + HAV L+G+GT G+DYW+
Sbjct: 385 FKHYKGGIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYWI 429
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 131/277 (47%), Gaps = 21/277 (7%)
Query: 34 DSHILQDSIIKEVNENPKA-GWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTH 92
D ++ D+++++++ ++ GW+A ++ + + K PK + + T+
Sbjct: 122 DVCLVDDALLRQLHHLERSIGWQATNYSEWWGHKYDEGKTFRLGTFYPKFKVKSMSRLTN 181
Query: 93 DKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLS 150
+ LP FDA + WP I + DQG CGS WA SDRF I + L+
Sbjct: 182 GQE-HLPTHFDATTYWP--GFIGEVKDQGWCGSSWALSTASVASDRFAILSKGREIVQLA 238
Query: 151 VNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK 210
+++C GC GG+ +AW Y G V +EC PY + C+
Sbjct: 239 PQQIISC--VRRSQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQN----ACKIRPSDTL 292
Query: 211 CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
C ++ R + + A+ +N++ DIM EI K+GPV+ V+ DF YKSG+Y
Sbjct: 293 ITANCDLPTKVDRTNMYKMGPAFSLNNE-TDIMIEIKKHGPVQAILRVHRDFFSYKSGIY 351
Query: 271 KH----ITGDVMGG-HAVKLIGWGTSDDGED---YWV 299
+H GD G H+V+LIGWG +G + YWV
Sbjct: 352 RHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWV 388
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 133/292 (45%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A PTP C+
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPTPPCM 310
Query: 213 ----------RKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF YK G+Y H + G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 96/206 (46%), Gaps = 18/206 (8%)
Query: 96 LKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVND 153
+ +P +FDAR+ W C + I DQ CG+CWAF A L+ R CI N+ LS
Sbjct: 1 MDIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEY 58
Query: 154 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 213
+ C C GGY +W + + G + C PY G + + C
Sbjct: 59 QVQC--DTMNKACQGGYLKYSWTFLENTGTPLDTCIPYASGRG--------TFSSGTCPT 108
Query: 214 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 273
+C + + Y R + +I I G V+ FTVY D YKSGVYKH+
Sbjct: 109 QCKIASM---SMSKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHV 165
Query: 274 TGDVMGGHAVKLIGWGTSDDGEDYWV 299
V+GGHAV LIG+G + G +YW+
Sbjct: 166 VSTVLGGHAVALIGFGV-EGGSNYWL 190
>gi|290980579|ref|XP_002673009.1| predicted protein [Naegleria gruberi]
gi|284086590|gb|EFC40265.1| predicted protein [Naegleria gruberi]
Length = 218
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 104/216 (48%), Gaps = 17/216 (7%)
Query: 88 PVKTHDKSLKLPKSFDARS--AWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HF 143
P + SL +P ++ + + C+ + +I DQ C +C+AFG E ++DR+CI
Sbjct: 12 PTRPAISSLDIPTNYTLTTDPKYMNCTQLHKIRDQSQCAACYAFGVAEMVADRYCISSQG 71
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 203
+N LS +L+C + C GG + Y ++G+VT+ C P+ G +
Sbjct: 72 KVNTILSPQFILSCDEY--EGNCYGGDIGNTLLYVKNYGIVTDSCLPFLARNGSNL---- 125
Query: 204 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 263
C KC+ + W+ D + + I + G + +F VYEDFA
Sbjct: 126 ------SCPNKCLDGSN-WKLRYKVKNPNQIAQDDIQGMQQSILQGGSIIAAFQVYEDFA 178
Query: 264 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
HYK G+Y H G +GGH VK++G+G + G YWV
Sbjct: 179 HYKGGIYVHTGGAYIGGHVVKIVGFGETPSGIPYWV 214
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 126/275 (45%), Gaps = 26/275 (9%)
Query: 48 ENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDAR 105
+N W+A + F T+ + ++ LG ++P+ + + LP +F+A
Sbjct: 120 DNCNRWWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLSPGEVLPTAFEAS 179
Query: 106 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCG 163
WP + I LDQG+C WAF SDR IH M LS +LL+C
Sbjct: 180 EKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-Q 236
Query: 164 DGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK- 218
GC GG AW + GVV++ C P+ D G + + P + R+ +
Sbjct: 237 QGCHGGRLDGAWWFLRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARC 296
Query: 219 --NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
NQ+ N + AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H
Sbjct: 297 PNNQVQANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVS 356
Query: 277 VM--------GGHAVKLIGWG--TSDDGE--DYWV 299
+ G H+VK+ GWG T DG YW
Sbjct: 357 LQRPEGYRRHGTHSVKITGWGEETLPDGRTLKYWT 391
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 102/196 (52%), Gaps = 15/196 (7%)
Query: 117 ILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 174
+L + H WA + ++SDR CI M + LS +L++C G C G+ +
Sbjct: 20 LLPREHYTELWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG--CQIGFSEFS 77
Query: 175 WRYFVHHGVVTEE---CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 224
W Y++ +G+VT + C PY + S+P C Y P C + C + ++
Sbjct: 78 WDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKA 137
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 284
KHY Y + + DI EI NGPVE V+ DF +YKSGVY+HITG ++ H+V+
Sbjct: 138 DKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVR 197
Query: 285 LIGWGTSDDGEDYWVC 300
+IGWG +D YW+C
Sbjct: 198 IIGWGIEND-IPYWLC 212
>gi|159117627|ref|XP_001709033.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157437148|gb|EDO81359.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 308
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 113/257 (43%), Gaps = 33/257 (12%)
Query: 53 GWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCS 112
WKA + N T FK +L + PV+ + +P FD R +PQC
Sbjct: 30 AWKAGIPERLKNLTKNDFKKMLSAGSPRTQSSIVRPVRVPENEDPVPDHFDFREEYPQC- 88
Query: 113 TISRILDQGHCGSCWAFGAVEALSDRFCI--------HFGMNLSLSVNDLLACCGFLCGD 164
I+ ++D G C S WA+ AV+A S R C+ + LS + C GF +
Sbjct: 89 -ITEVIDIGLCSSSWAYSAVDAFSHRRCLTGLDQEATRYSAQYILSCSSTNGCFGFSTRE 147
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWR 223
AW + G+ E C Y D S P C C + L
Sbjct: 148 SI-------AWDFIATTGIPLESCVKYTDYDQTQSRP----------CPSTCDDDSFL-- 188
Query: 224 NSKHYSISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 282
+ Y Y + + E + + GP++ FTVYEDF +Y G+Y + G+ +G +
Sbjct: 189 --EVYKPDGYEGVGLNCERLKRAVALRGPMQAMFTVYEDFTYYLEGIYSYTYGNRVGFLS 246
Query: 283 VKLIGWGTSDDGEDYWV 299
V+++G+GTSD+G+DYW+
Sbjct: 247 VEIVGYGTSDEGQDYWI 263
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMINAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEALPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCNTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ AYR+ S+ +IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H ++ G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRPDGRKLKYWT 422
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 136/292 (46%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ + +I+ +N GW+A + F T+ + ++ LG V+P+ +
Sbjct: 208 LVDEDMIEAINHG-DYGWRAGNHSAFWGMTLDEGIRYRLGTVRPSSFVANMNEIHTVLGP 266
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP++F+A WP + I LDQG+C WAF SDR IH ++S LS
Sbjct: 267 GEVLPRTFEASEKWP--NLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQ 324
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ S G + A P P C+
Sbjct: 325 NLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPF------SGHGRDEAVPAPPCM 377
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + + + N + AYR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 378 MHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 437
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+SG+Y H + G H+VK+ GWG T DG YW
Sbjct: 438 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWT 489
>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
griseus]
gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
Length = 465
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 127/272 (46%), Gaps = 23/272 (8%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPDMINAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTALGR 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP++F+A WP + I LDQG+C WAF SDR IH +++ LS
Sbjct: 199 GEVLPRAFEASEKWP--NLIQEPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ + G S +
Sbjct: 257 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCYPFVGREQNEAGTSSRCMMHSRAM 315
Query: 209 PKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ + Q+ N + AYR+ SD ++IM E+ +NGPV+ V+EDF Y
Sbjct: 316 GRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLY 375
Query: 266 KSGVYKHI--------TGDVMGGHAVKLIGWG 289
+SG+Y H G H+VK+ GWG
Sbjct: 376 QSGIYSHTPISQGRPEQYRRHGTHSVKITGWG 407
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 103/225 (45%), Gaps = 25/225 (11%)
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDL 154
+LP F++ WP I LDQG+C + WAF SDR I M LS +L
Sbjct: 7 QLPLYFNSAEKWP--GKIHEPLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNL 64
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-------DSTGCSHPGCEPAYP 207
++C G GC GG AW Y GVVTE+C PY + + C
Sbjct: 65 ISCDTRNQG-GCAGGRLDGAWWYLRRRGVVTEDCYPYRPPQQTPAELSRCMMQSRSVGRG 123
Query: 208 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
+ ++C N ++N + S YR+++ ++IM EI NGPV+ V+EDF Y S
Sbjct: 124 KRQATQRCPNTNN-YQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNS 182
Query: 268 GVYKHITGDVM--------GGHAVKLIGWGTSD--DG--EDYWVC 300
G+YKH G H+VK+ GWG DG YW+
Sbjct: 183 GIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTRKYWIA 227
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 133/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSLVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ + N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
Length = 215
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 99/217 (45%), Gaps = 30/217 (13%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVK----PTPKGLLLGVPVK 90
++ LQ I+ +NE WKA N N F +LG K P + L K
Sbjct: 3 AYFLQKDFIENINEQATT-WKAGVNFN-PNTPKEHFLKMLGSKGVQIPNRNNIHL---YK 57
Query: 91 THDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF-- 143
T D + ++P+ FDAR W C TI + DQG+CGSCWA A +DR C+
Sbjct: 58 TDDAAYDNLFGRIPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDG 117
Query: 144 GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY----- 191
N LS ++ CC CG GC+GGYPI AW F HG+VT E C+PY
Sbjct: 118 DFNQLLSAEEITFCC-HTCGFGCNGGYPIKAWERFKKHGLVTGGDYKSEEGCEPYRVPPC 176
Query: 192 -FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 227
+D +G + +P +C R C L + H
Sbjct: 177 PYDESGNNTCAGKPMEKNHRCTRMCYGDQDLDFDQDH 213
>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
Length = 498
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 77/219 (35%), Positives = 109/219 (49%), Gaps = 26/219 (11%)
Query: 98 LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLA 156
LP+ FDAR +P+C+ I + DQG CGSCWA A E ++DR CI G ++ A
Sbjct: 257 LPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFA 316
Query: 157 CCGFLCGDGCDGGYPIS----AWRYFVHHGVVTEE--CDPY-FDSTGCSHPGCEPAYPTP 209
+ G GC+GG + A V HG + ++ C PY F+ C HP P
Sbjct: 317 LSCYNSGAGCEGGDVVDTLTLALAKGVPHGGMLDKGACLPYQFEP--CDHPCMIPGTSPE 374
Query: 210 KCVRKCVKKNQ---LWRNSKHYSISAYRINSDPED---IMAEIYKNGPVEVSF-TVYEDF 262
C C ++ ++ + Y+ P+D I EI G V V+F V+EDF
Sbjct: 375 ACPATCADGSKFQLVYPKNLPYTCP-------PDDIACIAKEIKNRGSVAVTFGPVHEDF 427
Query: 263 AHYKSGVYK--HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+K GVYK +G +G HA KLIGWG + +G+ YW+
Sbjct: 428 YGHKEGVYKVTESSGRELGNHATKLIGWGVTQEGDHYWI 466
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSTFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVK--KNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/234 (33%), Positives = 111/234 (47%), Gaps = 47/234 (20%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
LP +FD+R WP C +I I +QG+C S +A A A SDR CI N +S ++
Sbjct: 61 LPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQII 120
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE----- 203
+CC +LCG GCDGG +W Y+ HG V+ + C PY + P C+
Sbjct: 121 SCC-YLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPY------TIPPCKLMNEK 173
Query: 204 -PAY--------PTPKCVRKCVKKNQLWR------NSKHYSISAYRINSDPEDIMAEIYK 248
P + TP C +KC N K+Y +S Y M +I+
Sbjct: 174 PPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGKYYKLSPYM-------AMKDIFD 226
Query: 249 NGPVEVSFTVYEDFAHYKSGVYKHITG---DVMGGHAVKLIGWGTSDDGEDYWV 299
NGP+ F +Y D YKSGVY++ D H+VK+ GWG ++G YW+
Sbjct: 227 NGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWG-EENGVPYWL 279
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVK--KNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
Length = 236
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/217 (32%), Positives = 98/217 (45%), Gaps = 20/217 (9%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARN--PQFSNYTVGQFKHLLGVK-PTPKGLLLGVPVKT 91
++ L+ I +NE WKA N P+ S + + GV+ P + L
Sbjct: 16 AYFLEKDFIDNINEQATT-WKAGVNFDPKTSKEHIMKLLGSRGVQIPNKNNMNLYKSEDA 74
Query: 92 HDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSL 149
+ +P+ FDAR W CSTI R+ DQG+CGSCWA A +DR C+ + N L
Sbjct: 75 DYNNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELL 134
Query: 150 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------DSTG 196
S ++ CC CG GC+GGYPI AW+ F G+VT E C+PY D G
Sbjct: 135 SAEEITFCC-HTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQG 193
Query: 197 CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 233
+ +P +C R C L + H Y
Sbjct: 194 NNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDY 230
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 146 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 204
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 205 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 262
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 263 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 321
Query: 209 PKCVRKCVK--KNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 322 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 381
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 382 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 427
>gi|60598652|gb|AAX25875.1| unknown [Schistosoma japonicum]
Length = 195
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 100/173 (57%), Gaps = 13/173 (7%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGV--PVKTHDKSLKLP 99
+I +NE+P AGWKA ++ F +++ + L+G + + V HD ++++P
Sbjct: 1 MISFINEHPDAGWKADKSEGF--HSLDDARILMGARKEDAEMKRKRRPTVDHHDLNVEIP 58
Query: 100 KSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACC 158
FD+R WP C +IS+I DQ CGSCWAFGAVEA++DR CI G S ++ L L C
Sbjct: 59 SQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISC 118
Query: 159 GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
CG GC GG+P AW Y+V G+VT + +H GC+P YP PKC
Sbjct: 119 CEDCGGGCKGGFPGQAWDYWVKRGIVT-------GGSKENHTGCQP-YPFPKC 163
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 111/225 (49%), Gaps = 25/225 (11%)
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVN 152
++++P+SFDAR W CSTI +I D+ C + WA V+++SDR CI +++ LS
Sbjct: 25 NMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSAR 84
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---------HPGCE 203
D ++ CGF GC G + Y++ +G+VT Y D +GC HP
Sbjct: 85 DAIS-CGF--SPGCFHGSEVEVLVYWITYGIVTG--GSYEDQSGCQPYPLPKCSYHPESR 139
Query: 204 ------PAYPTPKCVRKCVK-KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
+ P+C +C N+ + + K Y Y + EDI EI NGPV S
Sbjct: 140 FLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIASI 199
Query: 257 TVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWVC 300
+V DF YKSGVY +G +++IGWG + YW+C
Sbjct: 200 SVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGY-EGKIPYWLC 243
>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 315
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 138/311 (44%), Gaps = 47/311 (15%)
Query: 1 MASSHLFLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNP 60
M + + LT+CL +++S+T A HI +S+ K ++A
Sbjct: 1 MITLFIVLTSCLSTQPMLNSRTLA-----------HI--NSLPKHWTAGISEKFRALTRD 47
Query: 61 QFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQ 120
T+ H L L G K + + P+SFD R +PQC + DQ
Sbjct: 48 DIELMTMSHLVHFLDANAHSH--LAGRTEK--NINYDYPESFDFREEYPQC--LLPTYDQ 101
Query: 121 GHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 177
GHCGSCWAF + A D C+ G++ + S L++C L GC GG +
Sbjct: 102 GHCGSCWAFASSRAFGDTRCMQ-GLDPVPVLYSPQYLVSCS--LQNMGCTGGTMEDVGDF 158
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 237
G+ T+ C PY D E A+ P C CV + + R + + R +
Sbjct: 159 LRDTGIATDTCVPYVD---------EDAHWEP-CPVSCVDGSPI-RTVQ--LMDFVRYDG 205
Query: 238 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-- 295
+ E +M I NGP+ S +YEDF +Y+SG+Y I G G HA++L+G+GT G+
Sbjct: 206 NLEAMMEAIAMNGPIHASMMIYEDFMYYQSGIYHFIYGSGCGMHAIELVGYGTDISGDSE 265
Query: 296 -------DYWV 299
DYW+
Sbjct: 266 AGEEVRVDYWI 276
>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
Length = 243
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 107/227 (47%), Gaps = 33/227 (14%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK----PTPKGLLLGVPV 89
++ LQ I +N N WKA N F T + F +LG K P + +
Sbjct: 19 TYFLQKDFIDNIN-NQATTWKAGVN--FDPDTPKEHFLKMLGSKGVQIPNKHNIHM---Y 72
Query: 90 KTHDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--H 142
KTHD + ++P+ FDAR W C TI + DQG+CGSCWA A +DR C+ +
Sbjct: 73 KTHDAAYDNLFGRIPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATN 132
Query: 143 FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 191
N LS ++ CC + CG GC+GGYPI AW F G+VT E C+PY
Sbjct: 133 ADFNELLSAEEITFCC-YSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPP 191
Query: 192 --FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRI 235
+D+ G + +P +C R C L + H Y+ +Y +
Sbjct: 192 CPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYL 238
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 36 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVLNP 94
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 95 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 152
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 153 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 211
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ +++M E+ +NGPV+ V+EDF Y
Sbjct: 212 GRGKRQATAHCPNSHVNNNDIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLY 271
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 272 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 317
>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
Length = 396
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 106/227 (46%), Gaps = 25/227 (11%)
Query: 94 KSLKLPKSFDARSAWPQC-STISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN 152
+SL LP+ FDAR W +C I + DQG CGSCWA A E ++DR CI G LS
Sbjct: 142 ESLGLPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIAHGKTEELSPQ 201
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--------EECDPYFDSTGCSHPGCEP 204
L+C + G GC+GG I + + GV T C PY + C HP P
Sbjct: 202 YALSC--YSAGAGCEGGNVIDTLQEAIEKGVPTGGMFGDSSSACLPY-EFEACDHPCQVP 258
Query: 205 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED---IMAEIYKNGPVEVSF-TVYE 260
+C C + ++ ++ P D I E++K G + V+F V +
Sbjct: 259 GTIAEECPTTCADGTPI-SETEMMRPTSEPYECPPGDWKCITQELHKYGSMAVTFGPVCD 317
Query: 261 DFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGED-------YWV 299
DF +K GVY+ G +G HA K+IGWG D E+ YW+
Sbjct: 318 DFYGHKHGVYEQPEGGKPLGLHATKIIGWGFEGDDEETGKGGKPYWI 364
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 81/141 (57%), Gaps = 17/141 (12%)
Query: 174 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 219
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 220 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 278
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 279 GGHAVKLIGWGTSDDGEDYWV 299
GGHAVK+IGWG + G YW+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWL 139
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 132/291 (45%), Gaps = 37/291 (12%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHDKS 95
++ +I +N+ GW+A + F T+ + ++ LG P ++ + T S
Sbjct: 141 LVDPDMINAINQG-NYGWQAGNHSAFWGMTLEEGIRYRLGTNRPPSSVMNMNEIYTGLGS 199
Query: 96 LK-LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
+ LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHP-------- 200
+LL+C GC GG AW + GVV++ C P+ D G + P
Sbjct: 258 NLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPAPPCMMHSRAM 316
Query: 201 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
G T +C V N +++ + AYR+ S+ ++IM E+ +NGPV+ V+E
Sbjct: 317 GRGKRQATARCPNSHVHANDIYQVTP-----AYRLGSNEKEIMKELLENGPVQALMEVHE 371
Query: 261 DFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWV 299
DF Y+ G+Y H + G H+VK+ GWG T DG YW
Sbjct: 372 DFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 128/308 (41%), Gaps = 64/308 (20%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGV------------KPTPKGLL 84
+ D I+ +N+ ++ WKA + QF T + K + G K K
Sbjct: 103 VNNDRYIQALNK-AQSTWKATAHKQFEGMTFAELKRITGSYRRSYQKTRNLKKQQAKLRA 161
Query: 85 LGVPVKT----------HDKSLKLPKSFDARSAWPQCST---ISRILDQGHCGSCWAFGA 131
+ T + KL S W + + + +Q CGSC+AF +
Sbjct: 162 MNADKVTLFNGKTGQFESQDAEKLRASLPTEFDWTNVNGRDFVVPVRNQEQCGSCYAFSS 221
Query: 132 VEALSDRFCIHFGMNLS----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 187
+ R + NL+ S D++ C + GCDGG+P +Y + +G+ E
Sbjct: 222 SDMFGSR--VRIPSNLTQVPVYSPQDIVDCSAY--SQGCDGGFPFLVGKYAMDYGLTVES 277
Query: 188 CDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEI 246
CDPY + KC +C V + Q +S +Y + Y NS +M EI
Sbjct: 278 CDPY------------QGHDLGKCSNQCPVNRQQRLHSSNYYFVGGYYGNSHELSMMHEI 325
Query: 247 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG----------------HAVKLIGWGT 290
Y+NGP+ + F VY D +YK GVYKH+T + + HAV ++GWG
Sbjct: 326 YQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELKAQGLSEDEMIPHFEVVNHAVLMVGWGV 385
Query: 291 SDDGEDYW 298
++G YW
Sbjct: 386 -ENGTPYW 392
>gi|156708124|gb|ABU93320.1| cathepsin B11 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 399
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 105/212 (49%), Gaps = 25/212 (11%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC 157
+P ++D+R+ +P C +++ +LDQ CGSCWAF + +D CI+ N+ S + C
Sbjct: 49 IPTAYDSRAVYPNCKSLTSVLDQKKCGSCWAFATTGSAADALCINGIANVIPSPQRQI-C 107
Query: 158 CGFLCGD--------GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 209
C C + GC+GG +A Y ++G+V++EC PY +++ S P
Sbjct: 108 CDNKCLEIPLKQCDYGCNGGVLTAASLYIENNGIVSDECIPYEENS--SKP--------- 156
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE-IYKNGPVEVSFTVYEDFAHYKSG 268
C C K+ K I E+ M + I G + V+ V++ F YK G
Sbjct: 157 -CFTSCSKEGVKMEVYKGLYAKIPSIMKVSENEMKKGIMIGGSIAVAMNVFQSFTLYKGG 215
Query: 269 VYKHITGDVMGGHAVKLIGWGTSDDGE-DYWV 299
VY GD +G HAV+L+GW +D E YW+
Sbjct: 216 VYNRTDGDFLGNHAVRLVGW--NDTAEVPYWI 245
>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 255
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/244 (32%), Positives = 112/244 (45%), Gaps = 35/244 (14%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGVK----PTPKGLLLGVPV 89
++ LQ I +N N WKA N F T + F +LG K P + +
Sbjct: 21 TYFLQKDFIDNIN-NQATTWKAGVN--FDPDTPKEHFLKMLGSKGVQIPNKHNIHM---Y 74
Query: 90 KTHDKSL-----KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--H 142
KTHD++ ++PK FDAR W C TI + DQG+CGSCWA A +DR C+ +
Sbjct: 75 KTHDEAYDNLFGRIPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATN 134
Query: 143 FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 191
N LS ++ CC CG GC+GGYPI AW F G+VT E C+PY
Sbjct: 135 ADFNELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPP 193
Query: 192 --FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYK 248
+D+ G + +P +C R C L + H Y+ Y + I ++
Sbjct: 194 CPYDAEGHNTCAGKPRESNHRCTRMCYGNXDLDFDEDHRYTRDFYYLTYG--SIQKDVMT 251
Query: 249 NGPV 252
GP+
Sbjct: 252 YGPI 255
>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
Length = 467
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 132/292 (45%), Gaps = 38/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ ++IK +N GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 140 LVDPAMIKAINRG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEIYTVLGQ 198
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 199 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQ 256
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A PTP+C+
Sbjct: 257 NLLSCDTHH-QKGCRGGRLDGAWWFLRCRGVVSDNCYPF-----SGREQNDEASPTPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + + + N + YR+ SD ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATSRCPNSHVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWT 422
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 97/213 (45%), Gaps = 15/213 (7%)
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
++ + FDAR WPQC TI + D G+ WA+ L+DR CI + N LS +L
Sbjct: 85 QIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEEL 144
Query: 155 LACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK--- 210
+ C G G G + W Y HG+V+ Y + GC P P
Sbjct: 145 IFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPPIGNIPTHLY 200
Query: 211 ---CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
C +C N + H +S Y EDI E+ GPV V F VY+DF YKS
Sbjct: 201 NHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKS 260
Query: 268 GVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWV 299
GVY + + H KLIGWG ++G DYW+
Sbjct: 261 GVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWL 292
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 97/213 (45%), Gaps = 15/213 (7%)
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDL 154
++ + FDAR WPQC TI + D G+ WA+ L+DR CI + N LS +L
Sbjct: 85 QIHEEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEEL 144
Query: 155 LACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK--- 210
+ C G G G + W Y HG+V+ Y + GC P P
Sbjct: 145 IFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPPIGNIPTHLY 200
Query: 211 ---CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
C +C N + H +S Y EDI E+ GPV V F VY+DF YKS
Sbjct: 201 NHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRVYDDFFLYKS 260
Query: 268 GVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWV 299
GVY + + H KLIGWG ++G DYW+
Sbjct: 261 GVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWL 292
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 131/292 (44%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ +
Sbjct: 141 LVDQDMISAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLVP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
+LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GERLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ A P P+C+
Sbjct: 258 NLLSCDKHN-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGQER------NEAGPEPRCM 310
Query: 213 RKCV-----KKNQLWRNSKH-------YSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVY 259
K+ + R H Y ++ AYR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQAIARCPNHHVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H + G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
Length = 220
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 108/229 (47%), Gaps = 34/229 (14%)
Query: 54 WKAARN-PQFSNYTVGQFKHLLGVKPTPKGLLLGV---PVKTHD----KSLKLPKSFDAR 105
WKA +N P+ N LLG K LLG+ P+K +D + ++P+ FD+R
Sbjct: 2 WKAKQNFPE--NTPREDIVRLLGSK-----RLLGLNKSPIKENDILYVDNGEVPEFFDSR 54
Query: 106 SAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 163
W C TI + +QG+CGSCWA G A +DR CI N +S +L CC CG
Sbjct: 55 LEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELTFCC-HTCG 113
Query: 164 DGCDGGYPISAWRYFVHHGVV-------TEECDP------YFDSTGCSHPGCEPAYPTPK 210
GC+GG P+ AW+YF HGVV T+ C P D G + +P K
Sbjct: 114 FGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDEGHNSCSGQPTERNHK 173
Query: 211 CVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTV 258
C +KC + HY AY +++ +Y GP+E SF V
Sbjct: 174 CSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMVY--GPIEASFDV 220
>gi|410966894|ref|XP_003989962.1| PREDICTED: tubulointerstitial nephritis antigen-like [Felis catus]
Length = 422
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 132/291 (45%), Gaps = 39/291 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ + +I +N GW+A + F T+ + ++ LG ++P+ +
Sbjct: 141 LVDEDMINAINRG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLGP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH +++ LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHGPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMAPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ S + A P P+C+
Sbjct: 258 NLLSC-NTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFMGSER------DEAGPAPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + + + N + AYR+ S ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATARCPSSHVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYW 298
EDF Y+ G+Y H + G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGQETLPDGRTLKYW 421
>gi|330798471|ref|XP_003287276.1| hypothetical protein DICPUDRAFT_151351 [Dictyostelium purpureum]
gi|325082736|gb|EGC36209.1| hypothetical protein DICPUDRAFT_151351 [Dictyostelium purpureum]
Length = 317
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/222 (31%), Positives = 108/222 (48%), Gaps = 27/222 (12%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MN 146
V DK +P S+D R+ W +C IS I +Q CGSCWA + L+D+ CI G +
Sbjct: 31 VTYGDKYDTIPDSYDVRTTWSEC--ISPIREQKSCGSCWAQVSTGLLADKACIQTGGKIK 88
Query: 147 LSLSVNDLLACCGFL-----CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP- 200
++LS ++ C G C +GC GG+ A+ + +++GVV + C Y S S P
Sbjct: 89 VTLSPQYMMDCDGSCTSNSGCNNGCKGGFVGKAFEFLINNGVVPDTCLSYKASKDDSCPQ 148
Query: 201 GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 260
C+ P + N + S+ + +D EI NGPV + +Y+
Sbjct: 149 SCDDGTP--------------FSNQTKFRASSCQKFPTIQDAQVEIMTNGPVVATLMLYD 194
Query: 261 DFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGEDYWV 299
DF YK ++ G+ + HAV+++GWG SD G YW+
Sbjct: 195 DFKPYKWANNIYVKGENAKTVESHAVRVVGWGKSDSGVLYWI 236
>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Nomascus leucogenys]
Length = 467
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 27/286 (9%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 141 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTMRPSSSVMNMHEIYTVLNP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 258 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 316
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ +++M E+ +NGPV+ V+EDF Y
Sbjct: 317 GRGKRQATAHCPNSHVNNNDIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLY 376
Query: 266 KSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
K G+Y H + G H+VK+ GWG T DG YW
Sbjct: 377 KGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|390367767|ref|XP_787947.3| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 146
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 79/134 (58%), Gaps = 8/134 (5%)
Query: 34 DSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHD 93
D I+Q +++++VN + K WKA N F + + F+ +LG P G L + +T
Sbjct: 19 DLDIMQATVVQKVN-SLKTTWKAGIN--FEGWQLDDFRRMLGALKNPNGRLPKLENQTRI 75
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSV 151
K L P++FDAR WP C TI + DQG CGSCWAFGAVEA+SDR CI + +S
Sbjct: 76 KDL--PENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISA 133
Query: 152 NDLLACCGFLCGDG 165
DL+ CC CG+G
Sbjct: 134 EDLMTCCK-TCGNG 146
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 24/204 (11%)
Query: 99 PKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLL 155
P+S+D R +P C I+ ++DQG+CGSCWAF +V+ +D C G++ +S SV +L
Sbjct: 141 PESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQTFADHRC-RSGLDATGVSYSVQYVL 197
Query: 156 ACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC 215
C GC+GG P++A+ + + G V C Y C KC
Sbjct: 198 DC--DRKDHGCNGGEPVNAFNFLHNTGTVLASCVGYTAGDDAVVKFCPQ-----KCDDGS 250
Query: 216 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 275
+N + + S + S + ++A +GPV +F V +DF +YKSGVY+H G
Sbjct: 251 AVENVV-------ATSGSKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWG 299
Query: 276 DVMGGHAVKLIGWGTSDDGEDYWV 299
+GGHAV++IG+G +D G DYW
Sbjct: 300 LWLGGHAVEIIGYGVTDSGLDYWT 323
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/246 (33%), Positives = 111/246 (45%), Gaps = 54/246 (21%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCW-------------------------- 127
+SL L + FDAR WP+C I I DQ C CW
Sbjct: 56 ESLPLEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIILLFDFSSSSSH 115
Query: 128 --------AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 177
A + ++DR CI + LS +L +CC CG GC+GG+P+ A++Y
Sbjct: 116 WLFISTFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCT-SCGYGCNGGFPLLAFKY 174
Query: 178 FVHHGVVTEECDPYFDSTGCSHPGCEP------AYPTPKCVRKCVK--KNQLWRNSKHYS 229
+ GV T PY +GC P A TP C KC+ K +L ++ ++Y
Sbjct: 175 WNEIGVPTG--GPYGSKSGCKPFSIAPPTSSSTAAQTPLCQLKCISDYKRKLDKD-RYYG 231
Query: 230 ISAYRINSDPE---DIMAEIYKNGPVEVSFTVYEDFAHYKSGVY---KHITGDVMGGHAV 283
S Y I S + I EI +GPV + ++E F +YKSGVY K +G HAV
Sbjct: 232 ESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAV 291
Query: 284 KLIGWG 289
KLIGWG
Sbjct: 292 KLIGWG 297
>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 355
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 103/208 (49%), Gaps = 15/208 (7%)
Query: 98 LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLL 155
+P SFDAR+ WP C +I+ I +QG C S ++DR CI + S D L
Sbjct: 20 IPTSFDARTRWPNCPSIALIPNQGCCNSSAFQIPAAVITDRACIRSNGTSTRTYSAYDAL 79
Query: 156 ACCG---FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
ACC F C GG P+ W Y+ G+V++ C P+ S C C P P
Sbjct: 80 ACCTDCPFSQLFKCAGGDPLKVWNYWATTGLVSDSCMPFSLSPLCLGFNC-PLLCAPGYA 138
Query: 213 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK-SGVYK 271
V + + K +++ Y + I +EI NGPVE SF +Y DF H K S VY
Sbjct: 139 GSIVGDRK--KGLKVVTVAPYV-----DAIQSEIILNGPVEASFDLYLDFVHLKQSQVYN 191
Query: 272 HITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+G +G +VK+IGWG ++G +YW+
Sbjct: 192 SRSGPNLGRQSVKIIGWGV-ENGTEYWL 218
>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
Length = 541
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 112/228 (49%), Gaps = 30/228 (13%)
Query: 98 LPKSFDARSAWPQCST-ISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDL 154
LP+SFDAR WP+CS I DQG CGSCWA + +SDR CI G + L+ +++
Sbjct: 276 LPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQERLAASEI 335
Query: 155 LACCGFLCGD----GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH---PGC-EPAY 206
L+ CG L + C+GG P A+ + GV + Y D GC+ P C P +
Sbjct: 336 LS-CGQLVSEFSFGSCEGGMPDDAYEFAKEFGVAS--GGKYGDEKGCAAYPFPPCHHPCH 392
Query: 207 --PTPKCVRKCVKK------NQLWRN--SKHYSISAYRINSDPEDIMAEIYKNGPV-EVS 255
PTP C K ++ RN ++H + + D + + EIY +GPV +
Sbjct: 393 VQPTPACPLKSDTAQCQGDLDEHTRNEVAQHIDKLIHCPDGDYDCMAREIYNSGPVSSYA 452
Query: 256 FTVYEDFAHYKSGVYK-----HITGDVMGGHAVKLIGWGTSDDGEDYW 298
T+Y++F YK G Y+ G GGH +++IGW DG W
Sbjct: 453 GTIYDEFYAYKDGAYRTSADSETRGRSHGGHVIEVIGWHKESDGTYSW 500
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 109/221 (49%), Gaps = 18/221 (8%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FD+R W I+ ++DQG CGS WA SDR I +N SLS
Sbjct: 194 KPRELPEHFDSRDKWGH--LINPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSS 251
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 252 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 309
Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY
Sbjct: 310 DRRGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 369
Query: 271 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWVC 300
+H + G H+V+++GWG ++ YW+C
Sbjct: 370 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLC 410
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 140/300 (46%), Gaps = 35/300 (11%)
Query: 7 FLTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYT 66
+L T L LG I + ++ L + + + ++ + W P+
Sbjct: 53 YLYTQLGKLGKIEIKMIGASLLLGAVLAAPAVSHADLRTIKALDGLTWV----PELPKRF 108
Query: 67 VGQFKHLLGVKPTPKGLL-LGVPVKTHDKSLK----LPKSFDARSAWPQCSTISRILDQG 121
VG K L GV+ L+ P T S K P+S+D R +P C I+ ++DQG
Sbjct: 109 VG--KSLDGVRAMLGPLIDTSRPTITMKHSTKPPVGAPESYDFREEYPHC--ITEVVDQG 164
Query: 122 HCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
CGSCWAF +++ +D C G++ +S SV +L C GC+GG P++A+ +
Sbjct: 165 SCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDCD--RKDHGCNGGEPVNAFNFL 221
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
+ G V C Y C KC +N + + S + S
Sbjct: 222 HNTGTVLTSCVEYTAGDDAVVKFCPQ-----KCDDGSAVENIV-------ATSGAKSGSA 269
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
+ ++A +GPV +F V +DF +YKSGVY+H G +GGHAV+++G+G +D G DYW
Sbjct: 270 IDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDSGLDYW 325
>gi|355572434|ref|ZP_09043578.1| Dipeptidyl-peptidase I, partial [Methanolinea tarda NOBI-1]
gi|354824808|gb|EHF09050.1| Dipeptidyl-peptidase I, partial [Methanolinea tarda NOBI-1]
Length = 685
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 116/242 (47%), Gaps = 36/242 (14%)
Query: 69 QFKHLLGVKP-----TPKGLLLG--VPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQG 121
+FK LLGVKP +P+G G + + LP S D R+ +T I +QG
Sbjct: 203 EFKKLLGVKPKFSVVSPEGEEGGNDTSAPSFETLQGLPPSLDWRNNGGDFTT--PIRNQG 260
Query: 122 HCGSCWAFGAVEALSDRFCI---HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
+CGSCWAF + R I + +N + DL++C G CG GC G + +
Sbjct: 261 NCGSCWAFATLGTFESRMEIANNNPNLNPDYAEQDLVSCAG--CG-GCSGAWMDCPLNWV 317
Query: 179 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 238
++ G + E C PY +PGC +Y R S+ + I Y + +
Sbjct: 318 LNRGAMNESCYPYVARDTSCNPGCSRSY----------------RISEWHRI--YPLQN- 358
Query: 239 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 298
E + + GP+ +F VY+DF++Y G+Y+H G + G HA+ ++GWG + G YW
Sbjct: 359 -EAAIKDALTRGPIIGTFAVYQDFSYYSGGIYEHTWGSLRGYHAIVVVGWGQDERGT-YW 416
Query: 299 VC 300
+C
Sbjct: 417 IC 418
>gi|161343831|tpg|DAA06096.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 194
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/188 (36%), Positives = 94/188 (50%), Gaps = 17/188 (9%)
Query: 40 DSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDK---SL 96
+ II VN PK WKA N F+ + L+GV P K L + T+D S
Sbjct: 7 NRIIHLVNSVPKHSWKAGIN--FNPSLLTNVSRLMGVLPRNK-LSEKDTLLTYDSPAGSE 63
Query: 97 KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDL 154
LP+S+D W +C ++ I DQ +CGSCWA A S R CI M N+ LS +
Sbjct: 64 PLPESYDVTQTWSECKSVVSIRDQSNCGSCWALSTASAFSGRLCIASNMDFNIVLSGEYI 123
Query: 155 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--AYPTPKCV 212
+CC CGDGC+GG+P AW+Y +G+ T S+ GC+P +P P+
Sbjct: 124 NSCCNGKCGDGCNGGHPEKAWKYIKKNGLCT-------GGEYNSNEGCQPYSIFPCPRNS 176
Query: 213 RKCVKKNQ 220
C K+N+
Sbjct: 177 NSCSKENE 184
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 107/221 (48%), Gaps = 18/221 (8%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FDAR W I I DQG CGS WA SDR I +N SLS
Sbjct: 254 KPRELPEHFDARDKWGH--LIHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSS 311
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 312 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 369
Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY
Sbjct: 370 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 429
Query: 271 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWVC 300
+H + G H+V+++GWG ++ YW+C
Sbjct: 430 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLC 470
>gi|56756124|gb|AAW26240.1| unknown [Schistosoma japonicum]
Length = 159
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 85/147 (57%), Gaps = 7/147 (4%)
Query: 17 VISSQTFAEG-VVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLG 75
++S T E V ++ L D +I +NE+P AGWKA ++ +F +++ + L+G
Sbjct: 8 IVSQFTLLEAHVTTRNNERIEPLSDEMISFINEHPDAGWKADKSDRF--HSLDDARILMG 65
Query: 76 VKPTPKGLLLGV--PVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVE 133
+ + V HD ++++P FD+R WP C +IS+I DQ CGSCWAFGAVE
Sbjct: 66 ARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVE 125
Query: 134 ALSDRFCIHFG--MNLSLSVNDLLACC 158
A++DR CI G + LS DL++CC
Sbjct: 126 AMTDRICIQSGGQQSAELSALDLISCC 152
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 130/292 (44%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ +
Sbjct: 141 LVDQDMINAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLRP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEAAEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A P P+C+
Sbjct: 258 NLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFVGREQ------DEAGPAPRCM 310
Query: 213 ----------RKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ + + + N + AYR+ ++ ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATARCPSSHVHANDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ G+Y H + G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 129/292 (44%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW+A + F T+ + ++ LG ++P+ +
Sbjct: 141 LVDQDMINAINQG-NYGWRAGNHSAFWGMTLDEGIRYRLGTIRPSSSVTSMNEIHTVLGP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG+ AW + GVV++ C P+ + A P P+C+
Sbjct: 258 NLLSC-DTHNQQGCRGGHLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPAPRCM 310
Query: 213 ----------RKCVK---KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
R+ +++ N + AYR+ S ++IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATAHCPNSRVHTNDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGD--------VMGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ GVY H G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQGGVYSHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 115/263 (43%), Gaps = 32/263 (12%)
Query: 53 GWKAARNPQF--SNYTVGQFKHLLGVKPTPKGLLLGVPVKT----HDKSLKLPKSFDARS 106
GWKA ++ Y G+ L +P +PVK ++ LP FDA
Sbjct: 252 GWKAGNYSEWWGRKYDEGKVLRLGTFQPK-------IPVKAMKRLSNRGGPLPSHFDAAD 304
Query: 107 AWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD 164
WP+ +R DQG CGS WA SDRF I + L+ LLAC
Sbjct: 305 HWPRLVGEAR--DQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLLACVRR--QQ 360
Query: 165 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 224
C GG+ +AW+Y GVV +EC PY + C+ C + R
Sbjct: 361 ACSGGHLDTAWQYLRRVGVVNDECYPYIAAKN----QCKINDGDTLVSANCELPANVNRT 416
Query: 225 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-----DVMG 279
+ + AY +N++ DIM EI + G V+ VY DF Y++G+Y+H +
Sbjct: 417 AMYRMGPAYSLNNE-TDIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSA 475
Query: 280 GHAVKLIGWGTSDDGED---YWV 299
H+V+LIGWG G D YW+
Sbjct: 476 YHSVRLIGWGEERVGYDMVKYWI 498
>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 199
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 100/200 (50%), Gaps = 27/200 (13%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + + ++ L+ I ++NE + W A N S
Sbjct: 1 MARVLILLSVILFSVY-------MTEQAYFLEKDYINKINEKA-STWTAGFNFDPSTPKE 52
Query: 68 GQFKHLLGVK--PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRILDQ 120
K LLG K TP + L + K+ D++ ++PK FDAR W C+TI ++ DQ
Sbjct: 53 DILK-LLGSKGVQTPSKINLKM-YKSEDENYDNLFGRIPKKFDARKKWRHCTTIGKVRDQ 110
Query: 121 GHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 178
G+CGSCWA A +DR C+ + N LS +L CC CG GC+GGYPI AW F
Sbjct: 111 GNCGSCWALSTSSAFADRLCVATNGDFNQLLSAEELTFCC-HKCGYGCNGGYPIKAWERF 169
Query: 179 VHHGVVT-------EECDPY 191
HG+VT E C+PY
Sbjct: 170 KKHGLVTGGEYKSGEGCEPY 189
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 73/103 (70%), Gaps = 2/103 (1%)
Query: 198 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 256
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F
Sbjct: 8 SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 67
Query: 257 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW+
Sbjct: 68 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWL 109
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 124/269 (46%), Gaps = 26/269 (9%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
W+A + F T+ + ++ LG ++P+ + + + LP +F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEALPTAFEASEKWP-- 183
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 169
+ I LDQG+C WAF SDR IH M LS +LL+C GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGG 242
Query: 170 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 223
AW + GVV++ C P+ D G + P + + R+ N
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVN 302
Query: 224 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 277
N+ Y ++ AYR+ S+ +IM E+ +NGPV+ V+EDF YK G+Y H ++
Sbjct: 303 NNNIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPER 362
Query: 278 ---MGGHAVKLIGWG--TSDDGE--DYWV 299
G H+VK+ GWG T DG YW
Sbjct: 363 YRRHGTHSVKITGWGEETRPDGRKLKYWT 391
>gi|170595047|ref|XP_001902227.1| Papain family cysteine protease containing protein [Brugia malayi]
gi|158590214|gb|EDP28925.1| Papain family cysteine protease containing protein [Brugia malayi]
Length = 246
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 20/227 (8%)
Query: 48 ENPKAGWKAARNPQFSNYTVGQ-FKHLLGVKPTPKGLLLGVPVKTHD----KSLKLPKSF 102
+N + W A +F T+ +H LG L V++ + K +LP SF
Sbjct: 32 QNGRYTWTARNYSEFWGRTLRDGIRHRLGT------LFPEQSVQSMNEMIVKPRELPTSF 85
Query: 103 DARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGF 160
DAR WP + I I DQG C S WA +DR + N+SLS +L+C
Sbjct: 86 DARQKWP--NFIHPIQDQGECASSWAQSTAATSADRLALITDGRQNVSLSAQQILSCNQH 143
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQ 220
GC+GGY AW Y GVV+EEC PY CE R+C +
Sbjct: 144 R-QKGCEGGYLDRAWWYIRKFGVVSEECYPYVSGITKKPEICEMQKSRHTEGRECPSGHA 202
Query: 221 LWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 266
NS+ Y + +YR++S +DIM+EI NGPV+ +F V+ DF Y+
Sbjct: 203 ---NSRVYRTTPSYRVSSKEKDIMSEILTNGPVQATFLVHGDFFMYR 246
>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 422
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/246 (33%), Positives = 111/246 (45%), Gaps = 47/246 (19%)
Query: 98 LPKSFDARSAWPQCS-TISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDL 154
LP SFDAR + C+ I + +QG C +CWA AV +DR CI G ++ LS+ L
Sbjct: 145 LPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYL 204
Query: 155 LACCGFLCG----DGCDGGYPISAWRYFVHHGVVT------------------EE----- 187
+CC G +GC G + +HG+VT EE
Sbjct: 205 TSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGRNFRFESFKLSGEYKPPEELGNDD 264
Query: 188 -CDPYFDSTGCSH-PGCEPAYPT-------PKCVRKCVKK--NQLWRNSKHYSISAYRIN 236
C PY C+H PG E YP P C C K + H + S R+
Sbjct: 265 GCWPY-PFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLP 323
Query: 237 SDPEDIMAEIYKNGPVE---VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 293
PE I EI+ NGP+ T+YEDF + VY H TG ++ H +KLIGWG +
Sbjct: 324 IGPEKIKQEIFDNGPLRXXAAMMTLYEDF-DLQVCVYVHKTGQMLAAHTLKLIGWGV-ES 381
Query: 294 GEDYWV 299
G++YW+
Sbjct: 382 GQEYWL 387
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 127/292 (43%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW A + F T+ + ++ LG ++P+ +
Sbjct: 128 LVDQDMINAINQG-NYGWWAGNHSAFWGMTLDEGIRYRLGTMRPSSSVTNMNEIHTVLRP 186
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 187 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 244
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A P P+C+
Sbjct: 245 NLLSC-DTHNQRGCHGGRLDGAWWFLRRRGVVSDHCYPFVGREQ------DEAGPAPRCM 297
Query: 213 RKCVKKNQLWR-------------NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ R N + AYR+ S+ ++IM E+ +NGPV+ V+
Sbjct: 298 MHSRAMGRGKRQATARCPSSHAHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVH 357
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+SG+Y H + G H+VK+ GWG T DG YW
Sbjct: 358 EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 409
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/224 (34%), Positives = 108/224 (48%), Gaps = 24/224 (10%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FDAR W I + DQG CGS WA SDR I +N SLS
Sbjct: 198 KPRELPEHFDARDKWGH--LIHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSS 255
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC---EPAYPT 208
LL+C GC+GGY AW Y GVV + C PY C + Y
Sbjct: 256 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPKRDYTN 314
Query: 209 PKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 267
+ +R C +Q +S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y
Sbjct: 315 RQGLR-CPSGDQ---DSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAG 370
Query: 268 GVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWVC 300
GVY+H + G H+V+++GWG ++ YW+C
Sbjct: 371 GVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLC 414
>gi|308163309|gb|EFO65659.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 309
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 115/252 (45%), Gaps = 24/252 (9%)
Query: 54 WKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCST 113
WKA + + T FK +L + P+ + P FD R +PQC
Sbjct: 31 WKAGIPERLKSLTKSDFKRMLSADSPRTQPSMVRPIHVPESEDPAPDHFDFREEYPQC-- 88
Query: 114 ISRILDQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGY 170
I+ ++D G C S WA AV+A S R C+ G++ S +L+C +GC G
Sbjct: 89 ITEVIDIGLCSSSWAHSAVDAFSHRRCLT-GLDQEATRYSAQYILSCAS---TNGCFGFS 144
Query: 171 PIS--AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 228
AW + GV E C Y D + + ++P P C + L + Y
Sbjct: 145 TQGDIAWDFIATTGVPLESCVKYTD-----YNETQSSWPCPSV---CNDNSFL----EIY 192
Query: 229 SISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 287
Y + + E + + GP++ F VYEDF +Y G+Y H G+ G +V+++G
Sbjct: 193 KPDGYEGVGFNSERLKRAVAFRGPMQAMFAVYEDFTYYLEGIYSHTYGNRAGFLSVEIVG 252
Query: 288 WGTSDDGEDYWV 299
+GTSD+G+DYW+
Sbjct: 253 YGTSDEGQDYWI 264
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 88/158 (55%), Gaps = 19/158 (12%)
Query: 89 VKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
VK + +P +FDAR+ WP C +I+ I DQ CGSCWAF A EA SDRFCI + +N
Sbjct: 72 VKHDIQEDTIPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVN 131
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYF-----DS 194
LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY ++
Sbjct: 132 TLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGET 190
Query: 195 TG-CSHPGCEP-AYPTPKCVRKCVKKNQ--LWRNSKHY 228
G + P C Y TP CV KC N +++ KH+
Sbjct: 191 VGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHF 228
>gi|226472634|emb|CAX71003.1| hypotherical protein [Schistosoma japonicum]
Length = 458
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 144/319 (45%), Gaps = 56/319 (17%)
Query: 11 CLLILGVISSQTFAEGVVSKLKLDSHIL---QDSIIKEVNENPKAGWKAARNPQFSNYTV 67
C V SQ E L+LD + L IK +N + WKA P++S YT+
Sbjct: 127 CFTATKVNHSQRMIEYKSPVLQLDENQLYKVDTKFIKAINAK-QNSWKATIYPEYSKYTI 185
Query: 68 GQFKHLLG-VKPTPKGLLLGVPVKTHDKS-----LKLPKSFDARSAWPQC--STISRILD 119
+ + G + T K + +P K + L LPK FD + P+ S ++ + +
Sbjct: 186 KEMRRRAGGSRSTFKRQNVQLPKKNLTSAMMLELLALPKEFDWVNR-PEGLRSPVTPVRN 244
Query: 120 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWR 176
Q CGSC+AF + A+ R + F + LS D++ C + +GCDGG+P + A +
Sbjct: 245 QKTCGSCYAFASTAAIEARIRLASRFRLQPILSPQDIIDCSPY--SEGCDGGFPYLVAGK 302
Query: 177 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 236
+ G V E+C+PY TG C N+L +++Y+ + I
Sbjct: 303 HGEDFGFVEEKCNPY---TGVKSGTC----------------NRLLGCTRYYTTDYHYIG 343
Query: 237 ----SDPEDIMA-EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG-----------G 280
+ ED+M E+ KNGP V F VY DF YKSGVY H D++
Sbjct: 344 GYYGATNEDLMKLELVKNGPFPVGFEVYGDFLQYKSGVYSHT--DIINNHHPFNPFELTN 401
Query: 281 HAVKLIGWGTSDDGE-DYW 298
HAV L+G+G + YW
Sbjct: 402 HAVLLVGYGIDNSSNLPYW 420
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 124/275 (45%), Gaps = 38/275 (13%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
W+A + F T+ + ++ LG ++P+ + + + LP +F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP-- 183
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 169
+ I LDQG+C WAF SDR IH M LS +LL+C GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGG 242
Query: 170 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK- 218
AW + GVV++ C P+ + A PTP C+ R+
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRER------DEAGPTPPCMMHSRAMGRGKRQATASC 296
Query: 219 -NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 276
N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H
Sbjct: 297 PNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVS 356
Query: 277 V--------MGGHAVKLIGWG--TSDDGE--DYWV 299
+ G H+VK+ GWG T DG YW
Sbjct: 357 LGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWT 391
>gi|256052325|ref|XP_002569723.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228438|emb|CCD74609.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 198
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/170 (38%), Positives = 87/170 (51%), Gaps = 32/170 (18%)
Query: 44 KEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPKSFD 103
KE P A WKA ++ +F +++ + +G + L
Sbjct: 30 KEEEHKPNAVWKAEKSNRF--HSLDDARIQMGARREESDLRR------------------ 69
Query: 104 ARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL 161
+ WP C +I+ I DQ CGS WAFGAVEA+SDR CI G N+ LS DLL+CC
Sbjct: 70 -KKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCEH- 127
Query: 162 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
CGDG +GG+P AW Y+V G+VT S+ +H C+P YP PKC
Sbjct: 128 CGDGFEGGFPALAWDYWVKEGIVT-------GSSKENHTVCQP-YPFPKC 169
>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
Length = 201
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 86/172 (50%), Gaps = 18/172 (10%)
Query: 35 SHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPV-KTHD 93
++ LQ I+ +NE WKA N N F LLG K L + + KT D
Sbjct: 5 AYFLQRDFIENINEQATT-WKAGVNFD-PNTPKEHFLKLLGSKGVQIPNLNNINLYKTDD 62
Query: 94 KSLK-----LPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMN 146
+ +P+ FDAR W C TI ++ DQG+CGSCWA A +DR C+ + N
Sbjct: 63 AAYDNLFGLIPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNGDFN 122
Query: 147 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY 191
LS ++ CC CG GC GGYPI AW+ F HG+VT E C+PY
Sbjct: 123 ELLSAEEITFCC-HTCGFGCHGGYPIKAWKRFNKHGLVTGGNYNSGEGCEPY 173
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 79/245 (32%), Positives = 111/245 (45%), Gaps = 28/245 (11%)
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSL----KLPKSFDARSAWPQCSTISRILDQGHCGSCWA 128
LLG + + KT D ++ K FDAR WPQC TI + ++G+ WA
Sbjct: 57 LLGTRGVEAATKSKMLYKTRDPRYIIDNQIHKEFDARKRWPQCKTIGEVHNEGNELLSWA 116
Query: 129 FGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY--PISAWRYFVHHGVV 184
+ A +DR CI N + LS +L++C G + GY + W YF HG+V
Sbjct: 117 YAATGVFADRMCIATNGNYNQLLSTEELISCSGI---KEREDGYVNRVLVWEYFKTHGLV 173
Query: 185 TEECDPYFDSTGCSHPGCEPAYPTP------KCVRKCVKKNQLWRNSKHYSISAY---RI 235
+ Y + GC Y + CV C K+ + N H +S + RI
Sbjct: 174 S--GGKYNTNEGCQPSKVPTVYNSQTKIYKRTCVEYCYGKDTINYNHDHVKVSNHYFIRI 231
Query: 236 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDG 294
+DI E+ GPV V F +++D YKSGVY K H KLIGWG ++G
Sbjct: 232 ----KDIQKEVQTYGPVSVFFDLHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGV-ENG 286
Query: 295 EDYWV 299
DYW+
Sbjct: 287 VDYWL 291
>gi|148694398|gb|EDL26345.1| tubulointerstitial nephritis antigen, isoform CRA_b [Mus musculus]
Length = 258
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 115/255 (45%), Gaps = 29/255 (11%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
W A QF T+ + FK LG + P+P L + + LP+ F A WP
Sbjct: 12 WTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASFPPRADLPEIFIASYKWPGW 71
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGG 169
+ LDQ +C + WAF +DR I +LS +L++CC GC+ G
Sbjct: 72 T--HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSG 128
Query: 170 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQ 220
AW + G+V+ C P F ++ C A + T C K N+
Sbjct: 129 SIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNR 188
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG----- 275
+++ S YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+
Sbjct: 189 IYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP 243
Query: 276 ---DVMGGHAVKLIG 287
+ HAVKL G
Sbjct: 244 EKYKKLRTHAVKLTG 258
>gi|145517168|ref|XP_001444467.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411889|emb|CAK77070.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 120/261 (45%), Gaps = 34/261 (13%)
Query: 45 EVNENPKAGWKAARNPQ--FSNYTVGQFKHLLGVKPTPKGLL--LGVPVKTHDKSLKLPK 100
E+N + G+ P F N T+ Q K L T + + VP+ + ++P
Sbjct: 73 EINSHNSQGYPYTLGPNNFFHNVTLMQAKTLFKNDFTQQINVEKCKVPI-----NFEIPT 127
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGF 160
F+ + ++P CS I +QG+C S ++ A SDR C LS +LL+C G
Sbjct: 128 YFNFKESYPNCS--HTIFNQGNCSSSYSIAVSSAFSDRVC-KLNQTQQLSAQNLLSCDGK 184
Query: 161 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQ 220
L GC GG+ + Y + HG+ T EC P+ G + K + KC
Sbjct: 185 L-NQGCTGGHITRSAEYIIKHGLTTNECHPF--------RGDDNFQECTKALEKC----- 230
Query: 221 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--VM 278
+ + +++ + +DI +I GPV VY+DF Y+ GVY+ + G
Sbjct: 231 -----QRFKANSFCQLQNKDDIKRDIINRGPVVAIMQVYKDFLVYRDGVYQVLEGTPRFH 285
Query: 279 GGHAVKLIGWGTSDDGEDYWV 299
GGHA+K+IGWG +G YW+
Sbjct: 286 GGHAIKIIGWG-EQNGYQYWI 305
>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 207
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 104/216 (48%), Gaps = 34/216 (15%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + + ++ L+ I ++NE WKA N F T
Sbjct: 1 MARVLILLSVILFSVY-------MTEQAYFLEKDYINKINEQATT-WKAGVN--FDPKTP 50
Query: 68 GQFKHLLGVKPTPKGLLLGVPV-----KTHDKSL-----KLPKSFDARSAWPQCSTISRI 117
+ H+L + + KG+ + V K+ D++ ++P+ FDAR W C TI I
Sbjct: 51 KE--HILKLLGS-KGVQIPSKVNYKMYKSEDENYDNLLGRIPRKFDARKKWRNCKTIGAI 107
Query: 118 LDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAW 175
DQG+CGSCWA A +DR C+ N + LS +L CC CG GC+GGYPI AW
Sbjct: 108 RDQGNCGSCWALATSSAFADRLCVASNGNFNQLLSAEELTFCC-HKCGFGCNGGYPIKAW 166
Query: 176 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
F+ HG+VT S GCEP Y P C
Sbjct: 167 ERFMKHGLVT-------GGDYKSREGCEP-YRVPPC 194
>gi|301777198|ref|XP_002924011.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ailuropoda
melanoleuca]
Length = 435
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 124/292 (42%), Gaps = 39/292 (13%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLGV-KPTPKGLLLGVPVKTHDK 94
++ +I +N+ GW A + F T+ + ++ LG +P+ +
Sbjct: 141 LVDQDMINAINQG-NYGWLAGNHSAFWGMTLDEGIRYRLGTFRPSSSVSNMNEIHTVLRP 199
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + + LDQG+C WAF SDR IH M LS
Sbjct: 200 GEVLPTAFEASEKWP--NLVHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 257
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV 212
+LL+C GC GG AW + GVV++ C P+ + A P P+C+
Sbjct: 258 NLLSC-DTHNQRGCRGGRLDGAWWFLRRRGVVSDHCYPFVGREQ------DEAGPAPRCM 310
Query: 213 RKCVKKNQLWR-------------NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 259
+ R N + AYR+ S E+IM E+ +NGPV+ V+
Sbjct: 311 MHSRAMGRGKRQATARCPSSHAHANDIYQVTPAYRLGSSEEEIMKELMENGPVQALMEVH 370
Query: 260 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWV 299
EDF Y+ GVY H + G H+VK+ GWG T DG YW
Sbjct: 371 EDFFLYQGGVYSHTPVSLGRPEQYRRHGTHSVKITGWGEETLPDGRTLKYWT 422
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 123/269 (45%), Gaps = 26/269 (9%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
W+A + F T+ + ++ LG ++P+ + + + LP +F+A WP
Sbjct: 126 WQAGNHSTFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP-- 183
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 169
+ I LDQG+C WAF SDR IH M LS +LL+C GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGG 242
Query: 170 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 223
AW + GVV++ C P+ D G + P + + R+ N
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVN 302
Query: 224 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 277
N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H +
Sbjct: 303 NNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPER 362
Query: 278 ---MGGHAVKLIGWG--TSDDGE--DYWV 299
G H+VK+ GWG T DG YW
Sbjct: 363 YRRHGTHSVKITGWGEETLPDGRTLKYWT 391
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 123/269 (45%), Gaps = 26/269 (9%)
Query: 54 WKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQC 111
W+A + F T+ + ++ LG ++P+ + + + LP +F+A WP
Sbjct: 126 WQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNPGEVLPTAFEASEKWP-- 183
Query: 112 STISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 169
+ I LDQG+C WAF SDR IH M LS +LL+C GC GG
Sbjct: 184 NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGG 242
Query: 170 YPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWR 223
AW + GVV++ C P+ D G + P + + R+ N
Sbjct: 243 RLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVN 302
Query: 224 NSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 277
N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H +
Sbjct: 303 NNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPER 362
Query: 278 ---MGGHAVKLIGWG--TSDDGE--DYWV 299
G H+VK+ GWG T DG YW
Sbjct: 363 YRRHGTHSVKITGWGEETLPDGRTLKYWT 391
>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
Length = 322
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 106/236 (44%), Gaps = 31/236 (13%)
Query: 73 LLGVKPTPKGLLLGVPVKTHDKSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAV 132
L+G + + L V T +P SFDAR+ WP C IS + DQG C SCWA +
Sbjct: 14 LVGTVYSQQQCLDNVVSYTDQDRANIPASFDARTQWPNC--ISPVRDQGSCSSCWAMTSS 71
Query: 133 EALSDRFCIHFGMNLS--LSVNDLLACCGFL-------CGDGCDGGYPISAWRYFVHHGV 183
L+DR CI G + LS ++ C C GC G+ + Y + +G+
Sbjct: 72 SILADRLCIASGGAIKKLLSPQYMVDCAKNCKTNSQSDCNSGCKFGFLDISMEY-LSNGI 130
Query: 184 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 243
E C PY +S C+ P QL+ S SI + D
Sbjct: 131 SAESCLPYKESDATCPSQCKDGSPI-----------QLYYGSGCISIGNLK------DAQ 173
Query: 244 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
EI KNGP+ F ++ + SG+Y+ TGD GHA ++IGWG ++G YW+
Sbjct: 174 LEIMKNGPILAVFQIFTSLYNIGSGLYRG-TGDPAEGHAARVIGWG-EENGTPYWL 227
>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain 1; AltName: Full=Dipeptidyl peptidase I
heavy chain 1; Contains: RecName: Full=Dipeptidyl
peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
peptidase I heavy chain 2; Contains: RecName:
Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
Full=Dipeptidyl peptidase I heavy chain 3; Contains:
RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
AltName: Full=Dipeptidyl peptidase I heavy chain 4;
Contains: RecName: Full=Dipeptidyl peptidase 1 light
chain; AltName: Full=Dipeptidyl peptidase I light chain;
Flags: Precursor
gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
Length = 435
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 130/269 (48%), Gaps = 30/269 (11%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
+K +N K+ W A R ++ T+ +G + P+ + + H++ +LP
Sbjct: 148 EFVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPT 206
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
S+D R+ + +S + +Q CGSC+AF + L R I + LS ++++C
Sbjct: 207 SWDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCS 265
Query: 159 GFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
+ GC+GG+P + A +Y G+V E C PY G P C+P C R
Sbjct: 266 QY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR---- 311
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH----- 272
+ +S++Y + + + + E+ ++GP+ V+F VY+DF HY+ G+Y H
Sbjct: 312 ----YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRD 367
Query: 273 -ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
+ HAV L+G+GT S G DYW+
Sbjct: 368 PFNPFELTNHAVLLVGYGTDSASGMDYWI 396
>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
Length = 459
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 130/269 (48%), Gaps = 30/269 (11%)
Query: 41 SIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPTPKGLLLGVPVKTHDKSLKLPK 100
+K +N K+ W A R ++ T+ +G + P+ + + H++ +LP
Sbjct: 172 EFVKAINTIQKS-WTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEISRLPT 230
Query: 101 SFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC 158
S+D R+ + +S + +Q CGSC+AF + L R I + LS ++++C
Sbjct: 231 SWDWRNV-RGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCS 289
Query: 159 GFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 217
+ GC+GG+P + A +Y G+V E C PY G P C+P C R
Sbjct: 290 QY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR---- 335
Query: 218 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH----- 272
+ +S++Y + + + + E+ ++GP+ V+F VY+DF HY+ G+Y H
Sbjct: 336 ----YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRD 391
Query: 273 -ITGDVMGGHAVKLIGWGT-SDDGEDYWV 299
+ HAV L+G+GT S G DYW+
Sbjct: 392 PFNPFELTNHAVLLVGYGTDSASGMDYWI 420
>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
Length = 470
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 128/277 (46%), Gaps = 44/277 (15%)
Query: 42 IIKEVNENPKAGWKAARNPQFSNYT-------VGQFKHLLGVKPTPKGLLLGVPVKTHDK 94
I+++N + ++ W+A P++ +T G K L +P P V +T
Sbjct: 179 FIEQIN-SAQSSWQAGVYPEYEKFTRNDLIRRAGGRKSRLPHRPRPAP----VSEETRLA 233
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVN 152
+ +LP+SFD R + +S I DQG CGSC+AF ++ L R + + LS
Sbjct: 234 AAQLPESFDWRKVM-GLNFVSPIRDQGQCGSCYAFASMGMLEARLRVLTNNTQQFVLSPQ 292
Query: 153 DLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTP 209
++++C + GC+GG+P + A +Y GVV EEC PY DS+ C Y T
Sbjct: 293 EIVSCGKY--SQGCEGGFPYLIAGKYAEDFGVVLEECYPYEGKDSSCKDTSRCGRGYAT- 349
Query: 210 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 269
+ + + + E + E+ KNGP+ V+F VY DF HYK GV
Sbjct: 350 ----------------NYRYVGGFYGGCNEELMQLELVKNGPMAVAFEVYSDFMHYKGGV 393
Query: 270 YKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWV 299
Y+H + HAV L+G+G + G +W
Sbjct: 394 YEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFWT 430
>gi|403355691|gb|EJY77431.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 129/288 (44%), Gaps = 39/288 (13%)
Query: 20 SQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQFKHLLGVKPT 79
S+ ++ V + + +H LQ+ IIK N + A +K N +S+ T +F +K
Sbjct: 61 SENYSTNNVDRKSVFAHKLQE-IIKH-NSHDSASYKKGLNA-YSDMTDEEFFDYFNLKAD 117
Query: 80 PKGLLLGVPVKTHDKSL-KLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDR 138
KT S +P ++D W +S + +QG CGSCW F V AL
Sbjct: 118 QN--CSATNRKTFGASNGSIPTNWD----WRTYGVVSPVKNQGKCGSCWTFSTVGALESH 171
Query: 139 FCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS 198
F + +G +LS L+ C G GC+GG P A+ Y +G + EE
Sbjct: 172 FLLKYGQFRNLSEQQLVDCAGNYDNHGCNGGLPSHAFEYLKDNGGIAEET---------- 221
Query: 199 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS--AYRINSDPEDIMAEIYKNGPVEVSF 256
+YP C K + S+ + A ++ +D+ IY +GPV ++F
Sbjct: 222 ------SYPYVAVTNTCALK----KGSQSVGVKGGAVNVSLSEDDLKQAIYSHGPVSIAF 271
Query: 257 TVYEDFAHYKSGVY-----KHITGDVMGGHAVKLIGWGTSDDGEDYWV 299
V DF Y++GVY K+ DV HAV +G+GT ++ DYW+
Sbjct: 272 QVASDFRDYRAGVYTSKVCKNGPQDV--NHAVLAVGFGTDENKVDYWI 317
>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
Length = 239
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 97/203 (47%), Gaps = 33/203 (16%)
Query: 8 LTTCLLILGVISSQTFAEGVVSKLKLDSHILQDSIIKEVNENPKAGWKAARNPQFSNYTV 67
+ L++L VI + L ++ LQ I +NE WKA N F T
Sbjct: 1 MARVLMLLSVIFVSFY-------LTEQAYFLQKDFIDNINERATT-WKAGVN--FDPDTP 50
Query: 68 GQ-FKHLLGVK----PTPKGLLLGVPVKTHDKSL-----KLPKSFDARSAWPQCSTISRI 117
+ F +LG K P + + KTHD + ++P+ FDAR W +C TI +
Sbjct: 51 KEHFLKMLGSKGVQIPNKHNIHM---YKTHDAAYDNLFGRIPRHFDARRKWRRCHTIGAV 107
Query: 118 LDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAW 175
DQG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 108 RDQGNCGSCWAMATSSAFADRLCVATNTDFNELLSAEEITFCC-HSCGFGCNGGYPIKAW 166
Query: 176 RYFVHHGVVT-------EECDPY 191
F G+VT E C+PY
Sbjct: 167 ERFKKRGLVTGGDYQSGEGCEPY 189
>gi|194375129|dbj|BAG62677.1| unnamed protein product [Homo sapiens]
Length = 394
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 125/270 (46%), Gaps = 23/270 (8%)
Query: 37 ILQDSIIKEVNENPKAGWKAARNPQFSNYTVGQ-FKHLLG-VKPTPKGLLLGVPVKTHDK 94
++ +IK +N+ GW+A + F T+ + ++ LG ++P+ + + +
Sbjct: 129 LVDPDMIKAINQG-NYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMHEIYTVLNP 187
Query: 95 SLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVN 152
LP +F+A WP + I LDQG+C WAF SDR IH M LS
Sbjct: 188 GEVLPTAFEASEKWP--NLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQ 245
Query: 153 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCEPAYPT 208
+LL+C GC GG AW + GVV++ C P+ D G + P +
Sbjct: 246 NLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAM 304
Query: 209 PKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 265
+ R+ N N+ Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF Y
Sbjct: 305 GRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLY 364
Query: 266 KSGVYKHITGDV--------MGGHAVKLIG 287
K G+Y H + G H+VK+ G
Sbjct: 365 KGGIYSHTPVSLGRPERYRRHGTHSVKITG 394
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 107/221 (48%), Gaps = 18/221 (8%)
Query: 94 KSLKLPKSFDARSAWPQCSTISRILDQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSV 151
K +LP+ FDAR W I + DQG CGS W+ SDR I +N +LS
Sbjct: 180 KPRELPEHFDARDKWG--PLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSS 237
Query: 152 NDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC 211
LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 238 QQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYT 295
Query: 212 VRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 270
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY
Sbjct: 296 NRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY 355
Query: 271 KH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWVC 300
+H + G H+V+++GWG ++ YW+C
Sbjct: 356 QHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLC 396
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,339,155,627
Number of Sequences: 23463169
Number of extensions: 235361115
Number of successful extensions: 459770
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4720
Number of HSP's successfully gapped in prelim test: 1587
Number of HSP's that attempted gapping in prelim test: 446512
Number of HSP's gapped (non-prelim): 7258
length of query: 300
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 159
effective length of database: 9,050,888,538
effective search space: 1439091277542
effective search space used: 1439091277542
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)