BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15348
(298 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/245 (31%), Positives = 113/245 (46%), Gaps = 30/245 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
+C + A++ A+ ++ +C S V+ + A C +L CS G
Sbjct: 77 NCKSSYAISVASAVSDRICIHSNGTVK---PKLSAQQILSCCYLCGDG-----CSGGQHF 128
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+W + + GLV+GG + SN GCQP + PC H T E C P+C +C N
Sbjct: 129 ESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTE-TAVENACSNKTLFTPECKVQCYNP 187
Query: 147 NYGRGFFQDKYQINGLGLYFD-PHF---------GPFWPAFWRSFCTKYTRPLFQTNGRV 196
+YG + +D +Q G ++ P + GP +F+ + V
Sbjct: 188 DYGTRYVKDNHQ----GTHYRVPAYTAMKEIYENGPITASFYM------YQDFVNYQSGV 237
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
YA + S + V VKI+GWGEENG PYW ++F +GD G +KILRG NE IE +
Sbjct: 238 YAYN-SGKYVTTQAVKILGWGEENGTPYWLAANSFNTYWGDNGFVKILRGANECYIEEFM 296
Query: 257 NGALP 261
LP
Sbjct: 297 YAGLP 301
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 98/192 (51%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G+S++ W + GLV+GG +++ GC+P S PC H++ S PEC P PKC
Sbjct: 40 CSGGVSAAAWQYWKDAGLVSGGLYNTTDGCKPYSLAPCEHSS-QGSLPEC-VGTLPTPKC 97
Query: 140 HTRCTNDNYGRGFFQDKY------QINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C + Y R + DKY ING GP F T Y L
Sbjct: 98 KRQC-REGYERSYDDDKYFAKNVYSINGSEKQIRTEIFQNGPVEAEF-----TAYADFLS 151
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY S +I+ ++I+GWG E+ PYW + +++ E +GD G K+LRG NE
Sbjct: 152 YKSG-VYQ-HHSRDIIGRHAIRILGWGSEDNNPYWLLANSWNEDWGDHGYFKMLRGVNEC 209
Query: 251 IIESLVNGALPK 262
IES VN +PK
Sbjct: 210 DIESFVNAGIPK 221
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 116/249 (46%), Gaps = 30/249 (12%)
Query: 26 LSCIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSG 83
++ I A ++ +C SK + VE ++ ++ RC + C G
Sbjct: 38 ITFINKHAFGAVESMSDRICIHSKNKISVELSAINLLSCCT-RCGF---------GCRGG 87
Query: 84 ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC 143
I W + G+VTGG++ ++TGCQP FP CNH + + S P C++ P P+CH C
Sbjct: 88 IPGMAWDYWKYEGIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETC 147
Query: 144 TNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
D+YG+ + +DK Y + + GP F+
Sbjct: 148 -QDDYGKPYKKDKFYGKSSYNVASEEISIMKEILLNGPVEGGFYV------YEDFLNYKS 200
Query: 195 RVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
VY + + +A ++I+GWG ++N PYW +++ Q+GD+G KILRG NE IE
Sbjct: 201 GVYKHITGSYLGGHA-IRIIGWGIQQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIE 259
Query: 254 SLVNGALPK 262
S+V LP
Sbjct: 260 SMVTAGLPN 268
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 16/194 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K G+VTG + +N+GC+P FPPC H + T C P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
+C D + + +DK Y + G+ D GP AF +
Sbjct: 234 EKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAF------EVYEDFL 287
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY V ++ VK+VGWG ENG PYWT +++ +G+ G +ILRG +E
Sbjct: 288 NYDGGVY-VHTGGKLGGGHAVKLVGWGIENGIPYWTCANSWNTDWGEDGFFRILRGVDEC 346
Query: 251 IIESLVNGALPKDN 264
IES V G +PK N
Sbjct: 347 GIESGVVGGVPKLN 360
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 95/191 (49%), Gaps = 18/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
H +C + YGR + N + H GP AF T ++ L
Sbjct: 218 HQKCQKGYKTPYGKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEAAF-----TVHSDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y AEI +A V+I+GWG E PYW I +++ E +G+KG +ILRG++E
Sbjct: 272 YKSG-IYKYMTGAEIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDEC 329
Query: 251 IIESLVNGALP 261
IES V G LP
Sbjct: 330 GIESEVTGGLP 340
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 95/191 (49%), Gaps = 18/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
H +C + YGR + N + H GP AF T ++ L
Sbjct: 218 HQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEAAF-----TVHSDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y AEI +A V+I+GWG E PYW I +++ E +G+KG +ILRG++E
Sbjct: 272 YKSG-IYKYMTGAEIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDEC 329
Query: 251 IIESLVNGALP 261
IES V G LP
Sbjct: 330 GIESEVTGGLP 340
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 95/191 (49%), Gaps = 18/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
H +C + YGR + N + H GP AF T ++ L
Sbjct: 218 HQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEAAF-----TVHSDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y AEI +A V+I+GWG E PYW I +++ E +G+KG +ILRG++E
Sbjct: 272 YKSG-IYKYMTGAEIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDEC 329
Query: 251 IIESLVNGALP 261
IES V G LP
Sbjct: 330 GIESEVTGGLP 340
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 86/195 (44%), Gaps = 23/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT-PQPK 138
C G + W ++ + GLVTGG +HS+ GCQP PC H + S+P C T P P
Sbjct: 161 CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEH-HMEGSKPNCSASPTEPTPA 219
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTR 187
C T CT +G K + G Y P GP AF K
Sbjct: 220 CETTCT---HGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAF------KVYE 270
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
F VY + VK++GWGE+NG PYW + +++ +GDKG KI RG
Sbjct: 271 DFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQNGLPYWLVQNSWDYDWGDKGLFKIARG- 329
Query: 248 NEAIIESLVNGALPK 262
NE E + LPK
Sbjct: 330 NECDFEKSMTAGLPK 344
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 96/193 (49%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
H +C + YGR + N + H GP AF T ++ L
Sbjct: 218 HQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH-GPVEVAF-----TVHSDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y AEI +A V+I+GWG E PYW I +++ E +G+KG ++LRG++E
Sbjct: 272 YKSG-IYKYMTGAEIGEHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRMLRGKDEC 329
Query: 251 IIESLVNGALPKD 263
IES V LP+D
Sbjct: 330 GIESAVTSGLPRD 342
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/193 (35%), Positives = 97/193 (50%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + + G+VTG + ++TGCQP FP C H N T P C PKC
Sbjct: 159 CLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEH-NTTGKYPACGQKIYETPKC 217
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+C + +YG+ + + + H GP SF T Y+ L
Sbjct: 218 QKKCQKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMH-GPV-----GSFFTVYSDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y EI + TV+IVGWG E G PYW I +++ E +G+KG +ILRG++E
Sbjct: 272 YKSG-IYKHMKGTEIGVH-TVRIVGWGVEKGTPYWLIANSWNEGWGEKGYFRILRGKDEC 329
Query: 251 IIESLVNGALPKD 263
IESLV G LP++
Sbjct: 330 DIESLVIGGLPRN 342
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 93/194 (47%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 14 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73
Query: 140 HTRCTNDN-YGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
CT++N Y G+ QDK+ ++ + H GP AF T Y
Sbjct: 74 VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAH-GPIEVAF-----TVY-ED 126
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+Q VY +A + +A VKI+GWG +NG PYW + +++ +G+KG +I+RG N
Sbjct: 127 FYQYTTGVYVHTAGKSLGGHA-VKILGWGVDNGTPYWLVANSWNVNWGEKGYFRIIRGLN 185
Query: 249 EAIIESLVNGALPK 262
E IE LP
Sbjct: 186 ECGIEHSAVAGLPD 199
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 96/195 (49%), Gaps = 22/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + + K GLVTG N CQ SFPPC H +T P CK P P+C
Sbjct: 165 CNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHVASTKYPPCKG-EVPTPEC 223
Query: 140 HTRCTNDN-YGRGFFQDKYQ-INGLGLYFDP--------HFGPFWPAF--WRSFCTKYTR 187
+C +D+ R + +D Y+ + DP + GP AF + F T Y
Sbjct: 224 KKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEVAFTVYEDFVT-YKS 282
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
++Q + E + VK++GWG EN PYW IV+++ E +GD+GT KILRG
Sbjct: 283 GVYQ--------HVTGEQLGGHAVKMIGWGVENDTPYWLIVNSWNETWGDQGTFKILRGS 334
Query: 248 NEAIIESLVNGALPK 262
NE IE V ALP+
Sbjct: 335 NECGIEDEVVTALPQ 349
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 94/195 (48%), Gaps = 18/195 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K G+VTG + +N GC+P FPPC H + T C P PKC
Sbjct: 173 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 232
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C +D + + +DK+ + + H GP AF +
Sbjct: 233 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH-GPLEIAF------EVYEDF 285
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY V ++ VK++GWG ++G PYWT+ +++ +G+ G +ILRG +E
Sbjct: 286 LNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDE 344
Query: 250 AIIESLVNGALPKDN 264
IES V G +PK N
Sbjct: 345 CGIESGVVGGIPKLN 359
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 94/195 (48%), Gaps = 18/195 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K G+VTG + +N GC+P FPPC H + T C P PKC
Sbjct: 164 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 223
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C +D + + +DK+ + + H GP AF +
Sbjct: 224 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH-GPLEIAF------EVYEDF 276
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY V ++ VK++GWG ++G PYWT+ +++ +G+ G +ILRG +E
Sbjct: 277 LNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDE 335
Query: 250 AIIESLVNGALPKDN 264
IES V G +PK N
Sbjct: 336 CGIESGVVGGIPKLN 350
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 94/195 (48%), Gaps = 18/195 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K G+VTG + +N GC+P FPPC H + T C P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C +D + + +DK+ + + H GP AF +
Sbjct: 234 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTH-GPLEIAF------EVYEDF 286
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY V ++ VK++GWG ++G PYWT+ +++ +G+ G +ILRG +E
Sbjct: 287 LNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDE 345
Query: 250 AIIESLVNGALPKDN 264
IES V G +PK N
Sbjct: 346 CGIESGVVGGIPKLN 360
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 94/194 (48%), Gaps = 16/194 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K G+VTG +N+GC+P FPPC H + T C P PKC
Sbjct: 189 CNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 248
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
RC + + + +DK Y + G+ D GP AF +
Sbjct: 249 EKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAF------EVYEDFL 302
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY V ++ VK++GWG E+G PYWT+ +++ +G+ G +ILRG +E
Sbjct: 303 NYDGGVY-VHTGGKLGGGHAVKLIGWGIEDGIPYWTVANSWNTDWGEDGFFRILRGVDEC 361
Query: 251 IIESLVNGALPKDN 264
IES V G +PK N
Sbjct: 362 GIESGVVGGIPKLN 375
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 94/194 (48%), Gaps = 16/194 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K G+VTG + +N+GC+P FPPC H + T C P PKC
Sbjct: 175 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 234
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
+C D + + +DK Y + G+ D GP AF +
Sbjct: 235 EKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAF------EVYEDFL 288
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY V ++ VK++GWG E+G PYWT +++ +G+ G +ILRG +E
Sbjct: 289 NYDGGVY-VHTGGKLGGGHAVKLIGWGIEDGIPYWTCANSWNTDWGEDGFFRILRGVDEC 347
Query: 251 IIESLVNGALPKDN 264
IES V G +PK N
Sbjct: 348 GIESGVVGGIPKLN 361
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++ K G TGG++ + GC+P S PC T+ P C T P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+CTN NY + DK+ ++ + H GP AF T Y
Sbjct: 210 VNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAH-GPVEAAF-----TVY-EDF 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q VY + E+ +A ++I+GWG +NG PYW + +++ +G+ G +I+RG NE
Sbjct: 263 YQYKSGVYVHTTGEELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321
Query: 250 AIIESLVNGALPK 262
IE V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 93/193 (48%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++ K G TGG++ S GC+P S PC T+ P+C P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSC 209
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+CTN+NY + DK+ ++ + H GP AF T Y
Sbjct: 210 VNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAH-GPVEAAF-----TVY-EDF 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q VY + E+ +A ++I+GWG +NG PYW + +++ +G+ G +I+RG NE
Sbjct: 263 YQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321
Query: 250 AIIESLVNGALPK 262
IE V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 86/193 (44%), Gaps = 34/193 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--CKTLATPQP 137
C+ G S + W + KRGLVTGG + SN GCQP PPCNH P C + P
Sbjct: 156 CNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETP 215
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
+C C N NY + F +D + G+ D H C+ R + +G
Sbjct: 216 QCTLNCYNPNYSKPFLKDISK----GIRIDWH------------CSGMIRNELKKHGPAT 259
Query: 198 AV----------------SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
A+ + +++ TVK++GWG G YW +++G +GDKG
Sbjct: 260 AIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQYWLAANSWGTSWGDKGFF 319
Query: 242 KILRGRNEAIIES 254
KI RG NE + E
Sbjct: 320 KIRRGYNECLFED 332
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 90/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213
Query: 140 HTRCT-NDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
CT N Y + QDK+ QI L GP AF T Y
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEIL----KNGPIEVAF-----TVY 264
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
+Q VY +A A + +A VKI+GWG +NG PYW + +++ +G+KG +I+R
Sbjct: 265 -EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWNINWGEKGYFRIIR 322
Query: 246 GRNEAIIESLVNGALPK 262
G NE IE +P
Sbjct: 323 GLNECGIEHSAVAGIPD 339
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 90/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213
Query: 140 HTRCT-NDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
CT N Y + QDK+ QI L GP AF T Y
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEIL----KNGPIEVAF-----TVY 264
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
+Q VY +A A + +A VKI+GWG +NG PYW + +++ +G+KG +I+R
Sbjct: 265 -EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWNINWGEKGYFRIIR 322
Query: 246 GRNEAIIESLVNGALPK 262
G NE IE +P
Sbjct: 323 GLNECGIEHSAVAGIPD 339
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 96/193 (49%), Gaps = 19/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W + + GLVTGG + ++ GC+P S PC H + S P C T P PKC
Sbjct: 154 CNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C YG+ + DK Y I+ GP F T Y L
Sbjct: 212 VHLCRK-GYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADF-----TVYADFLS 265
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY S +++ ++I+GWG ENG PYW + +++ E +GD G KILRG++E
Sbjct: 266 YKSG-VYQ-HQSGDVLGGHAIRILGWGTENGTPYWLVANSWNEDWGDHGYFKILRGKDEC 323
Query: 251 IIESLVNGALPKD 263
IE +N +PK+
Sbjct: 324 GIEDDINAGIPKN 336
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 100/193 (51%), Gaps = 20/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C+ G + W++ K+GLV+GG + S+ GCQP + PC +HAN T P C PK
Sbjct: 151 CNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGT--RPPCSG-GGRTPK 207
Query: 139 CHTRCTNDNYGRGFFQDK-YQINGLGLYFDP--------HFGPFWPAFWRSFCTKYTRPL 189
CHT C N++Y + +DK + + + DP + GP AF + Y+ L
Sbjct: 208 CHTFCENEDYSLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAF-----SVYSDFL 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G V S ++ ++I+GWG ENG PYW + +++ +GD GT KIL+G +
Sbjct: 263 NYKSGVYRHVKGS--LLGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGTFKILKGSDH 320
Query: 250 AIIESLVNGALPK 262
IE + LP+
Sbjct: 321 CGIEGSIVAGLPQ 333
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 94/192 (48%), Gaps = 17/192 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + RG+VTG + +++GC+P FPPC H N T CK P PKC
Sbjct: 192 CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 251
Query: 140 HTRCTNDNYGRGFFQDKY---QINGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLF 190
+C + NYG+ + DKY Q+ + + GP +F YT L+
Sbjct: 252 VKKC-DKNYGKSYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASF-----EVYTDFLY 305
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
T G V+ S + VK++GWG + G PYW +++ +G+ G +ILRG NE
Sbjct: 306 YTGGIYKHVAGS--MGGGHAVKVLGWGIDQGVPYWLAANSWNTDWGEDGYFRILRGVNEC 363
Query: 251 IIESLVNGALPK 262
IES + +PK
Sbjct: 364 GIESGIIAGIPK 375
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 93/194 (47%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + RG+VTG + +++GC+P FPPC H N T CK P PKC
Sbjct: 58 CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 117
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
+C + NYG+ + DKY G +Y GP +F YT
Sbjct: 118 VKKC-DKNYGKSYKADKYY--GQSVYNVESNVESIQKEIMTLGPVEASF-----EVYTDF 169
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L+ T G V+ S + VK++GWG + G PYW +++ +G+ G +ILRG N
Sbjct: 170 LYYTGGIYKHVAGS--MGGGHAVKVLGWGIDQGVPYWLAANSWNTDWGEDGYFRILRGVN 227
Query: 249 EAIIESLVNGALPK 262
E IES + +PK
Sbjct: 228 ECGIESGIIAGIPK 241
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 88/193 (45%), Gaps = 17/193 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG++ + GC+P S PC P C P PKC
Sbjct: 154 CEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKC 213
Query: 140 HTRCTN-DNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPL 189
CT+ +NY + QDK Y + GP AF T Y
Sbjct: 214 VDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAF-----TVY-EDF 267
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q VY +A A + +A VKI+GWG +NG PYW + +++ +G+KG +I+RG NE
Sbjct: 268 YQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNE 326
Query: 250 AIIESLVNGALPK 262
IE +P
Sbjct: 327 CGIEHSAVAGIPD 339
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 92/197 (46%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 155 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKC 214
Query: 140 HTRCTNDN-YGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
CT++N Y + QDK+ QI L GP AF T Y
Sbjct: 215 VEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEIL----KNGPVEVAF-----TVY 265
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
+Q VY ++ A + +A VKI+GWG +NG PYW + +++ +G+KG +I+R
Sbjct: 266 -EDFYQYTTGVYVHTSGASLGGHA-VKILGWGVDNGTPYWLVANSWNVNWGEKGYFRIIR 323
Query: 246 GRNEAIIESLVNGALPK 262
G NE IE +P
Sbjct: 324 GLNECGIEHSAVAGIPD 340
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 93/200 (46%), Gaps = 33/200 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W + + GLVTGG + +N GC+P S PC H + S P C T P PKC
Sbjct: 154 CNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
C YG+ + DK HFG + S K + NG V A
Sbjct: 212 VHLCRK-GYGKDYQDDK------------HFGK--KVYSISSDEKQIQTEIFKNGPVEAD 256
Query: 199 ---------------VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S +++ ++I+GWG ENG PYW +++ E +GD G KI
Sbjct: 257 FIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGTENGTPYWLAANSWNEDWGDHGYFKI 316
Query: 244 LRGRNEAIIESLVNGALPKD 263
LRG++E IE +N +PK+
Sbjct: 317 LRGKDECGIEEDINAGIPKN 336
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 92/194 (47%), Gaps = 20/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+VTGG+ ++TGCQP FP C H + PEC + +PKC
Sbjct: 159 CQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCEH-HTKGRYPECGEIIYMKPKC 217
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
H +C Y + +DKY + + H GP +F +
Sbjct: 218 HQKCQK-GYKTPYEKDKYYGKVSYNLLKNEDSIKKEIMMH-GPVEASF------RVHSDF 269
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Y +I ++ V+I+GWG E PYW I +++ E +G+KG ++LRG++E
Sbjct: 270 LNYKSGIYKHMTGIDIGSH-VVRIIGWGVEKETPYWLIANSWNEDWGEKGYFRMLRGKDE 328
Query: 250 AIIESLVNGALPKD 263
IES V LP+D
Sbjct: 329 CGIESAVTSGLPRD 342
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 91/193 (47%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++ K G TGG++ + GC+P S PC + P C P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPAC 209
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+CTN NY + DK+ +++ + H GP AF T Y
Sbjct: 210 VNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAH-GPVEAAF-----TVY-EDF 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q VY + E+ +A ++I+GWG +NG PYW + +++ +G+ G +I+RG NE
Sbjct: 263 YQYKTGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321
Query: 250 AIIESLVNGALPK 262
IE V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 93/191 (48%), Gaps = 17/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + GLVTGG ++S+ GCQP P C+H +P C PKC
Sbjct: 164 CQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQP-CPKEEAKTPKC 222
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
+C NY + DK Y ++ + GP AF T Y L
Sbjct: 223 SKKC-EANYNVTYKDDKHYGKNSYSVDSVEKIMTEIMTNGPVEAAF-----TVYEDFLSY 276
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G VY E+ +A VKI+GWGE+NG PYW + +++ +G++G ILRG++E
Sbjct: 277 KSG-VYQHRTGQELGGHA-VKILGWGEDNGTPYWIVANSWNPDWGNQGFFNILRGKDECG 334
Query: 252 IESLVNGALPK 262
IES + LPK
Sbjct: 335 IESQIVAGLPK 345
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 92/193 (47%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 150 CNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCSGEGGDTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
C Y + +DK+ G Y P GP AF + Y+
Sbjct: 209 SKIC-EPGYSPSYKEDKH--FGCDTYSVPSDEKEIMVEIYKNGPVEAAF-----SVYSDF 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G V+ E+V V+I+GWG ENG PYW + +++ +GD G KILRGR+
Sbjct: 261 LLYKSGVYQHVTG--EMVGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGRD 318
Query: 249 EAIIESLVNGALP 261
IES + +P
Sbjct: 319 HCGIESEIVAGIP 331
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 91/196 (46%), Gaps = 23/196 (11%)
Query: 77 IWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
++ C+ G W + + G+VTGG + + GCQP PPC + E + Q
Sbjct: 150 VFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQ 203
Query: 137 P-----KCHTRCTNDN---YGRGFFQ--DKYQINGLGLYFDPH-FGPFWPAFWRSFCTKY 185
P KC +C D+ Y + ++ D Y + + D +GP +F
Sbjct: 204 PTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPIEASF------DV 257
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
VY + +A + VK++GWG E G PYW +V+++GEQ+GDKG KILR
Sbjct: 258 YDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKGMFKILR 317
Query: 246 GRNEAIIESLVNGALP 261
G +E IES +P
Sbjct: 318 GTDECGIESSCTAGVP 333
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 92/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G SS W + K GLV+GG ++S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISPCEH-HVNGSRPPCTGEGGDTPEC 206
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+RC Y + QDK Y + G GP AF T Y +
Sbjct: 207 ISRC-EAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAF-----TVYEDFVM 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VS S ++ +K++GWGEE+G PYW +++ +GD G KILRG N
Sbjct: 261 YKSGVYQHVSGS--VLGGHAIKVLGWGEEDGIPYWLCANSWNTDWGDNGFFKILRGSNHC 318
Query: 251 IIESLVNGALPK 262
IES + +PK
Sbjct: 319 GIESEIVAGIPK 330
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + +GLV+GG + S++GCQP PC H T +P + TP KC
Sbjct: 150 CNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPCAEGGRTP--KC 207
Query: 140 HTRCTNDNYGRGFFQD-KYQINGLGLYFDP--------HFGPFWPAFWRSFCTKYTRPLF 190
H C N+NY + +D + + + DP GP AF + Y+ +
Sbjct: 208 HRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAF-----SVYSDFMN 262
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V S ++ ++I+GWG E G PYW + +++ +GDKGT KILRG +
Sbjct: 263 DKSGVYRHVKGS--LLGGHAIRILGWGVEKGTPYWLVANSWNTDWGDKGTFKILRGSDHC 320
Query: 251 IIESLVNGALPK 262
IE V LP+
Sbjct: 321 GIEGSVVTGLPR 332
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 84/193 (43%), Gaps = 19/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 138
C G +W + + G V+GG ++SN GCQP + PPC N C T + P
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C +C N NY F D Y+ G P+ GP F+ R L
Sbjct: 190 CEKKCYNPNYYTSFRTDIYK--GKYYKLSPYMAMKDIFDNGPITTQFYM------YRDLV 241
Query: 191 QTNGRVYAVSASAEIVAYA--TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
VY ++ + +VKI GWGEENG PYW + ++FG +G GT KI RG +
Sbjct: 242 DYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENGVPYWLVANSFGTDWGYNGTFKISRGND 301
Query: 249 EAIIESLVNGALP 261
+ + LP
Sbjct: 302 GCFFQEKMYAGLP 314
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 92/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H P C T P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y + N + + +GP AF Y L
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ S IV ++I+GWG E G+PYW I +++ E +G+KG +++RGR+E
Sbjct: 272 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDEC 329
Query: 251 IIESLVNGALPK 262
IES V L K
Sbjct: 330 SIESHVVAGLIK 341
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 89/193 (46%), Gaps = 23/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
C+ G W + + G+VTGG + + GCQP PPC + E + QP
Sbjct: 153 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 206
Query: 138 ---KCHTRCTNDN---YGRGFFQ--DKYQINGLGLYFDPH-FGPFWPAFWRSFCTKYTRP 188
KC +C D+ Y + ++ D Y + + D +GP +F
Sbjct: 207 RNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVYGPIEASF------DVYDD 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
VY + +A + VK++GWG E G PYW +V+++GEQ+GDKG KILRG +
Sbjct: 261 FMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRGTD 320
Query: 249 EAIIESLVNGALP 261
E IES +P
Sbjct: 321 ECGIESSCTAGVP 333
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 25/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI + +W + + G+VTGG + TGC P FP C+H T P C P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLG-------LYFDPHFGPFWPAFWRSFCTKYT 186
+C + Y + + QDK Y + G + P G F+ + F +
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPVDGIFY--MFEDFLVYKS 271
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
T GR +V ++++GWG ENG YW I +++ E +G+KG ++ RG
Sbjct: 272 GIYHYTTGR---------LVGGHAIRVIGWGVENGVKYWLIANSWNEGWGEKGYFRMRRG 322
Query: 247 RNEAIIESLVNGALP 261
NE IE+ +N LP
Sbjct: 323 NNECGIEARINAGLP 337
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 19/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++ W + + GLV+ G + + GC+P S PC H + S P C T P PKC
Sbjct: 154 CDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C YG+ + DK Y I+ GP F T Y L
Sbjct: 212 VHLCRK-GYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADF-----TVYADFLS 265
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY S +++ ++I+GWG ENG PYW + +++ E +GD G KILRG++E
Sbjct: 266 YKSG-VYQ-HHSGDVLGGHAIRILGWGTENGTPYWLVANSWNEDWGDHGYFKILRGKDEC 323
Query: 251 IIESLVNGALPKD 263
IE +N +PKD
Sbjct: 324 GIEDDINAGIPKD 336
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 154 CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209
Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 210 H-RCTRMCYGNQELDFKEDHHWTRDAYYLTYTTIQKDVMAYGPIEASFDVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + +A + VK++GWGEE G PYW +V+++ +Q+GD+G KILRG
Sbjct: 263 PNYKSG--VYMKTENASYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKILRGT 320
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 321 NECGIDNSTTGGVP 334
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 90/193 (46%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++ K G TGG++ + GC+P S PC + P+C P C
Sbjct: 150 CDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPAC 209
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+CTN Y + DK+ ++ + H GP AF T Y
Sbjct: 210 VNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAH-GPVEAAF-----TVY-EDF 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q VY + E+ +A ++I+GWG +NG PYW + +++ +G+ G +I+RG NE
Sbjct: 263 YQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321
Query: 250 AIIESLVNGALPK 262
IE V G +PK
Sbjct: 322 CGIEHAVVGGVPK 334
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 93/195 (47%), Gaps = 24/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLV+GG + S+ GC+P S PC H + S P C P+C
Sbjct: 148 CNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEH-HVNGSRPPCTGEGGDTPQC 206
Query: 140 HTRCTNDNYGRGFFQDK------YQING------LGLYFDPHFGPFWPAFWRSFCTKYTR 187
+C Y G+ QDK Y ++ L +Y + GP AF T Y
Sbjct: 207 TKKC-EAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKN---GPVEGAF-----TVYED 257
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L G V+ SA V +K++GWGEENG PYW +++ +GD G KILRG
Sbjct: 258 FLLYKTGVYQHVTGSA--VGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGFFKILRGS 315
Query: 248 NEAIIESLVNGALPK 262
+ IES + +PK
Sbjct: 316 DHCGIESEIVAGIPK 330
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 68/194 (35%), Positives = 96/194 (49%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W + K GLVTGG + SN GC+P S PPC H + + P C+ PKC
Sbjct: 147 CFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYSIPPCEH-HVNGTRPPCQGEGD-TPKC 204
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
T+C D Y + +DKY G Y P GP AF + Y
Sbjct: 205 QTKCI-DGYTPAYEKDKY--FGKKTYSVPSKQEQIMTELYKNGPVEAAF-----SVYEDF 256
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VY + +++ +KI+GWG+EN PYW +++ +G++G KILRG +
Sbjct: 257 LLYKSG-VYQ-HLTGDMLGGHAIKILGWGKENNTPYWLAANSWNTDWGNQGFFKILRGGD 314
Query: 249 EAIIESLVNGALPK 262
E IES V +P+
Sbjct: 315 ECGIESEVVAGIPQ 328
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 88/195 (45%), Gaps = 18/195 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + K G+VTG GC+P FPPC H + T CK P PKC
Sbjct: 190 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 249
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C + + + +DK+ + + H GP AF +
Sbjct: 250 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTH-GPVEVAF------EVYEDF 302
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G +Y V +I VK++GWG E G PYW + +++ +G+ G +I+RG +E
Sbjct: 303 LMYDGGIY-VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNTDWGEDGFFRIIRGIDE 361
Query: 250 AIIESLVNGALPKDN 264
IES V G LPK N
Sbjct: 362 CGIESSVVGGLPKLN 376
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 88/195 (45%), Gaps = 18/195 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + K G+VTG GC+P FPPC H + T CK P PKC
Sbjct: 149 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C + + + +DK+ + + H GP AF +
Sbjct: 209 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTH-GPVEVAF------EVYEDF 261
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G +Y V +I VK++GWG E G PYW + +++ +G+ G +I+RG +E
Sbjct: 262 LMYDGGIY-VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNTDWGEDGFFRIIRGIDE 320
Query: 250 AIIESLVNGALPKDN 264
IES V G LPK N
Sbjct: 321 CGIESSVVGGLPKLN 335
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 27/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
C+ G W + + G+VTGG +++ GCQP PPC + E + QP
Sbjct: 153 CNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPC------VRDDEGHNSCSGQPTE 206
Query: 138 ---KCHTRCTND---NYGRGFFQ--DKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYT 186
KC +C D NY + ++ D Y ++ + D +GP +F + F T Y
Sbjct: 207 RNHKCSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMVYGPIEASFDVYDDF-TSYE 265
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
++Q + +A + VK++GWG E G PYW +V+++GEQ+GDKG KILRG
Sbjct: 266 SGVYQK-------TENASYLGGHAVKMIGWGVEEGTPYWLMVNSWGEQWGDKGMFKILRG 318
Query: 247 RNEAIIESLVNGALP 261
+E +ES +P
Sbjct: 319 TDECGVESSCTAGVP 333
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 96/198 (48%), Gaps = 23/198 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCKGEGGETPKC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
C Y + +DK+ G Y P GP AF + YT
Sbjct: 209 SKTC-EPGYSPSYKEDKHY--GYSSYGVPSSEQEIMAEIYKNGPVEGAF-----SVYTDF 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VY E+ +A ++I+GWG ENG PYW +++ +GD G KILRG++
Sbjct: 261 LVYKSG-VYQHVTGEEVGGHA-IRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGQD 318
Query: 249 EAIIESLVNGALPK-DNY 265
IES + +P+ D Y
Sbjct: 319 HCGIESEIVAGIPRTDQY 336
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 97/201 (48%), Gaps = 35/201 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W + G+VTGG + ++ GCQP FPPC H + P C T P P+C
Sbjct: 153 CNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPQC 210
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
C Y + + +DK Y L D T+ +F+ NG V A
Sbjct: 211 VRDCRK-GYEKSYSEDKHYAKKVYTLSADE--------------TQIKTEIFK-NGPVEA 254
Query: 199 -VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
+ A+ V+Y + ++I+GWG ENG PYW + +++ E +GDKG K
Sbjct: 255 DFTVYADFVSYKSGVYQRHSDDALGGHAIRILGWGTENGVPYWLVANSWNEDWGDKGYFK 314
Query: 243 ILRGRNEAIIESLVNGALPKD 263
ILRG +E IE +N +PK+
Sbjct: 315 ILRGNDECGIEDDINAGIPKE 335
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ + G+VTG ++ GCQP FPPC H + P C+ PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPCEH-HVVGPRPSCEG-DVETPKC 218
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
T C + YG+ ++ + H GP F + F Y
Sbjct: 219 KTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEH-GPVEVDFEVYADF-PNYKSG 276
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEENG PYW I +++ +GD G KI+RGRN
Sbjct: 277 VYQ--------HVSGGLLGGHAVRLLGWGEENGVPYWLIANSWNSDWGDNGYFKIIRGRN 328
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 329 ECGIESDVNAGIPK 342
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 91/195 (46%), Gaps = 25/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI + +W + + G+VTGG + TGC P FP C+H T P C P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYF-------DPHFGPFWPAFWRSFCTKYT 186
+C + Y + + QDK Y + F P G F+ + F +
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPVDGIFY--MFEDFLVYKS 271
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
T GR +V ++++GWG ENG YW I +++ E +G+KG ++ RG
Sbjct: 272 GIYHYTTGR---------LVGGHAIRVIGWGVENGVKYWLIANSWNEGWGEKGYFRMRRG 322
Query: 247 RNEAIIESLVNGALP 261
NE IE+ +N LP
Sbjct: 323 NNECGIEARINAGLP 337
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/192 (34%), Positives = 96/192 (50%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W +GLVTGG + S+ GCQP C+H +P CK +P PKC
Sbjct: 73 CNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKP-CKG-DSPTPKC 130
Query: 140 HTRCT---NDNYG--RGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PLF 190
+C N +Y + F Q Y + GP AF T Y P +
Sbjct: 131 ERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAF-----TVYADFPTY 185
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
++ VY ++ + + +A +KI+GWGEENG PYW + +++ +GD+G KI RG +E
Sbjct: 186 KSG--VYQHTSGSALGGHA-IKILGWGEENGTPYWLVANSWNSDWGDEGFFKIKRGNDEC 242
Query: 251 IIESLVNGALPK 262
IES + G LPK
Sbjct: 243 GIESGIVGGLPK 254
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/192 (34%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 96 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 154
Query: 140 HTRCTNDNYGRGFFQDKY--------QINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK+ Q N + D +GP AF Y L
Sbjct: 155 KQTCQK-GYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAF-----DVYEDFLN 208
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ S IV ++I+GWG E PYW I +++ E +G+KG +I+RGR+E
Sbjct: 209 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDEC 266
Query: 251 IIESLVNGALPK 262
IES V L K
Sbjct: 267 SIESNVVAGLIK 278
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 93/195 (47%), Gaps = 24/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + K GLV+GG + S+ GC+P + PPC H + S P C P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGDTPQC 206
Query: 140 HTRCT---------NDNYGR---GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
++C + +YG+ D+ +I Y GP AF T Y
Sbjct: 207 LSQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQ----YEIYKNGPVEGAF-----TVYED 257
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+ +G VS SA V +K++GWGEENG PYW +++ +GD G K LRG
Sbjct: 258 FVLYKSGVYQHVSGSA--VGGHAIKVLGWGEENGVPYWLCANSWNTDWGDNGFFKFLRGS 315
Query: 248 NEAIIESLVNGALPK 262
+ IES + +PK
Sbjct: 316 DHCGIESEIVAGIPK 330
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ + G+VTG +++ GCQP FPPC H + P C PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPCEH-HVVGPRPSCGG-DVETPKC 218
Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
T C + YG+ ++ + H GP F + F Y
Sbjct: 219 KTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDH-GPVEVDFEVYADF-PNYKSG 276
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEENG PYW I +++ +GD G KI+RGRN
Sbjct: 277 VYQ--------HVSGGLLGGHAVRLLGWGEENGVPYWLIANSWNSDWGDNGYFKIIRGRN 328
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 329 ECGIESDVNAGIPK 342
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 90/192 (46%), Gaps = 21/192 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 135
C+ G W + + G+VTGG +++ GCQP PPC N + +P P
Sbjct: 153 CNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGHNSCSGQP-----TEP 207
Query: 136 QPKCHTRCTND---NYGRGFFQDK--YQINGLGLYFDP-HFGPFWPAFWRSFCTKYTRPL 189
KC C D +Y +G ++ K Y +N + D +GP +F
Sbjct: 208 NHKCSRSCYGDKTCDYKKGHYKTKNAYYLNIDTMQKDTIAYGPIEASF------DVYDDF 261
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
VY + A+ + VK++GWGEE+G PYW +V+++GEQ+G G KILRG NE
Sbjct: 262 VNYESGVYQKTEDAKYLGGHAVKMIGWGEEDGTPYWLMVNSWGEQWGANGMFKILRGTNE 321
Query: 250 AIIESLVNGALP 261
IE +P
Sbjct: 322 CGIEGSPTAGVP 333
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 90/190 (47%), Gaps = 15/190 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ--P 137
C G S+ W ++ ++G+ TGG + +T C+P FPPC+H +P TPQ
Sbjct: 158 CKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQPCGPIQPTPQCVK 217
Query: 138 KCHTRCTNDNYGRGF------FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
+C++ T + Y + + K + + H GP +F K
Sbjct: 218 ECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAH-GPVQASF------KVAADFLT 270
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY + + +VKI+GWG+E PYW I +++ E +G+KG ++LRGRNE
Sbjct: 271 YKSGVYIRNPKLKYEGGHSVKIIGWGKEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECG 330
Query: 252 IESLVNGALP 261
IE+ + LP
Sbjct: 331 IEAQIVAGLP 340
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/250 (30%), Positives = 112/250 (44%), Gaps = 33/250 (13%)
Query: 27 SCIEARAVATATPLAFAVC--RSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
SC A ++ VC + K++VE ++ ++ C C+ G
Sbjct: 104 SCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGM---------GCNGGY 154
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
S W + + GLV+GG + S+ GC+P S PPC H + S P CK PKC +C
Sbjct: 155 PSGAWQFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKCVKQC- 212
Query: 145 NDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRPLFQTN 193
+ Y + DK+ G Y P GP A F PL+++
Sbjct: 213 EEGYSPAYGTDKHF--GTTSYGVPTSEKEIMAEIYKNGPVEGA----FLVYADFPLYKSG 266
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
VY E+ +A +KI+GWG ENG PYW +++ +GD G KILRG++ IE
Sbjct: 267 --VYQHETGEELGGHA-IKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIE 323
Query: 254 SLVNGALPKD 263
S + +PK+
Sbjct: 324 SEIVAGVPKN 333
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 94/195 (48%), Gaps = 22/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + + GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+C D Y + DK+ G Y P GP AF P
Sbjct: 209 VKQC-EDGYAPVYGSDKHF--GATSYGVPSSEKEIMAEIYKNGPVEGAF----LVYADFP 261
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++++ VY E+ +A +KI+GWG ENG PYW +++ +GD G KILRG++
Sbjct: 262 MYKSG--VYQHETGEELGGHA-IKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 249 EAIIESLVNGALPKD 263
IES + +PK+
Sbjct: 319 HCGIESEIVAGIPKN 333
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 95/195 (48%), Gaps = 23/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + G+VTGG + + GCQP FPPC H + P C T P P+C
Sbjct: 32 CNGGYPSAAWQFYKDEGIVTGGLYGTEDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPEC 89
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
C + Y + + +DK Y I+ GP F + F Y
Sbjct: 90 AKTC-REGYEKSYTRDKHFGKKVYSISSDETQIKTEICKNGPVEADFNVYADF-PSYKSG 147
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q + S E++ ++I+GWG E+G PYW + +++ E +GDKG KI RG +
Sbjct: 148 VYQRH--------SKEMLGGHAIRILGWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGND 199
Query: 249 EAIIESLVNGALPKD 263
E IE+ +N +PK+
Sbjct: 200 ECGIENDINAGIPKE 214
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 92/190 (48%), Gaps = 18/190 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
+C Y + QDK Y + N + + +GP AF Y L
Sbjct: 218 KQKCQK-GYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ S IV ++I+GWG E G+PYW I +++ E +G+ G +++RGR+E
Sbjct: 272 YKSGIYRHVAGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGENGLFRMVRGRDEC 329
Query: 251 IIESLVNGAL 260
IES V L
Sbjct: 330 SIESHVVAGL 339
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 104/252 (41%), Gaps = 40/252 (15%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC S ++F F + C W C+ G
Sbjct: 101 SCGSCWAFGAVEAMSDRVCIHSNGE---SNFHFSSDDLVSCCWTCGM-----GCNGGYPG 152
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ W + ++GLV+GG + + GC+P PPC H + S P C PKC C
Sbjct: 153 AAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEH-HTNGSRPACDASEGNTPKCAKSC--- 208
Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-AVSASAEI 205
+ Y+IN D HFG A+ S K + NG V A S A+
Sbjct: 209 -------ESNYKIN---YSNDLHFGS--KAYSISSDVKQIQAEILQNGPVEGAFSVYADF 256
Query: 206 VAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
V Y T ++I GWG EN PYW I +++ +GD GT KILRG +
Sbjct: 257 VNYKTGVYQHIKGQFLGGHAIRIFGWGVENNTPYWLIANSWNTDWGDSGTFKILRGSDHC 316
Query: 251 IIESLVNGALPK 262
IES + LPK
Sbjct: 317 GIESGIVAGLPK 328
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDKY--------QINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK+ Q N + D +GP AF Y L
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAF-----DVYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ S IV ++I+GWG E PYW I +++ E +G+KG +++RGR+E
Sbjct: 272 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRMVRGRDEC 329
Query: 251 IIESLVNGALPK 262
IES V L K
Sbjct: 330 SIESDVVAGLIK 341
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 114/265 (43%), Gaps = 37/265 (13%)
Query: 7 SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR 66
S+IRD S SC AV+ A ++ +C +S T A
Sbjct: 114 SKIRDQS-------------SCGSCWAVSAAETISDRICIASNGK---TQLSISADDINA 157
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE 126
C +V C+ G W K+G VTGG++ TGC+P +PPC H T
Sbjct: 158 CCGMVCGNG----CNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHY 213
Query: 127 PECKTLATPQPKCHTRC--------TND-NYGRGFFQDKYQINGLGLYFDPHFGPFWPAF 177
C + P KC C T D ++G+ + ++ + H GP AF
Sbjct: 214 KPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMTH-GPVEVAF 272
Query: 178 WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGD 237
+G VY +A A + +A VK++GWG +NG PYW +++ E +G+
Sbjct: 273 ------SVYEDFEHYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDWGE 325
Query: 238 KGTIKILRGRNEAIIESLVNGALPK 262
G +I+RG NE IES V G +PK
Sbjct: 326 NGYFRIIRGVNECGIESGVVGGIPK 350
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 90/195 (46%), Gaps = 26/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLVTGG + S+ GC+P S PPC H + + P C P+C
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEH-HVNGTRPPCTGEEGDTPQC 206
Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
+C Y G+ QDK+ QI L P G F T Y
Sbjct: 207 SNQCET-GYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNGPVEGAF---------TVYE 256
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VS SA V +K++GWGEE G PYW +++ +G+ G KILRG
Sbjct: 257 DFLLYKSGVYQHVSGSA--VGGHAIKVLGWGEEGGTPYWLAANSWNTDWGENGFFKILRG 314
Query: 247 RNEAIIESLVNGALP 261
++ IES + +P
Sbjct: 315 KDHCGIESEMVAGVP 329
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 93/195 (47%), Gaps = 22/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + + GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPSCKGEEGDTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
C + Y + DK+ G Y P GP AF P
Sbjct: 209 MKTC-EEGYTPAYGSDKHF--GATSYGVPSSEKEIMADIYKNGPVEGAF----VVYADFP 261
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L+++ VY E+ +A +KI+GWG ENG PYW +++ +GD G KILRG++
Sbjct: 262 LYKSG--VYQHETGEELGGHA-IKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKD 318
Query: 249 EAIIESLVNGALPKD 263
IES V +PK+
Sbjct: 319 HCGIESEVVAGIPKN 333
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 94/192 (48%), Gaps = 24/192 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W + +GLVTGG S GC+P + PC H + S P C+ PKC
Sbjct: 148 CFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEH-HVNGSRPPCQG-EVETPKC 205
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
T+C N+ Y + +DK+ G Y P GP AF + Y
Sbjct: 206 VTQC-NNGYSLSYPKDKH--FGQRSYSIPSQQEQIMTELYKNGPVEAAF-----SVYADF 257
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L NG V+ +++ VKI+GWGEENG PYW + +++ +GDKG KI RG +
Sbjct: 258 LLYKNGVYQHVTG--DMLGGHAVKILGWGEENGTPYWLVANSWNSDWGDKGFFKIKRGND 315
Query: 249 EAIIES-LVNGA 259
E IES +V GA
Sbjct: 316 ECGIESEMVAGA 327
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 89/195 (45%), Gaps = 17/195 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W ++ K G+ TGG++ S GC+P S PPC + P C +P P C
Sbjct: 148 CEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSC 207
Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPL 189
+CT + + G D+ + + + D GP F Y L
Sbjct: 208 EKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATF-----EVYDDFL 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
T G ++ + + + +V+I+GWG G PYW +++G Q+G+ GT ++LRG NE
Sbjct: 263 QYTTGIYVHLTGNKQ--GHLSVRIIGWGVWQGVPYWLCANSWGRQWGENGTFRVLRGTNE 320
Query: 250 AIIESLVNGALPKDN 264
+ES +PK N
Sbjct: 321 CGLESNCVSGMPKLN 335
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 90/208 (43%), Gaps = 29/208 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
C GI S W+WVH +G+ TGG + + + GC P FPPC H + P+C +
Sbjct: 210 CDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 269
Query: 134 TPQPKCHTRCTNDNYGRGFFQDK----------YQINGLGLYFDPHFGPFWPAFWRSFCT 183
P C +C N Y D+ Y +N GP P ++
Sbjct: 270 YETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTD-GPVGPIYFCDPSV 328
Query: 184 KYTR---------PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQ 234
+ + VY ++ E+ +A VKI+GWGEE G+ YW +V+++ E
Sbjct: 329 NFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHA-VKIIGWGEETGQAYWLVVNSWNED 387
Query: 235 FGDKGTIKILRGRNEAIIESLVNGALPK 262
+GD G KI G E I+ + G PK
Sbjct: 388 WGDNGLFKIALGNCE--IDDDLLGGTPK 413
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 93/191 (48%), Gaps = 17/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W++ + GLVTGG ++S+ GCQP + C+H +P C P PKC
Sbjct: 158 CEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQP-CSKDIGPTPKC 216
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
C Y + +DK Y ++G+ GP AF T Y Q
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGMSAYSVHGVEKIMTEIMTNGPVEGAF-----TVYAD-FPQ 269
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY + + +A +KI+GWG ENG YW + +++ +GD+G KILRG++E
Sbjct: 270 YKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECG 328
Query: 252 IESLVNGALPK 262
IES ++ PK
Sbjct: 329 IESQISAGEPK 339
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 90/194 (46%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLV+GG + S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEH-HVNGSRPPCTGEGGDTPEC 206
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+C + Y + QDK+ G Y P GP AF T Y
Sbjct: 207 VRQCES-GYTPSYIQDKHY--GKTSYSVPSDEQQIQTEIYKNGPVEGAF-----TVYEDF 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G VS SA V +K++GWGEENG PYW +++ +GD G KILRG +
Sbjct: 259 LLYKTGVYQHVSGSA--VGGHAIKVLGWGEENGTPYWLCANSWNTDWGDNGYFKILRGSD 316
Query: 249 EAIIESLVNGALPK 262
IES + +PK
Sbjct: 317 HCGIESEIVAGIPK 330
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 95/194 (48%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG + S+ GC+P S PPC H T P C P+C
Sbjct: 140 CNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGT-RPPCSGEGGETPEC 198
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
+C D Y + QDK+ G+ Y P GP AF Y+
Sbjct: 199 VKKC-EDGYTPAYKQDKHY--GVTSYGIPRSEKEIMAEIYKNGPVEGAF-----VVYSDF 250
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VY + E+ +A ++I+GWG +NG PYW +++ +G+ G +ILRG++
Sbjct: 251 LMYKSG-VYQHVSGEEVGGHA-IRILGWGVDNGTPYWLAANSWNTDWGEDGFFRILRGQD 308
Query: 249 EAIIESLVNGALPK 262
IES + +PK
Sbjct: 309 HCGIESEIVAGIPK 322
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 96/195 (49%), Gaps = 25/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G + W + + GLVTGG ++S+ GC+P S PC H + + P C PKC
Sbjct: 144 CSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAPCEH-HVNGTRPPCSG-EQDTPKC 201
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR- 187
C Y + QDK+ G +Y P GP AF T Y
Sbjct: 202 TGVCI-PKYSVPYKQDKH--FGSKVYNVPSDQQQIMTELYTNGPVEAAF-----TVYEDF 253
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
PL+++ VY + + +A VKI+GWGEENG P+W + +++ +GD G KILRG
Sbjct: 254 PLYKSG--VYQHLTGSALGGHA-VKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGH 310
Query: 248 NEAIIESLVNGALPK 262
+E IES + LPK
Sbjct: 311 DECGIESEMVAGLPK 325
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 91/195 (46%), Gaps = 27/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G SS W + K+GLVTGG S GC+P S PC H T P T TP KC
Sbjct: 146 CSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQETP--KC 203
Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
+C D Y + +DK+ QI LY + GP AF T Y
Sbjct: 204 EKKCI-DGYLTSYLKDKHFGKRSYSLPSQQEQIM-TELYKN---GPVEAAF-----TVYA 253
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L G V+ E++ +KI+GWGEE+G PYW +++ +GDKG KI RG
Sbjct: 254 DFLLYKTGVYQHVTG--EVLGGHAIKILGWGEESGTPYWLAANSWNGDWGDKGFFKIKRG 311
Query: 247 RNEAIIESLVNGALP 261
+E IES + P
Sbjct: 312 NDECGIESEMVAGTP 326
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + +GLV+GG + S+ GCQP PC H T +P + TP KC
Sbjct: 149 CNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKPCAEGGRTP--KC 206
Query: 140 HTRCTNDNYGRGFFQD-KYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
H C N NY + +D + + + DP GP AF + Y+ +
Sbjct: 207 HKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAF-----SVYSDFMS 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V S ++ ++I+GWG E G PYW + +++ +GD GT KILRG +
Sbjct: 262 YKSGVYRHVKGS--LLGGHAIRILGWGMEKGTPYWLVANSWNTDWGDNGTFKILRGSDHC 319
Query: 251 IIESLVNGALPK 262
IE V LP+
Sbjct: 320 GIEDSVVAGLPR 331
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 89/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + K GLV+GG + S+ GC+P + PPC H + S P C P+C
Sbjct: 64 CNGGYPSAAWDFWTKDGLVSGGLYDSHIGCRPYTIPPCEH-HVNGSRPSCSGEGGETPQC 122
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
RC Y + QDK Y + D GP AF T Y +
Sbjct: 123 VYRC-EAGYTPSYKQDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAF-----TVYEDFVL 176
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G V+ SA + +KI+GWGEENG PYW +++ +G+ G KILRG N
Sbjct: 177 YKTGVYQHVTGSA--LGGHAIKILGWGEENGIPYWLCANSWNTDWGNNGFFKILRGSNHC 234
Query: 251 IIESLVNGALPK 262
IES + +P
Sbjct: 235 GIESEIVAGIPN 246
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 92/196 (46%), Gaps = 23/196 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI W + G+VTGG++ ++TGCQP FP C H + + + C+ P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
+ C D Y + DKY G Y+ GP F+ +
Sbjct: 221 YQTCQPD-YAIQYENDKYY--GKSSYYVTSDEVSIMKEILLNGPVEATFYV-----FDDF 272
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRG 246
L G V+ S ++ ++I+GWG N PYW +++ +Q+GDKG KILRG
Sbjct: 273 LNYKTGVYKYVTGS--LLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRG 330
Query: 247 RNEAIIESLVNGALPK 262
NE IES+V LPK
Sbjct: 331 SNECGIESMVTAGLPK 346
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 97/198 (48%), Gaps = 23/198 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + + P+C PKC
Sbjct: 150 CNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEH-HVNGTRPKCTGEGGDTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
C Y + +DKY G Y P GP AF + ++
Sbjct: 209 SKTC-EPGYSPSYKEDKYY--GYSSYSVPSTEKEIMAEIYKNGPVEAAF-----SVFSDF 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VY + E++ ++I+GWG+ENG PYW + +++ +GD G KILRG +
Sbjct: 261 LTYKSG-VYK-HVAGEVLGGHAIRILGWGKENGVPYWLVGNSWNVDWGDNGFFKILRGED 318
Query: 249 EAIIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 319 HCGIESEVVAGIPRTDQY 336
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 93/193 (48%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 173 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGSTPKC 231
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+R Y + +DK+ G Y P GP AF + Y+
Sbjct: 232 -SRICEAGYTPSYKEDKHF--GCSSYSVPSSETEIMAEIYKNGPVEAAF-----SVYSDF 283
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G V+ E++ V+I+GWG E+G PYW + +++ +GD G KILRG++
Sbjct: 284 LLYKSGVYQHVTG--EMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDSGFFKILRGQD 341
Query: 249 EAIIESLVNGALP 261
IES + LP
Sbjct: 342 HCGIESEIVAGLP 354
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 89/190 (46%), Gaps = 18/190 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H P C T P+C
Sbjct: 121 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 179
Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y + N + + +GP AF Y L
Sbjct: 180 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLN 233
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ S IV ++I+GWG E PYW I +++ E +G+KG +I+RGR+E
Sbjct: 234 YKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDEC 291
Query: 251 IIESLVNGAL 260
IES V L
Sbjct: 292 SIESHVVAGL 301
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + KRG+VTGG+ ++TGCQP FP C H P C T P+C C
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223
Query: 147 NYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
Y + QDK Y + N + + +GP AF Y L +G
Sbjct: 224 GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
V+ S IV ++I+GWG E G+PYW I +++ E +G+KG +++RGR+E IES V
Sbjct: 279 HVTGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVV 336
Query: 258 GALPK 262
L K
Sbjct: 337 AGLIK 341
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 89/194 (45%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +G+VTG +++ GCQP FPPC H N P C P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-NTLGPLPVCDG-DVETPPC 221
Query: 140 HTRCT--------NDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
C ND YG+ ++ K + H GP F + F Y
Sbjct: 222 KRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEEN PYW I +++ +GD G KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 332 ECGIESDVNAGIPK 345
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 88/193 (45%), Gaps = 18/193 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTG +N GC+P FPPC H + T C+ P PKC
Sbjct: 190 CEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKC 249
Query: 140 HTRCT--------NDN--YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C +D+ YGR + K + + H GP AF +
Sbjct: 250 SKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTH-GPVEVAF------EVYEDF 302
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
G +Y V ++ VK++GWG + G PYW I +++ +G++G +ILRG +E
Sbjct: 303 LHYAGGIY-VHTGGKLGGGHAVKLIGWGIDQGTPYWLIANSWNTDWGEEGFFRILRGVDE 361
Query: 250 AIIESLVNGALPK 262
IES V G +PK
Sbjct: 362 CGIESGVVGGIPK 374
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +G+VTG +++ GCQP FPPC H + P C P C
Sbjct: 163 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HVIGPLPSCDG-DVETPSC 220
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAF--WRSFCTKYTRP 188
T C Y + +DK Y ++ +P GP F + F Y
Sbjct: 221 KTNC-QPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADF-PNYKSG 278
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEEN PYW I +++ +GDKG KI+RG+N
Sbjct: 279 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNSDWGDKGYFKIVRGKN 330
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 331 ECGIESDVNAGIPK 344
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 94/191 (49%), Gaps = 17/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W++ K GLVTGG ++S+ GC P + C+H +P K++ P PKC
Sbjct: 158 CEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPCSKSIG-PTPKC 216
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
C Y + +DK Y ++G+ GP AF T Y Q
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGSSAYSVHGVEKIMTEIMTNGPVEGAF-----TVYAD-FPQ 269
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY + + +A +KI+GWG ENG YW + +++ +GD+G KILRG++E
Sbjct: 270 YKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSWNPDWGDQGFFKILRGQDECG 328
Query: 252 IESLVNGALPK 262
IES ++ PK
Sbjct: 329 IESQISAGEPK 339
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 109/247 (44%), Gaps = 30/247 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
SC A ++ VC +S K+H FRF A C + C+ G
Sbjct: 109 SCGSCWAFGAVEAMSDRVCVASGGKIH-----FRFSAEDLVSCCHTCG-----FGCNGGF 158
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
+ W++ ++GLV+GG SN GCQP + PC H + + P C+ PKC +C
Sbjct: 159 PGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPKCVKKC- 216
Query: 145 NDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGR 195
++Y + +DK Y I GP AF T Y L G
Sbjct: 217 QESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAF-----TVYEDLLHYKEGV 271
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
V+ +++ ++I+GWG ENG YW I +++ +GD G KILRG + IES
Sbjct: 272 YQHVTG--KMLGGHAIRILGWGVENGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESS 329
Query: 256 VNGALPK 262
++ LPK
Sbjct: 330 ISAGLPK 336
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 90/191 (47%), Gaps = 18/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLVTGG ++S+ GC+P + PC H + S P C P C
Sbjct: 148 CNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206
Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+C + ++G+ + NG+ + GP AF T Y L
Sbjct: 207 DMKCEPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFKN-GPVEAAF-----TVYEDFLL 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +S SA + +KI+GWGEENG PYW +++ +GD G KILRG +
Sbjct: 261 YKSGVYQHMSGSA--LGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHC 318
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 319 GIESEIVAGIP 329
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 89/193 (46%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W ++ G+ TGG + ++ C+P FPPC+H + P C + P PKC
Sbjct: 156 CQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDH-HVVGQYPPCGPIK-PTPKC 213
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
+C + + + QD + + + Y P+ GP +F +
Sbjct: 214 VKQCNSQYTEKTYQQDLHHPSKV--YQLPNNAEAIQREIMAHGPVQASF------RVASD 265
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
VY + +VKI+GWG E G PYW I +++ E +G+ G K+LRG+N
Sbjct: 266 FLTYKSGVYIRDPKLKYEGGHSVKIIGWGVEQGTPYWLIANSWNEDWGENGLFKMLRGKN 325
Query: 249 EAIIESLVNGALP 261
E IE+ V LP
Sbjct: 326 ECGIEAEVVAGLP 338
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 23/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + +VTGG + + GCQP FPPC H P C T P P+C
Sbjct: 7 CNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEHHT-VGPLPNC-TGIKPTPEC 64
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
C + Y + + +DK Y I+ GP F + F Y
Sbjct: 65 AKTC-REGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADF-PSYKSG 122
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q + S E++ ++I+GWG E+G PYW + +++ E +GDKG KI RG +
Sbjct: 123 VYQRH--------SEEMLGGHAIRILGWGTEDGVPYWLVANSWNEDWGDKGYFKIRRGND 174
Query: 249 EAIIESLVNGALPKD 263
E IE +N +PK+
Sbjct: 175 ECGIEDDINAGIPKE 189
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 87/183 (47%), Gaps = 18/183 (9%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + KRG+VTGG+ ++TGCQP FP C H P C T P+C C
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223
Query: 147 NYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
Y + QDK Y + + +GP AF Y L +G
Sbjct: 224 GYKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
V+ S IV ++I+GWG E G+PYW I +++ E +G+KG +++RGR+E IES V
Sbjct: 279 HVTGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVV 336
Query: 258 GAL 260
L
Sbjct: 337 AGL 339
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 91/193 (47%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + G+VTGG +HS+ GCQP P C H + K L P PKC
Sbjct: 151 CNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKCEHHVKGPFKACGKEL--PTPKC 208
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
+C Y + F QDK Y I GP AF T Y P
Sbjct: 209 SQKC-QPGYNKTFNQDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAF-----TVYADFPS 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+++ VY + + +A VKI+GWG EN PYW I +++ +GDKG KI+RG++E
Sbjct: 263 YKSG--VYQHTTGGPLGGHA-VKILGWGTENNTPYWLIANSWNPTWGDKGYFKIIRGKDE 319
Query: 250 AIIESLVNGALPK 262
IES + +PK
Sbjct: 320 CGIESSIVAGMPK 332
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 108/237 (45%), Gaps = 24/237 (10%)
Query: 28 CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
C + AV+ ++ +C S K VE ++ I+ K + C G++
Sbjct: 115 CASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCKNCGSG----------CDGGVT 164
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
+W + K G+VTGG+ ++TGC+P FP C+H C P+C C
Sbjct: 165 GYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQCKQTCQK 223
Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
Y + QDK Y + G+ + P ++ Y L +G +Y
Sbjct: 224 -GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPV--EAYLQIYEDFLNYKSG-IYRY 279
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+ I +A V+++GWG ENG YW +T+ E +G+KG +I+RGR+E +IES +
Sbjct: 280 TTGKYISGHA-VRLIGWGVENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 89/181 (49%), Gaps = 18/181 (9%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
W + KRG+VTGG+ ++TGCQP FP C H P C T P+C C Y
Sbjct: 168 WDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GY 225
Query: 149 GRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+ QDK Y + N + + +GP AF Y L +G V
Sbjct: 226 KTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYRHV 280
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+ S IV ++I+GWG E G+PYW I +++ E +G+KG +++RGR+E IES V
Sbjct: 281 TGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAG 338
Query: 260 L 260
L
Sbjct: 339 L 339
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 91/195 (46%), Gaps = 24/195 (12%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + W+ WVHK GLV+GG SN GCQP + PC H + + P C+ PK
Sbjct: 152 CNGGFPGAAWSYWVHK-GLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPK 209
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR 187
C +C D+Y + +DK G Y P GP AF T Y
Sbjct: 210 CVKKC-QDSYTVPYAKDKRY--GSKSYSIPRHEDQIRKEIMTNGPVEGAF-----TVYED 261
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L G V+ +++ ++I+GWG EN YW I +++ +GD G KILRG
Sbjct: 262 LLHYKEGVYQHVTG--KMLGGHAIRILGWGVENNTKYWLIANSWNSDWGDNGFFKILRGE 319
Query: 248 NEAIIESLVNGALPK 262
+ IES + LPK
Sbjct: 320 DHLGIESSIAAGLPK 334
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 96/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W ++ ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 94/194 (48%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W K GLVTGG ++S GCQP PPC Y + C+ P K
Sbjct: 154 CHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNT--CR--GKPAEKN 209
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y +N + D +GP ++ + F
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHRYTRDAYYLNYQIIQNDLMTYGPIEASYDVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PNYKSG--VYMKTENASYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 321 NECGIDNSTTGGVP 334
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 22/192 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W ++ + G+VTGG ++S C+ FPPC+H P+C T PKC
Sbjct: 138 CQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCSHG-IEGQYPQCSTKPPVVPKC 196
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAF--WRSFCTKYTRP 188
T C + Y + +D+Y+ + + + GP +F + F T +
Sbjct: 197 ETTC-QEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYEDFMTYKSGI 255
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
G+ + TVKI+GWGEENG YW V+++ ++G+ G +I G N
Sbjct: 256 YHHVEGK---------FMNLHTVKIIGWGEENGEAYWKAVNSWNSEWGENGLFRIRLGTN 306
Query: 249 EAIIESLVNGAL 260
E IES V G L
Sbjct: 307 ECTIESQVEGGL 318
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 154 CSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209
Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + +D +GP +F + F
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHHYTRDAYYLTYGTIQYDVLAYGPIEASFEVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 321 NECGIDNSTTGGVP 334
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 89/193 (46%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLVTGG ++S+ GC+P + PC H + S P C P C
Sbjct: 148 CNGGYPSAAWDFWATEGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCSGEGGDTPNC 206
Query: 140 HTRCTNDNYGRGFFQDKY-----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+C Y + QDK+ Q + + F GP AF T Y
Sbjct: 207 DMKC-EPGYSPSYKQDKHFGKTSYSVPSNQNSIMAELFKN--GPVEGAF-----TVYEDF 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G +S S V +KI+GWGEENG PYW +++ +GD G KILRG +
Sbjct: 259 LLYKSGVYQHMSGSP--VGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGED 316
Query: 249 EAIIESLVNGALP 261
IES + +P
Sbjct: 317 HCGIESEIVAGIP 329
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 88/201 (43%), Gaps = 29/201 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K GLVTGG+ S GC+P S PC + PEC + PKC
Sbjct: 145 CEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKC 204
Query: 140 HTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL--------- 189
CT N++Y + QDK HFG A RS T L
Sbjct: 205 EHHCTGNNSYPIPYDQDK------------HFGASAYAIGRSAKQIQTEILAHGPVEVGF 252
Query: 190 ------FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ +Y A E+ +A VK++GWG +NG PYW +++ +G+KG +I
Sbjct: 253 IVYEDFYLYKTGIYTHVAGGELGGHA-VKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRI 311
Query: 244 LRGRNEAIIESLVNGALPKDN 264
LRG +E IES +P N
Sbjct: 312 LRGVDECGIESAAVAGMPDLN 332
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 89/204 (43%), Gaps = 43/204 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + G+V+GG + S+ GC+P PPC H + + + P+CK + PKC
Sbjct: 154 CNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEH-HTSGNRPDCKG-NSKTPKC 211
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+C F KYQ D HF + + + VY
Sbjct: 212 QRQCVES------FDGKYQA-------DKHFAS------NVYNVRASEEDIMNEILVYG- 251
Query: 200 SASAEIVAYA---------------------TVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
A+ + YA VKI+GWGEENG PYW +++ +GD
Sbjct: 252 PVEADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWGEENGVPYWLCANSWNTDWGDG 311
Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
G KILRG N IE+ +N +PK
Sbjct: 312 GFFKILRGYNHCKIEADINAGIPK 335
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
C+ G S W + +G+VTG +++ GCQP FPPC H + P CK
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223
Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
T Q + ND YG+ ++ K + H GP F + F Y
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEEN PYW I +++ +GD G KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 332 ECGIESDVNAGIPK 345
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 89/193 (46%), Gaps = 20/193 (10%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G S W+ WVHK G+VTGG + S+ GC P C+H T P C P P+
Sbjct: 190 CNGGFPGSAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKTIPPTPR 247
Query: 139 CHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPL 189
C R Y F DK Y + GP F T Y L
Sbjct: 248 C-VRMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADF-----TVYEDFL 301
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY + + +A ++++GWG ENG PYW +++ ++GDKG KILRG +E
Sbjct: 302 HYKSG-VYQRHTDSALGGHA-IRLLGWGVENGVPYWLAANSWNTEWGDKGFFKILRGSDE 359
Query: 250 AIIESLVNGALPK 262
IES + LPK
Sbjct: 360 CGIESDIVAGLPK 372
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
C+ G S W + +G+VTG +++ GCQP FPPC H + P CK
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223
Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
T Q + ND YG+ ++ K + H GP F + F Y
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEEN PYW I +++ +GD G KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 332 ECGIESDVNAGIPK 345
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 91/196 (46%), Gaps = 25/196 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K+GLV+GG++ S GC+P S PC + P+C P+C
Sbjct: 147 CDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPEC 206
Query: 140 HTRCTN-DNYGRGFFQDKYQINGLGLYFDP-------------HFGPFWPAFWRSFCTKY 185
+ CT+ +Y + +DK+ GL Y P GP F
Sbjct: 207 ASHCTSKSSYSVAYEKDKHY--GLSAY--PVGRKEAQIQTEILQHGPVEAGFL------V 256
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
++ +Y + E+ +A VKI+GWG ENG YW + +++ +G+KG +ILR
Sbjct: 257 YSDFYRYKSGIYTHVSGQELGGHA-VKILGWGVENGTKYWLVANSWNINWGEKGYFRILR 315
Query: 246 GRNEAIIESLVNGALP 261
GRNE IES V +P
Sbjct: 316 GRNECGIESAVVAGIP 331
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
C+ G S W + +G+VTG +++ GCQP FPPC H + P CK
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223
Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
T Q + ND YG+ ++ K + H GP F + F Y
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ V+++GWGEEN PYW I +++ +GD G KI+RG+N
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKN 331
Query: 249 EAIIESLVNGALPK 262
E IES VN +PK
Sbjct: 332 ECGIESDVNAGIPK 345
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 96/203 (47%), Gaps = 33/203 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + K+GLV+GG + S+ GC+P S PPC H + + P+C PKC
Sbjct: 150 CNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGTRPQCTGEGGDTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
C Y + +DK HFG + ++ S K NG V A
Sbjct: 209 SKTC-EPGYSPSYKEDK------------HFG--YDSYSVSSNEKEIMAEIYKNGPVEGA 253
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ ++ + Y T ++I+GWG+ENG PYW + +++ +GD G KI
Sbjct: 254 FTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWGKENGVPYWLVGNSWNVDWGDSGFFKI 313
Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
+RG + IES + +P+ D Y
Sbjct: 314 VRGEDHCGIESEIVAGIPRTDQY 336
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 94/190 (49%), Gaps = 20/190 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S+ W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 SAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKSC-E 212
Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK Y + G+ GP AF + Y+ L +G
Sbjct: 213 PGYSSSYKEDKHYGYSSYSVPGIEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 267
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 268 QHVTG--EMMGGHAIRILGWGTENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEI 325
Query: 257 NGALPK-DNY 265
+P+ D Y
Sbjct: 326 VAGIPRTDQY 335
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 90/199 (45%), Gaps = 26/199 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
C+ G +S W+WVH +G+ TGG + + + GC P FPPC H + P+C +
Sbjct: 89 CNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 148
Query: 134 TPQPKCHTRCTNDNYGRGFFQDK----------YQINGLGLYFDPHFGPFWPAFWRSFCT 183
P C +C N Y D+ Y +N GP +F T
Sbjct: 149 YETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTD-GPVSASF-----T 202
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L +G VY S E + VKI+GWGEE+G+ YW +V+++ E +GD G KI
Sbjct: 203 VYEDFLAYKSG-VYK-HTSGEYLGGHAVKIIGWGEESGQAYWLVVNSWNEDWGDHGLFKI 260
Query: 244 LRGRNEAIIESLVNGALPK 262
G I+ + G PK
Sbjct: 261 ALGN--CGIDDYLLGGTPK 277
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 92/191 (48%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y + GP AF + Y+ L
Sbjct: 208 SKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 320 GIESEIVAGMP 330
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 112/266 (42%), Gaps = 39/266 (14%)
Query: 7 SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR 66
S+IRD S SC AV+ A ++ +C +SK T A
Sbjct: 114 SKIRDQS-------------SCGSCWAVSAAETISDRICIASKGQ---TQVSISADDINA 157
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE 126
C + C+ G W K G VTGG++ TGC+P +PPC H T
Sbjct: 158 CCGMACGNG----CNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHY 213
Query: 127 PECKTLATPQPKCHTRCTNDNYGRGFFQD------KYQINGLGLYFDPHF---GPFWPAF 177
C + P KC C Y + QD Y ++ GP AF
Sbjct: 214 KPCPSDMYPTDKCERSC-QAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAF 272
Query: 178 WRSFCTKYTRPLFQT-NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG 236
T Y F+ +G VY +A A + +A VK++GWG +NG PYW +++ E +G
Sbjct: 273 -----TVYAD--FEVYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDWG 324
Query: 237 DKGTIKILRGRNEAIIESLVNGALPK 262
+ G +I+RG NE IE V G +PK
Sbjct: 325 ENGYFRIIRGVNECGIEHGVVGGIPK 350
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 108/237 (45%), Gaps = 24/237 (10%)
Query: 28 CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
C + AV+ ++ +C S K VE ++ I+ K + C G++
Sbjct: 115 CASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKNCGSG----------CDGGVT 164
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
+W + K G+VTGG+ ++TGC+P FP C+H C P+C C
Sbjct: 165 GYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQCKQTCQK 223
Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
Y + QDK Y + G+ + P ++ Y L +G +Y
Sbjct: 224 -GYNTSYEQDKHYGEFSYNVIGVESVIQKEIMMYGPV--EAYLHIYEDFLNYKSG-IYRY 279
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+ I +A V+++GWG ENG YW +T+ E +G+KG +I+RGR+E +IES +
Sbjct: 280 TTGQFISGHA-VRLIGWGVENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 93/190 (48%), Gaps = 20/190 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG + S+ GC+P S PPC H + S P C T P+C C
Sbjct: 156 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 212
Query: 146 DNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK+ GP AF T Y+ L +G V
Sbjct: 213 PGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPVEGAF-----TVYSDFLMYKSG-V 266
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y + +I+ ++I+GWGEENG PYW + +++ +GDKG KILRG++ IES +
Sbjct: 267 YQ-HTTGDIMGGHAIRILGWGEENGVPYWLVANSWNTDWGDKGFFKILRGQDHCGIESEI 325
Query: 257 NGALPK-DNY 265
+P+ D Y
Sbjct: 326 VAGIPRTDQY 335
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 113/255 (44%), Gaps = 31/255 (12%)
Query: 23 PYALSCIEARAVATATPLAFAVCRSSKMHV--ECTSFRFIAGVKQRCAWLVSRWMTIWVC 80
P ++ C + A ++ +C + HV E ++ + C C
Sbjct: 6 PLSIPCRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGD---------GC 56
Query: 81 SSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCH 140
+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 57 NGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCS 114
Query: 141 TRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQ 191
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 115 KIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLLY 168
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 169 KSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCG 226
Query: 252 IESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 227 IESEVVAGIPRTDQY 241
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 89/194 (45%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +RGLV+GG + S+ GC+P + PPC H + S P C P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGETPRC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C Y + +DK+ G+ Y P GP AF Y
Sbjct: 209 SRHC-EPGYSPSYKEDKHY--GITSYGVPRSEKEIMAEIYKNGPVEGAF-----IVYEDF 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VS E V ++I+GWG ENG PYW +++ +GD G KILRG +
Sbjct: 261 LMYKSGVYQHVSG--EQVGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGED 318
Query: 249 EAIIESLVNGALPK 262
IES + +P+
Sbjct: 319 HCGIESEIVAGVPR 332
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 92/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + GLVTG + +N+ CQ +F PC H + P C T P P C
Sbjct: 161 CDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPC-TGELPTPPC 219
Query: 140 HTRC-TNDNYGRGFFQDKYQ-INGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPL 189
C +N + + +D ++ G+ D GP A T Y L
Sbjct: 220 INSCDSNSTHTIPYSKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVAL-----TVYEDFL 274
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
G VY E+ +A VK+VGWG ENG PYWTIV+++ E +GDKGT KILRG+NE
Sbjct: 275 TYKTG-VYQHVTGDELGGHA-VKMVGWGVENGTPYWTIVNSWNESWGDKGTFKILRGKNE 332
Query: 250 AIIESLVNGALP 261
IES ALP
Sbjct: 333 CGIESSCVTALP 344
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 89/183 (48%), Gaps = 12/183 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G++ +W + K G+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
C Y + QDK Y + G+ + P ++ Y L +
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGR+E +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRDECLIE 332
Query: 254 SLV 256
S +
Sbjct: 333 SFI 335
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 94/192 (48%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + K GLV+GG ++S+ GC+P + PPC H + S P C PKC
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPPCEH-HVNGSRPHCSGEGGDTPKC 206
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C + +YG+ + + + + + GP AF Y +
Sbjct: 207 VHSCEAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQN-GPVEGAF-----IVYEDFVM 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + + + +A +K++GWGEE+G PYW +++ +G+ G KILRG +
Sbjct: 261 YKSG-VYQHTTGSALGGHA-IKVLGWGEEDGVPYWLCANSWNTDWGENGFFKILRGSDHC 318
Query: 251 IIESLVNGALPK 262
IES + +PK
Sbjct: 319 GIESEIVAGIPK 330
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 72 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 129
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 130 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 183
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 184 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 241
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 242 GIESEVVAGIPRTDQY 257
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 92/194 (47%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W++ K GLVTGG + ++ GC+P F PCNH + T P C P P C
Sbjct: 171 CQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGP-CSHDLEPTPVC 229
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
C + Y + +DKY GL Y + GP AF Y
Sbjct: 230 KKACQS-TYKIQYNKDKYY--GLKAYSLHNKASDLQKELMMNGPMEVAF-----EVYEDF 281
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G VY + + +A V+++GWGEENG PYW + +++ ++GDKG KI RGRN
Sbjct: 282 LLYKTG-VYQHHTGSVLGGHA-VRLLGWGEENGVPYWLLANSWNTEWGDKGFFKIYRGRN 339
Query: 249 EAIIESLVNGALPK 262
E IES L K
Sbjct: 340 ECGIESEAVAGLYK 353
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 12/188 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + G+V+GG+++S+ GCQP + PC H T +P C TP +C
Sbjct: 162 CNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKP-CGEGDTP--RC 218
Query: 140 HTRCT---NDNYG--RGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
RC + YG R F + Y + G PA + T Y L G
Sbjct: 219 VKRCEEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPA--EAALTVYDDFLHYRTG 276
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VS A + V+++GWG E+G PYW + +++ +GD G +ILRG++E IES
Sbjct: 277 VYQHVSGGA--LGGHAVRLLGWGVEDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIES 334
Query: 255 LVNGALPK 262
+NG LPK
Sbjct: 335 DINGGLPK 342
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 93/197 (47%), Gaps = 27/197 (13%)
Query: 78 WVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
+ C G W++ + G+VTGG + S GC P PPC SE + QP
Sbjct: 157 FACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPC------FSEEDGNNTCRGQP 210
Query: 138 -KCHTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTK 184
+ H RCT YG F +D Y + + D +GP + + F
Sbjct: 211 MEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYASIQKDVMTYGPIEASMEVYDDF--- 267
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
P +++ VY S +A + VK++GWGEE+G PYW +V+++ E +GDKG KI
Sbjct: 268 ---PSYKSG--VYEKSENATYLGGHAVKLIGWGEEDGVPYWLMVNSWSEMWGDKGLFKIR 322
Query: 245 RGRNEAIIESLVNGALP 261
RG NE +++ + +P
Sbjct: 323 RGTNECSVDNSMTAGVP 339
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 71 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 128
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 129 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 182
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 183 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 240
Query: 251 IIESLVNGALPKDN 264
IES V +P+ +
Sbjct: 241 GIESEVVAGIPRTD 254
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 89/189 (47%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + G+ P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 97/199 (48%), Gaps = 20/199 (10%)
Query: 77 IWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
++ C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T
Sbjct: 84 LFSCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDT 141
Query: 137 PKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
PKC C Y + QDK Y N + GP AF + Y+
Sbjct: 142 PKCSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSD 195
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L +G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG+
Sbjct: 196 FLLYKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQ 253
Query: 248 NEAIIESLVNGALPK-DNY 265
+ IES V +P+ D Y
Sbjct: 254 DHCGIESEVVAGIPRTDQY 272
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 92/195 (47%), Gaps = 22/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G+ S W + ++G+VTGG + + GCQP S + P L +P P C
Sbjct: 152 CKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDL-SPMPPC 210
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
C +YG+ + +DK Y ++G GP F + F + Y
Sbjct: 211 KRECRK-SYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYS-YKSG 268
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q + RV S + ++I+GWG ENG PYW +++ E +GDKG KI RG N
Sbjct: 269 VYQAHSRVRCGSHA--------IRILGWGTENGVPYWLAANSWTEHWGDKGYFKIRRGNN 320
Query: 249 EAIIESLVNGALPKD 263
E IE +N +PK+
Sbjct: 321 ECGIEEDINAGIPKE 335
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 73 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 130
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 131 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 184
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 185 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 242
Query: 251 IIESLVNGALPKDN 264
IES V +P+ +
Sbjct: 243 GIESEVVAGIPRTD 256
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 112/267 (41%), Gaps = 41/267 (15%)
Query: 7 SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQR 66
S+IRD S SC AV+ A ++ +C +S T A
Sbjct: 114 SKIRDQS-------------SCGSCWAVSAAETISDRICIASNGK---TQISISADDINA 157
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE 126
C +V C+ G W K+G VTGG++ +GC+P +PPC H T
Sbjct: 158 CCGMVCGNG----CNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHY 213
Query: 127 PECKTLATPQPKCHTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWP 175
C + P KC C T D + F Q Y ++ GP
Sbjct: 214 KPCPSNMYPTDKCEHSCQAGYPLTYTQDLH---FGQSAYAVSKKPAEIQKEIMTHGPVEV 270
Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
AF T Y +G VY +A A + +A VK++GWG +NG PYW +++ E +
Sbjct: 271 AF-----TVY-EDFEHYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDW 323
Query: 236 GDKGTIKILRGRNEAIIESLVNGALPK 262
G+ G +I+RG NE IES V G PK
Sbjct: 324 GENGYFRIIRGVNECGIESGVVGGTPK 350
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 93/197 (47%), Gaps = 27/197 (13%)
Query: 78 WVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
+ C G W++ + G+VTGG + S GC P PPC SE + QP
Sbjct: 157 FACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPC------FSEEDGNNTCRGQP 210
Query: 138 -KCHTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTK 184
+ H RCT YG F +D Y + + D +GP + + F
Sbjct: 211 MEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYASIQKDVMTYGPIEASMEVYDDF--- 267
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
P +++ VY S +A + VK++GWGEE+G PYW +V+++ E +GDKG KI
Sbjct: 268 ---PSYKSG--VYEKSENATYLGGHAVKLIGWGEEDGVPYWLMVNSWSEMWGDKGLFKIR 322
Query: 245 RGRNEAIIESLVNGALP 261
RG NE +++ + +P
Sbjct: 323 RGTNECSVDNSMTAGVP 339
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 88/191 (46%), Gaps = 17/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + HK G+V+GG + S GCQP S PC H+ + +S P C + T PKC
Sbjct: 151 CLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSS-PACGGV-TDTPKC 208
Query: 140 HTRCTND---NYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQ 191
+C Y + F+ Q Y I GP +F LF
Sbjct: 209 KKQCEKGYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFL------VYEDLFS 262
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY + E + +KI GWG ENG PYW + +++ +G+ G KI RG++E
Sbjct: 263 YKEGVYQ-HVAGEFLGGHVIKIFGWGIENGTPYWLVANSWNTDWGNNGFFKIPRGKDECG 321
Query: 252 IESLVNGALPK 262
IE V+ LP+
Sbjct: 322 IEIDVSAGLPR 332
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 192 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 245
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 246 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 303
Query: 251 IIESLVNGALPKDN 264
IES V +P+ +
Sbjct: 304 GIESEVVAGIPRTD 317
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 24/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W + ++GLVTGG ++S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPEC 206
Query: 140 HTRC---------TNDNYGR---GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
T+C + +YG+ G ++ QI P G F + F
Sbjct: 207 VTQCEAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAF--IVYEDF------ 258
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + + +A +K++GWGEENG PYW +++ +GD G KILRG
Sbjct: 259 PSYKSG--VYQHVTGSALGGHA-IKMIGWGEENGVPYWLCANSWNTDWGDNGFFKILRGS 315
Query: 248 NEAIIESLVNGALPK 262
N IES V +PK
Sbjct: 316 NHCGIESEVVAGIPK 330
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 22 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 79
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 80 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 133
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 134 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 191
Query: 251 IIESLVNGALPKDN 264
IES V +P+ +
Sbjct: 192 GIESEVVAGIPRTD 205
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 94/194 (48%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 20 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 77
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 78 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 131
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 132 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 189
Query: 251 IIESLVNGALPKDN 264
IES V +P+ +
Sbjct: 190 GIESEVVAGIPRTD 203
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 97/194 (50%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 6 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 63
Query: 140 HTRCTNDNYGRGFFQDK-YQINGL-------GLYFDPH-FGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N G+ + + GP AF + Y+ L
Sbjct: 64 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAF-----SVYSDFLL 117
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 118 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 175
Query: 251 IIESLVNGALPKDN 264
IES V +P+ +
Sbjct: 176 GIESEVVAGIPRTD 189
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 86/187 (45%), Gaps = 7/187 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+VTGG + TGCQP F C+H + C P P C
Sbjct: 155 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 214
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
C Y + + QDK+ N H ++ + T +FQ G VY
Sbjct: 215 ARACQT-GYNKTYEQDKFYGNS-SYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG-VYRS 271
Query: 200 S----ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+ + + V+++GWG ENG YW + +++ E++G+ G +++RGRNE IES
Sbjct: 272 GIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESE 331
Query: 256 VNGALPK 262
V +P+
Sbjct: 332 VVAGMPR 338
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 94/193 (48%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +G+VTGG ++S+ GCQP + P C+H + P +L P PKC
Sbjct: 148 CNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACDHHVPHSKNPCNGSL--PTPKC 205
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTR-PL 189
C Y + DK Y + + D + GP AF T + P
Sbjct: 206 EKVCEK-GYNITYKNDKHYGVTSYSINNDQNEIMREIMTNGPVEAAF-----TVFADFPN 259
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+++ VY + E+ +A +KI+GWG EN PYW + +++ +GD G KILRG +E
Sbjct: 260 YKSG--VYQHVSGEELGGHA-IKILGWGVENNTPYWLVANSWNPSWGDNGFFKILRGSDE 316
Query: 250 AIIESLVNGALPK 262
IE V LPK
Sbjct: 317 CGIEDEVVAGLPK 329
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 71/192 (36%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + GLVTG + +N+ CQ S PC H + P C T P P C
Sbjct: 161 CDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPCAHHVTSDVYPPC-TGELPTPPC 219
Query: 140 HTRC-TNDNYGRGFFQD------KYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPL 189
C +N Y + +D Y I+ GP AF T Y L
Sbjct: 220 VKSCDSNSTYTIPYPKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAF-----TVYEDFL 274
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY +E+ +A VK+VGWG ENG PYW IV+++ E +GDKGT KILRG+NE
Sbjct: 275 TYKSG-VYQHVTGSELGGHA-VKMVGWGVENGTPYWIIVNSWNESWGDKGTFKILRGQNE 332
Query: 250 AIIESLVNGALP 261
IES ALP
Sbjct: 333 CGIESECVTALP 344
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 18/183 (9%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + KRG+VTGG+ ++TGCQP FP C H P C T P+C C
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223
Query: 147 NYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
Y + QDK Y + N + + +GP AF Y L +G
Sbjct: 224 GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
V+ S IV ++I+GWG E G+PYW I +++ E +G+ G +++RGR+E IES V
Sbjct: 279 HVAGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVV 336
Query: 258 GAL 260
L
Sbjct: 337 AGL 339
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 93/196 (47%), Gaps = 26/196 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 152 CNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEGGDTPKC 210
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
+ +C Y + DK+ G Y P GP AF + F +Y
Sbjct: 211 NKKC-EAGYSPDYKDDKHY--GTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADF-LQYK 266
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
++Q + +++ ++++GWG E+G PYW +++ +GD G KILRG
Sbjct: 267 SGVYQ--------HVTGDMLGGHAIRVLGWGVEDGVPYWLAANSWNTDWGDNGFFKILRG 318
Query: 247 RNEAIIESLVNGALPK 262
++ IES + +P+
Sbjct: 319 KDHCGIESEMVAGIPR 334
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 86/187 (45%), Gaps = 7/187 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+VTGG + TGCQP F C+H + C P P C
Sbjct: 63 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 122
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
C Y + + QDK+ N H ++ + T +FQ G VY
Sbjct: 123 ARACQT-GYNKTYEQDKFYGNS-SYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG-VYRS 179
Query: 200 S----ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+ + + V+++GWG ENG YW + +++ E++G+ G +++RGRNE IES
Sbjct: 180 GIYHHVAGKFIGRHAVRMIGWGVENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESE 239
Query: 256 VNGALPK 262
V +P+
Sbjct: 240 VVAGMPR 246
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 18/183 (9%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + KRG+VTGG+ ++TGCQP FP C H P C T P+C C
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK- 223
Query: 147 NYGRGFFQDK------YQI--NGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
Y + QDK Y + N + + +GP AF Y L +G
Sbjct: 224 GYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAF-----DVYEDFLNYKSGIYR 278
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
V+ S IV ++I+GWG E G+PYW I +++ E +G+ G +++RGR+E IES V
Sbjct: 279 HVAGS--IVGGHAIRIIGWGVEKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVV 336
Query: 258 GAL 260
L
Sbjct: 337 AGL 339
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 88/197 (44%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K GLVTGG++ S GC+P S PC + P+C PKC
Sbjct: 153 CEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKC 212
Query: 140 HTRCT-NDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKY 185
CT N +Y + +DK+ QI L GP F T Y
Sbjct: 213 VDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEIL----KNGPVEVGF-----TVY 263
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
+Q VY A E+ +A VK++GWG +NG PYW +++ +G+ G +ILR
Sbjct: 264 AD-FYQYKSGVYVHVAGPELGGHA-VKLLGWGVDNGTPYWLAANSWNTNWGENGYFRILR 321
Query: 246 GRNEAIIESLVNGALPK 262
G NE IES V +P
Sbjct: 322 GVNECGIESQVVAGMPD 338
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 88/205 (42%), Gaps = 38/205 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G S W+WVH G+ TGG + + GC P FPPC H T P+C +
Sbjct: 128 CDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGS 187
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
P C +C N Y D++ + L P+ + + +T+
Sbjct: 188 YETPNCVEQCHNPKYSTSLKNDRHYM----LESSPY----------QYSVNNAKNAIRTD 233
Query: 194 GRVYAVSASAE-IVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGD 237
G V A E +AY + VKI+GWGEENG YW +V+++ E +GD
Sbjct: 234 GPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEENGEAYWLVVNSWNEDWGD 293
Query: 238 KGTIKILRGRNEAIIESLVNGALPK 262
G KI G N I + L+ G PK
Sbjct: 294 HGLFKIALG-NCQIDDDLL-GGTPK 316
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 321 NECGIDNSTTGGVP 334
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
C Y + QDK Y + G+ P ++ Y L +
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 93/197 (47%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + RGLVTGG + S GC+P PPC + +E P+ K
Sbjct: 157 CNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPREKN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG F +D Y + + D +GP +F + F
Sbjct: 213 H-RCTRTCYGNQDLDYNDDHRFTRDSYYLTYSSIQKDVMRYGPIEASFDMYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY S +A + VK++GWGEE+G YW +V+++ E +GD G KI RG
Sbjct: 266 PSYKSG--VYVRSENASYLGGHAVKLIGWGEEHGVLYWLMVNSWNEGWGDNGLFKIRRGT 323
Query: 248 NEAIIESLVNGALPKDN 264
NE I++ G +P N
Sbjct: 324 NECGIDNSTTGGVPVAN 340
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 91/194 (46%), Gaps = 20/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTG + ++TGCQP FP C H + P C PKC
Sbjct: 159 CQGGFPGAAWDYWVEEGIVTGSSKENHTGCQPYPFPKCEH-HTKGKYPACGEKIYKTPKC 217
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C Y + +DKY + + + H GP AF T Y+ L
Sbjct: 218 QQKCQK-GYKTPYKKDKYYGKLSYNVLSKEDAIKKEIMMH-GPVEAAF-----TVYSDFL 270
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G +Y I +A V+I+GWG E PYW I +++ E +G+KG +ILRG++
Sbjct: 271 NYKSG-IYKHMKGTVIGGHA-VRIIGWGVEKKTPYWLIANSWNEDWGEKGYFRILRGKDV 328
Query: 250 AIIESLVNGALPKD 263
IES V LP +
Sbjct: 329 CGIESAVTAGLPHN 342
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 91/191 (47%), Gaps = 17/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G S+ W+W G+VTGG ++S+ GCQP S P C+H + + P C P P C
Sbjct: 159 CSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDH-HVSGQYPACSGEG-PTPAC 216
Query: 140 HTRCT---NDNYG--RGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQ 191
C N+ Y + F Y + G GP AF T Y L
Sbjct: 217 KKSCEAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAF-----TVYEDLLTY 271
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G VY + +++ +KI+GWG E+G YW + +++ +GD G KI +G +E
Sbjct: 272 KSG-VYQ-HTTGQVLGGHAIKIIGWGVESGVDYWWVANSWNNDWGDNGFFKIKKGVDECG 329
Query: 252 IESLVNGALPK 262
IES + +PK
Sbjct: 330 IESQIVAGMPK 340
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPAEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 262 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 320 GIESEVVAGIPRTDQY 335
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 266 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 323
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 324 NECGIDNSTTGGVP 337
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 220 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 273
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G ++ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 274 YKSGVYQHITG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 331
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 332 GIESEVVAGIPRTDQY 347
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 90/190 (47%), Gaps = 17/190 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W K GLVTGG + S GCQP PPC Y + K + +C
Sbjct: 153 CNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGKPMEKNH-RC 211
Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTRPLFQ 191
C D N + +D Y + + D +GP +F + F P ++
Sbjct: 212 TRMCYGDQDLDFNNDHHYTRDAYYLTYGTIQNDVLTYGPIEASFEVYDDF------PSYK 265
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ VY + +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG NE
Sbjct: 266 SG--VYVKTENASYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECG 323
Query: 252 IESLVNGALP 261
I++ G +P
Sbjct: 324 IDNSTTGGVP 333
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 92/194 (47%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W + + GLVTGG + SN GC+P S PC H + + P C T PKC
Sbjct: 148 CMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEH-HVNGTRPPC-TGEGDTPKC 205
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+ C N Y + +DK G Y P GP AF + Y
Sbjct: 206 VSEC-NAGYTPSYKKDKR--FGKQTYSVPPKEQQIMTELYKNGPVEAAF-----SVYEDF 257
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G V+ +++ +KI+GWG+EN PYW + +++ +GD G KILRG++
Sbjct: 258 LLYKTGVYQHVTG--QMLGGHAIKILGWGKENNTPYWLVANSWNTDWGDNGFFKILRGKD 315
Query: 249 EAIIESLVNGALPK 262
E IES + +P+
Sbjct: 316 ECGIESEIVAGIPR 329
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 266 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 323
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 324 NECGIDNSTTGGVP 337
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 90/191 (47%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + GLV+GG+++S GC+P PPC H P C T PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLP-CSG-DTKTPKC 208
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C DNY + QDK Y + G + GP AF T Y L
Sbjct: 209 IKKC-EDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAF-----TVYADLLS 262
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A + +A +KI+GWG ENG YW I +++ +GD G KILRG +
Sbjct: 263 YKSG-VYKHVAGDALGGHA-IKIMGWGVENGNKYWLIANSWNSDWGDNGFFKILRGEDHC 320
Query: 251 IIESLVNGALP 261
IES + P
Sbjct: 321 GIESSIVAGEP 331
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 92/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDILAYGPIEASFEVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 321 NECGIDNSTTGGVP 334
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 90/189 (47%), Gaps = 16/189 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G +S ++ K GLVTG +++ CQ SF PC H T P C T P PKC
Sbjct: 160 CNGGYPASAMSYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPAC-TGELPTPKC 218
Query: 140 HTRCTNDNYGRGFFQDK----YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQT 192
C + G+ + K Y + GP AF T Y L
Sbjct: 219 AKTC-DSGSGQTYTVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAF-----TVYEDFLNYK 272
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
+G V+ A + +KIVGWG EN PYW +V+++ + +GD GT KILRG+NE I
Sbjct: 273 SGVYKHVTGKA--LGGHAIKIVGWGVENNTPYWIVVNSWNQTWGDNGTFKILRGKNECGI 330
Query: 253 ESLVNGALP 261
E+ V ALP
Sbjct: 331 EAQVVTALP 339
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 91/194 (46%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 154 CSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGK----PTEKN 209
Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 321 NECGIDNSTTGGVP 334
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y ++ GP AF T Y+ L
Sbjct: 208 SKIC-EPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAF-----TVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + E++ V+I+GWG E+G PYW + +++ +GD G KILRGR+
Sbjct: 262 YKSG-VYQ-HVTGEMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGFFKILRGRDHC 319
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 320 GIESEIVAGIP 330
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 94/194 (48%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W + K GLVTGG + S+ GC+P + PPC H + + P C P+C
Sbjct: 148 CNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
+C + +YG+ + + N + + GP AF + F P
Sbjct: 207 INQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYKN-GPVEGAFMVYEDF------P 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++++ VY S ++ +KI+GWG E+G PYW +++ +GD G KILRG +
Sbjct: 260 MYKSG--VYQ-HVSGSLIGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGYFKILRGSD 316
Query: 249 EAIIESLVNGALPK 262
IES V +PK
Sbjct: 317 HCGIESEVVAGIPK 330
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 91/193 (47%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGDTPKC 208
Query: 140 HTRCTNDNYGRGFFQDKY-----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
C Y + +DK+ + + F GP AF T Y+
Sbjct: 209 SKIC-EPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKN--GPVEAAF-----TVYSD- 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
Q VY A + +A V+I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 260 FLQYKSGVYQHVAGDMMGGHA-VRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQD 318
Query: 249 EAIIESLVNGALP 261
IES + +P
Sbjct: 319 HCGIESEIVAGIP 331
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/194 (33%), Positives = 88/194 (45%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLVTGG ++S+ GC+P + PC H + S P C P C
Sbjct: 148 CNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
C Y + QDK+ G Y P GP AF T Y
Sbjct: 207 DMSC-EPGYSPSYKQDKHF--GKTSYSVPSNQKDIMKELYKNGPVEGAF-----TVYEDF 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VS A + +KI+GWGEENG PYW +++ +GD G KILRG +
Sbjct: 259 LSYKSGVYQHVSGPA--LGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGED 316
Query: 249 EAIIESLVNGALPK 262
IES + +P+
Sbjct: 317 HCGIESEIVAGIPQ 330
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 89/200 (44%), Gaps = 31/200 (15%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C G W WVH GLVTGG++ S GC+P S PC + P+C P+
Sbjct: 147 CEGGYPIQAWRYWVHN-GLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPE 205
Query: 139 CHTRCTN-DNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL-------- 189
C +CT+ +Y + QDK H+G A ++ T +
Sbjct: 206 CVKQCTSKSDYAVPYDQDK------------HYGSSAYAIRQNVAQIQTEIMRNGPVEVG 253
Query: 190 -------FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
+Q +Y A E+ +A VKI+GWG ENG PYW +++ +G+KG +
Sbjct: 254 FLVYSDFYQYKSGIYKHVAGRELGGHA-VKILGWGVENGTPYWLAANSWNVNWGEKGYFR 312
Query: 243 ILRGRNEAIIESLVNGALPK 262
I RG NE IES V +P
Sbjct: 313 IRRGTNECGIESSVVAGIPD 332
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 107/241 (44%), Gaps = 20/241 (8%)
Query: 28 CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
C + AV+ ++ +C S K VE ++ I+ K + C G++
Sbjct: 82 CASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKNCGS----------GCDGGVT 131
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
+W + G+VTGG+ ++TGC+P FP C+H C P+C C
Sbjct: 132 GYSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQCKQTCQK 190
Query: 146 DNYGRGFFQDK----YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
Y + QDK + N L + ++ Y L +G +Y +
Sbjct: 191 -GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSG-IYRYTT 248
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IES + L
Sbjct: 249 GQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLI 307
Query: 262 K 262
K
Sbjct: 308 K 308
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 220 SKSC-EPGYTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAF-----SVYSDFLL 273
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 274 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 331
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 332 GIESEVVAGIPRTDQY 347
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 88/192 (45%), Gaps = 20/192 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGGA++S+ GC+ S PC H S P+C +L P+C
Sbjct: 156 CDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPEC 215
Query: 140 HTRC--TNDNYGRGF--------FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
C ++ +Y F ++ Q+ L GP AF T Y L
Sbjct: 216 VRSCYESSLDYTESLTFGQQVSTFTNEKQMQLEIL----KNGPIEAAF-----TVYNDFL 266
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY +A E V +K++GWG E G YW I +++ +GD G K LRG +
Sbjct: 267 SYKSG-VYQATAQDESVGGHAIKVLGWGVEEGTKYWLIANSWNTDWGDNGYFKFLRGVDH 325
Query: 250 AIIESLVNGALP 261
IES +LP
Sbjct: 326 CGIESETAASLP 337
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 93/197 (47%), Gaps = 31/197 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC---NHANYTTSEPECKTLATPQ 136
CS G W K GLVTGG + S GC+P PPC + N T S P
Sbjct: 94 CSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTCS-------GQPM 146
Query: 137 PKCHTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTK 184
K H RCT YG + +D Y + G+ D ++GP +F + F
Sbjct: 147 EKNH-RCTRMCYGDQDLDFDEDHRYTRDHYYLTYRGIQKDVINYGPIEASFDVYDDF--- 202
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
P +++ +Y S +A + +VK++GWGEE G YW +V+++ +GDKG KI
Sbjct: 203 ---PSYKSG--IYVKSENASYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWGDKGLFKIR 257
Query: 245 RGRNEAIIESLVNGALP 261
RG NE +++ G +P
Sbjct: 258 RGTNECGVDNSTTGGVP 274
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 89/190 (46%), Gaps = 21/190 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W++ G+ TGG + S GCQP S PC H + ++ +C TL P C
Sbjct: 149 CEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEH-HTEGNKVQCSTLDYDTPSC 207
Query: 140 HTRCTND--------NYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
+C + +G G ++ Y + + + GP AF + F Y +
Sbjct: 208 KHKCDDSALNYKSELTFGSGSVRNFYSVANIQKEILTN-GPVEAAFDVYSDF-VNYKSGV 265
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q + E + V+I+GWGEE+G PYW + +++ E +GDKG KI RG NE
Sbjct: 266 YQ--------HVAGEYLGGHAVRILGWGEESGVPYWLVANSWNEDWGDKGLFKIRRGNNE 317
Query: 250 AIIESLVNGA 259
+ E + A
Sbjct: 318 SGFEDSIVAA 327
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 93/195 (47%), Gaps = 27/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
C+ G W GLVTGG + S GC+P PPC + + + K + QP
Sbjct: 155 CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY------DKDGKNTCSGQPME 208
Query: 138 ---KCHTRCTND-----NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYT 186
KC +C D N + +D Y + G+ D ++GP +F + F
Sbjct: 209 SNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTYRGIQKDVINYGPIETSFDVYDDF----- 263
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
P +++ +Y S +A + +VK++GWGEE G YW +V+++ +GDKG KI RG
Sbjct: 264 -PNYKSG--IYVKSENASYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWGDKGLFKIRRG 320
Query: 247 RNEAIIESLVNGALP 261
NE +++ G +P
Sbjct: 321 TNECRVDNSTTGGVP 335
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 91/191 (47%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + DK Y ++ GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VS E++ ++I+GWG EN PYW + +++ +GDKG KILRG++
Sbjct: 262 YKSGVYQHVSG--EMMGGHAIRILGWGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHC 319
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 320 GIESEIVAGMP 330
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 89/189 (47%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + PA ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPA--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 91/191 (47%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + DK Y ++ GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VS E++ ++I+GWG EN PYW + +++ +GDKG KILRG++
Sbjct: 262 YKSGVYQHVSG--EMMGGHAIRILGWGVENDTPYWLVGNSWNTDWGDKGFFKILRGQDHC 319
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 320 GIESEIVAGMP 330
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 87/194 (44%), Gaps = 18/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + +G+V+GG+ SN GC+P PC H + + P C P C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEH-HVNGTRPPCTGDDNKTPSC 207
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+C + N+G+ + ++ + + GP AF + L
Sbjct: 208 KQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTN-GPVEGAF------EVYEDLL 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
VY E + ++I+GWG E G PYW I +++ +GD GT KILRG +
Sbjct: 261 SYKKGVYQ-HVKGEALGGHAIRILGWGTEKGTPYWLIANSWNSDWGDNGTFKILRGEDHC 319
Query: 251 IIESLVNGALPKDN 264
IES + +PKD+
Sbjct: 320 GIESSIVAGIPKDS 333
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 89/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++GLV+GG S+ GCQP + PC H + S P C+ PKC
Sbjct: 157 CNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEH-HVNGSRPSCEGEGGKTPKC 215
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C +Y + +DK Y I GP AF T Y L
Sbjct: 216 VKKC-QASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAF-----TVYEDLLN 269
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G + V +++ ++I+GWG E+G YW I +++ +GD G KILRG +
Sbjct: 270 YKEGVYHHVHG--KMLGGHAIRILGWGVEDGTKYWLIANSWNSDWGDNGFFKILRGEDHL 327
Query: 251 IIESLVNGALPK 262
IES + LPK
Sbjct: 328 GIESSIAAGLPK 339
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 95/194 (48%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + + GLVTGG + S+ GCQP PC H + S P C L P P+C
Sbjct: 149 CNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEH-HINGSRPACGKL-EPTPRC 206
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR-P 188
C + Y F +DK+ ++ + + + GP AF T Y P
Sbjct: 207 KKSCES-GYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTN-GPVEAAF-----TVYADFP 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+++ VY + AE+ +A VK++GWG E PYW I +++ +G+ G KILRG++
Sbjct: 260 HYKSG--VYQHESGAELGGHA-VKMIGWGTEGSTPYWLIANSWNTDWGNMGFFKILRGQD 316
Query: 249 EAIIESLVNGALPK 262
E IE + PK
Sbjct: 317 ECGIERDIVAGEPK 330
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 97/199 (48%), Gaps = 26/199 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDKY------------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
C Y + +DK+ + +Y + GP AF T Y+
Sbjct: 208 SKFC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKN---GPVEAAF-----TVYSD 258
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L +G VY + E++ V+I+GWG ENG PYW + +++ +GD G KILRGR
Sbjct: 259 FLLYKSG-VYQ-HVTGEMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGR 316
Query: 248 NEAIIESLVNGALP-KDNY 265
+ IES + +P D Y
Sbjct: 317 DHCGIESEIVAGIPCTDQY 335
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC A+ + P C T PKC
Sbjct: 77 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC-EAHVNGARPPC-TGEGDTPKC 134
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 135 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 188
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 189 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 246
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 247 GIESEVVAGIPRTDQY 262
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 88/195 (45%), Gaps = 24/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + K GLV+GG + S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAPCEH-HVNGSRPSCTGEGGDTPQC 206
Query: 140 HTRCT---------NDNYGRG---FFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
T+C + ++G+ D+ QI P G F Y
Sbjct: 207 ITKCEAGYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAF---------IVYED 257
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+ +G VS SA V +KI+GWG E+G PYW +++ +GD G K LRG
Sbjct: 258 FVLYKSGVYQHVSGSA--VGGHAIKILGWGVEDGVPYWLCANSWNTDWGDNGFFKFLRGS 315
Query: 248 NEAIIESLVNGALPK 262
+ IES V +PK
Sbjct: 316 DHCGIESEVVAGIPK 330
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 95/196 (48%), Gaps = 20/196 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 141 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 198
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y + + + GP AF + Y L
Sbjct: 199 SKSC-EPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAF-----SVYADFLL 252
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 253 YKSGVYQHVTG--EMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 310
Query: 251 IIESLVNGALPK-DNY 265
IES V +P+ D Y
Sbjct: 311 GIESEVVAGIPRTDQY 326
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 88/199 (44%), Gaps = 32/199 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +G+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 33 CQGGFPGQAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF----------------GPFWPAFWRSFCT 183
C Y + QDK+ Y D + GP AF
Sbjct: 92 KQTCQK-GYKTPYEQDKH-------YGDESYNVISNEKAIQKEIMMNGPVEAAF-----D 138
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L +G V+ S IV ++I+GWG E PYW I +++ E +G+KG +I
Sbjct: 139 VYEDFLNYKSGIYRHVTGS--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRI 196
Query: 244 LRGRNEAIIESLVNGALPK 262
+RGR+E IES V L K
Sbjct: 197 VRGRDECSIESHVVAGLIK 215
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 88/181 (48%), Gaps = 21/181 (11%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPP----CNHANYTTSEPECKTLATPQPKCHTR 142
S W ++ G+V+GG ++SN GCQP FPP H +T + + H R
Sbjct: 147 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPPIANILTHLQHTCDDHCYGNTSINYNHDHVR 206
Query: 143 CTNDNYGR-GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
N R G+ Q + Q +GP F C + L +G VY S
Sbjct: 207 VRNYYTIRTGYIQKEVQT----------YGPVAVQF--KVCDDF---LLYKSG-VYVKSD 250
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+A+++ K++GWG ENG YW +++++G ++G KG KI RG N+ +ES+V +P
Sbjct: 251 NAKVIRTQYAKLIGWGVENGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVP 310
Query: 262 K 262
+
Sbjct: 311 E 311
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 93/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + + GLVTGG ++S+ GCQP + C+H +P C P C
Sbjct: 183 CNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVGKLQP-CSKKEEHTPVC 241
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTR-PLF 190
C + Y + +DK Y + G+ GP AF T Y P +
Sbjct: 242 KHECES-GYNVSYTKDKHYGATAYSVRGVQQIMTEIMTNGPVEGAF-----TVYADFPQY 295
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
++ VY + + + +A +KI+GWG E G YW + +++ +G++GT KILRGR+E
Sbjct: 296 KSG--VYKHTTGSPLGGHA-IKIMGWGTEGGDDYWLVANSWNPDWGNQGTFKILRGRDEC 352
Query: 251 IIESLVNGALPK 262
IES + PK
Sbjct: 353 GIESQIAAGEPK 364
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 91/193 (47%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ + K GLV+GG + S+ GC+P S PPC H + + P CK P+C
Sbjct: 148 CNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPPCEH-HVNGTRPPCKGEEGDTPQC 206
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+C Y G+ QDK+ G Y P GP AF T Y
Sbjct: 207 TNQC-EPGYTPGYKQDKHF--GKRSYSVPSDEKEIMKELYKNGPVEGAF-----TVYEDF 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VS SA V +K++GWGEE G PYW +++ +G+ G KI+RG +
Sbjct: 259 LLYKSGVYRHVSGSA--VGGHAIKVLGWGEEGGIPYWLAANSWNTDWGENGFFKIVRGED 316
Query: 249 EAIIESLVNGALP 261
IES + +P
Sbjct: 317 HCGIESEMVAGIP 329
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 90/194 (46%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE ++ G +P
Sbjct: 321 NECGTDNSTTGGVP 334
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/193 (32%), Positives = 93/193 (48%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+VTGG + ++ GC P P C+H T P + P PKC
Sbjct: 157 CNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 214
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
R Y F DK Y ++ GP AF T Y PL
Sbjct: 215 -VRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAF-----TVYADFPL 268
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+++ VY S S + + ++I+GWG ENG P+W + +++ ++GDKG KILRG NE
Sbjct: 269 YKSG--VYK-SHSTDALGGHAIRILGWGVENGVPFWLVANSWNTEWGDKGYFKILRGSNE 325
Query: 250 AIIESLVNGALPK 262
IE + +PK
Sbjct: 326 CGIEEDIVAGIPK 338
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 93/195 (47%), Gaps = 25/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W++ +G+VTGG + ++ GC P P C+H T P + P PKC
Sbjct: 158 CNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 215
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTR- 187
R Y F DK+ G Y P GP AF T Y
Sbjct: 216 -VRLCRKGYNVDFKDDKHY--GKSSYSVPSNETQIQMEIMKNGPVEGAF-----TVYADF 267
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
PL+++ VY S S + + ++I+GWG EN PYW + +++ ++GDKG KILRG
Sbjct: 268 PLYKSG--VYK-SHSTDALGGHAIRILGWGVENDVPYWLVANSWNTEWGDKGYFKILRGS 324
Query: 248 NEAIIESLVNGALPK 262
NE IE + +PK
Sbjct: 325 NECGIEEDIVAGIPK 339
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 22/192 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W W G+V+GG + +N GC P S P C+H +TT + + P PKC
Sbjct: 154 CNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDH--HTTGKYQPCPAVVPTPKC 211
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAF--WRSFCTKYTRPL 189
+C Y + + DK Y + G+ GP AF + F + T
Sbjct: 212 EKKCLT-GYPKSYSNDKTRGKKSYGVRGVQSIMQELVDNGPVTAAFDVYSDFLSYKTGVY 270
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
T G A VKI+G+G E+G+ YW + +++ E +GDKG KI +G++E
Sbjct: 271 RHTTGSYEGGHA---------VKIIGYGTESGQDYWLVANSWNEDWGDKGFFKIAKGKDE 321
Query: 250 AIIESLVNGALP 261
IES + P
Sbjct: 322 CGIESSIVAGDP 333
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 82 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 139
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 140 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 193
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 194 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 251
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 252 GIESEIVAGIPR 263
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 87/183 (47%), Gaps = 18/183 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +GLVTG N+ C+P +FPPC+H C + P P C
Sbjct: 143 CNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCGD-SQPTPAC 201
Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
CT + GR + DK + I+ + FGP +F T Y L
Sbjct: 202 VKSCTAQS-GRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASF-----TVYEDFLT 255
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A A + +A VKI+GWG E PYW +V+++ E +G+ G KILRG N
Sbjct: 256 YKSG-VYQNVAGANLGGHA-VKIIGWGVEKNVPYWLVVNSWNEGWGENGLFKILRGSNHV 313
Query: 251 IIE 253
IE
Sbjct: 314 GIE 316
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYIEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 91/197 (46%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W GLVTGG + S GC+P PPC H +E P K
Sbjct: 157 CNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHH----AEGNNSCSDKPMEKN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D ++GP +F + F
Sbjct: 213 H-RCTRMCYGDQDLDFDDDHRYTRDSYYLTYGSIQKDVMNYGPIEASFDVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY S +A + VK++GWGEE+G PYW +V+++ +GDKG KI RG
Sbjct: 266 PSYKSG--VYIRSDNASYLGGHAVKLIGWGEESGVPYWLMVNSWNTDWGDKGLFKIQRGT 323
Query: 248 NEAIIESLVNGALPKDN 264
NE +++ +P N
Sbjct: 324 NECGVDNSTTAGVPVTN 340
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 94/192 (48%), Gaps = 24/192 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +RG+ TGG + SN GC P PPC + + + L +P
Sbjct: 155 CQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207
Query: 140 HT-RCTNDNYGRGFFQDKYQINGLGLYFDPH---------FGPFWPAFWRSFCTKYTRPL 189
H +C YG +++Y++ + + D +GP +F Y +
Sbjct: 208 HNHKCPRACYGNSTVENRYKVESIYV-LDSFKTIEQDIRTYGPVEASF-----DVYDDFI 261
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G +Y + +A V +VK++GWGEE+G PYW +V+++ + +G++GT +I++GRNE
Sbjct: 262 TYKSG-IYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNE 320
Query: 250 AIIESLVNGALP 261
IE +P
Sbjct: 321 CGIERSATAGIP 332
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 88/181 (48%), Gaps = 18/181 (9%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
++ KRG+VTGG+ ++TGCQP FP C H P C T P+C +C Y
Sbjct: 9 YLVKRGIVTGGSKENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQKCQK-GYKT 66
Query: 151 GFFQDK------YQI--NGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
+ QDK Y + N + + GP AF Y L +G V+
Sbjct: 67 PYEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAF-----DVYEDFLNYKSGIYRHVTG 121
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
S IV ++I+GWG E PYW I +++ E +G+KG +I+RGR+E IES V L
Sbjct: 122 S--IVGGHAIRIIGWGVEKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLI 179
Query: 262 K 262
K
Sbjct: 180 K 180
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 93/192 (48%), Gaps = 24/192 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + + RGLVTGG + + CQP + C H + P C T PKC
Sbjct: 163 CNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEH-HVPGDRPPC-TEGGGTPKC 220
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTR- 187
+C D + + DK ++G Y P H+GP AF T Y+
Sbjct: 221 SHQCIPDYTTKAYKDDK--VHGHKAYSVPNDVGKIQQEIMHYGPVEAAF-----TVYSDF 273
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY ++ +E+ +A +KI+GWG E G YW I +++ +GDKGT KILRG
Sbjct: 274 PSYKSG--VYRHTSGSELGGHA-IKIIGWGTEGGDDYWLINNSWNSDWGDKGTFKILRGS 330
Query: 248 NEAIIESLVNGA 259
NE IE V A
Sbjct: 331 NECGIEGEVVAA 342
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 89/185 (48%), Gaps = 19/185 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 77 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133
Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK Y + GP AF + Y+ L +G
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 188
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
V S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 189 QHV--SGEIMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEI 246
Query: 257 NGALP 261
+P
Sbjct: 247 VAGMP 251
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 91/193 (47%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +GLVTGG ++S+ GCQP + C H P + TPQ C
Sbjct: 152 CNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDIVDTPQ--C 209
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
C Y + DKY Q + + + GP AF T Y +
Sbjct: 210 VHMCEK-GYNVSYRADKYFGKKSYSIDEQEDQIKTEISTN-GPVEAAF-----TVYADFV 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY E+ +A V+I+GWG E+G PYW + +++ +GDKG KILRG +E
Sbjct: 263 TYKSG-VYRHVTGEEMGGHA-VRILGWGTESGTPYWLVANSWNTDWGDKGYFKILRGSDE 320
Query: 250 AIIESLVNGALPK 262
IES + LPK
Sbjct: 321 CGIESSIVAGLPK 333
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 94/192 (48%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + S+ GC+P PC H + + P C+ P+C
Sbjct: 157 CNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+C T+ ++G + ++ + H GP AF T Y +
Sbjct: 216 QHKCQASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTH-GPVEGAF-----TVYEDLIL 269
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY E+ +A ++I+GWG E PYW + +++ +G+ G KILRG++
Sbjct: 270 YKDG-VYEHVHGKELGGHA-IRIIGWGVEKDIPYWLVANSWNTDWGNNGFFKILRGKDHC 327
Query: 251 IIESLVNGALPK 262
IES ++ LPK
Sbjct: 328 GIESSISAGLPK 339
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMVHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 91/195 (46%), Gaps = 28/195 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + + G+VTGG + S GCQP S PC T E + T P C
Sbjct: 160 CDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDT-----PDC 214
Query: 140 HTR-CTNDNYGRGFFQDKYQINGL------------GLYFDPHFGPFWPAFWRSFCTKYT 186
+ CTN NY + + D + ++ + LY + GP AF+ YT
Sbjct: 215 SIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKN---GPVQAAFY-----VYT 266
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
++ +G VY+ + +I +KI+GWG ++G YW +++ +G+ G +ILRG
Sbjct: 267 DFMYYKSG-VYSYT-RGQIEGGHAIKILGWGVDDGTKYWLCANSWSRSWGENGLFRILRG 324
Query: 247 RNEAIIESLVNGALP 261
NE IE V +P
Sbjct: 325 NNECHIEDRVIAGMP 339
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 262 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 320 GIESEIVAGIPR 331
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 262 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 320 GIESEIVAGIPR 331
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
+IES + L K
Sbjct: 330 LIESEIAAGLIK 341
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 88/193 (45%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG + S+ GC+P S PPC H + S P C P+C
Sbjct: 150 CNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGETPRC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C Y + +DK+ G+ Y P GP AF Y
Sbjct: 209 SRHC-EPGYSPSYKEDKHY--GITSYGVPRSEKEIMAEIYKNGPVEGAF-----IVYEDF 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G V+ E V ++++GWG +NG PYW +++ +GD G KILRG +
Sbjct: 261 LMYKSGVYQHVTG--EQVGGHAIRLLGWGVDNGTPYWLAANSWNTDWGDNGFFKILRGED 318
Query: 249 EAIIESLVNGALP 261
IES + +P
Sbjct: 319 HCGIESEIVAGIP 331
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/193 (32%), Positives = 95/193 (49%), Gaps = 16/193 (8%)
Query: 67 CAWLVSRWMTIWVC--SSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
CA+ M+ C S G + + V G+VTG + +NTGC+P FP C H +T
Sbjct: 118 CAFGAVEAMSERSCIQSGGKQNVELSAVDLEGIVTGSSKENNTGCEPYPFPKCEH--FTK 175
Query: 125 SE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
+ P C + P+C T C Y + QDK++ + +GP +F T
Sbjct: 176 GQYPPCGSKIYKTPRCKTTCQK-RYKTSYAQDKHRAIQKEIM---KYGPVEASF-----T 226
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L +G +Y + E + ++I+GWG EN PYW I +++ E +G+ G +I
Sbjct: 227 VYEDFLNYKSG-IYK-HITGETLGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRI 284
Query: 244 LRGRNEAIIESLV 256
+RGR+E IES V
Sbjct: 285 VRGRDECSIESEV 297
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 86/185 (46%), Gaps = 24/185 (12%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCH-TRCTND 146
W ++ K GL TGG + SN GCQP S PC +AN + E E P+C+ +CTN+
Sbjct: 165 WKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENE------DTPQCYKDQCTNN 218
Query: 147 NYGRGFFQDKYQINGL-GLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVY 197
NY D Y + + P GP A K G +Y
Sbjct: 219 NYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAM------KVYDDFLCYKGGIY 272
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
+ +A VKI+GWGE++G YW +T+G +G G KI RGRNE IE+ +
Sbjct: 273 QYTTGGLKGDHA-VKIMGWGEDDGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRIT 331
Query: 258 GALPK 262
G LPK
Sbjct: 332 GGLPK 336
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 114/264 (43%), Gaps = 37/264 (14%)
Query: 7 SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVK 64
S+IRD S C + AV+ ++ +C S K VE ++ I+
Sbjct: 107 SQIRDQS-------------QCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISC-- 151
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C + S C G +W + RG+VTGG+ ++TGC+P FP C+H
Sbjct: 152 --CKYCGSG------CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKG 202
Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFW 178
C P+C+ C Y + QDK Y + + P
Sbjct: 203 KYRACGDKLYKTPQCNQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-- 259
Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
++ Y L +G +Y + I +A V+++GWG ENG YW +T+ E +G+K
Sbjct: 260 EAYLEIYEDFLNYKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEK 317
Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
G +I+RGRNE +IES + L K
Sbjct: 318 GYFRIVRGRNECLIESEIAAGLIK 341
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + + P C T PKC
Sbjct: 77 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 134
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 135 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 188
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 189 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC 246
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 247 GIESEIVAGIPR 258
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 89/199 (44%), Gaps = 32/199 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +G+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 33 CQGGFPGVAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF----------------GPFWPAFWRSFCT 183
+C Y + QDK+ Y D + GP AF
Sbjct: 92 KQKCQK-GYKTPYKQDKH-------YGDESYNVISNEKAIQKEIMMNGPVEAAF-----D 138
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L +G V+ S IV ++I+GWG + PYW I +++ E +G+KG +I
Sbjct: 139 VYEDFLNYKSGIYRHVTGS--IVGGHAIRIIGWGVKKRTPYWLIANSWNEDWGEKGLFRI 196
Query: 244 LRGRNEAIIESLVNGALPK 262
+RGR+E IES V L K
Sbjct: 197 VRGRDECSIESNVVAGLIK 215
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + + P C T PKC
Sbjct: 71 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 128
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 129 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 182
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 183 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC 240
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 241 GIESEIVAGIPR 252
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 91/192 (47%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
+IES + L K
Sbjct: 330 LIESEIAAGLIK 341
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
C+ G+ + W + GLV+GG+++S+ GC+P PPC H N + KT
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 210
Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
PKCH C + +Y + +DK Y ++ + GP AF T Y+
Sbjct: 211 PKCHKTCES-SYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAF-----TVYSD 264
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L NG VY + + +A +KI+GWG ENG YW I +++ +GD G KILRG
Sbjct: 265 LLNYKNG-VYKHTVGNALGGHA-IKILGWGVENGNKYWLIANSWNSDWGDNGFFKILRGE 322
Query: 248 NEAIIESLVNGALP 261
+ IES + P
Sbjct: 323 DHCGIESSIVAGEP 336
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 91/194 (46%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W K GLVTGG + S GCQP PC Y + C+ P K
Sbjct: 152 CHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG F +D Y + + D +GP ++ + F
Sbjct: 208 H-RCTRMCYGNQDLDFKKDHHFTRDAYYLTFGIIQRDVMAYGPIEASYDVYDDF------ 260
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + +A + VK++GWGEE G PYW +V+++ +Q+GDKG KI RG
Sbjct: 261 PSYKSG--VYVRTENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNDQWGDKGLFKIRRGT 318
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 319 NECGIDNSTTGGVP 332
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 114/264 (43%), Gaps = 37/264 (14%)
Query: 7 SRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVK 64
S+IRD S C + AV+ ++ +C S K VE ++ I+
Sbjct: 74 SQIRDQS-------------QCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISC-- 118
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C + S C G +W + RG+VTGG+ ++TGC+P FP C+H
Sbjct: 119 --CKYCGSG------CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKG 169
Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFW 178
C P+C+ C Y + QDK Y + + P
Sbjct: 170 KYRACGDKLYKTPQCNQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-- 226
Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
++ Y L +G +Y + I +A V+++GWG ENG YW +T+ E +G+K
Sbjct: 227 EAYLEIYEDFLNYKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEK 284
Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
G +I+RGRNE +IES + L K
Sbjct: 285 GYFRIVRGRNECLIESEIAAGLIK 308
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 95/203 (46%), Gaps = 34/203 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ K+GLV+GG ++S+ GC P + PPC H + S P C T P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
+ C Y + +DK HFG + ++ S K NG V A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 252
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ ++ + Y + ++I+GWG ENG PYW +++ +GD G KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKI 312
Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
LRG N IES + +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 92/194 (47%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W K GLVTGG + S+ GCQP PC Y + C+ P K
Sbjct: 152 CHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG F +D Y + + D +GP ++ + F
Sbjct: 208 H-RCTRMCYGDQDRDFKEDHRFTRDAYYLTYGTIQKDVMTYGPIEASYEVYDDF------ 260
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 261 PSYKSG--VYVRTENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNDQWGDRGLFKIRRGT 318
Query: 248 NEAIIESLVNGALP 261
NE I++ G +P
Sbjct: 319 NECGIDNSTTGGVP 332
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 93/194 (47%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + GLVTGG + S GCQP PC H + S P C + P P+C
Sbjct: 149 CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEH-HINGSRPACGKI-EPTPRC 206
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR-P 188
C + Y F +DK+ ++ + + + GP AF T Y P
Sbjct: 207 KKTCES-GYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTN-GPVEAAF-----TVYADFP 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+++ VY + AE+ +A VK++GWG E PYW I +++ +GD G KILRG++
Sbjct: 260 HYKSG--VYQHESGAELGGHA-VKMIGWGMEGSTPYWLIANSWNSDWGDMGFFKILRGQD 316
Query: 249 EAIIESLVNGALPK 262
E IE + P+
Sbjct: 317 ECGIERDIVAGEPR 330
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 94/191 (49%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y I N + + + GP AF T Y+
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAF-----TVYSD-FL 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Q VY + +++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 261 QYKSGVYQ-HVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 320 GIESEIVAGIP 330
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 84/190 (44%), Gaps = 15/190 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ GC+ FP C+H + P C PKC
Sbjct: 149 CQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHG-SKKYPPCPHRIYDTPKC 207
Query: 140 HTRCTNDNYG------RGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQT 192
+C N R Q + + + + GP AF + F
Sbjct: 208 VPKCDTPNIDYETDKTRANITYNVQRSQMAIMKEIMINGPVEAAF------EVYEDFFGY 261
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
VY ++ E + ++I+GWGEENG PYW I +++ E +G+ G K+LRG+NE I
Sbjct: 262 KQGVY-FHSTGEFIGGHAIRILGWGEENGTPYWLIANSWNEGWGEDGYFKMLRGKNECGI 320
Query: 253 ESLVNGALPK 262
E V LP+
Sbjct: 321 EDEVTAGLPE 330
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 92/194 (47%), Gaps = 26/194 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 138
C+ G + WA+ + G+VTGG + + GC+ + PPC H +T + P C + P P+
Sbjct: 154 CNGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEH--HTEGDLPACGDI-VPTPQ 210
Query: 139 CHTRCT--------NDNYGRGFFQ---DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
C C +D +Q D+ QI + P F + F Y
Sbjct: 211 CKKECDAGVDIEYKSDLRKGSAYQTSSDESQIQTEIMTNGPVEADF--DVYEDF-LNYKS 267
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
++Q YA + +KI+GWG E+G PYW +++ E +GDKG KILRG+
Sbjct: 268 GVYQQTTGNYAGGHA--------IKILGWGVEDGTPYWLAANSWNEDWGDKGYFKILRGQ 319
Query: 248 NEAIIESLVNGALP 261
NE IES + G +P
Sbjct: 320 NECGIESDIIGGIP 333
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 94/191 (49%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y I N + + + GP AF T Y+
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAF-----TVYSD-FL 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Q VY + +++ ++I+GWG ENG PYW + +++ +GD G KILRG++
Sbjct: 261 QYKSGVYQ-HVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319
Query: 251 IIESLVNGALP 261
IES + +P
Sbjct: 320 GIESEIVAGIP 330
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 91/193 (47%), Gaps = 20/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + GLVTGG ++S GCQP C+H +P C + P+C
Sbjct: 145 CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP-CASKEEHTPRC 203
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
C Y F +DK Y + GP AF T Y P
Sbjct: 204 SKTC-EAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAF-----TVYADFPT 257
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+++ VY ++ A + +A ++I+GWG ENG PYW + +++ E +G G KI+RG+++
Sbjct: 258 YKSG--VYQHTSGAMLGGHA-IRILGWGTENGTPYWLVANSWNEDWGAMGYFKIIRGKDD 314
Query: 250 AIIESLVNGALPK 262
IES + +PK
Sbjct: 315 CGIESQITAGMPK 327
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 208 NKMCEA-GYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 262 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 320 GIESEIVAGIPR 331
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 93/190 (48%), Gaps = 32/190 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE--------PECKT 131
C G+ + W + K G+V+GG + S GCQP + PPCNH + E P+CK
Sbjct: 153 CEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKN 212
Query: 132 L-ATPQ--------PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPH-FGPFWP 175
+ P+ P+C +C N NY + +DK Y++ ++ + + +GP
Sbjct: 213 IPVIPEQCKYIPITPECEKKC-NKNYKVCYSKDKHRGKSVYRVKKSEIFKEIYEYGPV-- 269
Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
S+ T Y L G +Y + S + + +VKI+GWGEE G YW ++F +
Sbjct: 270 ---TSYFTVYEDFLNYKEG-IYNYT-SGQKLGLHSVKIIGWGEERGIKYWLAANSFNTDW 324
Query: 236 GDKGTIKILR 245
GDKG KI+R
Sbjct: 325 GDKGFFKIIR 334
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 92/195 (47%), Gaps = 25/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + GLV+GG + ++ GC+P S PC H T P C P PKC
Sbjct: 191 CNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP-CSGEG-PTPKC 248
Query: 140 HTRC---------TNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
C + N+G + D+ QI + P G F + F
Sbjct: 249 ERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAF--TVYADF------ 300
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + E+ +A ++++GWG E+G PYW + +++ +GD G KILRG+
Sbjct: 301 PTYKSG--VYQHVSGGELGGHA-IRVLGWGVEDGTPYWLVANSWNSDWGDNGFFKILRGQ 357
Query: 248 NEAIIESLVNGALPK 262
NE IE + LPK
Sbjct: 358 NECGIEGEIVAGLPK 372
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 94/192 (48%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + + P C T PKC
Sbjct: 133 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 190
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+ C Y + +DK Y ++ GP AF T ++ L
Sbjct: 191 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAF-----TVFSDFLT 244
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +++ ++I+GWG ENG PYW + +++ +GD G KILRG N
Sbjct: 245 YKSG-VYKHEA-GDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC 302
Query: 251 IIESLVNGALPK 262
IES + +P+
Sbjct: 303 GIESEIVAGIPR 314
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 69/251 (27%), Positives = 118/251 (47%), Gaps = 33/251 (13%)
Query: 21 RRPYALS-CIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTI 77
+R Y S C + A+A+ ++ +C + + VE ++ ++ +CA
Sbjct: 101 KRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELVSCC-SKCAV-------- 151
Query: 78 WVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S S W + + GLVTG ++ +N+GC P FP C+H + + S P C + P
Sbjct: 152 -GCNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGS-SDSYPMCGYVVYTPP 209
Query: 138 KCHTRCT-------NDN--YGRGFFQDKYQINGLGLYFDPHFGPFWPA-FWRSFCTKYTR 187
C+ C ND+ +G+ +Q K + + +GP + F Y
Sbjct: 210 VCNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREI-MLYGPVEASIFIYDDFVDYKS 268
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+++ + ++ +V+I+GWG ENG PYW +++ E++G G KILRG
Sbjct: 269 GVYK--------HLTGRLITIQSVRIIGWGIENGIPYWLCANSWNEEWGLNGFFKILRGS 320
Query: 248 NEAIIESLVNG 258
NE IE+ VN
Sbjct: 321 NECEIEAFVNA 331
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 89/194 (45%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + ++G+V+GG++ S +GC+P FPPC H T C P C
Sbjct: 162 CDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTC 221
Query: 140 HTRC--------TNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
+C TND YG + ++ + H GP A+ + F Y +
Sbjct: 222 EHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLH-GPVEVAYDVYEDF-EHYLKG 279
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+++ Y + VK++GWG ENG PYW +++ +G+ G +ILRG +
Sbjct: 280 IYKHTAGSY--------LGGHAVKMIGWGTENGIPYWICSNSWNSDWGENGFFRILRGTD 331
Query: 249 EAIIESLVNGALPK 262
E IES V LPK
Sbjct: 332 ECGIESGVVAGLPK 345
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 85/186 (45%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + GLV+ CQP FPPC H+ + P C + PKC
Sbjct: 159 CDGGYPDEAWLYFTESGLVS-------DYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKC 211
Query: 140 HTRCTNDNYG--RGFFQDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+ CT+ R F + Y + G Y + GPF AF T Y L +G
Sbjct: 212 NATCTDKRIPVVRYFASESYSLQGEEDYKRELYLRGPFEVAF-----TVYEDFLAYESG- 265
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + + +A V++VGWGE NG PYW I +++ +G+ G + RG++E IES
Sbjct: 266 VYKHVSGGPVGGHA-VRVVGWGERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIESQ 324
Query: 256 VNGALP 261
+ P
Sbjct: 325 GSAGTP 330
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 94/185 (50%), Gaps = 19/185 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC+ C
Sbjct: 29 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCNKTC-E 85
Query: 146 DNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK Y + N + + + GP AF + Y+ L +G
Sbjct: 86 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 140
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
V S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 141 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 198
Query: 257 NGALP 261
+P
Sbjct: 199 VAGMP 203
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 87/191 (45%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y N + GP AF
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAF------SVFSDFL 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Q VY + E++ V+I+GWG EN PYW + +++ +GD G KILRGR+
Sbjct: 261 QYKSGVYQ-HVTGEMMGGHAVRILGWGVENDTPYWLVGNSWNTDWGDHGFFKILRGRDHC 319
Query: 251 IIESLVNGALP 261
IES V +P
Sbjct: 320 GIESEVVAGIP 330
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 88/190 (46%), Gaps = 14/190 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
C G + W + +G+VTGG + SN GCQP PC+H +S C +L Q
Sbjct: 134 CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 192
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
C +C N NY + D Y+ + + + + P +F Y +
Sbjct: 193 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMGY 250
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G VY S + E++ Y VK++GWG +E G YW ++++ +G+ G KILRG N
Sbjct: 251 KEG-VYK-STAGELIGYHHVKLIGWGVDEAGIEYWLAMNSWNSNWGNDGLFKILRGYNFC 308
Query: 251 IIESLVNGAL 260
IE LV L
Sbjct: 309 SIELLVMAGL 318
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 92/185 (49%), Gaps = 19/185 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 77 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133
Query: 146 DNYGRGFFQDKY--------QINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK+ N + + + GP AF + Y+ L +G
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 188
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
V S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 189 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 246
Query: 257 NGALP 261
+P
Sbjct: 247 VAGMP 251
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 87/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + F P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSGESVFQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE IE
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 19/185 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212
Query: 146 DNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK Y + N + + + GP AF + Y+ L +G
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 267
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
V S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 268 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 325
Query: 257 NGALP 261
+P
Sbjct: 326 VAGMP 330
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 27/198 (13%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + + WVH G+V+GG+ +S GCQP PC H + P+C PK
Sbjct: 149 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKCSE-GGGTPK 205
Query: 139 CHTRCTN----------DNYGRGF--FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
C RC N + G+ + +D+ QI Y GP AF T Y
Sbjct: 206 CVKRCENGYTVDYESDLHHGGKAYSIMKDEDQIK----YEIMKNGPVEGAF-----TVYV 256
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY + +A ++I+GWGEENG PYW +++ +GD G KILRG
Sbjct: 257 DFLHYKSG-VYQHRHGLPLGGHA-IRILGWGEENGTPYWLCANSWNTDWGDNGLFKILRG 314
Query: 247 RNEAIIESLVNGALPKDN 264
+ IES ++ LPK N
Sbjct: 315 SDHCGIESEISAGLPKLN 332
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 110/246 (44%), Gaps = 30/246 (12%)
Query: 28 CIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
C + AV++ ++ +C S K VE ++ I+ K + C G
Sbjct: 115 CASSWAVSSVGAMSDRICIQSGGKQSVELSAIDLISCCKNCGSG----------CDGGYF 164
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECKTLATPQ 136
+W + G+VTGG+ ++TGC+P FP C+H + P+CK T Q
Sbjct: 165 LPSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQ--TCQ 222
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
+T D + GF + + + GP ++ Y L +G +
Sbjct: 223 KGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLNYKSG-I 276
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 277 YRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEI 335
Query: 257 NGALPK 262
L K
Sbjct: 336 AAGLIK 341
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 92/191 (48%), Gaps = 22/191 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ TGG + SN GC P PPC + + + L +P
Sbjct: 155 CQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207
Query: 140 HT-RCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
H +C YG +++Y++ + + +GP +F Y +
Sbjct: 208 HNHKCPRACYGNSTVENRYKVKSIYVLDSSKTIEQDIRKYGPVEASF-----DVYDDFIT 262
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + +A V +VK++GWGEE+G PYW +V+++ + +G++GT +I++GRNE
Sbjct: 263 YKSG-IYQKTPNAFYVGGHSVKLIGWGEEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNEC 321
Query: 251 IIESLVNGALP 261
IE +P
Sbjct: 322 GIERSATAGVP 332
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 90/199 (45%), Gaps = 32/199 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ K G+VTGG + S+ GC P C+H T P C P P+C
Sbjct: 163 CNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYPIKACDHHVNGTLGP-CDKKIPPTPRC 221
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
C +G+ D + D H+G ++ K + TNG V A
Sbjct: 222 VHMCR-----KGYDVDYHD--------DKHYGK--SSYSVPSEEKQIQAEIMTNGPVEAD 266
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ ++ V Y + ++++GWG ENG PYW +++ ++GDKG KI
Sbjct: 267 FTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGVENGVPYWLAANSWNTEWGDKGFFKI 326
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG +E IE V LPK
Sbjct: 327 LRGSDECGIEDDVVAGLPK 345
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 90/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W KRGLVTGG + S GC+P PPC + +E P+
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY S +A + VK++GWGEE G PYW +V+++ +GD G KI RG
Sbjct: 266 PSYKSG--VYVKSENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGT 323
Query: 248 NEAIIESLVNGALPKDN 264
NE I++ +P N
Sbjct: 324 NECGIDNSTTAGVPVTN 340
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/195 (32%), Positives = 87/195 (44%), Gaps = 26/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +S W + G+V+GG + S GCQP S PC H + S P C P C
Sbjct: 150 CDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAPCEH-HVPGSRPACSG-GGDTPDC 207
Query: 140 HTRCTNDNYGRGFFQDKY------------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
+C ++ G + QD Y QI L GP AF T Y
Sbjct: 208 RNQC-DEGSGISYDQDHYYGETVYTLDEAKQIQAEIL----KNGPVEAAF-----TVYED 257
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L G VY A + +A +KI+GWG EN PYW + +++ +G+ G KILRG
Sbjct: 258 LLNYKEG-VYQHVAGEALGGHA-IKILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGS 315
Query: 248 NEAIIESLVNGALPK 262
+E IE + LP+
Sbjct: 316 DECGIEDQIVAGLPR 330
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 92/193 (47%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + + GLVTGG + S TGC P PC H + P+C P C
Sbjct: 182 CNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEH-HVPGDRPKCSE-GGGTPSC 239
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTR-PL 189
++C N + QDK Y ++ + DP GP AF T Y P
Sbjct: 240 VSKCKG-NTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAF-----TVYADFPT 293
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+++ VY + ++ ++I+GWG ENG YW + +++ +GDKG KILRG +E
Sbjct: 294 YKSG--VYK-HVTGGVLGGHAIRILGWGSENGVAYWLVANSWNTDWGDKGYFKILRGSDE 350
Query: 250 AIIESLVNGALPK 262
IES V +P+
Sbjct: 351 CGIESSVVAGIPQ 363
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 80/194 (41%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ TGC+P FP C H + P C P PKC
Sbjct: 155 CDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 213
Query: 140 HTRC-----------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
C T N Q + I L P +F P
Sbjct: 214 VKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGP--------VEATFEVHEDFP 265
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+++ +A S V ++I+GWGEENG PYW I +++ E +G+KG ++ LRG N
Sbjct: 266 EYKSGIYFHAWGGS---VGGHAIRILGWGEENGVPYWLIANSWNEDWGEKGYLRFLRGHN 322
Query: 249 EAIIESLVNGALPK 262
E IE LP
Sbjct: 323 ECGIEEEATAGLPD 336
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 19/185 (10%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212
Query: 146 DNYGRGFFQDK------YQI--NGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + +DK Y + N + + + GP AF + Y+ L +G
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAF-----SVYSDFLLYKSGVY 267
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
V S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 268 QHV--SGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 325
Query: 257 NGALP 261
+P
Sbjct: 326 VAGMP 330
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + W + HK G+V+GG + S GCQP S PC H+ S P C+ + PKC
Sbjct: 150 CLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHS-IPGSRPACEGVRD-TPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
+C YG + D Y G + D GP + LF
Sbjct: 208 KKQCEK-GYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASIL------VYEDLF 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
VY + E++ +KI+GWG EN PYW + +++ +G+ G KILRG +E
Sbjct: 261 SYKAGVYQ-HVAGEVLGGHVIKILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGSDEC 319
Query: 251 IIESLVNGALPK 262
IE + +P+
Sbjct: 320 GIEDQIVAGIPR 331
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 90/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W KRGLVTGG + S GC+P PPC + +E P+
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLTYGSIQKDVMTYGPIEASFDVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY S +A + VK++GWGEE G PYW +V+++ +GD G KI RG
Sbjct: 266 PSYKSG--VYVKSENATYLGGHAVKLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIRRGT 323
Query: 248 NEAIIESLVNGALPKDN 264
NE I++ +P N
Sbjct: 324 NECGIDNSTTAGVPVTN 340
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + WA+ ++G+V+GG + S+ GC+P PC H + + P C P C
Sbjct: 161 CNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCDGEHGKTPSC 219
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C T+ ++G + K + + + GP AF T Y +
Sbjct: 220 RHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQN-GPVEGAF-----TVYEDLIL 273
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY E+ +A ++I+GWG EN PYW I +++ +G+ G K+LRG +
Sbjct: 274 YKDG-VYQHVHGRELGGHA-IRILGWGVENKTPYWLIANSWNTDWGNNGFFKMLRGEDHC 331
Query: 251 IIESLVNGALPK 262
IES + LPK
Sbjct: 332 GIESAIAAGLPK 343
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 86/191 (45%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 155 CSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPHQYYPTPEC 213
Query: 140 HTRCTNDNYGRGFFQDKYQIN-GLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C D G + +DK + N +Y GP F T Y
Sbjct: 214 VQHC--DTPGIDYVKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVF-----TVYED-FL 265
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Q VY S A + +A ++I+GWGEE PYW I +++ E +G+KG +K LRG NE
Sbjct: 266 QYKFGVYFHSWGAPLSEHA-IRILGWGEEGDVPYWLIANSWNEDWGEKGYMKFLRGLNEC 324
Query: 251 IIESLVNGALP 261
IE V LP
Sbjct: 325 GIEDDVTAGLP 335
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 104/255 (40%), Gaps = 59/255 (23%)
Query: 28 CIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
C AVA+A ++ C + M V+ ++ I+ K + C G S
Sbjct: 109 CDSGWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG-----------CQIGFS 157
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC-- 143
+W + K GLVTG TGC P FP C+H + + S P+C + P C C
Sbjct: 158 EFSWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPCTKTCRS 212
Query: 144 -------TNDNYGRGFF---------QDKYQING---LGLYFDPHFGPFWPAFWRSFCTK 184
+ +YGR + + + +NG G++ F + +R
Sbjct: 213 GYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHI--- 269
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
+ ++V +V+I+GWG EN PYW +++ E +G G KIL
Sbjct: 270 -----------------TGQLVTIHSVRIIGWGIENDIPYWLCANSWNEDWGLNGYFKIL 312
Query: 245 RGRNEAIIESLVNGA 259
RG NE IES VN
Sbjct: 313 RGSNECEIESFVNAG 327
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 88/186 (47%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI W + K G+VTG + ++TGC+P FP C H + P C + P+C
Sbjct: 163 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 221
Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK++ + + D +GP +F T Y L
Sbjct: 222 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASF-----TVYEDFLN 275
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G ++ A + ++I+GWG EN PYW I +++ E +G+ G +I+RGR+E
Sbjct: 276 YKSGIYKHITGEA--LGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 333
Query: 251 IIESLV 256
IES V
Sbjct: 334 FIESEV 339
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI W + K G+VTG + ++TGC+P FP C H + P C + P+C
Sbjct: 158 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216
Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK++ + + D +GP +F T Y L
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASF-----TVYEDFLN 270
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + E + ++I+GWG EN PYW I +++ E +G+ G +I+RGR+E
Sbjct: 271 YKSG-IYK-HITGEALGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328
Query: 251 IIESLV 256
IES V
Sbjct: 329 FIESEV 334
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 87/189 (46%), Gaps = 24/189 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G WA+ G+ +G CQP FP C+H +T+ P+C L P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210
Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+ CT+ + ++ Y ++G LYF GPF F LF
Sbjct: 211 NPACTDSTISKKKYRGLKSYSLSGEEDFRRELYFR---GPFQAVF------DVWSDLFAY 261
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
VY A I A+A V+IVGWG ++G PYW I +++ ++GD+G +LRG NE I
Sbjct: 262 KHGVYKHVGGAFIGAHA-VRIVGWGNQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGI 320
Query: 253 ESLVNGALP 261
E + +P
Sbjct: 321 EDSGSAGVP 329
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 88/195 (45%), Gaps = 20/195 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +G+ TGG + + GC P PPC + P +
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNT-----CGGKPMERN 208
Query: 140 HTRCTNDNYGRGFFQDKYQ------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQ 191
H +C YG+ QD+Y+ IN + +GP +F Y
Sbjct: 209 H-QCPKTCYGKTTVQDRYKTKNEYVINSIETIEQDLMTYGPVEASF-----DVYDDFSVY 262
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G +Y + A+ ++KI+GWGEENG PYW V+++ + +GD GT KI++GRNE
Sbjct: 263 KSG-IYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAVNSWSKFWGDHGTFKIIKGRNECG 321
Query: 252 IESLVNGALPKDNYG 266
IE V +P + G
Sbjct: 322 IERAVTAGIPSTSRG 336
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 94/194 (48%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG ++S+ GC P + PPC H + S P+C T PKC
Sbjct: 150 CNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEH-HVNGSRPQC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y ++ GP AF T ++ L
Sbjct: 208 TKSC-EAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAF-----TVFSDFLT 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY A +I+ ++I+GWG EN PYW + +++ +GD G KILRG +
Sbjct: 262 YKSG-VYKHEA-GDIMGGHAIRILGWGVENSVPYWLVANSWNVDWGDNGLFKILRGEDHC 319
Query: 251 IIESLVNGALPKDN 264
IES + +P+ +
Sbjct: 320 GIESEIVAGIPRTD 333
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/183 (31%), Positives = 86/183 (46%), Gaps = 12/183 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKMYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDKY----QINGLG--LYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+C Y + DK+ IN + L + P ++ + L +
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGIAINVIKNELAIQKEIMMYGPV--EAYLLIFEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + + V V+I+GWG ENG YW +T+ E +G+KG +I+RGRNE IE
Sbjct: 275 G-IYKYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
Query: 254 SLV 256
S+V
Sbjct: 333 SVV 335
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 93/195 (47%), Gaps = 33/195 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP---- 135
C G + W +RG+VTGG + TGC+P PCN N C L TP
Sbjct: 153 CDGGFPYRAFQWWARRGVVTGG-DYLGTGCKPYPIRPCNSDN-------CVNLQTPPCRL 204
Query: 136 --QPKCHTRCTND-NYGRGFFQDKYQINGL--GLYFDPHFGPFWPAF--WRSFCTKYTRP 188
QP T TND NYG + + + +Y++ GP AF + F KY
Sbjct: 205 SCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYN---GPVVAAFIVYEDF-EKYKSG 260
Query: 189 LFQ-TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+++ GR A VK++GWG E G PYW V+++G Q+G+ GT +ILRG
Sbjct: 261 IYRHIAGRSKGGHA---------VKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGV 311
Query: 248 NEAIIESLVNGALPK 262
+E IES + LP+
Sbjct: 312 DECGIESRIVAGLPR 326
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 90/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + G+VTGG +S+ GCQP C+H T P C+ P P+C
Sbjct: 173 CNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEG-PTPEC 230
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAF--WRSFCTKYTRP 188
+C +Y + QDK Y ++ + +P GP F + F T +
Sbjct: 231 KHKC-EASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSGV 289
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
T G V A +KI+GWG E G YW + +++ ++GD G KILRG N
Sbjct: 290 YQHTTGGVLGGHA---------IKILGWGVEEGTKYWLVANSWNNEWGDNGFFKILRGSN 340
Query: 249 EAIIESLVNGALPK 262
E IES +N +PK
Sbjct: 341 ECGIESDINFGIPK 354
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 97/196 (49%), Gaps = 27/196 (13%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + + WVH G+V+GGA +S GCQP PC H + + P+C + PK
Sbjct: 148 CNGGFPGAAFQYWVHS-GIVSGGAFNSTQGCQPYEIAPCEH-HVSGPRPKCAEGGS-TPK 204
Query: 139 CHTRCTND---------NYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
CH C ++ ++G + +D+ QI Y GP AF T Y
Sbjct: 205 CHKNCESNYVVDYESDLHHGSKHYSVDKDETQIK----YDIMTNGPVEGAF-----TVYV 255
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY + + +A ++++GWGEE+G PYW +++ +GD G KILRG
Sbjct: 256 DFLHYKSG-VYQHTHGLPLGGHA-IRVLGWGEEDGTPYWLCANSWNTDWGDNGYFKILRG 313
Query: 247 RNEAIIESLVNGALPK 262
+ IES ++ LPK
Sbjct: 314 SDHCGIESEISAGLPK 329
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 90/198 (45%), Gaps = 23/198 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C P C
Sbjct: 151 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEEGDTPTC 209
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+C + Y + DK G Y P GP AF + Y
Sbjct: 210 RKKC-EEGYSTQYKDDKNY--GSTSYSVPSSEQEIMAEIYKNGPVEGAF-----SVYEDF 261
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G V + E++ ++I+GWG ENG YW +++ +GD G K LRG+N
Sbjct: 262 LHYKSGVYQHV--AGEMLGGHAIRILGWGVENGIRYWLAANSWNIDWGDNGFFKFLRGKN 319
Query: 249 EAIIESLVNGALPK-DNY 265
IES + +P+ D Y
Sbjct: 320 HCGIESEIIAGIPRTDQY 337
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 88/186 (47%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI W + K G+VTG + ++TGC+P FP C H + P C + P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216
Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK++ + + D +GP F T Y L
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGF-----TVYEDFLN 270
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + E + ++I+GWG EN PYW I +++ E +G+ G +I+RGR+E
Sbjct: 271 YKSG-IYK-HITGETLGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328
Query: 251 IIESLV 256
IES V
Sbjct: 329 SIESEV 334
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+ Q +T D + GF + + + GP ++ Y L
Sbjct: 219 QIC--QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 88/194 (45%), Gaps = 19/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + K GL TGG++ S GC+P S PC + P C P P C
Sbjct: 100 CGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 159
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
+CT+ N G DK + G + P+ GP F Y
Sbjct: 160 EKKCTSKN-GYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTF-----EVYDDF 213
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L T G ++ + + + +V+I+GWG G PYW + +++G+++G+ GT + LRG N
Sbjct: 214 LQYTTGIYVHLTGNKQ--GHLSVRILGWGMYEGVPYWLLANSWGKEWGENGTFRALRGTN 271
Query: 249 EAIIESLVNGALPK 262
E +E+ +PK
Sbjct: 272 ECGLEANCVSGMPK 285
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 88/190 (46%), Gaps = 20/190 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +G+ TGG +++ GC P PPC + E C P +
Sbjct: 154 CGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQ---GENICD--EQPMERN 208
Query: 140 HTRCTNDNYGRGFFQDKYQ------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQ 191
H +C YG+ Q++Y+ IN + +GP +F L
Sbjct: 209 H-QCPKTCYGKTTVQNRYKTKSEYYINSIKTIEQDIKTYGPVEASF------DCYDDLSV 261
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+Y S +A+ ++KI+GWG+E+G PYW V+++ + +GD GT KI++GRNE
Sbjct: 262 YKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTPYWLAVNSWSKFWGDHGTFKIIKGRNECG 321
Query: 252 IESLVNGALP 261
IE V +P
Sbjct: 322 IERAVTAGIP 331
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 87/193 (45%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + +G+ TGG +S+ GCQP P C H + T P C + PKC
Sbjct: 65 CDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEH-HTTGDRPPCSDIVD-TPKC 122
Query: 140 HTRCT---NDNY--GRGFFQDKYQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRPL 189
C N +Y + F + Y I L GP AF + F Y +
Sbjct: 123 VHLCEKGYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDF-INYKSGV 181
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q + S E + ++++GWG EN PYW +++ +GDKG KILRG +E
Sbjct: 182 YQHH--------SGESLGGHAIRVLGWGYENDVPYWLCANSWNTDWGDKGYFKILRGSDE 233
Query: 250 AIIESLVNGALPK 262
IES + +PK
Sbjct: 234 CGIESSIVAGIPK 246
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+ Q +T D + GF + + + GP ++ Y L
Sbjct: 219 QIC--QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 93/195 (47%), Gaps = 24/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++G+V+GG + S+ GC+P PC H + + P C+ P+C
Sbjct: 157 CNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215
Query: 140 HTRC---------TNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
+C T+ ++G + ++ I G + P G F T Y
Sbjct: 216 QHKCQASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAF---------TVYED 266
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+ +G VY E+ +A ++I+GWG E PYW I +++ +G+ G KILRG+
Sbjct: 267 LILYKDG-VYEHVHGKELGGHA-IRIIGWGVEKDTPYWLIANSWNTDWGNNGFFKILRGK 324
Query: 248 NEAIIESLVNGALPK 262
+ IES ++ LPK
Sbjct: 325 DHCGIESSISAGLPK 339
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 95/200 (47%), Gaps = 28/200 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ K+GLV+GG ++S+ GC P + PPC H + S P C T P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207
Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
+ C Y + +DK+ +I DP G F T ++
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAF---------TVFS 257
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY A + +A ++I+GWG NG PYW +++ +GD G KILRG
Sbjct: 258 DFLTYKSG-VYKHEAGDMMGGHA-IRILGWGVGNGVPYWLAANSWNLDWGDNGFFKILRG 315
Query: 247 RNEAIIESLVNGALPK-DNY 265
N IES + +P+ D Y
Sbjct: 316 ENHCGIESEIVAGIPRTDQY 335
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 95/192 (49%), Gaps = 24/192 (12%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
+ W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPAC-TGEGDTPKCSKTC-E 212
Query: 146 DNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRPLFQTNG 194
Y + +DK+ G Y P GP AF + Y+ L +G
Sbjct: 213 PGYSPTYKEDKH--FGYTSYSLPTNEWEIMAEIYKNGPVEGAF-----SVYSDFLLYKSG 265
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY + +++ ++I+GWGEENG PYW + +++ +GD G +ILRG++ IES
Sbjct: 266 -VYQ-HLTGDMMGGHAIRILGWGEENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIES 323
Query: 255 LVNGALPK-DNY 265
V +P+ D Y
Sbjct: 324 EVVAGIPRTDQY 335
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 89/194 (45%), Gaps = 18/194 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + K GL TGG++ + GC+P S PC + P C P P C
Sbjct: 144 CGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 203
Query: 140 HTRCTNDN-YGRGFFQDKY-------QINGLGLYFDPHF---GPFWPAFWRSFCTKYTRP 188
+CT+ N Y +D++ Q+ + GP F Y
Sbjct: 204 EKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTF-----EVYDDF 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L T G ++ + + + +V+I+GWG G PYW + +++G+++G+ GT + LRG N
Sbjct: 259 LQYTTGIYVHLTGNKQ--GHLSVRILGWGMYEGVPYWLLANSWGKEWGENGTFRALRGTN 316
Query: 249 EAIIESLVNGALPK 262
E +E+ A+PK
Sbjct: 317 ECGLEANCVSAMPK 330
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+ Q +T D + GF + + + GP ++ Y L
Sbjct: 219 QIC--QKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 88/193 (45%), Gaps = 17/193 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K GLVTGG + S GC+P PPC + + K + +C
Sbjct: 157 CHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKGNNTCAGKPIEKNH-RC 215
Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTRPLFQ 191
C D N F +D Y + + D +GP +F + F P ++
Sbjct: 216 TRMCYGDQDLDYNDDHRFTRDFYYLTYGSIQKDVMTYGPIEASFDVYDDF------PSYK 269
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ VY + +A + VK++GWG E G PYW +V+++ Q+GDKG KI RG NE
Sbjct: 270 SG--VYEKTENASYLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDKGLFKIRRGTNECG 327
Query: 252 IESLVNGALPKDN 264
I++ +P N
Sbjct: 328 IDNSTTAGVPVTN 340
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 88/202 (43%), Gaps = 39/202 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN---YTTSEPECKTLATPQ 136
C+ G W + K GLVTGG + S+ GCQP P CNH Y E KT
Sbjct: 149 CAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCTGEGKT----- 203
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
P+C C R + Y+ D H+G A R + + TNG V
Sbjct: 204 PQCERTC------RSGYTTSYEA-------DLHYGEKAYAVHRE--VEAIQTEIMTNGPV 248
Query: 197 ------------YAVSASAEIVAYA----TVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
Y +V +A ++I+GWG ENG PYW I +++ +GDKG
Sbjct: 249 EGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGTENGVPYWLIANSWNPSWGDKGY 308
Query: 241 IKILRGRNEAIIESLVNGALPK 262
K++RG+++ IES + PK
Sbjct: 309 FKMIRGKDDCGIESNIVAGTPK 330
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 101/202 (50%), Gaps = 32/202 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAH-----HSNTGCQPVSFPPCNHANYTTSEPECKTLAT 134
C+ G ++ W + K GLV+G + +S T CQP SFPPC+H + C L
Sbjct: 158 CNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSH-HVQGEYQACTDL-- 214
Query: 135 PQ---PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRS 180
PQ PKC+T C + + QD ++ G+ Y P +G +F
Sbjct: 215 PQFNTPKCYTECNSQYTQNSYEQDLHK--GVSSYSVPKSEEQIKAEIYQYGSTTASF--- 269
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
Y+ L ++G VY ++ + + +A +K++GWG ENG PYW +++ +G+ G
Sbjct: 270 --NVYSDFLTYSSG-VYQNTSGSYMGGHA-IKMLGWGVENGTPYWLCANSWNSSWGENGF 325
Query: 241 IKILRGRNEAIIES-LVNGALP 261
KILRG NE IES +V G +P
Sbjct: 326 FKILRGSNECGIESGMVAGFVP 347
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 89/194 (45%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W K GLVTGG + S GCQP PC Y + K P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGNNTCSGK----PAEKN 209
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLTYGTIQNDVLAYGPIEASFEVYDDF------ 262
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY +A + VK++GWGEE G PYW +V+++ +Q+GD+G KI RG
Sbjct: 263 PSYKSG--VYTKMENATYLGGHAVKLIGWGEEYGVPYWLLVNSWNDQWGDQGLFKIRRGT 320
Query: 248 NEAIIESLVNGALP 261
NE ++ G +P
Sbjct: 321 NECGTDNSTTGGVP 334
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/253 (26%), Positives = 104/253 (41%), Gaps = 59/253 (23%)
Query: 30 EARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSS 87
E AVA+A ++ C + M V+ ++ I+ K + C G S
Sbjct: 28 ELWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG-----------CQIGFSEF 76
Query: 88 TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC---- 143
+W + K GLVTG TGC P FP C+H + + S P+C + P C C
Sbjct: 77 SWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPCTKTCRSGY 131
Query: 144 -----TNDNYGRGFF---------QDKYQING---LGLYFDPHFGPFWPAFWRSFCTKYT 186
+ +YGR + + + +NG G++ F + +R
Sbjct: 132 PIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHI----- 186
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+ ++V +V+I+GWG EN PYW +++ E +G G KILRG
Sbjct: 187 ---------------TGQLVTIHSVRIIGWGIENDIPYWLCANSWNEDWGLNGYFKILRG 231
Query: 247 RNEAIIESLVNGA 259
NE IES VN
Sbjct: 232 SNECEIESFVNAG 244
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 86/189 (45%), Gaps = 24/189 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G WA+ G+ +G CQP FP C+H +T+ P+C L P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210
Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+ CT+ + ++ Y +G LYF GPF F LF
Sbjct: 211 NPACTDSTISKKKYRGLKSYSFSGEEDFRRELYFR---GPFQAVF------DVWSDLFAY 261
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
VY A I A+A V+IVGWG ++G PYW I +++ ++GD+G +LRG NE I
Sbjct: 262 KHGVYKHVGGAFIGAHA-VRIVGWGNQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGI 320
Query: 253 ESLVNGALP 261
E + +P
Sbjct: 321 EDSGSAGVP 329
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 87/189 (46%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE I+
Sbjct: 275 G-IYRYTTGKYISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSID 332
Query: 254 SLVNGALPK 262
S + L K
Sbjct: 333 SEIAAGLIK 341
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++TGC+P FP C+H + P+CK
Sbjct: 146 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCK 205
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 206 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 258
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 259 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 316
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 317 SIESEIAAGLIK 328
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 110/251 (43%), Gaps = 35/251 (13%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHV-ECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
+C AVAT++ A +C ++ E S I C + C+ G
Sbjct: 110 NCGSCWAVATSSAFADRLCVATNADFNELLSAEEITFCCHTCGF---------GCNGGYP 160
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
W K+GLVTGG + S GC+P PPC + + + T A + + RCT
Sbjct: 161 IKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNN-----TCAGKPMESNHRCTR 215
Query: 146 DNYGRG---------FFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTRPLFQTN 193
YG + +D Y + + D +GP +F + F P +++
Sbjct: 216 MCYGDQDLDFDEDHRYTRDYYYLTYGSIQKDVMTYGPIEASFDVYDDF------PSYKSG 269
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
VY S +A + VK++GWGEE G PYW +V+++ E +GD G KI RG NE ++
Sbjct: 270 --VYVKSENASYLGGHAVKLIGWGEEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVD 327
Query: 254 SLVNGALPKDN 264
+ +P N
Sbjct: 328 NSTTAGVPVTN 338
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 86/186 (46%), Gaps = 9/186 (4%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
+C D G+ +DK + N + R + T Y L ++G
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG- 270
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + A + +A V+I+GWGE PYW I +++ E +G++G +K LRG NE IE
Sbjct: 271 VYFHALGAPMSGHA-VRILGWGELGNVPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDD 329
Query: 256 VNGALP 261
V LP
Sbjct: 330 VTAGLP 335
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 20/190 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +G+ TGG + S GC P PPC + P +
Sbjct: 154 CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLFQ 191
H +C YG Q +Y++ + P+ +GP +F L
Sbjct: 209 H-QCPKTCYGSTTVQKRYKVKNEYVLNSPNTMEQDLIKYGPIEASF------NLFDDLSA 261
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+Y + A+ ++ ++KI+GWG+ENG PYW V+++ + +G++GT +I++GRNE
Sbjct: 262 YKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECG 321
Query: 252 IESLVNGALP 261
IE +P
Sbjct: 322 IERSATAGIP 331
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 87/177 (49%), Gaps = 13/177 (7%)
Query: 88 TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-ND 146
W ++ GLV+GG +++N GCQP PP + E C + +C+ T N
Sbjct: 167 VWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLYENTC------EKRCYGNNTINY 220
Query: 147 NYGRGFFQDKYQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEI 205
N ++ Y I + + ++GP AF R F + F VY + ++E
Sbjct: 221 NQDHVKIKNHYDIEYEDIQREVQNYGPVSMAF-RVFDNDF----FLYKSGVYEKTTNSEF 275
Query: 206 VAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ + K++GWG ENG YW +V+++G ++G G KI RG +E IE+ V+ P+
Sbjct: 276 IQWQYAKLIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 108/242 (44%), Gaps = 31/242 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA+ L+ +C +S ++HV T G +C + C+
Sbjct: 26 NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGYG---------CNG 74
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + + K+G VTGG + + +GC+P F PC H T EC AT PKC +
Sbjct: 75 GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 133
Query: 143 C-----TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
C + R +D Y++ GP AF T Y + G
Sbjct: 134 CQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAF-----TVYEDFSYYKKG 188
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+Y +A +A +KI+GWG+ENG PYW I +++ +G+ G +ILRG N IE
Sbjct: 189 -IYKHTAGKARGGHA-IKIIGWGKENGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEE 246
Query: 255 LV 256
V
Sbjct: 247 NV 248
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 93/190 (48%), Gaps = 27/190 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + G VTGG ++S+ GCQP P C H + +P C+ + P PKC
Sbjct: 148 CNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP-CEG-SEPTPKC 205
Query: 140 HTRCTNDNYGRGFFQDKYQINGL------------GLYFDPHFGPFWPAFWRSFCTKYTR 187
C + Y + DK++++ +Y + GP AF T Y+
Sbjct: 206 KRSC-REGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLN---GPVEAAF-----TVYSD 256
Query: 188 -PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
P +++ VY + + +A +KI+GWG EN PYW + +++ +GDKG KILRG
Sbjct: 257 FPNYKSG--VYKYTTGNALGGHA-IKILGWGVENNVPYWLVANSWNPDWGDKGFFKILRG 313
Query: 247 RNEAIIESLV 256
NE IE+ V
Sbjct: 314 SNECGIEASV 323
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 84/195 (43%), Gaps = 21/195 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + GLVTGG + S GC+P PPC TS P K
Sbjct: 156 CNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTS----SCAGQPIEKN 211
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPL 189
H RCT YG F +D Y + + D ++GP +F
Sbjct: 212 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLTYGSIQKDVMNYGPIEASF------DVYDDF 264
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+ VY + +A + VK++GWG E G PYW +V+++ Q+GD G KI RG +E
Sbjct: 265 YSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDE 324
Query: 250 AIIESLVNGALPKDN 264
I+S +P N
Sbjct: 325 CGIDSATTAGVPVTN 339
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 87/194 (44%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +RGLV+GG + S+ GC+ + PPC H + S P C P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEH-HVNGSRPPCTGEGGETPRC 208
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C Y + +DK+ G+ Y P GP AF Y
Sbjct: 209 SRHC-EPGYSPSYKEDKHY--GITSYGVPRSEKEIMAEIYKNGPVEGAF-----IVYEDF 260
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L +G VS E V ++I+GWG ENG PYW +++ +G G KILRG +
Sbjct: 261 LMYKSGVYQHVSG--EQVGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGED 318
Query: 249 EAIIESLVNGALPK 262
IES + +P+
Sbjct: 319 HCGIESEIVAGVPR 332
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 92/195 (47%), Gaps = 27/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + + GLV+GG +H TGCQP + PC H + P C PKC
Sbjct: 165 CNGGFPQAAWEYWVQNGLVSGGLYHG-TGCQPYAIEPCEH-HTEGDRPPCTGEEGTTPKC 222
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
+C D Y F QDK+ G Y P GP AF + F
Sbjct: 223 SHKCV-DGYTGNFAQDKHY--GSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDF----- 274
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
P +++ VY+ + + +A ++++GWGEENG YW +++ +G+ G KI RG
Sbjct: 275 -PTYKSG--VYSHHTGSALGGHA-IRVLGWGEENGEKYWLCGNSWNTDWGNNGFFKIKRG 330
Query: 247 RNEAIIESLVNGALP 261
NE IES + G +P
Sbjct: 331 VNECGIESEMVGGIP 345
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 88/192 (45%), Gaps = 17/192 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W K+G VTGG++ TGC+P +PPC H T C + P KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226
Query: 140 HTRCTNDNYGRGFFQD------KYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + QD Y ++ GP AF T Y
Sbjct: 227 ERSC-QAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAF-----TVY-EDFE 279
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY +A A + +A VK++GWG +NG PYW +++ E +G+ G +I+RG NE
Sbjct: 280 HYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNEC 338
Query: 251 IIESLVNGALPK 262
IE V G +PK
Sbjct: 339 GIEGGVVGGIPK 350
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 96/233 (41%), Gaps = 65/233 (27%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTL- 132
C+ G +S W+WVH +G+ TGG + + + GC P FPPC H T PEC +
Sbjct: 605 CNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPECPKVS 664
Query: 133 -------ATPQ-------------PKCHTRCTNDNYGRGFFQDK----------YQINGL 162
AT + P C +C N Y D+ Y +N
Sbjct: 665 CSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDA 724
Query: 163 --GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYAT---------- 210
+ D GP + FC P + + S + +AY +
Sbjct: 725 KNAIRTDGPVGPIY------FCD----PNVNFDQVSASFSVYEDFLAYKSGVYKHTSGEY 774
Query: 211 -----VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
VKI+GWGEE+G+ YW +V+++ E +GD G KI G N I ++L+ G
Sbjct: 775 LGGHAVKIIGWGEESGQAYWIVVNSWNEDWGDHGLFKIALG-NCGIDDNLLGG 826
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 90/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + GLVTGG + S GC+P PPC + + P+ K
Sbjct: 156 CNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR----NEDGKSSCAGKPKEKN 211
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
H RCT YG F +D Y + + D ++GP +F + F
Sbjct: 212 H-RCTRMCYGNQDLDYDDDHRFTRDFYYLTYGSIQKDVLNYGPIEASFDVYDDF------ 264
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + +A + VK++GWG E G PYW +V+++ Q+GD G KI RG
Sbjct: 265 PSYKSG--VYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGT 322
Query: 248 NEAIIESLVNGALPKDN 264
+E I+S +P N
Sbjct: 323 DECRIDSATTAGVPVTN 339
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 87/191 (45%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI S W + G+V+GG ++S+ GC P PPC H P C T PKC
Sbjct: 151 CNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPYEIPPCEHHVPGNRIP-CNG-ETSTPKC 208
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
H C + Y + DK Y + G + GP AF T Y L
Sbjct: 209 HRSCRKE-YTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAF-----TVYADLLT 262
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + + +A +KI+GWG ENG YW I +++ +GD G KILRG +
Sbjct: 263 YKSG-VYKHTEGEALGGHA-IKIMGWGVENGNKYWLIANSWNSDWGDNGFFKILRGEDHC 320
Query: 251 IIESLVNGALP 261
IES + P
Sbjct: 321 GIESSIVAGEP 331
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 20/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPL 189
+C Y + DK+ G+ + + +GP A+ F
Sbjct: 218 KRKCQK-GYTTPYEHDKH-YGGISINVIKNESAIQKEIMMYGPV-EAYLLIF-----EDF 269
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Y + + V V+I+GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 270 LNYKSGIYRYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNE 328
Query: 250 AIIESLV 256
IES+V
Sbjct: 329 CSIESVV 335
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 20/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPL 189
+C Y + DK+ G+ + + +GP A+ F
Sbjct: 218 KRKCQK-GYTTPYEHDKH-YGGISINVIKNESAIQNEIMMYGPV-EAYLLIF-----EDF 269
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Y + + V V+I+GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 270 LNYKSGIYRYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNE 328
Query: 250 AIIESLV 256
IES+V
Sbjct: 329 CSIESVV 335
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 89/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W K GLVTGG + S GC+P PPC + E T A +
Sbjct: 157 CNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY-----DESGNNTCAGKPMEA 211
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
+ RCT YG + +D Y + + D +GP +F + F
Sbjct: 212 NHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYGSIQKDVLTYGPVEASFDVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY S +A + K++GWGEE G PYW +V+++ +GD G KI RG
Sbjct: 266 PSYKSG--VYIRSENASYLGGHAAKLIGWGEEYGVPYWLMVNSWNADWGDNGLFKIQRGT 323
Query: 248 NEAIIESLVNGALPKDN 264
NE I++ G +P N
Sbjct: 324 NECGIDNSTTGGVPITN 340
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 85/215 (39%), Gaps = 36/215 (16%)
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC--NHANYTT 124
C L+S W C W GL TGG + GC+P S PC N+ N TT
Sbjct: 153 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTT 212
Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
S P C TP C CT N + + QDK HFG +
Sbjct: 213 SVP-CPGYHTP--PCEDHCTSNITWPIAYKQDK------------HFGKAHYNVGKKMTD 257
Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
T TNG V A V + + KI+GWG +NG PYW
Sbjct: 258 IQTE--IMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLC 315
Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V +G FG+ G ++ILRG NE IE V ALP
Sbjct: 316 VHQWGTDFGENGFVRILRGVNEVNIEHQVLAALPD 350
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 94/203 (46%), Gaps = 34/203 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ K+GLV+GG ++S+ GC P + PPC H + S P C T +C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTHRC 207
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
+ C Y + +DK HFG + ++ S K NG V A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 252
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ ++ + Y + ++I+GWG ENG PYW +++ +GD G KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKI 312
Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
LRG N IES + +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 84/187 (44%), Gaps = 20/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 133 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-KGKYPSCGDKMYKTPQC 191
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPL 189
+C Y + DK+ G+ + + +GP A+ F
Sbjct: 192 KRKCQK-GYKTPYEHDKH-YGGISINVIKNESAIQKEIMMYGPV-EAYLLIF-----EDF 243
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Y + + V V+I+GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 244 LNYKSGIYRYT-TGSFVGEHYVRIIGWGIENGTAYWLAANTWNEDWGEKGYFRIVRGRNE 302
Query: 250 AIIESLV 256
+ES+V
Sbjct: 303 CSVESVV 309
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 86/186 (46%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI W + K G+VTG + ++ GC+P FP C H + P C + P+C
Sbjct: 72 CEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 130
Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK++ + + D +GP F T Y L
Sbjct: 131 KQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGF-----TVYEDFLN 184
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G ++ E + ++I+GWG EN PYW I +++ E +G+ G +I+RGR+E
Sbjct: 185 YKSGIYKHITG--ETLGGHAIRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRGRDEC 242
Query: 251 IIESLV 256
IES V
Sbjct: 243 SIESEV 248
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 89/197 (45%), Gaps = 25/197 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + G+VTGG + S GC+P PPC E + P K
Sbjct: 157 CNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ----DEEGKSSCAGKPIEKN 212
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFD-PHFGPFWPAF--WRSFCTKYTR 187
H RCT YG F +D Y + + D ++GP +F + F
Sbjct: 213 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLTYGSIQKDVMNYGPIEASFDVYDDF------ 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ VY + +A + VK++GWG E G PYW +V+++ Q+GD G KI RG
Sbjct: 266 PSYKSG--VYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGT 323
Query: 248 NEAIIESLVNGALPKDN 264
+E I+S +P N
Sbjct: 324 DECGIDSAATAGVPVTN 340
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 85/178 (47%), Gaps = 14/178 (7%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-N 145
S W ++ G+V+GG ++SN GCQP FPP + P+ T C+ T N
Sbjct: 141 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPP------IANIPKHLHKHTCDDHCYGNSTIN 194
Query: 146 DNYGRGFFQDKYQINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
N+ ++ Y I + + +GP F C + F VYA S A+
Sbjct: 195 YNHDHVRVRNYYTIRTRDIQKEVQTYGPVVVRF--MVCDDF----FLYKSGVYAKSDKAK 248
Query: 205 IVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ K++GWG ENG YW +++++G ++G KG KI G N+ +ES V LP+
Sbjct: 249 GIRTQYAKLIGWGVENGVDYWLVINSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPE 306
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 92/189 (48%), Gaps = 19/189 (10%)
Query: 79 VCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
+ + G+ S + + K G+ TGG + + CQP S PC+ +YT S P CK Q
Sbjct: 338 ILACGMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTASTPSCKY--DCQAD 395
Query: 139 CHTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH-FGPFWPAF--WRSFCTKYTRPLFQTN 193
++D + + + Y + N + + + GP F + F T Y ++Q
Sbjct: 396 YDIPISDDKF---YASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDF-TYYISGIYQQT 451
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
V A+ A ++I+GWGEENG PYW I +++ FG+KG +I RG NE IE
Sbjct: 452 TYV-AMGGHA-------IRIIGWGEENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIE 503
Query: 254 SLVNGALPK 262
S V +PK
Sbjct: 504 SEVYTGIPK 512
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 42/95 (44%), Gaps = 6/95 (6%)
Query: 69 WLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE 128
+++SR + C SG + + + + GLVTGG + C P S PC P+
Sbjct: 59 FVISRIAALVGCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTMCRPYMLAPK 118
Query: 129 CKTLATPQPKCHTRCTNDN-YGRGFF---QDKYQI 159
C+ T Q + D YG+ + QD++ I
Sbjct: 119 CQR--TCQASYNLSLKRDKYYGKSHYYVNQDEFDI 151
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 25/195 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W+ K+G+VTGG +S+ GCQP P C H + T P C PKC
Sbjct: 158 CNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEH-HTTGDRPPCSE-GGGTPKC 215
Query: 140 HTRCTNDNYGRGFFQDKY----------QINGLGLYFDPHFGPFWPAF--WRSFCTKYTR 187
C D Y + QD + ++ + L + GP A + F T +
Sbjct: 216 LKTC-EDGYTVDYTQDLHYGASSYSVHKRMEDIQLEI-MNNGPVEGALTVYEDFPTYKSG 273
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G+ A ++I+GWG E G PYW I +++ +GD G IK+LRG+
Sbjct: 274 VYQHVHGKALGGHA---------IRILGWGVEEGVPYWLIANSWNTDWGDNGYIKLLRGK 324
Query: 248 NEAIIESLVNGALPK 262
+ IES + LPK
Sbjct: 325 DHCGIESQITAGLPK 339
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 85/183 (46%), Gaps = 12/183 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
+ C Y + QDK Y + + P ++ Y L +
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV--EAYLEIYEDFLNYKS 274
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE IE
Sbjct: 275 G-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIE 332
Query: 254 SLV 256
S +
Sbjct: 333 SEI 335
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 87/187 (46%), Gaps = 8/187 (4%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDK----YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
+ C Y + QDK + N L + ++ Y L +G
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y + I +A V+++G G ENG YW +T+ E +G+KG +I+RGRNE +IES
Sbjct: 276 IYRYTTGKYISGHA-VRLIGCGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESE 334
Query: 256 VNGALPK 262
+ L K
Sbjct: 335 IAAGLIK 341
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 89/186 (47%), Gaps = 19/186 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G+ W + GLV+GG+++S+ GC+P PPC H P C T PKC
Sbjct: 152 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C + Y + QDK Y ++G + GP AF T Y+ L
Sbjct: 210 TKKCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAF-----TVYSDLLS 263
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + + +A VKI+GWG EN YW I +++ +GD G KILRG +
Sbjct: 264 YKSG-VYKHTQGDALGGHA-VKILGWGVENDNKYWLIANSWNSDWGDNGFFKILRGEDHC 321
Query: 251 IIESLV 256
IES +
Sbjct: 322 GIESSI 327
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 88/186 (47%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI W + K G+VT + ++TGC+P FP C H + P C + P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYNTPRC 216
Query: 140 HTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
C Y + QDK++ + + D +GP +F T Y L
Sbjct: 217 KQTCQR-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASF-----TVYEDFLN 270
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + E + ++I+GWG EN PYW I +++ E +G+ G +I+RGR+E
Sbjct: 271 YKSG-IYK-HITGEALGGHAIRIIGWGVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328
Query: 251 IIESLV 256
IES V
Sbjct: 329 SIESEV 334
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 84/177 (47%), Gaps = 13/177 (7%)
Query: 88 TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-ND 146
W ++ GLV+GG +++N GCQP PP + E C + +C+ T N
Sbjct: 167 VWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLYENTC------EKRCYGNNTINY 220
Query: 147 NYGRGFFQDKYQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEI 205
N ++ Y I + + ++GP AF + F VY + ++E
Sbjct: 221 NQDHVKIKNHYDIEYEDIQREVQNYGPVSMAF-----KVFDNDFFLYKSGVYEKTTNSEF 275
Query: 206 VAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ + K++GWG ENG YW +V+ +G ++G G KI RG +E IE+ V+ P+
Sbjct: 276 IQWQYAKLIGWGVENGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 89/192 (46%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---------ANYTTSEPECK 130
C G +W + RG+VTGG+ ++T C+P FP C+H + P+CK
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKCDHFVKGKYRACGDKLYETPQCK 218
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T Q +T D + GF + + + GP ++ Y L
Sbjct: 219 Q--TCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPV-----EAYLEIYEDFLN 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G +Y + I +A V+++GWG ENG YW +T+ E +G+KG +I+RGRNE
Sbjct: 272 YKSG-IYRYTTGQFISGHA-VRLIGWGVENGTAYWLAANTWNEDWGEKGYFRIVRGRNEC 329
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 330 SIESEIAAGLIK 341
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 94/198 (47%), Gaps = 27/198 (13%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + + WVH G+V+GG+ +S GCQP PC H + + P+C PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVSGPRPKCSE-GGGTPK 204
Query: 139 CHTRCT--------NDNYGRG----FFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
C C +D + G +D+ QI Y + GP AF T Y
Sbjct: 205 CAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIK----YEIMNNGPVEGAF-----TVYV 255
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY + +A ++++GWGEENG PYW +++ +GD G KILRG
Sbjct: 256 DFLHYKSG-VYQHRHGLPLGGHA-IRVLGWGEENGTPYWLCANSWNTDWGDNGLFKILRG 313
Query: 247 RNEAIIESLVNGALPKDN 264
+ IES ++ LPK N
Sbjct: 314 SDHCGIESEISAGLPKVN 331
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 89/207 (42%), Gaps = 37/207 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGA------HHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
CS G ++W ++H G+V+GG + GC P +FP C H + C
Sbjct: 174 CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYNFPKCAHHQKESDYKPCAKEI 233
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
P C + C N YG F +D++ L F FG T + TN
Sbjct: 234 YDTPSCSSSCPNAKYGTAFDKDRHYTESL---FPSRFGS----------TSSIKKEIMTN 280
Query: 194 GRVYAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGD 237
G A S + ++Y + V+I+GWG E G YW +++++ E++GD
Sbjct: 281 GPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGD 340
Query: 238 KGTIKILRGRNEAIIESLVNGALPKDN 264
GT KI++G + I+ ++ P N
Sbjct: 341 HGTFKIVQG--DCGIDDMILAGTPAIN 365
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 90/196 (45%), Gaps = 26/196 (13%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + W+ WVHK G+VTGG + S+ GC P C+H T P C P P+
Sbjct: 158 CNGGFPGAAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKSIPPTPR 215
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR 187
C R Y F DK+ G Y P GP F T Y
Sbjct: 216 C-VRMCRKGYNVDFADDKHY--GKKSYSVPSNVTQIQVEIMTNGPVEADF-----TVYAD 267
Query: 188 -PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
PL+++ VY + +A ++++GWG E G PYW +++ ++GDKG KILRG
Sbjct: 268 FPLYKSG--VYQRHTDQALGGHA-IRLLGWGVEKGVPYWLAANSWNTEWGDKGFFKILRG 324
Query: 247 RNEAIIESLVNGALPK 262
+E IE V +P+
Sbjct: 325 SDECGIEDDVVAGIPR 340
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 94/199 (47%), Gaps = 26/199 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDKY------------QINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
C Y + +DK+ + +Y + GP AF + Y+
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKN---GPVEAAF-----SVYSD 258
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L +G V+ E++ V+I+GWG ENG PYW + +++ +GD G KILRG+
Sbjct: 259 FLMYKSGVYQHVTG--EMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQ 316
Query: 248 NEAIIESLVNGALP-KDNY 265
+ IES + +P D Y
Sbjct: 317 DHCGIESEIVAGIPCTDQY 335
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 85/185 (45%), Gaps = 9/185 (4%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
+C D G+ +DK + N + R + T Y L ++G
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG- 270
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + A + +A V+I+GWGE PYW I +++ E +G++G +K LRG NE IE
Sbjct: 271 VYFHALGAPMSGHA-VRILGWGELGNVPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDD 329
Query: 256 VNGAL 260
V L
Sbjct: 330 VTAVL 334
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 90/194 (46%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 136
C+ G + W++ ++GLV+GG + S+ GCQP + PC H T P E KT
Sbjct: 152 CNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPCNGEGKT----- 206
Query: 137 PKCHTRCT---NDNYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRP 188
PKC +C N Y + F + Y I GP AF T Y
Sbjct: 207 PKCVKKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAF-----TVYEDL 261
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G VY +A + +A ++I+GWG EN +W I +++ +GD G KILRG +
Sbjct: 262 LNYKEG-VYQHTAGKMLGGHA-IRILGWGVENDTKFWLIANSWNSDWGDNGYFKILRGSD 319
Query: 249 EAIIESLVNGALPK 262
IES + LPK
Sbjct: 320 HLGIESSIAAGLPK 333
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 92/203 (45%), Gaps = 34/203 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC P + PPC H + S P C T P+C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
+ C Y + +DK HFG + ++ S K NG V A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 252
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ ++ + Y + ++I+ WG ENG PYW +++ +GD G KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWLAANSWNLDWGDNGFFKI 312
Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
LRG N IES + +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 95/198 (47%), Gaps = 22/198 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C G + W + G+VTG + +++GC+P FPPC +H+N T EP CK P PK
Sbjct: 190 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 248
Query: 139 CHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPL 189
C+ +C + NY + + DKY + D GP +F YT L
Sbjct: 249 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF-----EVYTDFL 302
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK---GTIKILRG 246
T+G V+ S + VKI+GWG + G YW +++ +G+ G +ILRG
Sbjct: 303 HYTSGIYKHVAGS--VGGGHAVKILGWGIDQGVSYWLAANSWNNDWGEDVFSGYFRILRG 360
Query: 247 RNEAIIESLVNGALPKDN 264
+E IES + +P+ +
Sbjct: 361 ADECGIESGIVAGIPRKD 378
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 87/191 (45%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + GLV+GG+++S+ GC+P PPC H P C T PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLP-CSG-DTKTPKC 208
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + Y + QDK Y + G + GP AF T Y L
Sbjct: 209 VKECES-GYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAF-----TVYADLLS 262
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V+ A + +KI+GWG ENG YW I +++ +GD G KILRG +
Sbjct: 263 YKSGVYKHVTGDA--LGGHAIKIMGWGVENGNKYWLIANSWNSDWGDNGFFKILRGEDHC 320
Query: 251 IIESLVNGALP 261
IES + P
Sbjct: 321 GIESSIVAGEP 331
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 106/242 (43%), Gaps = 31/242 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA+ L+ +C +S ++HV T G +C + C+
Sbjct: 114 NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGY---------GCNG 162
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + + K+G VTGG + + +GC+P F PC H T EC AT PKC +
Sbjct: 163 GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 221
Query: 143 CTNDNY-----GRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
C R +D Y++ GP AF T Y + G
Sbjct: 222 CQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAF-----TVYEDFSYYKKG 276
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+Y +A +A +KI+GWG+E G PYW I +++ +G+ G +ILRG N IE
Sbjct: 277 -IYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEE 334
Query: 255 LV 256
V
Sbjct: 335 NV 336
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 107/242 (44%), Gaps = 31/242 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA+ L+ +C +S ++HV T G +C + C+
Sbjct: 26 NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGYG---------CNG 74
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + + K+G VTGG + + +GC+P F PC H T EC AT PKC +
Sbjct: 75 GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 133
Query: 143 C-----TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
C + R +D Y++ GP AF T Y + G
Sbjct: 134 CQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAF-----TVYEDFSYYKKG 188
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+Y +A +A +KI+GWG+E G PYW I +++ +G+ G +ILRG N IE
Sbjct: 189 -IYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEE 246
Query: 255 LV 256
V
Sbjct: 247 NV 248
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 83/193 (43%), Gaps = 31/193 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+T CT ND+Y +D ++ LYF+ GPF AF Y+
Sbjct: 211 NTTCTDKAIPLIKYRGNDSYVLLHGEDDFKRE---LYFN---GPFVVAF-----QVYSDF 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G VS + + V+IVGWG+ NG PYW I +++ +G G ILRG N
Sbjct: 260 LAYKTGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGNN 317
Query: 249 EAIIESLVNGALP 261
E IES LP
Sbjct: 318 ECGIESTGYAGLP 330
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 91/194 (46%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
C+ G+ + W + GLV+GG+++S+ GC+P PPC H N + KT
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207
Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
PKC C + NY + +DK + ++ + GP AF T Y+
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAF-----TVYSD 261
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L G VY + + +A VKI+GWG ENG YW I +++ +GD G KILRG
Sbjct: 262 LLNYKTG-VYKHTIGDALGGHA-VKILGWGVENGNKYWLIANSWNSDWGDNGFFKILRGE 319
Query: 248 NEAIIESLVNGALP 261
+ IES + P
Sbjct: 320 DHCGIESSIVAGEP 333
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 83/199 (41%), Gaps = 34/199 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W G+VTGG + GC+ SF PC H + P C P P C
Sbjct: 154 CNGGMPAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEH-HVDGDLPPCGP-TKPTPDC 211
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
C + G +G DP+ K + TNG V A
Sbjct: 212 KKEC---DSGSSLTYQNDLTHGSNYGIDPY-------------PKQIQTEIMTNGPVEAS 255
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S + ++Y + +KI+GWG EN PYW + +++ E +GDKG KI
Sbjct: 256 FSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTPYWLVANSWNEDWGDKGYFKI 315
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG NE IE + +P+
Sbjct: 316 LRGSNECGIEGSIVAGIPE 334
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 91/199 (45%), Gaps = 25/199 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--------CKT 131
CS G +++ W ++ K+G+VTGG + SN GCQP PCN A+ T ++P C
Sbjct: 165 CSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCN-ASTTAADPSSVLGPHGVCGG 223
Query: 132 LATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP--------HFGPFWPAFWRSFCT 183
PKC C N + + D + + FD GP+
Sbjct: 224 DPATTPKCDLSCYNARHEGKYLDDIIKAKKV-FTFDGCSARKNLRKHGPYVVTM-----R 277
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L +G + V + + + +V+++GWG E G+ +W + +++G +GDKG KI
Sbjct: 278 VYEDFLAYKSGVYHHV--TGDYLGLLSVRMIGWGLEGGQAFWLLANSWGTSWGDKGFFKI 335
Query: 244 LRGRNEAIIESLVNGALPK 262
R NE IE+ +P
Sbjct: 336 RRFVNECWIENFRYAGVPN 354
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 86/191 (45%), Gaps = 15/191 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + K G+ TGG++ S GC+P S PC + P C P P C
Sbjct: 157 CAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTC 216
Query: 140 HTRC-------TNDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQ 191
+C + + G D+ + + D GP + Y L
Sbjct: 217 EKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPV-----EATMEIYDDFLQY 271
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
T G ++ + + + +V+I+GWG G PYW + +++G+++G+ GT ++LRG NE
Sbjct: 272 TTGIYVHLAGNKQ--GHLSVRILGWGMFEGVPYWLLANSWGKEWGENGTFRVLRGVNECG 329
Query: 252 IESLVNGALPK 262
+E+ +PK
Sbjct: 330 LEANCISGMPK 340
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 91/194 (46%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
C+ G+ + W + GLV+GG+++S+ GC+P PPC H N + KT
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207
Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
PKC C + NY + +DK + ++ + GP AF T Y+
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAF-----TVYSD 261
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L G VY + + +A VKI+GWG ENG YW I +++ +GD G KILRG
Sbjct: 262 LLNYKTG-VYKHTIGDALGGHA-VKILGWGVENGNKYWLIANSWNSDWGDNGFFKILRGE 319
Query: 248 NEAIIESLVNGALP 261
+ IES + P
Sbjct: 320 DHCGIESSIVAGEP 333
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 90/194 (46%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 136
C+ G+ + W + GLV+GG+++S GC+P PPC H N + KT
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPCNGDSKT----- 210
Query: 137 PKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR 187
PKCH C +Y + +DK Y ++ + GP AF T Y+
Sbjct: 211 PKCHKTCEA-SYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAF-----TVYSD 264
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L NG VY + + +A +KI+GWG ENG Y I +++ +GD G KILRG
Sbjct: 265 LLNYKNG-VYKHTVGNALGGHA-IKILGWGVENGNKYRLIANSWNSDWGDNGFFKILRGE 322
Query: 248 NEAIIESLVNGALP 261
+ IES + P
Sbjct: 323 DHCGIESSIVAGEP 336
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 104/240 (43%), Gaps = 30/240 (12%)
Query: 33 AVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWAWV 92
AV++A+ ++ +C S R A C S ++ C GI S T+
Sbjct: 111 AVSSASVMSDRICIQSDQK---NQLRISAADMIECC--ESCTFSVDGCHGGIPSFTFTEW 165
Query: 93 HKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-------- 144
G V+GG ++S GC P CN P CKTL P C C
Sbjct: 166 KDSGFVSGGEYNSTNGCMSYPLPRCN--------PSCKTLYDA-PTCKKECDKGSPLKYE 216
Query: 145 -NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA 203
+ +Y + ++ ++ GP +F T Y + +G VY +
Sbjct: 217 EDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASF-----TVYADFIHYLSG-VYKFDGES 270
Query: 204 EIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+++ V+I+GWG ENG PYW + +++ E++GD+G KI RG+NE IE + LP+
Sbjct: 271 KLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 330
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 86/191 (45%), Gaps = 17/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +S W + G+V+GG + S GCQP S PC H + P C + P C
Sbjct: 150 CDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAPCEH-HVPGPRPACSGEGS-TPDC 207
Query: 140 HTRC---TNDNYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQ 191
+C + +Y + + + Y + GP AF T Y +
Sbjct: 208 RNQCDKRSGISYDKDLYYGESAYSLEDEAKQIQAEILKNGPVEAAF-----TVYEDLVNY 262
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
G V+ S ++ +KI+GWG EN PYW + +++ +G+ G KILRG++E
Sbjct: 263 KEGVYQHVAGS--VLGGHAIKILGWGVENDTPYWLVANSWNTDWGNNGFFKILRGKDECG 320
Query: 252 IESLVNGALPK 262
IE V+ LP+
Sbjct: 321 IEIDVSAGLPR 331
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 84/194 (43%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTG + + +GC+P +PPC H +C P C
Sbjct: 163 CEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTC 222
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRP 188
+C DNY + +DK Y + G + GP F + F Y+
Sbjct: 223 EYKC-QDNYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDF-EHYSSG 280
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+++ + E V VK++GWG ENG YW +++ +G+ G +ILRG N
Sbjct: 281 IYK--------HMAGEYVGVHAVKMLGWGTENGVDYWICANSWNSDWGENGFFRILRGEN 332
Query: 249 EAIIESLVNGALPK 262
E IES V PK
Sbjct: 333 ECGIESNVVAGKPK 346
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 107/252 (42%), Gaps = 37/252 (14%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHV-ECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
+C A+AT++ A +C ++ + S I +C + C+ G
Sbjct: 112 NCGSCWAIATSSAFADRLCVATNADFNQLLSAEEITFCCHKCGY---------GCNGGYP 162
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQPKCHTR 142
W K GLVTGG + S GC+P PPC + N T S P + H R
Sbjct: 163 IKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCS-------GKPMEQNH-R 214
Query: 143 CTNDNYGRGFF---------QDKYQINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQT 192
CT YG +D Y + + D +GP +F Y L
Sbjct: 215 CTRMCYGDQDLDFDDDHRHTRDSYYLTIGSIQKDVMTYGPIEASF-----DVYDDFLSYK 269
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
+G VY S +A + VK++GWGEE G PYW +++++ +GD+G KI RG NE +
Sbjct: 270 SG-VYVRSENASYLGGHAVKLIGWGEEYGTPYWLMMNSWNADWGDEGLFKIRRGTNECGV 328
Query: 253 ESLVNGALPKDN 264
++ +P N
Sbjct: 329 DNSTTAGVPVTN 340
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 86/181 (47%), Gaps = 24/181 (13%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
WV GLV+G ++S+ GC+P F PC++ + E K PKC C N Y R
Sbjct: 173 WV-DAGLVSGAPYNSSEGCKPYPFEPCSYP-FVGCHHEKK-----NPKCLHHCIN-GYDR 224
Query: 151 GFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
+ +DK Y+I GP F + + + VY
Sbjct: 225 KYRKDKFFGATAYKIPNDARMIQLEIMTNGPVATGF------EVFEDFYFYHSGVYKHVV 278
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
++ +A ++IVGWG ENG PYW I +++G+ +GDKG K+LRG N IES V LP
Sbjct: 279 GKKVGMHA-IRIVGWGTENGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIESTVIAGLP 337
Query: 262 K 262
+
Sbjct: 338 Q 338
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 19/182 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + GLV+GG ++S+ GC+P PPC H P C T PKC
Sbjct: 112 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 169
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + +Y F +DK Y ++G + GP AF T Y+ L
Sbjct: 170 QKNCES-SYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAF-----TVYSDLLS 223
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
NG VY + + +A +KI+GWG EN YW I +++ +GD G KILRG +
Sbjct: 224 YKNG-VYKHTEGNALGGHA-IKIIGWGVENNNKYWLIANSWNSDWGDNGFFKILRGEDHC 281
Query: 251 II 252
I
Sbjct: 282 GI 283
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 86/194 (44%), Gaps = 23/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + GLVTG +++ C+P SFPPC H +P TPQ C
Sbjct: 157 CQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCTGDPTTPQ--C 214
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
+C + Y + + DK Y + ++ D A R T PL + + VYA
Sbjct: 215 VKKCQPE-YPKTYENDKWYGLKAYSIHSDQE------AIMRDLMT--YGPL-EVDFEVYA 264
Query: 199 VSASAEIVAY----------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
S Y V++VGWG E+G YW I +++ +GD G KI RG N
Sbjct: 265 DFPSYSSGVYRHVAGGLLGGHAVRLVGWGVEDGADYWLIANSWNTDWGDGGYFKIRRGVN 324
Query: 249 EAIIESLVNGALPK 262
E IES N PK
Sbjct: 325 ECGIESDANAGHPK 338
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 89/184 (48%), Gaps = 19/184 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + GLV+GG ++S+ GC+P PPC H P C T PKC
Sbjct: 113 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 170
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + +Y F +DK Y ++G GP AF T Y+ L
Sbjct: 171 EKTCES-SYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAF-----TVYSDLLS 224
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + + +A +KI+GWG ENG YW I +++ +GD G +KILRG +
Sbjct: 225 YKSG-VYQHTHGNALGGHA-IKILGWGVENGSKYWLIANSWNSDWGDNGFLKILRGEDHC 282
Query: 251 IIES 254
IES
Sbjct: 283 GIES 286
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 93/194 (47%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ RG+V+GG+++S GC+P PC H + P C + +TP C
Sbjct: 157 CNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEH-HVDGPRPPCHSGSTPH--C 213
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C NY + +DK Y IN GP AF T Y +
Sbjct: 214 KHQC-QPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
G VY ++ +A ++I+GWG E+ PYW I +++ +GD G +ILRG++
Sbjct: 268 YKTG-VYQHVHGKQLGGHA-IRIIGWGVWGESKVPYWLIANSWNTDWGDNGFFRILRGKD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESQISAGLPK 339
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 88/207 (42%), Gaps = 37/207 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGA------HHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
CS G ++W ++H G+V+GG + GC P SFP C H + C
Sbjct: 53 CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEI 112
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
P C + C N YG F +D++ L F FG T + TN
Sbjct: 113 YDTPSCSSSCPNAKYGTAFDKDRHYTESL---FPSRFGS----------TSSIKKEIMTN 159
Query: 194 GRVYAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGD 237
G A S + ++Y + V+I+GWG E G YW +++++ E++GD
Sbjct: 160 GPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGD 219
Query: 238 KGTIKILRGRNEAIIESLVNGALPKDN 264
GT KI++G + I+ + P N
Sbjct: 220 HGTFKIVQG--DCGIDDTILAGTPAMN 244
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 86/192 (44%), Gaps = 23/192 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 138
C G + + G+VTGG ++ GC P SFPPC + S P CKT
Sbjct: 165 CQGGYTIEAMKYWMNSGVVTGG-DYNGAGCMPYSFPPCKKSPCVEFSTPSCKT------T 217
Query: 139 CHTRCTNDNY--GRGFFQDKYQINGLG------LYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C + T +Y + F Y+++ Y H GP A +R F +
Sbjct: 218 CQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPV-EASYRVF-----EDFY 271
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Q VY S +V VKI+GWG ENG YW + +++G FG+KG KI RG NE
Sbjct: 272 QYKSGVYH-HVSGNLVGGHAVKIIGWGTENGVDYWLVANSWGTSFGEKGFFKIRRGTNEC 330
Query: 251 IIESLVNGALPK 262
IES + L K
Sbjct: 331 QIESNIVAGLAK 342
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 93/198 (46%), Gaps = 27/198 (13%)
Query: 80 CSSGISSSTWA-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + + WVH G+V+GG+ +S GCQP PC H + P+C + PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKC-SEGGGTPK 204
Query: 139 CHTRCT--------NDNYGRG----FFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
C C +D + G +D+ QI Y GP AF T Y
Sbjct: 205 CAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIK----YEIMKNGPVEGAF-----TVYV 255
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY + +A ++++GWGEENG PYW +++ +GD G KILRG
Sbjct: 256 DFLHYKSG-VYQHRHGLPLGGHA-IRVLGWGEENGTPYWLCANSWNTDWGDNGLFKILRG 313
Query: 247 RNEAIIESLVNGALPKDN 264
+ IES ++ LPK N
Sbjct: 314 SDHCGIESEISAGLPKLN 331
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 23/192 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W W + G+V+ ++GC P +FP C+H T CK +P P C
Sbjct: 202 CDGGQPDSAWRWFSEHGVVS----ELDSGCWPYNFPECSHHVETKGMEPCKG-NSPSPVC 256
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAFWRSFCTKYTRPLF 190
T C N ++ F D++ G D GP AF T Y L+
Sbjct: 257 STTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNGPVAAAF-----TVYEDFLY 311
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY +E+ +A VKI+GWG + YW +++++ +GD+G KI G E
Sbjct: 312 YKSG-VYKHVNGSELGGHA-VKIIGWGTDQNEQYWLVMNSWNVNWGDQGIFKIAIG--EC 367
Query: 251 IIESLVNGALPK 262
I+S V +PK
Sbjct: 368 GIDSEVTAGIPK 379
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 92/200 (46%), Gaps = 31/200 (15%)
Query: 80 CSSGISSSTW-AWVHK---RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP 135
C G ++ W W K G+VTGG + SN GCQP + P C+H E + +TP
Sbjct: 153 CDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGPYENCSGSQSTP 212
Query: 136 QPKCHTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPFWPAFWRSFC 182
C C + +Y + + DK+ I + P G F + + F
Sbjct: 213 S--CKRSCIS-SYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAF--SVYADFP 267
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
T YT ++Q + + +KI+GWG ENG PYW + +++ +GD G K
Sbjct: 268 T-YTSGVYQ--------HTTGSFLGGHAIKILGWGTENGVPYWLVANSWNPSWGDSGFFK 318
Query: 243 ILRGRNEAIIESLVNGALPK 262
I+RG++E IES + +P+
Sbjct: 319 IIRGKDECGIESSIVAGMPE 338
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 110/245 (44%), Gaps = 38/245 (15%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA+ L+ +C S +MH+ +S F++ + C++ C
Sbjct: 118 NCGSCWAVSTASALSDRICIESNGETQMHI--SSIDFVSCC-ESCSY---------GCDG 165
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + + G VTGG + S GC+P F PC H T EC A PKC R
Sbjct: 166 GWPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRR 224
Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQ 191
C +Y + ++ DK G Y PH GP AF T Y +
Sbjct: 225 CQR-SYKKAYYMDKSY--GEDAYEVPHSVKAIQREIMKNGPVVGAF-----TVYEDFSYY 276
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
G +Y +A +A +KI+GWG EN PYW I +++ +G++G +++RG NE
Sbjct: 277 KKG-IYKHTAGQARGGHA-IKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECG 334
Query: 252 IESLV 256
IE V
Sbjct: 335 IEQEV 339
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 12/180 (6%)
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + W + K G+VTG + ++T CQP FP C H + P C P C
Sbjct: 139 GFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEH-HTKGKYPACFEEIYKTPNCENT 197
Query: 143 CTNDNYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
C +Y + QDK Y + + P + Y L +G +
Sbjct: 198 CQK-SYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYGPV--EANFIVYEDFLNYKSG-I 253
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y + ++V++ ++I+GWG EN PYW I +++ E +G+ G +ILRGR+E IES V
Sbjct: 254 YK-HITGKLVSWHAIRIIGWGVENNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEV 312
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 87/197 (44%), Gaps = 26/197 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN----YTTSEPECKTLATP 135
C G W ++ G+VTGG ++ + C+P SFPPC+H N Y+ E + L
Sbjct: 145 CDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTEV 204
Query: 136 QPKCHTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH--------FGPFWPAF--WRSFCT 183
P C +C + + R + DK + N L D GP F + F
Sbjct: 205 TPSCTKKC-HPQFSRTYDVDKIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLN 263
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ QT G+ A VKI+GWG ENG PYW ++++ + +G G KI
Sbjct: 264 YKSGVYQQTTGQRRGKHA---------VKIIGWGTENGVPYWEAINSWNDGWGINGKFKI 314
Query: 244 LRGRNEAIIESLVNGAL 260
LRG N IE V ++
Sbjct: 315 LRGFNHLDIEGEVYASI 331
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 93/190 (48%), Gaps = 19/190 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W ++ + G+VTGG ++S+ GC P C+H +P CK P P+C
Sbjct: 155 CNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQP-CKGDG-PTPRC 212
Query: 140 HTRCT---NDNYGRGFFQDK--YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTR-PLFQ 191
C N+ Y + K + + G+ GP AF T Y+ P ++
Sbjct: 213 KKECESGYNNTYSKDEHHAKTVHAVEGVEQIMTEIMTNGPVEAAF-----TVYSDFPTYK 267
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ VY + + +A +K +GWG E+G+ YW + +++ +GD G KILRGR+E
Sbjct: 268 SG--VYEHKSGGPLGGHA-IKTLGWGNEDGKDYWLVANSWNPDWGDNGFFKILRGRDECG 324
Query: 252 IES-LVNGAL 260
IES +V G +
Sbjct: 325 IESNIVAGMM 334
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 84/191 (43%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + GLV+GG ++S+ GC P PPC H P C T PKC
Sbjct: 32 CNGGMPTLAWEYWKHMGLVSGGNYNSSQGCSPYVIPPCEHHVPGNRLP-CNG-DTKTPKC 89
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C N Y + +DK Y + G + GP AF T Y L
Sbjct: 90 SKTCEN-GYNVLYKKDKRYGKHVYAVRGGEDHIKAELFKNGPVEAAF-----TVYADLLA 143
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G V A + +KI+GWG ENG YW I +++ +G+ G KILRG +
Sbjct: 144 YKSGVYKHVEGDA--LGGHAIKIIGWGVENGNKYWLIANSWNTDWGNNGFFKILRGEDHC 201
Query: 251 IIESLVNGALP 261
IES + P
Sbjct: 202 GIESSIVAGEP 212
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 110/247 (44%), Gaps = 29/247 (11%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC S +V +FRF A C + C+ G
Sbjct: 112 SCGSCWAFGAVEAMSDRVCIHSNGNV---NFRFSADDLVSCCHTCG-----FGCNGGFPG 163
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC--- 143
+ W++ ++G+V+GG + S TGC+P PC H T P C + PKC +C
Sbjct: 164 AAWSYWTRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAP-CNH-DSKTPKCQHQCEAG 221
Query: 144 ------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
+ ++G + + + + + GP AF T Y + +G VY
Sbjct: 222 YNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTN-GPVEGAF-----TVYEDLILYKSG-VY 274
Query: 198 AVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
E+ +A ++I+GWG + PYW I +++ + +GDKG +ILRG + IES
Sbjct: 275 QHEHGKELGGHA-IRILGWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESS 333
Query: 256 VNGALPK 262
++ LPK
Sbjct: 334 ISAGLPK 340
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 109/245 (44%), Gaps = 38/245 (15%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA+ L+ +C S +MH+ +S F++ + C + C
Sbjct: 118 NCGSCWAVSTASALSDRICIESNGETQMHI--SSIDFVSCC-ESCGY---------GCDG 165
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + + G VTGG + S GC+P F PC H T EC A PKC R
Sbjct: 166 GWPILAFDFYTYEGAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRR 224
Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQ 191
C +Y + ++ DK G Y PH GP AF T Y +
Sbjct: 225 CQR-SYKKAYYMDKSY--GEDAYEVPHSVKAIQREIMKNGPVVGAF-----TVYEDFSYY 276
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
G +Y +A +A +KI+GWG EN PYW I +++ +G++G +++RG NE
Sbjct: 277 KKG-IYKHTAGQARGGHA-IKIIGWGVENDVPYWLIANSWHNDWGEEGYFRMIRGINECG 334
Query: 252 IESLV 256
IE V
Sbjct: 335 IEQEV 339
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 86/193 (44%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ + TGC+ FP C+H + P C P C
Sbjct: 149 CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207
Query: 140 HTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
+C T+ + K + N + + GP AF + F +
Sbjct: 208 VQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMIN-GPVEAAFQVYEDFLGYKSGVY 266
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
F ++G + A ++I+GWGEENG YW I +++ + +G+ G K+LRG+NE
Sbjct: 267 FHSDGTLLGGHA---------IRILGWGEENGVAYWLIANSWNDGWGEDGYFKMLRGKNE 317
Query: 250 AIIESLVNGALPK 262
IE V LP+
Sbjct: 318 CGIEDEVTAGLPE 330
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 94/206 (45%), Gaps = 20/206 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C +G W + G+VTGG+ +GC+ FP C H P C P P+C
Sbjct: 120 CGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRR-KGRYPPCPRHIYPTPEC 178
Query: 140 HTRCTNDNYGRGFFQDKYQIN--------GLGLYFDPHF-GPFWPAFWRSFCTKYTRPLF 190
+C D + +DK + N + + + GP +F
Sbjct: 179 IKQC--DEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPVEASF------GIYADFL 230
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+ NG VY I +A ++I+GWGE++G PYW I +++ E +G+KG ++ LRG NE
Sbjct: 231 EYNGGVYFHCWGGPISRHA-IRILGWGEDDGVPYWLIANSWNEDWGEKGYVRFLRGHNEC 289
Query: 251 IIESLVNGALPKDNYGVEFGEESGER 276
IE V A+P D + + ++S R
Sbjct: 290 GIEEEVT-AVPIDWFLRQMIKQSTLR 314
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 36/52 (69%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
++I+GWGEE+G PYW + +++ E +G+KG +++LR RNE I V LP
Sbjct: 895 IRILGWGEEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPD 946
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 28/64 (43%), Gaps = 1/64 (1%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + G+VTGG+ TGC+ FP C H P C P P+C
Sbjct: 708 CRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 766
Query: 140 HTRC 143
RC
Sbjct: 767 IKRC 770
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 85/214 (39%), Gaps = 36/214 (16%)
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
C L+S W C W GL TGG ++ GC+P S PC+ +AN TT
Sbjct: 154 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTT 213
Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
S P C TP C CT N + + QDK HFG +
Sbjct: 214 SVP-CPGYHTP--TCEEHCTSNITWPIAYKQDK------------HFGKAHYNVGKKMTD 258
Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
TNG V A V + + KI+GWG +NG PYW
Sbjct: 259 IQIE--IMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLC 316
Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V +G FG+ G ++ LRG NE IE V ALP
Sbjct: 317 VHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 84/191 (43%), Gaps = 15/191 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + K GL TGG++ S GC+P S PC+ + P C P C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248
Query: 140 HTRCT-------NDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQ 191
+C + + G D+ + + D GP S + Q
Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPI------SATMEVYDDFLQ 302
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+Y V + + +V+I+GWG G PYW + +++G+Q+G+ GT ++LRG NE
Sbjct: 303 YTTGIY-VHLTGNKQGHLSVRILGWGMYEGVPYWLLANSWGKQWGENGTFRVLRGVNECG 361
Query: 252 IESLVNGALPK 262
+E+ +P+
Sbjct: 362 LEANCVSGMPR 372
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 87/192 (45%), Gaps = 31/192 (16%)
Query: 82 SGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 141
G S W V GLV+G A++S GC+P F PC + + PE P C
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNSTDGCKPYPFKPCLYP-FVGCHPE------KTPSCTH 209
Query: 142 RCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLF 190
CT + Y + +DKY G Y P+ GP F + L+
Sbjct: 210 HCT-EGYDGTYRRDKYY--GSAAYKLPNDERMIQLEIMTNGPVESGF------SVYQDLY 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
VY E+ +A V+++GWG+E G PYW I +++GE +G+ G K LRG N
Sbjct: 261 LYKTGVYQHVVGREVGKHA-VRLIGWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHL 319
Query: 251 IIESLVNGALPK 262
IES+V LPK
Sbjct: 320 GIESVVIAGLPK 331
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 19/180 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C G + W + G+VTG + +++GC+P FPPC +H+N T EP CK P PK
Sbjct: 146 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 204
Query: 139 CHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPL 189
C+ +C + NY + + DKY + D GP +F YT L
Sbjct: 205 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASF-----EVYTDFL 258
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
T+G V+ S + VKI+GWG + G YW +++ +G+ G +ILRG +E
Sbjct: 259 HYTSGIYKHVAGS--VGGGHAVKILGWGIDQGVSYWLAANSWNNDWGEDGYFRILRGADE 316
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 85/198 (42%), Gaps = 29/198 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGG------AHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G++ W +++K G+ TGG + + GC P +FP C H + C +
Sbjct: 157 CKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPRCAHYQKKSKYGPCPKKS 216
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR------ 187
P C RC N+ YG +D++ F P+W RS + +
Sbjct: 217 YETPSCLDRCPNEKYGTPLDKDRH--------FTARAVPYWFNGIRSIKKEIMKHGPTSA 268
Query: 188 ------PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
F VY ++ A V + TV+++GWG E G YW + + E++ D GT
Sbjct: 269 SFFTYEDFFSYKSGVYKYTSGA-YVEFHTVELIGWGTEKGVDYWLAKNDWNEEWADLGTF 327
Query: 242 KILRGRNEAIIESLVNGA 259
KI +G + I LV GA
Sbjct: 328 KIAQG--DCGINDLVLGA 343
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 85/201 (42%), Gaps = 32/201 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G+ + W+++ G+ T G+ + GC P +FP C H + C P C
Sbjct: 133 CKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSC 192
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
RC N+ YG +D+ HF P + T + TNG A
Sbjct: 193 LDRCPNEKYGIPLDKDR------------HFTAHSPDLFEG--TDNIKKEIMTNGPTSAT 238
Query: 200 -SASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S + V+Y +V+I+GWG E G YW +++++ E +GD GT KI
Sbjct: 239 FSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKI 298
Query: 244 LRGRNEAIIESLVNGALPKDN 264
+G + I+ V G+ P N
Sbjct: 299 AQG--DCGIDDAVLGSPPAMN 317
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 86/193 (44%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ + TGC+ FP C+H + P C P C
Sbjct: 40 CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 98
Query: 140 HTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
+C T+ + K + N + + GP AF + F +
Sbjct: 99 VQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMIN-GPVEAAFQVYEDFLGYKSGVY 157
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
F ++G + A ++I+GWGEENG YW I +++ + +G+ G K+LRG+NE
Sbjct: 158 FHSDGTLLGGHA---------IRILGWGEENGVAYWLIANSWNDGWGEDGYFKMLRGKNE 208
Query: 250 AIIESLVNGALPK 262
IE V LP+
Sbjct: 209 CGIEDEVTAGLPE 221
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 88/195 (45%), Gaps = 20/195 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTG + +++GC+P FPPC H N T CK P PKC
Sbjct: 205 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 264
Query: 140 HTRCTNDNYGRGFFQDKYQ-------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLF 190
+C + NY + + DKY N + L GP +F YT L
Sbjct: 265 DRQC-DKNYKKPYKADKYYGEQAYNVENDVELIQKEIMTLGPVEASF-----EVYTDFLH 318
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK---GTIKILRGR 247
G V+ S + VKI+GWG + G YW +++ +G+ G +ILRG
Sbjct: 319 YIGGIYKHVAGS--VGGGHAVKILGWGIDQGVSYWLAANSWNTDWGEDVFSGYFRILRGV 376
Query: 248 NEAIIESLVNGALPK 262
+E IES + +P+
Sbjct: 377 DECGIESGIVAGIPR 391
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 80/187 (42%), Gaps = 9/187 (4%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W G+VTGG+ TGC+ FP C H P C P P+C
Sbjct: 155 CRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 213
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
RC D + +DK + N + R + Y L +G
Sbjct: 214 IKRC--DTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGV 271
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+ V + ++I+GWGEE+G PYW + +++ E +G+KG +++LR RNE I
Sbjct: 272 YFHVWGGH--LGEHGIRILGWGEEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQ 329
Query: 256 VNGALPK 262
V LP
Sbjct: 330 VTAGLPD 336
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 24/244 (9%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A+AT + ++ +C S +FR C + + C G
Sbjct: 113 SCGSCWAIATTSVMSDRLCIGSN---GVMNFRLSGLDMLSCCAICG-----FACQGGYPG 164
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ WA+ ++GLV+GG + S GCQP + PC+H+ S P C +C C
Sbjct: 165 AAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHSG-NGSRPVCTVGGG--VRCQHLC-EP 220
Query: 147 NYGRGFFQDK------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
+Y F +DK Y I+ L P ++ T Y L G Y +
Sbjct: 221 SYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPV--QAILTVYEDFLSYKTGVYYHL- 277
Query: 201 ASAEIVAYATVKIVGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
E V V+I+GWG + PYW + +++G +GD G I RG N IE +
Sbjct: 278 -EGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMA 336
Query: 259 ALPK 262
LPK
Sbjct: 337 GLPK 340
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 87/186 (46%), Gaps = 19/186 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ + W + G+V+GG+++S GC P PPC H P C T PKC
Sbjct: 153 CNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLP-CNG-DTKTPKC 210
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y F +DK Y ++G GP AF T Y+ L
Sbjct: 211 QKTCEA-GYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAF-----TVYSDLLS 264
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + + + +A VKI+GWG ENG YW I +++ +GD G KILRG +
Sbjct: 265 YKSG-VYQHTDGSALGGHA-VKILGWGVENGSKYWLIANSWNSDWGDNGFFKILRGEDHC 322
Query: 251 IIESLV 256
IES +
Sbjct: 323 GIESSI 328
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/193 (32%), Positives = 83/193 (43%), Gaps = 31/193 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CKGGAPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+T CT N++Y +D Y+ LYF+ GPF F Y+
Sbjct: 211 NTTCTDKAIPLIKYRGNNSYMLLNGEDDYKRE---LYFN---GPFVVDF-----GVYSDF 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G VS +++ V+IVGWG+ NG PYW I +++ +G G ILRG N
Sbjct: 260 LAYKTGVYRHVSG--DVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGNN 317
Query: 249 EAIIESLVNGALP 261
E IES LP
Sbjct: 318 ECGIESTGYAGLP 330
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 85/192 (44%), Gaps = 30/192 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPGTAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210
Query: 140 HTRCTND---------NYGRGFF-QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+T CT+ N+ G +D Y+ LYF+ GPF AF Y+ L
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYGLDGEDDYKRE---LYFN---GPFVVAF-----QVYSDFL 259
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
G VS +++ V+IVGWG+ NG PYW I +++ +G G ILRG++E
Sbjct: 260 AYKTGVYRHVSG--DVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGKDE 317
Query: 250 AIIESLVNGALP 261
IES LP
Sbjct: 318 CGIESEGYAGLP 329
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 94/194 (48%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ +G+V+GG++ S GC+P PC H + + P C + +T P+C
Sbjct: 155 CNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPYEVEPCEH-HVNGTRPPCHSGST--PRC 211
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C + Y + +DK Y +N L GP AF T Y +
Sbjct: 212 MHKCES-GYSVDYAKDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAF-----TVYEDLIL 265
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
G VY ++ +A ++I+GWG +N PYW I +++ +GD G +ILRG +
Sbjct: 266 YKTG-VYQHVHGRQLGGHA-IRILGWGVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGED 323
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 324 HCGIESAISAGLPK 337
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 89/191 (46%), Gaps = 18/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W G+VTGG ++S GCQP C+H +P CK P+C
Sbjct: 28 CNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDHHVVGKLKP-CKGDGK-TPRC 85
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
+C Y F DK Y ++ + + GP AF T Y+ Q
Sbjct: 86 EKKCEA-GYNVTFKDDKHYGQRSYSVSSVNDIMEELVTRGPVEAAF-----TVYSD-FLQ 138
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ VY + + + +A VKI+G+G ENG YW + +++ +GD+G KILRG +E
Sbjct: 139 YHSGVYRHTTGSALGGHA-VKILGYGVENGDKYWLVANSWNPDWGDQGFFKILRGVDECG 197
Query: 252 IESLVNGALPK 262
IE + PK
Sbjct: 198 IEGQIVAGEPK 208
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 90/191 (47%), Gaps = 18/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++ W + +G+VTGG + SN GCQP S C H +P C + P P C
Sbjct: 160 CNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CGDIV-PTPAC 217
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQ 191
C Y + DK Y + G+ GP AF T Y+ L
Sbjct: 218 KRSC-RQGYNVTYPNDKHFGASSYGVRGVDQIATEIMTNGPVEAAF-----TVYSDFLSY 271
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G VY ++ + +A +KI+GWG ++G YW + +++ + +G+ G I +G +E
Sbjct: 272 KSG-VYQHTSGQPLGGHA-IKIIGWGVQDGTDYWIVANSWNDSWGNDGFFWIKKGTDECG 329
Query: 252 IESLVNGALPK 262
IES V LPK
Sbjct: 330 IESQVVAGLPK 340
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 85/193 (44%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ + TGC+ FP C+H + P C P C
Sbjct: 149 CQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207
Query: 140 HTRC--------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
+C T+ + K + N + + GP AF + F +
Sbjct: 208 VQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMIN-GPVEAAFQVYEDFLGYKSGVY 266
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
F ++G + A ++I+GWGEENG YW I +++ + +G+ G K+LRG+NE
Sbjct: 267 FHSDGTLLGGHA---------IRILGWGEENGVAYWLIANSWNDGWGEDGCFKMLRGKNE 317
Query: 250 AIIESLVNGALPK 262
IE V LP+
Sbjct: 318 CGIEDEVTAGLPE 330
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 90/198 (45%), Gaps = 26/198 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G S W+WVH G+ TGG + + GC P FPPC H P C A
Sbjct: 211 CRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPCAHFFKDPKYPACPKFA 270
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKY-QINGLGLYFDPHF--------GPFWPAFWRSFCTK 184
+C ++ + +F D+Y + + +F GP F+
Sbjct: 271 RVNLRCVSKLRH--MMVVYFSDRYFMVESVPYHFSADDAKNAIRTDGPVSATFYV----- 323
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
Y L +G VY ++ + + A+A VKI+GWGE+ G YW +V+++ E +GD G KI
Sbjct: 324 YEDFLAYKSG-VYKHTSGSLLGAHA-VKIIGWGEDGGEAYWLVVNSWNEGWGDHGLFKIA 381
Query: 245 RGRNEAIIESLVNGALPK 262
G + I++ + G PK
Sbjct: 382 LG--DCGIDNELLGGTPK 397
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 86/196 (43%), Gaps = 9/196 (4%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S + G VTGG + + GC P SF PC ++ P CKT K
Sbjct: 162 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCKTTCQSSYKT 220
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-HFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
+ +YG ++ + + + H+GP ++ K + VY
Sbjct: 221 EEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASY------KVYEDFYHYKSGVYH 274
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
+ S ++V VKI+GWG ENG YW I +++G FG+KG KI RG NE IE V
Sbjct: 275 YT-SGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVA 333
Query: 259 ALPKDNYGVEFGEESG 274
+ K E E+ G
Sbjct: 334 GIAKLGTHSETYEDDG 349
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 89/213 (41%), Gaps = 39/213 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTG---------------------CQPVSFPPCN 118
C+ G SS W + GLV+GG + S+ G C+P + PPC
Sbjct: 148 CNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCE 207
Query: 119 HANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF-- 170
H + S P C P+C RC Y + QDK Y ++
Sbjct: 208 H-HVNGSRPSCSGEGGDTPECIFRC-EAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYK 265
Query: 171 -GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVS 229
GP AF T Y + +G VS SA + +K++GWGEENG PYW +
Sbjct: 266 NGPVEGAF-----TVYEDFVLYKSGVYQHVSGSA--LGGHAIKMLGWGEENGVPYWLCAN 318
Query: 230 TFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
++ +GD G KILRG + IES + PK
Sbjct: 319 SWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 87/187 (46%), Gaps = 14/187 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHA---NYTTSEPECKTLATPQ 136
C G W + +G+ TGG + + GC P PPC + N +P + P+
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPK 213
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFD--PHFGPFWPAFWRSFCTKYTRPLFQTNG 194
C+ + T N R + +Y IN + +GP +F Y +G
Sbjct: 214 T-CYGKTTVQN--RYKTKSEYSINSIKTIEQDLKTYGPVEASF-----DVYDDFSVYKSG 265
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+Y + A+ ++KI+GWG+ENG YW V+++ + +G+ GT KI++GRNE IE
Sbjct: 266 -IYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAVNSWSKFWGEHGTFKIIKGRNECGIER 324
Query: 255 LVNGALP 261
V +P
Sbjct: 325 AVTAGIP 331
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 84/195 (43%), Gaps = 26/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++GLV+GG S+ GC+P + PC H P CK TP KC
Sbjct: 152 CDGGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSP-CKDSITP--KC 208
Query: 140 HTRCT---------NDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
+C + ++G+ + D+ QI P F + F + Y
Sbjct: 209 IKKCLPGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPVEATF--TVFDDFAS-YKH 265
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
++Q S + V+I+GWG ENG YW +++ +GD G KILRG
Sbjct: 266 GIYQ--------HTSGNLAGEHAVRILGWGVENGTKYWLAANSWNSDWGDNGYFKILRGS 317
Query: 248 NEAIIESLVNGALPK 262
N IES + LPK
Sbjct: 318 NHVDIESAIVAGLPK 332
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 100/248 (40%), Gaps = 30/248 (12%)
Query: 27 SCIEARAVATATPLAFAVC-RSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGIS 85
SC AVA ++ +C S H+ R AG C L + C G
Sbjct: 137 SCASCWAVAPTDVMSDRICIHSGSRHI----VRLSAGNLLSCCKLCGK-----GCKGGFP 187
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPKCHTRCT 144
W K G+VTGG++ S+ GCQ F PC S + +C +C C
Sbjct: 188 GGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETCR 247
Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV----- 199
+Y + + QD Y G +Y P+ A P+ Q N R+Y
Sbjct: 248 T-SYNKSYKQDLYY--GESVYRIPN-----DARAIQLEIMENGPV-QANLRIYEDFLHYK 298
Query: 200 -----SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+ + Y VKI GWG E G PYW + + +++G+ G KILRG N A IE
Sbjct: 299 FGVYRHVHGQGLEYHAVKIFGWGTEGGTPYWLAANPWSKRWGNGGFFKILRGSNHAEIED 358
Query: 255 LVNGALPK 262
V +PK
Sbjct: 359 HVMAGIPK 366
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 71/152 (46%), Gaps = 14/152 (9%)
Query: 116 PCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD-----KYQINGLGLYFDPH- 169
PC H P C P+C +C N +YG + +D +Y+I G + +
Sbjct: 1 PCQHTESAVENP-CSNKTFFTPECKVQCYNPDYGTRYVKDNHKGTQYRIPGYTAMKEIYE 59
Query: 170 FGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVS 229
GP +F+ + VYA + S + V VKI+GWGEENG PYW +
Sbjct: 60 NGPITASFYMY------QDFVNYQSGVYAFN-SGKYVTTQAVKILGWGEENGTPYWLAAN 112
Query: 230 TFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+F +GD G +KILRG NE IE + LP
Sbjct: 113 SFNTYWGDNGFVKILRGANECYIEEFMYAGLP 144
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 86/191 (45%), Gaps = 16/191 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL-ATPQP 137
C G + W +G+VTGG SN GCQP PC+H Y S C +L T
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLF 190
C +C N NY + D ++ + + + + P +F Y +
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMG 249
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
G +Y S + E++ Y VK++GWG + +G YW ++++ +G+ G KILRG N
Sbjct: 250 YKEG-IYK-STTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGLFKILRGYNF 307
Query: 250 AIIESLVNGAL 260
IE LV +
Sbjct: 308 CSIELLVMAGI 318
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 87/192 (45%), Gaps = 31/192 (16%)
Query: 82 SGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 141
G S W V GLV+G A+++ GC+P F PC + + PE P C
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNNTDGCKPYPFKPCLYP-FVGCHPE------KTPSCTH 209
Query: 142 RCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLF 190
CT + Y + +DKY G Y P+ GP F + L+
Sbjct: 210 HCT-EGYDGTYRRDKYY--GSAAYKLPNDERMIQLEIMTNGPVESGF------SVYQDLY 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
VY E+ +A V+++GWG+E G PYW I +++GE +G+ G K LRG N
Sbjct: 261 LYKTGVYQHVVGREVGKHA-VRLIGWGKERGVPYWLIANSYGEDWGEHGYFKFLRGSNHL 319
Query: 251 IIESLVNGALPK 262
IES+V LPK
Sbjct: 320 GIESVVIAGLPK 331
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 80/197 (40%), Gaps = 31/197 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
CS G W WV K G+ TGG + + C+P +F PC + C + P P+C
Sbjct: 155 CSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRC 214
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
C Y + + +DK+ ++W K R NG V A
Sbjct: 215 EKFCQR-GYIKPYKKDKFYAK--------------KSYWLPNDEKEIRLDIMKNGPVQAA 259
Query: 200 SASAEIVAY----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
E VKI+GWG++NG YW I +++ + +G+ G ++
Sbjct: 260 FDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTDYWLIANSWSKDWGESGFFRM 319
Query: 244 LRGRNEAIIESLVNGAL 260
+RG N+ IE ++ +
Sbjct: 320 VRGENDCEIEDMITAGI 336
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 87/199 (43%), Gaps = 33/199 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++ W +RG+V+GG + + GC+P S PC + + P C + P+C
Sbjct: 154 CKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEY-HTKCRIPNCIPIVH-TPEC 211
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
C Y + + +DK HFG + R K + TNG V A
Sbjct: 212 VHHCRK-GYDKDYQEDK------------HFGQKVYSISRD--EKQIQTEIFTNGPVEAD 256
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ + Y + ++I+GWG ENG PYW +++ E +GDKG KI
Sbjct: 257 FHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGTENGTPYWLAANSWNENWGDKGYFKI 316
Query: 244 LRGRNEAIIESLVNGALPK 262
LR NE IE + +PK
Sbjct: 317 LRRTNECGIEEHIYAGIPK 335
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 23/195 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + GLV+GG + + C+ PPC H + + P C+ A P PKC
Sbjct: 169 CNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEH-HVNGTRPPCEGDA-PTPKC 226
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAF--WRSFCTKYTRP 188
C + Y + +DK Y + ++ + GP F + F T Y
Sbjct: 227 KNVC-QEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPT-YKSG 284
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q S ++ +K++GWGEE+G PYW +++ +G+ G KILRG+N
Sbjct: 285 VYQ--------HVSGALLGGHAIKLMGWGEEDGVPYWLCANSWNTDWGEGGFFKILRGKN 336
Query: 249 EAIIESLVNGALPKD 263
IES + +P++
Sbjct: 337 HCGIESDIVAGIPQN 351
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 89/187 (47%), Gaps = 14/187 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHA---NYTTSEPECKTLATPQ 136
C G W + +G+ TGG + + GC+P PC + N +P + P+
Sbjct: 154 CEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKNTCGGKPMERNHQCPK 213
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTNG 194
C+ + T+ R + +Y IN + +GP +F Y +G
Sbjct: 214 T-CYGKTTDQK--RYKTKSEYVINSIKTIEQDIKTYGPVEASF-----DVYDDFSVYKSG 265
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+Y + +A+ +VKI+GWG+ENG PYW V+++ + +GD GT KI++G+NE IE
Sbjct: 266 -IYRKTPNAKYQNGHSVKIIGWGQENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIER 324
Query: 255 LVNGALP 261
V +P
Sbjct: 325 AVTAGIP 331
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 87/197 (44%), Gaps = 11/197 (5%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN-YTTSEPECKTLATPQPK 138
C G S + G VTGG ++ N GC P SF PC + ++ P CKT
Sbjct: 163 CQGGYSIEAMRFWKSNGAVTGGDYNGN-GCMPYSFAPCQKSPCVESTTPTCKTTCQSSYT 221
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL--YFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
T+ +YG ++ N + Y H GP ++ K +Q V
Sbjct: 222 TANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEASY------KVYEDFYQYKSGV 275
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y S ++V VKI+GWG EN YW + +++G +FG+ G KI RG NE IES V
Sbjct: 276 YHY-VSGKLVGGHAVKIIGWGTENDVDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNV 334
Query: 257 NGALPKDNYGVEFGEES 273
+ K E G++
Sbjct: 335 VAGVAKLGTHAEKGDDD 351
>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 203
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 88/186 (47%), Gaps = 24/186 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL 132
C+ G +++ G+VTG G GC P F CNH SE P+CK +
Sbjct: 12 CNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNHVPTENSEYPKCKDV 71
Query: 133 A-TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAFWRSFC 182
A P P C T CTN Y + +D ++ F+ GP + AF
Sbjct: 72 AHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVFNDAQSIKQEIFDNGPVFSAF----- 126
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
Y + +G VY V + E++++ VKI+GWG ++ + YW ++++ E++GD G IK
Sbjct: 127 KMYEDFRYYKSG-VY-VPTTKEVLSFHLVKIIGWGADSVQEYWLAMNSWNEEWGDHGLIK 184
Query: 243 ILRGRN 248
+ G+N
Sbjct: 185 MAFGKN 190
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 80/185 (43%), Gaps = 16/185 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + + G+VTGG + + C+P PPC H T C +A P C
Sbjct: 163 CDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIAD-TPDC 221
Query: 140 HTRC-----TNDNYGRGFFQDKYQINGLGLYFDPH---FGPFWPAFWRSFCTKYTRPLFQ 191
T C + + + F +D Y I +GP AF F
Sbjct: 222 VTTCQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAF------IVYEDFFH 275
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ +Y + E +A V+I+GWGEE G YW + +++ +G+ G +ILRG NE
Sbjct: 276 YHRGIYKHVSGGEEGGHA-VRILGWGEEKGTAYWLVANSWNTDWGENGYFRILRGSNECG 334
Query: 252 IESLV 256
IE V
Sbjct: 335 IEENV 339
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 87/202 (43%), Gaps = 40/202 (19%)
Query: 79 VCSSGISSST------WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL 132
+ SGI S+ W + K+GLV+GG +++N GCQP PP
Sbjct: 144 ISCSGIKSNAMADDQAWKFFKKQGLVSGGKYNTNDGCQPSKIPP--------------IF 189
Query: 133 ATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPH------------FGPFWPAFWRS 180
P+ + C N YG Y + + + + H +GP F
Sbjct: 190 NLPKKIYNRTCDNFCYGNSLID--YNHDHVKVSYTYHVLYKNIQREVQTYGPVSAYF--- 244
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
+ Y T+G VYA + ++ V Y + K++GWG ENG YW +V+++G ++G G
Sbjct: 245 --SLYDDLFLYTSG-VYARTEKSKFVRYQSAKLIGWGVENGVDYWLLVNSWGNEWGQNGL 301
Query: 241 IKILRGRNEAIIESLVNGALPK 262
KI RG +E +PK
Sbjct: 302 FKIKRGTDECQFGRHTYAGVPK 323
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 82/182 (45%), Gaps = 24/182 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLG-----LYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+T CT+ ++ Y+++G LYF+ GPF FW Y+ L
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYEVHGEDDYKRELYFN---GPFVVVFW-----VYSDFLAYK 262
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
G VS + + V+IVGWG+ NG PYW I +++ +G G + LRG NE I
Sbjct: 263 TGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGI 320
Query: 253 ES 254
E+
Sbjct: 321 EA 322
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 105/248 (42%), Gaps = 33/248 (13%)
Query: 27 SCIEARAVATATPLAFAVC--RSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
SC AV++A ++ +C +S + V + ++ C + C G
Sbjct: 113 SCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYG---------CQGGW 163
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
+ W+ + G+VTGG + C+P +F PC H C P PKC C
Sbjct: 164 PIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQ 223
Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQTN 193
Y + + +DK+ Y+ P+ GP AF Y +
Sbjct: 224 R-KYNKSYQEDKH--FATRAYYLPNNERNIRQEIYKNGPVVAAF-----RVYQDFSYYKK 275
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + A+A VK+VGWG EN YW I +++ +G+ G +I+RG NE IE
Sbjct: 276 G-IYVHKWGGQTGAHA-VKVVGWGRENATDYWLIANSWNTDWGESGYFRIVRGTNECGIE 333
Query: 254 S-LVNGAL 260
+ +V GA+
Sbjct: 334 AQMVGGAM 341
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 82/191 (42%), Gaps = 19/191 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI S W + G+V+GG ++S GC+P PPC H P C T PKC
Sbjct: 152 CNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C N Y + +DK Y ++ + GP AF T Y L
Sbjct: 210 QKNCEN-GYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAF-----TVYADLLA 263
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G + A + +KI+GWG EN YW + +++ +GD G KILRG N
Sbjct: 264 YKSGVYKHIQGDA--LGGHAIKILGWGVENDNKYWLVANSWNTDWGDNGFFKILRGENHC 321
Query: 251 IIESLVNGALP 261
IE + P
Sbjct: 322 GIEGSIIAGEP 332
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 83/194 (42%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTGG + TGC P FP C H + C P P C
Sbjct: 145 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPRYTYPTPSC 204
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLY-FDPH----------FGPFWPAFWRSFCTKYTRP 188
+ C Y + + +DK + G Y D H GP F YT
Sbjct: 205 YPYC-QAGYDKTYEKDK--VYGKTSYNVDRHEYTIMEEIMKNGPVEAGF-----IVYTDF 256
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G + V S ++I+GWG ENG YW +++ +G+ G +ILRG +
Sbjct: 257 AVYKSGIYHHV--SGRYAGKHAIRIIGWGVENGVKYWLTANSWNVGWGENGYFRILRGTD 314
Query: 249 EAIIESLVNGALPK 262
E IES+V +P+
Sbjct: 315 ECRIESIVVAGMPR 328
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 82/182 (45%), Gaps = 24/182 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLG-----LYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+T CT+ ++ Y+++G LYF+ GPF FW Y+ L
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYEVHGEDDYKRELYFN---GPFVVVFW-----VYSDFLAYK 262
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
G VS + + V+IVGWG+ NG PYW I +++ +G G + LRG NE I
Sbjct: 263 TGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGI 320
Query: 253 ES 254
E+
Sbjct: 321 EA 322
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 91/203 (44%), Gaps = 34/203 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC P + PPC H + S P C + +C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPCEH-HVNGSRPPCTGEGDTR-RC 207
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
+ C Y + +DK HFG + ++ S K NG V A
Sbjct: 208 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKKIMAEIYKNGPVEGA 252
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ ++ + Y + ++I+ WG ENG PYW +++ +GD G KI
Sbjct: 253 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGVENGVPYWAAANSWNLDWGDNGFFKI 312
Query: 244 LRGRNEAIIESLVNGALPK-DNY 265
LRG N IES + +P+ D Y
Sbjct: 313 LRGENHCGIESEIVAGIPRTDQY 335
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 85/192 (44%), Gaps = 19/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + G+V+GG + S GCQP S PC H + S P C+ C
Sbjct: 150 CFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPCEH-HIPGSRPPCRGEGH-TADC 207
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+C + +Y + + + + + GP AF+ L
Sbjct: 208 RKQCEKGYSIPYDKDLHYAEFVYSTERDVKEIQTEILKN-GPVEAAFF------VYEDLL 260
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
VY A A + +A +KI+GWG ENG PYW I +++ +G+ G KILRG +E
Sbjct: 261 TYKEGVYKHVAGAPVGGHA-IKILGWGVENGTPYWLIANSWNTDWGNNGFFKILRGSDEC 319
Query: 251 IIESLVNGALPK 262
IE V+ LP+
Sbjct: 320 GIEIDVSAGLPR 331
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 90/186 (48%), Gaps = 15/186 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G+ S+ W + + G+ +GGA+ S+ GCQ F C + L QP
Sbjct: 204 CDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCKPQEIFAPHVDLICLRQCQPGY 263
Query: 140 HTRCTND-NYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
+T D ++GR + +D+ +I LY +FGP +F T YT Q
Sbjct: 264 NTTYLEDKHFGRVAYSVPRDEDRI----LYELFYFGPVQASF-----TVYT-DFIQYKSG 313
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + + + +VKIVGWG ENG +W +++G ++G+ G KI+RG + +ES
Sbjct: 314 VYRHTYGVRVGDH-SVKIVGWGVENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSVESN 372
Query: 256 VNGALP 261
V LP
Sbjct: 373 VVAGLP 378
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 107/261 (40%), Gaps = 40/261 (15%)
Query: 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
T S IRD S + + A++ +EA + T R S H+ S F+ G+
Sbjct: 113 TISEIRDQSNCGSCW-----AIAAVEAMSDRYCTVAGITDLRVSTGHL--LSCCFVCGMG 165
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C GI + W W GL ++ CQP FPPC H
Sbjct: 166 ---------------CQGGIPTMAWLWWVWVGL-------TSEVCQPYPFPPCGHHTDGG 203
Query: 125 SEPECKTLATPQPKCHTRCTNDNYG--RGFFQDKYQINGLGLYFDP--HFGPFWPAFWRS 180
P C + P C++ C + + + + Y + G Y +GPF AF
Sbjct: 204 KYPACPSTIYDTPTCNSTCADSHTALTKHKGEKSYSLRGEREYMIELMTYGPFEVAF--- 260
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
Y + +G VY+ + + +A VK+VGWG +NG PYW I +++ +GD G
Sbjct: 261 --DVYADFVSYKSG-VYSHTTGERLGGHA-VKLVGWGVQNGTPYWKIANSWNSDWGDNGY 316
Query: 241 IKILRGRNEAIIESLVNGALP 261
I RG +E IES LP
Sbjct: 317 FLIRRGTDECGIESTGVAGLP 337
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 84/215 (39%), Gaps = 36/215 (16%)
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
C L+S W C W GL TGG + GC+P S PC+ + N TT
Sbjct: 156 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTT 215
Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
S P C TP C CT N + + QDK HFG +
Sbjct: 216 SVP-CPGYHTP--TCEEHCTSNITWPIAYKQDK------------HFGKAHYNVGKKMTD 260
Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
T TNG V A V + + KI+GWG ++G PYW
Sbjct: 261 IQTE--IMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDSGVPYWLC 318
Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V +G FG+ G ++ LRG NE IE V ALP
Sbjct: 319 VHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALPD 353
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 84/215 (39%), Gaps = 36/215 (16%)
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
C L+S W C W GL TGG + GC+P + PC+ + N TT
Sbjct: 156 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTT 215
Query: 125 SEPECKTLATPQPKCHTRCT-NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT 183
S P C TP C RCT N + + QDK HFG +
Sbjct: 216 SVP-CPGYHTP--VCEERCTSNITWPISYKQDK------------HFGKAHYNVGKKMTD 260
Query: 184 KYTRPLFQTNGRVYA----------------VSASAEIVAYATVKIVGWGEENGRPYWTI 227
T NG V A V + + KI+GWG +NG PYW
Sbjct: 261 IQTE--IMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGVDNGVPYWLC 318
Query: 228 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V +G FG+ G ++ILRG NE IE V A P
Sbjct: 319 VHQWGTDFGENGFVRILRGVNEVNIEHQVLAAQPD 353
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 93/192 (48%), Gaps = 18/192 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP--QP 137
C+ G + W++ ++G+V+GG + S GC+P PC H + + P C +TP Q
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPSCQH 213
Query: 138 KCHTR-----CTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
KC + N+G + + + + + GP AF T Y +
Sbjct: 214 KCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTN-GPVEGAF-----TVYEDLILYK 267
Query: 193 NGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY E+ +A ++I+GWG E+ PYW I +++ +GD G +ILRG++
Sbjct: 268 SG-VYQHEHGKELGGHA-IRILGWGVWGESKVPYWLIGNSWNTDWGDNGFFRILRGQDHC 325
Query: 251 IIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 GIESSISAGLPK 337
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 85/200 (42%), Gaps = 34/200 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL-ATPQP 137
C G + W +G+VTGG SN GCQP PC+H Y S C +L T
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
C +C N NY + D ++ + + W + K + T+G V
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTS-----------IVYMTSWTN--VKQIQQEIMTHGPVT 238
Query: 198 AV----------------SASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGT 240
A S + E++ Y VK++GWG + +G YW ++++ +G+ G
Sbjct: 239 AFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGL 298
Query: 241 IKILRGRNEAIIESLVNGAL 260
KILRG N IE LV +
Sbjct: 299 FKILRGYNFCSIELLVMAGI 318
>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
Length = 188
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 75/185 (40%), Gaps = 26/185 (14%)
Query: 95 RGLVTGGAHHSNT-------GCQPVSFPPCNHANYTTSEPECKTLATPQ-PKCHTRCTND 146
RG++TG GCQP + PPC N C T + P C +C N
Sbjct: 11 RGIITGDMGLCQVEIITPTQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNP 70
Query: 147 NYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
NY F D Y+ G P+ GP F+ R L VY
Sbjct: 71 NYYTSFRTDIYK--GKYYKLSPYMAMKDIFDNGPITTQFYMY------RDLVDYKSGVYQ 122
Query: 199 VSASAEIVAYA--TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
++ + +VKI GWGEENG PYW + ++FG +G GT KI RG + + +
Sbjct: 123 YDEQSDFDFFTVHSVKIFGWGEENGVPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKM 182
Query: 257 NGALP 261
LP
Sbjct: 183 YAGLP 187
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 24/181 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + GL +++ CQP FP C+H +P C PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKC 210
Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+T CT+ ++ Y+++G LYF+ GPF AF + F
Sbjct: 211 NTTCTDKAIPLIKYRGNHSYEVHGEEDYKRELYFN---GPFVVAF------QVYSDFFAY 261
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
VY S +++ V+IVGWG+ NG PYW I +++ +G G ILRG++E I
Sbjct: 262 KTGVYR-HVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGKDECGI 320
Query: 253 E 253
E
Sbjct: 321 E 321
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 86/195 (44%), Gaps = 36/195 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHH----SNTGCQPVSFPPCNHANYTTSEPECKTLATP 135
C G + W + + GLVTGG ++ + CQP P C H + S+P C +
Sbjct: 147 CEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPYPLPSCEH-HINGSKPACPSKIAK 205
Query: 136 QPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
P+C C + Y + QD H+G + R T TNG
Sbjct: 206 TPECVHTC-HAGYPTSYEQDL------------HYGESAYSVRRRVAEIQTE--IMTNGP 250
Query: 196 VYAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
V A + A+ AY + VK++GWGEE+G PYW I +++ +GD G
Sbjct: 251 VEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEEDGIPYWLIANSWNSDWGDHG 310
Query: 240 TIKILRGRNEAIIES 254
KI+RG++E IES
Sbjct: 311 YFKIVRGQDECGIES 325
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 86/194 (44%), Gaps = 25/194 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W GLVTGG + S GC+P PPC + + P K
Sbjct: 13 CHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNN----TCAGKPMEKN 68
Query: 140 HTRCTNDNYG---------RGFFQDKYQINGLGLYFDPH-FGPFWPAF--WRSFCTKYTR 187
H RCT YG + +D Y + + D +GP +F + F
Sbjct: 69 H-RCTRICYGDQELDFDEDHRYTRDYYYLTYGSIQKDVMTYGPIEASFDVYSDF------ 121
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
P +++ +Y + +A + VK++GWGE+ G PYW +V+++ E +GD G KI RG
Sbjct: 122 PSYKSG--IYERTENATYLGGHAVKLIGWGEQYGIPYWLMVNSWNEDWGDNGLFKIRRGT 179
Query: 248 NEAIIESLVNGALP 261
NE +++ +P
Sbjct: 180 NECGVDNSTTAGVP 193
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 86/183 (46%), Gaps = 30/183 (16%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
WV K G V+GG H+SN GCQP S C H + P C+ P+ C C ++ YG+
Sbjct: 157 WVTK-GFVSGGRHNSNEGCQPYSVEECEH-HIEGPRPPCEG-DMPELVCSETC-HEEYGK 212
Query: 151 GFFQD-KYQINGLGLYFDPHF-----------GPFWPAF--WRSFCTKYTRPLFQTNGRV 196
+ +D +Y GL Y P GP AF + F + Y ++Q
Sbjct: 213 TYEEDLEY---GLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLS-YKSGVYQ----- 263
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+ + Y V+++GWGEE G PYW + +++ +GD G KILRG +E E +
Sbjct: 264 ---HETGLLDGYHAVRVIGWGEEEGTPYWLVANSWNTDWGDNGLFKILRGSDECEFEGDM 320
Query: 257 NGA 259
A
Sbjct: 321 AAA 323
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 84/181 (46%), Gaps = 20/181 (11%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
W + +G+ TGG + + GC P PPC + + C P + H +C Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216
Query: 149 GRGFFQDKYQ------INGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
G+ Q++Y+ IN + +GP +F L +Y +
Sbjct: 217 GKTTVQNRYKTKSEYVINSIKTIERDIMTYGPVEASF------DVYDDLSAYKSGIYRKT 270
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
A+ ++KI+GWG++NG PYW V+++ + +G+ GT KI++GRNE IE V +
Sbjct: 271 PKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGI 330
Query: 261 P 261
P
Sbjct: 331 P 331
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 93/194 (47%), Gaps = 22/194 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG S GC+P PC H + + P C + +T P+C
Sbjct: 157 CNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPYEIEPCEH-HVNGTRPPCSSGST--PRC 213
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + +Y + +DK Y I L GP AF T Y +
Sbjct: 214 QHVCES-SYKVDYKKDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGE--ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG + PYW I +++ +GD G +I+RG++
Sbjct: 268 YKSG-VYEHVHGKELGGHA-IRILGWGVWGDEKIPYWLIANSWNTDWGDNGFFRIVRGKD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESSISAGLPK 339
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 85/196 (43%), Gaps = 32/196 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++++ G+ T G+ + GC P +FP C H + C P C
Sbjct: 159 CTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCGHHQQDSKYQPCPEKNYDTPPC 218
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
RC N NYG +D+ +F HF P+ + T + TNG A
Sbjct: 219 LDRCPNKNYGTPLDKDR--------HFTAHFSPY-----QLKGTDNIKKEIMTNGPTSAA 265
Query: 200 -SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S + ++Y + V+I+GWG + G YW +++++ E +G GT KI
Sbjct: 266 FSMYDDFLSYESGVYKHTSGTLMGEHGVEIIGWGTKQGVDYWLVMNSWNEGWGVHGTFKI 325
Query: 244 LRGR---NEAIIESLV 256
+G N+ IE +
Sbjct: 326 AQGDCGINDMAIERFM 341
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 91/200 (45%), Gaps = 35/200 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W + G+VTGG + ++ GCQP FPPC H + P C T P PKC
Sbjct: 94 CFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEH-HTKGPLPNC-TDTKPTPKC 151
Query: 140 HTRCTNDNYGRGFFQDKYQINGL-GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
C Y + + +DKY + L+ D T+ +++ NG V A
Sbjct: 152 LQVCRK-GYEKSYSEDKYFAKTVYSLHSDE--------------TQIKTEIYK-NGPVEA 195
Query: 199 -VSASAEIVAY--------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S + +AY A + +GW + R W + +++ + +GDKG KI
Sbjct: 196 DFSVYTDFLAYKSGVYQRHSYELWEARHQNLGWALKR-RSVWLVANSWNQDWGDKGYFKI 254
Query: 244 LRGRNEAIIESLVNGALPKD 263
RG NE IE+ +N +PK+
Sbjct: 255 RRGNNECGIENDINAGIPKE 274
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 92/194 (47%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + SN GC+P PC H T P AT PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGAT--PKC 213
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C + ++G + + + + + GP AF T Y +
Sbjct: 214 SHVCQSSYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTN-GPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG + PYW I +++ +GD+G +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGDEKIPYWLIGNSWNTDWGDQGFFRILRGQD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESSISAGLPK 339
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 86/185 (46%), Gaps = 15/185 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G+ S+ W + + G+ +GGA S+ GCQ F C + + P C P
Sbjct: 200 CDGGVPSAVWHYWVENGITSGGAFGSHEGCQSYPFDVCKKSGDSNDTPRCLRFCQPGYNV 259
Query: 140 HTRCTNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
T + +YGR + +D+ +I +Y +FGP F T YT Q V
Sbjct: 260 -TYPEDKHYGRVAYTVPKDEERI----MYEVFNFGPAQATF-----TMYT-DFVQYKSGV 308
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y + + + +VK++GWG EN YW +++G Q+GD G KI+RG + E+ V
Sbjct: 309 YRHTFGVRVGTH-SVKVMGWGVENDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFETNV 367
Query: 257 NGALP 261
LP
Sbjct: 368 VAGLP 372
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 88/193 (45%), Gaps = 20/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G +W + +G+VTG +++ C+P FP C H + P+C + PKC
Sbjct: 148 CDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPDCPSTDYSTPKC 207
Query: 140 HTRC----TNDNYGRGFF--QDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTR-PL 189
C T + Y Q Y + GP AF T Y+ P
Sbjct: 208 TKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAF-----TVYSDFPT 262
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+++ VY ++ + + +A + IVGWG E+G PYW + +++ +GD G KILRG +
Sbjct: 263 YRSG--VYKHTSGSVLGGHA-ISIVGWGTESGSPYWLVKNSWNPSWGDGGFFKILRG--D 317
Query: 250 AIIESLVNGALPK 262
I + V G LPK
Sbjct: 318 CGINNDVVGGLPK 330
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 91/194 (46%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + SN GC+P PC H + + P C PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEH-HVNGTRPPCGH-GGGTPKC 213
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C + ++G + K + + + GP AF T Y +
Sbjct: 214 SHVCESGYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTN-GPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG E PYW I +++ +GD G +ILRG++
Sbjct: 268 YKDG-VYQHQHGKELGGHA-IRILGWGVWGEEKIPYWLIGNSWNTDWGDNGFFRILRGQD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESSISAGLPK 339
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 91/192 (47%), Gaps = 27/192 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + RG+VTGG +GC+P F PCN + PE KT P C
Sbjct: 106 CEGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPCN----SYKCPEEKT-----PTC 155
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + +DK + ++ + + GP AF T Y ++
Sbjct: 156 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF-----TMY-EDMY 208
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+ VY +A + +A +KI+GWG +NG PYW I +++G +G+ G +K+ RG NE
Sbjct: 209 KYKSGVYRHTAGRLLGGHA-IKIIGWGTQNGIPYWLIANSWGADWGENGFLKMRRGVNEC 267
Query: 251 IIESLVNGALPK 262
IES V +PK
Sbjct: 268 GIESAVVAGMPK 279
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 95/193 (49%), Gaps = 22/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ +G+V+GG+++SN GC+P PC H + + P CK TP C
Sbjct: 159 CNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPYEIEPCEH-HVNGTRPPCKNGRTPS--C 215
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLF 190
+C + +Y + +DK + + +P GP AF T Y +
Sbjct: 216 KHQCES-SYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNGPVEGAF-----TVYEDLIL 269
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG ++ PYW I +++ +GD G +I+RG +
Sbjct: 270 YKSG-VYKHVHGKELGGHA-IRILGWGVWGDSKVPYWLIGNSWNTDWGDNGFFRIVRGED 327
Query: 249 EAIIESLVNGALP 261
IES ++ LP
Sbjct: 328 HCGIESAISAGLP 340
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 90/189 (47%), Gaps = 12/189 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W GLVTGG ++S GC+P PP N N ++S+ + C
Sbjct: 157 CHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHC 216
Query: 140 HTRCTND-NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTRPLFQTNGR 195
+ + D N + +D Y + + D +GP +F + F P +++
Sbjct: 217 YGNQSIDFNDDHRYTRDYYYLTYGSIQKDVLTYGPIEASFDVYDDF------PSYKSG-- 268
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY S +A + VK++GWGEE+G PYW +V+++ Q+GD G KI RG NE +++
Sbjct: 269 VYVKSDNASYLGGHAVKLIGWGEEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNS 328
Query: 256 VNGALPKDN 264
+P N
Sbjct: 329 TTAGVPVTN 337
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 83/185 (44%), Gaps = 24/185 (12%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDN 147
+ W+ + +VTGG + C+P +F PC NH N P C P PKC C
Sbjct: 165 YRWMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGP-CPRGLWPTPKCRKACQR-K 222
Query: 148 YGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
Y + + +DKY Y+ P GP AF K + G +
Sbjct: 223 YNKSYNEDKY--FATRSYYLPSNERSIREEIYKNGPVVAAF------KVYQDFSYYRGGI 274
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE-SL 255
Y + A+A VK+VGWG ENG YW I +++ +G+ G +I RG NE IE +
Sbjct: 275 YVHKWGGQTGAHA-VKVVGWGRENGTDYWLIANSWNTDWGENGYFRIARGSNECGIEGQM 333
Query: 256 VNGAL 260
V+G +
Sbjct: 334 VSGVM 338
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 105/242 (43%), Gaps = 31/242 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA+ L+ +C +S ++HV T G +C + C+
Sbjct: 26 NCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCG--NQCGYG---------CNG 74
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G + + K+G VTGG + + +GC+P F PC H T EC AT PKC +
Sbjct: 75 GWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKCVRK 133
Query: 143 C-----TNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLFQTNG 194
C + R +D Y+ GP AF T Y + G
Sbjct: 134 CQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAF-----TVYEDFSYYKKG 188
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+Y +A +A +KI+GWG+E G PYW I +++ +G+ G +IL G N IE
Sbjct: 189 -IYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIANSWHNDWGENGYFRILCGSNHCGIEE 246
Query: 255 LV 256
V
Sbjct: 247 NV 248
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 80/192 (41%), Gaps = 39/192 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W +G+VTGG +H GC+P PC N PE KT P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PAC 205
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---------- 189
C Y + +DK HFG A RS T +
Sbjct: 206 SLSC-QSGYSTAYAKDK------------HFGASAYAVARSVAAIQTEIMTNGPVEAAFT 252
Query: 190 -----FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
++ VY +A + +A +KI+GWG E+G PYW + +++G +G+ G KIL
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGTNWGESGFFKIL 311
Query: 245 RGRNEAIIESLV 256
RG ++ IE V
Sbjct: 312 RGDDQCGIEGAV 323
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 86/199 (43%), Gaps = 33/199 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W+W H G+ TGG + S C FP C+H + P C P P+C
Sbjct: 138 CNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDH-HVEGKYPPCGE-TQPTPEC 195
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
+C + Y + +DK HF F A+ + + TNG +
Sbjct: 196 VEKC-QEGYPVEYKKDK------------HF--FGEAYHVPSNVEAIKTELMTNGPIEVD 240
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S + + Y + VK+VGWG E+G YW I +++ E +G+ G +I
Sbjct: 241 FSVYEDFMTYKSGIYQHVAGKYLGGHAVKLVGWGVEDGVEYWKIANSWNEDWGENGYFRI 300
Query: 244 LRGRNEAIIESLVNGALPK 262
+ G+NE IES +P+
Sbjct: 301 IAGKNECGIESDGVAGIPE 319
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 89/204 (43%), Gaps = 40/204 (19%)
Query: 79 VCSSGISSSTWAWVHK---------RGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEP 127
+ SGI +S WV GLV+GG+ +++N GCQP PP CN
Sbjct: 146 ISCSGIKASANGWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN--------- 196
Query: 128 ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGP---------FWPAFW 178
P C + YG KY + + + + H P + P
Sbjct: 197 ------LPTKINKRTCVDYCYGNDTI--KYNHDHVKVRYYYHVKPKDIQKEVQTYGPV-- 246
Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
+ +F VY ++ +A+ V VK++GWG ENG YW +V+++G ++G
Sbjct: 247 -TAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLVNSWGNEWGQN 305
Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
G +KI RG+ +ES V A+PK
Sbjct: 306 GLLKIKRGKYGCAVESFVYAAVPK 329
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 89/212 (41%), Gaps = 25/212 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S + G VTGG + + GC P SF PC ++ P CKT K
Sbjct: 100 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCKTTCQSSYKT 158
Query: 140 HTRCTNDNYGRGFFQ--DKYQ--INGLGLYFDP-------------HFGPFWPAFWRSFC 182
+ +YG + +++Q +N Y H+GP ++
Sbjct: 159 EEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASY----- 213
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
K + VY + S ++V VKI+GWG ENG YW I +++G FG+KG K
Sbjct: 214 -KVYEDFYHYKSGVYHYT-SGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFK 271
Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEESG 274
I RG NE IE V + K E E+ G
Sbjct: 272 IRRGTNECQIEGNVVAGIAKLGTHSETYEDDG 303
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 91/194 (46%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + SN GC+P PC H + + P C + PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAN-GSGTPKC 213
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C + ++G + K + + + GP AF T Y +
Sbjct: 214 SHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTN-GPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG PYW I +++ +GD G +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGNEKIPYWLIGNSWNTDWGDHGFFRILRGQD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESSISAGLPK 339
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 86/181 (47%), Gaps = 20/181 (11%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
W + +G+ TGG + + GC P PPC + + C P + H +C Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216
Query: 149 GRGFFQDKYQ------INGLGLYFD--PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
G+ Q++Y+ +N + +GP +F Y +G +Y +
Sbjct: 217 GKTTVQNRYKTKSEYVMNSIKTIEQDLKTYGPVEASF-----DVYDDFSVYKSG-IYRKT 270
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
A+ ++KI+GWG++NG PYW V+++ + +G+ GT KI++GRNE IE V +
Sbjct: 271 PKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGI 330
Query: 261 P 261
P
Sbjct: 331 P 331
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 74/165 (44%), Gaps = 18/165 (10%)
Query: 105 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDK-------- 156
++TGCQP FP C H P C T P+C C Y F QDK
Sbjct: 184 NHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKTPFEQDKPFGEGSSN 241
Query: 157 YQINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVG 215
Q N D +GP AF Y L +G V+ S IV ++I+G
Sbjct: 242 VQNNEKVFQRDIMMYGPVEAAF-----DVYEDFLNSKSGISRHVTGS--IVGGHPIRIIG 294
Query: 216 WGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
WG E G PYW I +++ E +G+ G +++RGR+E IES V L
Sbjct: 295 WGVEKGNPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 11/180 (6%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 147
W + G+VTGG + + C P FPPC H SE P C P+C + C
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223
Query: 148 YGRGFFQDKYQIN-GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRVYAVSAS 202
Y + DK + + LY W + T ++ G VY +
Sbjct: 224 YATKYEDDKIRASTSYNLYRS--VTAIQKEIWMRGPVEATMNVYTDFANYAGGVYK-HTT 280
Query: 203 AEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
E++ ++++GWG EE+G PYW +++ +G+KG +ILRG + IES V+ LP
Sbjct: 281 GELLGGHAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 89/204 (43%), Gaps = 40/204 (19%)
Query: 79 VCSSGISSSTWAWVHK---------RGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEP 127
+ SGI +S WV GLV+GG+ +++N GCQP PP CN
Sbjct: 91 ISCSGIKASANGWVRDGLAWEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN--------- 141
Query: 128 ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGP---------FWPAFW 178
P C + YG KY + + + + H P + P
Sbjct: 142 ------LPTKINKRTCVDYCYGNDTI--KYNHDHVKVRYYYHVKPKDIQKEVQTYGPV-- 191
Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
+ +F VY ++ +A+ V VK++GWG ENG YW +V+++G ++G
Sbjct: 192 -TAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDYWLLVNSWGNEWGQN 250
Query: 239 GTIKILRGRNEAIIESLVNGALPK 262
G +KI RG+ +ES V A+PK
Sbjct: 251 GLLKIKRGKYGCAVESFVYAAVPK 274
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 81/180 (45%), Gaps = 11/180 (6%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 147
W + G+VTGG + + C P FPPC H SE P C P+C + C
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223
Query: 148 YGRGFFQDKYQIN-GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRVYAVSAS 202
Y + DK + + LY W + T ++ G VY +
Sbjct: 224 YATKYEDDKIRASTSYNLYRS--VTTIQKEIWMRGPVEATMNVYTDFANYAGGVYK-HTT 280
Query: 203 AEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
E++ ++++GWG EE+G PYW +++ +G+KG +ILRG + IES V+ LP
Sbjct: 281 GELLGGHAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 105/247 (42%), Gaps = 32/247 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC S +F F A C W + C+ G
Sbjct: 115 SCGSCWAFGAVEAMSDRVCIHSN---GTKNFHFSAENLVSCCWTCG-----FGCNGGFPG 166
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC--- 143
+ W + +G+V+GG + SN GC P PC H T P CK P C +C
Sbjct: 167 AAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKCEEG 224
Query: 144 ------TNDNYGRGFFQDKYQINGL--GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
+ ++G+ + + ++ + +Y + GP AF T Y + G
Sbjct: 225 YKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN---GPVEGAF-----TVYEDFIAYRAG- 275
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY A + +A ++I+GWG +NG PYW + +++ +G G KILRG +E IE
Sbjct: 276 VYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEG 334
Query: 255 LVNGALP 261
+N LP
Sbjct: 335 QINAGLP 341
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 80/171 (46%), Gaps = 7/171 (4%)
Query: 96 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 155
G+VTGG++ +GCQP P C++ + + +C P+C C D Y + + D
Sbjct: 172 GIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQCTNEC-QDGYNKTYDDD 229
Query: 156 KYQ----INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATV 211
K+ N G D + + T L +G VY + + + + T+
Sbjct: 230 KFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSG-VYLPTPRSRNLGWITL 288
Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+I+GWG E PYW +++ E++GD G +KI RG IES V +PK
Sbjct: 289 RIIGWGYEGKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPK 339
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 90/194 (46%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + SN GC+P PC H + + P C PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 213
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + Y + +DK Y + GP AF T Y +
Sbjct: 214 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG E PYW I +++ +GD G +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESSISAGLPK 339
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 83/199 (41%), Gaps = 31/199 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W++ G+VTG + S +GC+P +PPC H +C P C
Sbjct: 164 CDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTC 223
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV-YA 198
+C QD Y I+ D H+G A + + + TNG V A
Sbjct: 224 EYKC----------QDGYSIS---YNSDKHYGASVYAVAQDVAS--IQKEIMTNGPVEVA 268
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ Y++ VK++GWG ENG YW +++ +G+ G +I
Sbjct: 269 FDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICANSWNSDWGENGFFRI 328
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG +E IES V PK
Sbjct: 329 LRGVDECQIESSVVAGEPK 347
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 103/244 (42%), Gaps = 42/244 (17%)
Query: 24 YALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSG 83
YA + + A + AT + S++ + C +G+K+R V+R +
Sbjct: 117 YAATGVFADRMCIATNGNYNQLLSTEELISC------SGIKEREDGYVNRVLV------- 163
Query: 84 ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC---- 139
W + GLV+GG +++N GCQP P ++ + C +
Sbjct: 164 -----WEYFKTHGLVSGGKYNTNEGCQPSKVPTVYNSQTKIYKRTCVEYCYGKDTINYNH 218
Query: 140 -HTRCTNDNYGR-GFFQDKYQING-LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
H + +N + R Q + Q G + ++FD H LF V
Sbjct: 219 DHVKVSNHYFIRIKDIQKEVQTYGPVSVFFDLH-----------------DDLFLYKSGV 261
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
YA + ++ Y K++GWG ENG YW +V+++G ++G G KI RG +E +ES V
Sbjct: 262 YAKTEKSKDKRYHHAKLIGWGVENGVDYWLLVNSWGYEWGQNGLFKIKRGTDECSVESHV 321
Query: 257 NGAL 260
L
Sbjct: 322 YAGL 325
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 90/194 (46%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + SN GC+P PC H T P T PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGGT--PKC 213
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
C + ++G + K + + + GP AF T Y +
Sbjct: 214 SHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTN-GPVEGAF-----TVYEDLIL 267
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG + PYW I +++ +GD G +ILRG++
Sbjct: 268 YKDG-VYQHEHGKELGGHA-IRILGWGVWGDEKIPYWLIGNSWNTDWGDHGFFRILRGQD 325
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 326 HCGIESSISAGLPK 339
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 81/193 (41%), Gaps = 31/193 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+T CT ND+Y +D ++ LYF+ GPF AF ++
Sbjct: 211 NTTCTDKAIPLIEYRGNDSYVLLHGEDDFKRE---LYFN---GPFVVAF-----QVFSDF 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L G VS + + V+IVGWG+ NG PYW I +++ +G G LRG N
Sbjct: 260 LAYKTGVYRHVSG--DFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLFLRGNN 317
Query: 249 EAIIESLVNGALP 261
E IE LP
Sbjct: 318 ECGIEFEGYAGLP 330
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 90/194 (46%), Gaps = 21/194 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W++ ++G+V+GG + SN GC+P PC H + + P C PKC
Sbjct: 146 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 203
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + Y + +DK Y + GP AF T Y +
Sbjct: 204 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAF-----TVYEDLIL 257
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY E+ +A ++I+GWG E PYW I +++ +GD G +ILRG++
Sbjct: 258 YKDG-VYQHEHGKELGGHA-IRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQD 315
Query: 249 EAIIESLVNGALPK 262
IES ++ LPK
Sbjct: 316 HCGIESSISAGLPK 329
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 112/282 (39%), Gaps = 58/282 (20%)
Query: 10 RDMSYGATV----------YNRRPYALSC-IEARAVATATPLAFAVCRSSKMHVECTSFR 58
+D YG V Y R P LS EA + +P RS + SF+
Sbjct: 274 KDHIYGKDVGSHTDEVCIFYERVPLGLSFPKEATKEISGSPGE----RSQEWRQLIQSFK 329
Query: 59 FIAG--VKQRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPP 116
+AG + R A L +I V S I A RG +T G GC P FPP
Sbjct: 330 KLAGGRPRDRTALLSIDRSSIEVQPSRICGDYVA----RGNLTKG-----DGCWPYDFPP 380
Query: 117 CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPA 176
C H T P+C + P C +C N Y D++ + L P+
Sbjct: 381 CAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYM----LESSPY------- 429
Query: 177 FWRSFCTKYTRPLFQTNGRVYAVSASAE-IVAYAT---------------VKIVGWGEEN 220
+ + +T+G + A E +AY + VKI+GWGEEN
Sbjct: 430 ---QYSVNNAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWGEEN 486
Query: 221 GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
G YW +V+++ E +GD+G KI G E I+ + G PK
Sbjct: 487 GEAYWLVVNSWNEDWGDQGLFKIALGNCE--IDDDLLGGTPK 526
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 81/186 (43%), Gaps = 18/186 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + + G+VTGG + + C+P PPC T C T P C
Sbjct: 135 CDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNC-TQEIDTPDC 193
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
T C + YG+ + ++ + +GP AF T Y F
Sbjct: 194 KTTCQAGYPISYDDDKTYGKTAYSVSNSVHAIQKEI-MTYGPVVAAF-----TVYDD-FF 246
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+Y + AE +A V+I+GWG++ G PYW + +++ +G+ G +ILRG +E
Sbjct: 247 HYKTGIYKHVSGAEAGGHA-VRILGWGQQGGVPYWLVANSWNTDWGENGYFRILRGSDEC 305
Query: 251 IIESLV 256
IE V
Sbjct: 306 GIEDGV 311
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 79/184 (42%), Gaps = 10/184 (5%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 138
C G S + G VTGG ++ GC P SF PC + + P CKT K
Sbjct: 167 CQGGYSIEALRFWKSSGAVTGG-DYNGAGCMPYSFAPCKKDSCAQGTTPSCKTTCQSSYK 225
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
+ ++G ++ + + H GP +F K ++ VY
Sbjct: 226 TAEYTKDKHFGTTAYKITNSVAAIQTEI-YHNGPVEASF------KVYEDFYKYKSGVYQ 278
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
+ S ++V VKI+GWG ENG YW I +++G FGD G K+ RG NE IE V
Sbjct: 279 YT-SGKLVGGHAVKIIGWGTENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIEGNVVA 337
Query: 259 ALPK 262
K
Sbjct: 338 GTAK 341
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 82/188 (43%), Gaps = 31/188 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W +G+VTGG +H GC+P PC N PE KT P C
Sbjct: 155 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 204
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C + Y + +DK+ G+ Y P GP AF
Sbjct: 205 SMSCQS-GYSTAYAKDKH--FGVSAYAVPKNAASIQAEIYANGPVEAAF------SVYED 255
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++ VY +A + +A +KI+GWG E+G PYW + +++G +G+ G KI RG +
Sbjct: 256 FYKYKSGVYKHTAGKYLGGHA-IKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDD 314
Query: 249 EAIIESLV 256
+ IES V
Sbjct: 315 QCGIESAV 322
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 82/198 (41%), Gaps = 33/198 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + + G+V+GG + TGC P FP C+H T C PKC
Sbjct: 155 CEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKCSHLEETPGLAPCPRELYATPKC 214
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQ--TNGRVY 197
+C Y + +DK I G Y + + T + + TNG V
Sbjct: 215 EKQC-QAGYSKTSEEDK--IKGKSSY--------------NVGDRETDIMMEIITNGPVS 257
Query: 198 AVSASAE--------IVAYATVK------IVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+ E I Y + I+GWG ENG YW +++ E +G+ G +I
Sbjct: 258 TIYYIFEDFTVYKSGIYQYTSGSLMGGHGIIGWGVENGVKYWLAANSWNEGWGENGYFRI 317
Query: 244 LRGRNEAIIESLVNGALP 261
RG NE IES +N LP
Sbjct: 318 RRGTNECGIESRINAGLP 335
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 86/192 (44%), Gaps = 20/192 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + +G+V+GG + S GC P PC H T P CK P C
Sbjct: 158 CNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPAC 215
Query: 140 HTRCTNDNYGRGFFQDKYQ---INGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLF 190
+C D Y + QD ++ LG D GP AF T Y +
Sbjct: 216 VKKC-EDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAF-----TVYEDFIA 269
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNE 249
G VY A + +A ++I+GWG +NG PYW + +++ +G G KILRG +E
Sbjct: 270 YRAG-VYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSWNSDWGSDGFFKILRGSDE 327
Query: 250 AIIESLVNGALP 261
IE +N LP
Sbjct: 328 CGIEGQINAGLP 339
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 87/194 (44%), Gaps = 29/194 (14%)
Query: 80 CSSG-ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
C+ G + +++ + GLV+GGA++S GC+P F PC + P PK
Sbjct: 156 CNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPYPFKPCEY-------PFNDCHVEISPK 208
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTR 187
C C D R + +DK + G Y P GP F Y
Sbjct: 209 CTHHC-RDGVDRHYSKDK--LFGKVAYSVPRDERAIRYEIMTNGPVEAGF-----DVYED 260
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
L +G VY +I +A V+I+GWG + G PYW I +++G+ +GD G K +RG
Sbjct: 261 VLLYKSG-VYRHVYGEQIGKHA-VRIIGWGRDGGIPYWLIANSYGDDWGDHGYFKFVRGS 318
Query: 248 NEAIIESLVNGALP 261
N IES + LP
Sbjct: 319 NHLGIESKIITGLP 332
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 102/245 (41%), Gaps = 29/245 (11%)
Query: 27 SCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
+C AV+TA ++ +C ++K V ++ + C + C G
Sbjct: 109 NCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPTCGF---------GCDGGW 159
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC- 143
S W + GLV+GG + S C+P PC H T EC A+ P C +C
Sbjct: 160 SIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEAS-TPSCKKKCQ 218
Query: 144 --------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
+ YG FQ + + + GP SF L+++
Sbjct: 219 PGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKN-GPVTA----SFAVYEDFSLYKSG-- 271
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y +A E+ Y VK++GWG EN YW I +++ + +G+ G +I+RG N+ IE
Sbjct: 272 IYRHTA-GELRGYHAVKMIGWGTENRTDYWLIANSWHDDWGENGYFRIIRGINDCGIEEN 330
Query: 256 VNGAL 260
V L
Sbjct: 331 VAAGL 335
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 77/175 (44%), Gaps = 22/175 (12%)
Query: 99 TGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQ 158
+GG SN GC P PC H + + P C PKC C +Y + QDK
Sbjct: 5 SGGPFGSNQGCHPYKIAPCEH-HVNGTRPACNGEEGKTPKCIKHC-QASYTVAYEQDKSY 62
Query: 159 INGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVA 207
G Y PH GP AF T Y L Q VY + +++
Sbjct: 63 --GAKSYSVPHHVAQIQKEIMTNGPVEGAF-----TVY-EDLVQYKDGVYQ-HVTGKMLG 113
Query: 208 YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
++I+GWG EN PYW I +++ +G+ G KILRG + IES ++ +PK
Sbjct: 114 GHAIRILGWGVENDVPYWLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 109/265 (41%), Gaps = 36/265 (13%)
Query: 10 RDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRC 67
RD+ T + R A +C AV+TA ++ +C +SK V ++ + + +C
Sbjct: 94 RDVWKNCTTFYIRDQA-NCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQC 152
Query: 68 AWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
C G W + G+V+GG + + C+P PC H T
Sbjct: 153 GD---------GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYG 203
Query: 128 ECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLG---LYFDPHFGPFWP 175
EC+ A P P C +C + YG+ + K + + L P F
Sbjct: 204 ECRGTA-PTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASF-- 260
Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
A + F Y +++ + E+ Y VK++GWG EN +W I +++ +
Sbjct: 261 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKMIGWGNENNTDFWLIANSWHNDW 311
Query: 236 GDKGTIKILRGRNEAIIESLVNGAL 260
G+KG +I+RG N+ IE + +
Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGI 336
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 89/193 (46%), Gaps = 23/193 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHH---SNTGCQPVSFPPCNH--ANYTTSEPECKTLAT 134
C+ G S W W K G+VTGG + + T C+P F PC H + P C
Sbjct: 367 CNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEY 426
Query: 135 PQPKCHTRCTNDNYGRGFF-QDK------YQINGL-GLYFDP-HFGPFWPAFWRSFCTKY 185
P P+C + C+ N+ G + +DK Y + G+ + D +G AF
Sbjct: 427 PTPECLSECSETNFSGGSYGEDKKMAREAYSLAGIENIQRDMMKYGSVTAAF------SV 480
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKI 243
+G VY + + + +A VK++GWG E +G YW I +++ +G+ G +I
Sbjct: 481 FSDFLTYSGGVYTHESGSFMGGHA-VKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRI 539
Query: 244 LRGRNEAIIESLV 256
LRG NE IE +
Sbjct: 540 LRGVNECGIEGQI 552
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 81/187 (43%), Gaps = 31/187 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ +++GCQP FP C H ++ C PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ CT N Y ++ Y+ LYF+ GPF F+ YT
Sbjct: 211 NATCTDKSIPLVKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFF-----VYTD- 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
LF VY + + + V+IVGWG+ NG PYW + +++ +G G + ILRG N
Sbjct: 259 LFAYKSGVYR-NVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNN 317
Query: 249 EAIIESL 255
E IE L
Sbjct: 318 ECNIEHL 324
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 109/265 (41%), Gaps = 36/265 (13%)
Query: 10 RDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRC 67
RD+ T + R A +C AV+TA ++ +C +SK V ++ + + +C
Sbjct: 94 RDVWKNCTTFYIRDQA-NCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQC 152
Query: 68 AWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
C G W + G+V+GG + + C+P PC H T
Sbjct: 153 GD---------GCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYG 203
Query: 128 ECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLG---LYFDPHFGPFWP 175
EC+ A P P C +C + YG+ + K + + L P F
Sbjct: 204 ECRGTA-PTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASF-- 260
Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
A + F Y +++ + E+ Y VK++GWG EN +W I +++ +
Sbjct: 261 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKMIGWGNENNTDFWLIANSWHNDW 311
Query: 236 GDKGTIKILRGRNEAIIESLVNGAL 260
G+KG +I+RG N+ IE + +
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAGI 336
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 21/191 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K GLVTGG +S GCQP FPPC T C + KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206
Query: 140 HTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH------FGPFWPAFWRSFCTKYTRPLFQ 191
+C N + D+ + + L +D +GP +F
Sbjct: 207 QKKCFG-NTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSF------DVYDDFIS 259
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY S +A + +VK +GWG E YW +++++ +GD G KI RG NE
Sbjct: 260 YKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQ 319
Query: 252 IESLVNGALPK 262
+E +P+
Sbjct: 320 VEDSSTAGMPE 330
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 81/192 (42%), Gaps = 36/192 (18%)
Query: 90 AWVH--KRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 147
AW H K G+V+GG++ S GCQP PPC H + C T P P C C
Sbjct: 128 AWDHWVKHGIVSGGSYGSKEGCQPYHLPPCEH-HRAGPRRNC-TKYGPTPSCARVC---- 181
Query: 148 YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE--- 204
Q Y+I+ D HFG W A K R NG V A A+ E
Sbjct: 182 ------QPDYKIS---YEDDLHFGKQWYAL-APHNEKIIRTEIFHNGPVEATMAAYEDFY 231
Query: 205 -------------IVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V VKI+GWG ++ PYW + ++F +G+ G KI RG NE
Sbjct: 232 TYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDWGEYGFFKIKRGVNE 291
Query: 250 AIIESLVNGALP 261
IE+ + +P
Sbjct: 292 CGIENKITAGIP 303
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 88/192 (45%), Gaps = 30/192 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C + W +K+G+VTGG + +GC+P F PC T SE P+C
Sbjct: 145 CKGASPLQAFRWWNKKGVVTGG-DYRGSGCKPYPFAPCTALPCTKSE---------TPRC 194
Query: 140 HTRCTNDNYGRGFFQDKY-----QINGL---GLYFDPHFGPFWPAF--WRSFCTKYTRPL 189
C Y + + +DKY I G+ + + GP AF + F Y +
Sbjct: 195 SLNC-QPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEITNGPVEAAFIVYDDF-NHYRSGV 252
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
++ + ++V VKI+GWG +NG PYW + +++G +G+ G K+LRG +E
Sbjct: 253 YR--------HVAGKLVGGHAVKIIGWGIQNGAPYWLMANSWGPYWGENGFFKMLRGVDE 304
Query: 250 AIIESLVNGALP 261
IES + P
Sbjct: 305 CGIESTIVAGKP 316
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 80/172 (46%), Gaps = 25/172 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 135
C+ G W GLVTGG + S GC+P PPC + N + +P P
Sbjct: 97 CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQP-----MEP 151
Query: 136 QPKCHTRCTND-----NYGRGFFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
KC +C D N + +D Y + G+ D ++GP +F + F
Sbjct: 152 NHKCSKKCYGDEDIDFNKDHRYTRDDYYLTYRGIQKDVINYGPIEASFDVYDDF------ 205
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
P +++ +Y S +A + +VK++GWGEE G YW +V+++ +GDKG
Sbjct: 206 PNYKSG--IYVKSENASYLGGHSVKLIGWGEEYGVLYWLMVNSWNADWGDKG 255
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 84/197 (42%), Gaps = 31/197 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S+ W + GLVTGG +SN GC P C+H +P C + P P C
Sbjct: 286 CEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP-CGDI-QPTPAC 343
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF---WRSFCTK-YTRPLFQTNGR 195
C N+ D HFG + +S T+ YT + +
Sbjct: 344 ANSCQNNATWSS---------------DKHFGASSYSVGTDQQSIMTEIYTNGPVEASYD 388
Query: 196 VYA--VSASAEIVAYAT--------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
VYA VS + + + T VKI+GWG + PYW + +++ +G+ G ILR
Sbjct: 389 VYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGVDGSTPYWIVANSWNNDWGNNGFFNILR 448
Query: 246 GRNEAIIESLVNGALPK 262
G +E IE + +PK
Sbjct: 449 GSDECGIEDGIVAGIPK 465
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 108/265 (40%), Gaps = 36/265 (13%)
Query: 10 RDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRC 67
RD+ T + R A +C AV+TA ++ +C +SK V ++ + + +C
Sbjct: 94 RDVWKNCTTFYIRDQA-NCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQC 152
Query: 68 AWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
C G W + G+V+GG + + C+P PC H T
Sbjct: 153 GD---------GCEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYG 203
Query: 128 ECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLG---LYFDPHFGPFWP 175
EC+ A P P C C + YG+ + K + + L P F
Sbjct: 204 ECRGTA-PTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASF-- 260
Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
A + F Y +++ + E+ Y VK++GWG EN +W I +++ +
Sbjct: 261 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKMIGWGNENNTDFWLIANSWHNDW 311
Query: 236 GDKGTIKILRGRNEAIIESLVNGAL 260
G+KG +I+RG N+ IE + +
Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGI 336
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 107/241 (44%), Gaps = 30/241 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSSKM--HVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
+C AV+TA+ L+ +C +SK V +S F++ C + C G
Sbjct: 118 NCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFVSCC-DSCGFG---------CEGGW 167
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
+ + +G+VTGG + S TGC+P F PC H T EC + P+C +C
Sbjct: 168 PIDAFEYYSYQGVVTGGDYGSKTGCRPYPFHPCGHHGNETYYGECPKEES-TPECVKQCQ 226
Query: 145 ---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
+ +G +++ + + + GP +F T Y + G
Sbjct: 227 KGYKNSYRRDKTWGEDYYEVENSVKAIQREI-MRSGPVVSSF-----TVYDDFSYYVKG- 279
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y +A ++A +KI+GWG E PYW I +++ +G+KG +++RG N IE
Sbjct: 280 IYKHTAGKARGSHA-IKIIGWGTEKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEED 338
Query: 256 V 256
V
Sbjct: 339 V 339
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 70/162 (43%), Gaps = 16/162 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K G+VTG H +N GC+P FP C H + T CK P PKC
Sbjct: 43 CNGGDPLSAWKFWVKEGIVTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKC 102
Query: 140 HTRCTNDNYGRGFFQDKY---QINGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLF 190
C R + +DKY G+ + + +GP AF +
Sbjct: 103 EKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIITYGPVEVAF------EVYEDFL 156
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFG 232
G +Y A +A VK++GWG +NG PYW + T G
Sbjct: 157 NYAGGIYVHQGGALGGGHA-VKMIGWGIDNGVPYWXHLPTHG 197
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 81/192 (42%), Gaps = 39/192 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W +G+VTGG +H GC+P PC N PE KT P C
Sbjct: 192 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 241
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---------- 189
C + Y + +DK HFG A R + T +
Sbjct: 242 SLSCQS-GYTTAYAKDK------------HFGTSAYAVARKVASIQTEIMTNGPVEAAFT 288
Query: 190 -----FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
++ VY +A + +A +KI+GWG E+G PYW + +++G +G+ G +I
Sbjct: 289 VYEDFYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGNSWGESGFFRIF 347
Query: 245 RGRNEAIIESLV 256
RG ++ IES V
Sbjct: 348 RGDDQCGIESAV 359
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 85/196 (43%), Gaps = 31/196 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C S+ A + R LV + GCQP S PPC P C T P PKC
Sbjct: 156 CDGRCHCSSVAILQGRRLVPEPVR-TEDGCQPYSLPPC--------VPNC-THPEPTPKC 205
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP---------HFGPFWPAF--WRSFCTKYTRP 188
C Y + + +DK+ + GP AF + F Y
Sbjct: 206 QHVCRK-GYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADF-PSYKSG 263
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
++Q + + + +KI+GWG E+G PYW + +++ +GDKG KILRG++
Sbjct: 264 VYQQH--------MIKFMGVHAIKILGWGTEDGVPYWLVANSWNVGWGDKGYFKILRGKD 315
Query: 249 EAIIESLVNGALPKDN 264
E IE +++ +P ++
Sbjct: 316 ECGIEEVIDAGIPMED 331
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 82/194 (42%), Gaps = 31/194 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ CT N Y ++ Y+ LYF+ GPF F+ YT
Sbjct: 212 NATCTDKSVPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
LF VY + + + VK+VGWG+ NG PYW + +++ +G G + ILRG N
Sbjct: 260 LFAYKSGVYR-NVDGDFLGGTAVKVVGWGKLNGTPYWKVANSWDTDWGMDGYLLILRGNN 318
Query: 249 EAIIESLVNGALPK 262
E IE L P+
Sbjct: 319 ECNIEHLGFAGTPE 332
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 80/172 (46%), Gaps = 19/172 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y N + GP AF + Y+ L
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 261
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
+G V + E++ ++I+GWG ENG PYW + +++ +GD G K
Sbjct: 262 YKSGVYQHV--TGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 311
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 78/182 (42%), Gaps = 29/182 (15%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
WV GLV+GGA++S GC+P F PC + P PKC C + +
Sbjct: 174 WVDA-GLVSGGAYNSTEGCKPYPFKPCLY-------PFTDCHREESPKCKHHCQH-GVDK 224
Query: 151 GFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+ +DK + G Y P GP F +F VY
Sbjct: 225 RYARDK--VFGSVAYSVPRDERVIRYEIMTNGPVEGGF------DVYEDVFLYKSGVYR- 275
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
E V V+I+GWG E G PYW I +++GE +GD G KI+RG N IES V
Sbjct: 276 HVYGEHVGKHAVRIIGWGREGGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIESKVITG 335
Query: 260 LP 261
LP
Sbjct: 336 LP 337
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 21/191 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K GLVTGG +S GCQP FPPC T C + KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206
Query: 140 HTRCTNDNYGRGFFQDKYQI--NGLGLYFDPH------FGPFWPAFWRSFCTKYTRPLFQ 191
+C N + D+ + + L +D +GP +F
Sbjct: 207 QKKCFG-NTSISYRGDRRYVERSPYVLAYDNMQNDIMTYGPIESSF------DVYDDFIS 259
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY S +A + +VK +GWG E YW +++++ +GD G KI RG NE
Sbjct: 260 YKSGVYFKSPNATYLGGHSVKCIGWGVERNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQ 319
Query: 252 IESLVNGALPK 262
+E +P+
Sbjct: 320 VEDSSTAGVPE 330
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 81/208 (38%), Gaps = 27/208 (12%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
+C + A++ A+ + +C S E + A C +L + C G
Sbjct: 87 NCRSSYAISVASAVTDRICIHSN---ETKNPIMSAQQIISCCYLCG-----YGCDGGSQF 138
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PKCHTRCTN 145
+W + + G V+GG ++SN GCQP PPC N + C T + P C +C N
Sbjct: 139 ESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRHSCTTYNREETPACEIKCNN 198
Query: 146 DNYGRGFFQDKYQINGLGLY--------FDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
NY F D Y+ +Y FD GP F+ R L VY
Sbjct: 199 PNYYSSFKTDIYKGKYYQVYPFMAMKEIFDN--GPITTQFYM------YRDLIDYKSGVY 250
Query: 198 AVSAS--AEIVAYATVKIVGWGEENGRP 223
+ KI+GWGEENG P
Sbjct: 251 QYDEGFYGDFFTVQGXKIIGWGEENGDP 278
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 83/177 (46%), Gaps = 22/177 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTGG + S GCQP S P T + + T P C
Sbjct: 45 CDGGSPEAAWYFFMRHGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDT-----PDC 99
Query: 140 HTR-CTNDNYGRGFFQDKYQINGL--------GLYFDPH-FGPFWPAFWRSFCTKYTRPL 189
R CTN NY +G+ D + ++ + + D + GP AF+ YT +
Sbjct: 100 SIRTCTNSNYTKGYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFY-----VYTDFM 154
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+ +G VY+ + +I +KI+GWG ++ YW +++ +G+ G +ILRG
Sbjct: 155 YYKSG-VYSYT-RGQIEGGHAIKILGWGVDDNTKYWLCANSWSRSWGENGLFRILRG 209
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 84/187 (44%), Gaps = 31/187 (16%)
Query: 96 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 155
G VTGG + + GC+P SF PC++ + + P C Q KC + T NY +G D
Sbjct: 156 GAVTGGDYKGD-GCKPYSFAPCSNCVESKTTPSC------QSKCQSTYTVTNY-KG---D 204
Query: 156 KYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRV-YAVSASAEIVAYAT 210
K+ G + H + +R + P+ Q NG V A + + Y +
Sbjct: 205 KHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHYKS 264
Query: 211 ---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VKI+GWG E G YW + +++G FGDKG KI RG NE IES
Sbjct: 265 GVYHHVTGKDTGGHAVKIIGWGTEKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESN 324
Query: 256 VNGALPK 262
V + K
Sbjct: 325 VVAGMAK 331
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 71/158 (44%), Gaps = 24/158 (15%)
Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-----------NDNYGRGFFQDKY 157
CQP FP C H ++ C PKC+ CT N Y ++ Y
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPLVKYRGNATYLLLHGEEDY 239
Query: 158 QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG 217
+ LYF+ GPF F+ YT LF VY + +I+ V+IVGWG
Sbjct: 240 KRE---LYFN---GPFVAVFF-----VYTD-LFAYKSGVYR-NVDGDILGGQAVRIVGWG 286
Query: 218 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+ NG PYW + +T+ +G G + ILRG NE IE L
Sbjct: 287 KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 90/228 (39%), Gaps = 59/228 (25%)
Query: 80 CSSGISSSTWAWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
C GI+ + W+++ G+VTG G+ + GC P SFP C H + C +
Sbjct: 133 CQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQEDSKYEPCPEVR 192
Query: 134 TP--------------------QPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHFGP 172
P P C RC N+ YG +D+ + L F+
Sbjct: 193 VPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEG---- 248
Query: 173 FWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE----------------IVAYATVKIVGW 216
T + TNG A ++ E + +V+I+GW
Sbjct: 249 ----------TDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGW 298
Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
G E G YW +++++ E +GD GT KI +G + I+ V G+LP N
Sbjct: 299 GTEKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAMN 344
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 89/192 (46%), Gaps = 29/192 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + RG+VTGG +GC+P F PC S PE KT P C
Sbjct: 151 CKGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTC 198
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C Y + +DK + ++ + + GP AF T Y ++
Sbjct: 199 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF-----TMY-EDMY 251
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+ VY +A + +A +KI+GWG +NG PYW I +++G +G+ G +K+ RG NE
Sbjct: 252 KYKSGVYRHTAGRLLGGHA-IKIIGWGTQNGIPYWLIANSWGANWGENGFLKMRRGVNEC 310
Query: 251 IIESLVNGALPK 262
IE V +P+
Sbjct: 311 GIERAVVAGMPR 322
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 79/171 (46%), Gaps = 7/171 (4%)
Query: 96 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 155
G+VTGG++ +GCQP P C++ + + +C P+C C D Y + + D
Sbjct: 111 GIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQCTNEC-QDGYNKTYDDD 168
Query: 156 KYQ----INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATV 211
K+ N G D + + T L +G VY + + + + T+
Sbjct: 169 KFYGERIYNVYGTQEDIQKEILMNGPVIASISVNTDFLVYKSG-VYLPTPRSRNLGWITL 227
Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+I+GWG E PYW +++ E++G G +KI RG IES V +PK
Sbjct: 228 RIIGWGYEGKIPYWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAPIPK 278
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 81/192 (42%), Gaps = 31/192 (16%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ W++ G+VTG + S +GC+P +PPC H +C P C +C
Sbjct: 56 AAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKC--- 112
Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV-YAVSASAEI 205
QD Y I+ D H+G A + + + TNG V A +
Sbjct: 113 -------QDGYSIS---YNSDKHYGASVYAVAQDVAS--IQKEIMTNGPVEVAFDVYEDF 160
Query: 206 VAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Y++ VK++GWG ENG YW +++ +G+ G +ILRG +E
Sbjct: 161 EHYSSGIYKHTTGDYLGGHAVKMLGWGTENGTDYWICANSWNSDWGENGFFRILRGVDEC 220
Query: 251 IIESLVNGALPK 262
IES V PK
Sbjct: 221 EIESGVVAGEPK 232
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 79/167 (47%), Gaps = 20/167 (11%)
Query: 91 WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 143
++ GLVTGG + ++ GC P FP CNH S+ P C + P C T C
Sbjct: 231 FMKNHGLVTGGEYKPPEELGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTC 289
Query: 144 TNDNYGRGFFQDKYQINGLGLYFDPHFGP-------FWPAFWRSFCTKYTRPLFQTNGRV 196
N YG +D ++ G GP F + T Y F +G V
Sbjct: 290 PNKAYGTSMQKDTHRAKSWGRL---PIGPEKIKQEIFDNGPVAAMMTLYEDFRFYKSG-V 345
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y V + +++A T+K++GWG E+G+ YW V+ + E++GD G IK+
Sbjct: 346 Y-VHKTGQMLAAHTLKLIGWGVESGQEYWLAVNAWNEEWGDHGMIKL 391
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 31/194 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ CT N Y ++ Y+ LYF+ GPF F+ YT
Sbjct: 212 NATCTDKAIPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
LF VY + + VK+VGWG+ NG PYW + +++ +G G + ILRG N
Sbjct: 260 LFAYKSGVYR-HVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWDTDWGMGGYLLILRGNN 318
Query: 249 EAIIESLVNGALPK 262
E IE L P+
Sbjct: 319 ECNIEHLGFAGTPE 332
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 31/194 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ CT N Y ++ Y+ LYF+ GPF F+ YT
Sbjct: 212 NATCTDKAIPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
LF VY + + VK+VGWG+ NG PYW + +++ +G G + ILRG N
Sbjct: 260 LFAYKSGVYR-HVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWDTDWGMGGYLLILRGNN 318
Query: 249 EAIIESLVNGALPK 262
E IE L P+
Sbjct: 319 ECNIEHLGFAGTPE 332
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 81/194 (41%), Gaps = 31/194 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ CT N Y ++ Y+ LYF+ GPF F+ YT
Sbjct: 212 NATCTDKAIPLIKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFY-----VYTD- 259
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
LF VY + + VK+VGWG+ NG PYW + +++ +G G + ILRG N
Sbjct: 260 LFAYKSGVYR-HVDGDFLGGTAVKVVGWGKLNGTPYWKLANSWDTDWGMGGYLLILRGNN 318
Query: 249 EAIIESLVNGALPK 262
E IE L P+
Sbjct: 319 ECNIEHLGFAGTPE 332
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/187 (29%), Positives = 80/187 (42%), Gaps = 45/187 (24%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
C Y + QDK H+G Y+V
Sbjct: 208 SKIC-EPGYSPTYKQDK------------HYG----------------------YNSYSV 232
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
S S + + K NG PYW + +++ +GD G KILRG++ IES V
Sbjct: 233 SNSEKDIMAEIYK-------NGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 285
Query: 260 LPK-DNY 265
+P+ D Y
Sbjct: 286 IPRTDQY 292
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 76/185 (41%), Gaps = 32/185 (17%)
Query: 88 TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 147
W + G+VTGG + C+P FPPC EC A PKC C
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAK-TPKCQKTCQ--- 56
Query: 148 YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVA 207
RG+ + + D HFG A+ K + NG V A E A
Sbjct: 57 --RGYLKAYKE--------DKHFGK--SAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFA 104
Query: 208 Y----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ VKI+GWG+E G PYW I +++ + +G+KG +++RG N
Sbjct: 105 HYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCR 164
Query: 252 IESLV 256
IE +V
Sbjct: 165 IEEMV 169
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 97/226 (42%), Gaps = 31/226 (13%)
Query: 33 AVATATPLAFAVCRSSKMHVE--CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
AV+ A ++ +C SK V+ + +A + C C+ G+ W
Sbjct: 125 AVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGR---------GCNGGMDHKAWE 175
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 149
+V + G+VTGG + C+P PC NH S P + TP C C YG
Sbjct: 176 YVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA--CKKYCQY-GYG 232
Query: 150 RGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
+ + +DK + + + + GP AF Y F T G +Y +
Sbjct: 233 KRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----ITYEDFSFYTKG-IYVHT 286
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+ A+A VK+VGWG ENG YW + +++ +G+ G +ILRG
Sbjct: 287 RGRQRGAHA-VKVVGWGVENGTKYWNVANSWSTDWGEDGYFRILRG 331
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 97/226 (42%), Gaps = 31/226 (13%)
Query: 33 AVATATPLAFAVCRSSKMHVE--CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
AV+ A ++ +C SK V+ + +A + C C+ G+ W
Sbjct: 125 AVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGR---------GCNGGMDHKAWE 175
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 149
+V + G+VTGG + C+P PC NH S P + TP C C YG
Sbjct: 176 YVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA--CKKYCQY-GYG 232
Query: 150 RGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
+ + +DK + + + + GP AF Y F T G +Y +
Sbjct: 233 KRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----ITYEDFSFYTKG-IYVHT 286
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+ A+A VK+VGWG ENG YW + +++ +G+ G +ILRG
Sbjct: 287 RGRQRGAHA-VKVVGWGVENGTKYWNVANSWSTDWGENGYFRILRG 331
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 77/169 (45%), Gaps = 21/169 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G S W+WVH +G+ TGG + + + GC P FPPC H T P+C ++
Sbjct: 47 CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKCPKVS 106
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
H + Y K I G P +F T Y L +
Sbjct: 107 CSGDDRHFMLESSPYHYSVNDAKNAIRTDG--------PVSASF-----TVYEDFLAYRS 153
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
G VY ++ + + +A VKI+GWGE++G+ YW V+++ E +GD G +
Sbjct: 154 G-VYKHTSGSYLGGHA-VKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFR 200
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 82/186 (44%), Gaps = 27/186 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W +G+VTGG +H GC+P PC + S PE KT P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C + Y + +DK Y + GP AF T Y +
Sbjct: 206 SLSCQS-GYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAF-----TVY-EDFY 258
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+ VY +A + +A +KI+GWG E+G PYW + +++G +G+ G KI RG ++
Sbjct: 259 KYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQC 317
Query: 251 IIESLV 256
IES V
Sbjct: 318 GIESAV 323
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 80/187 (42%), Gaps = 31/187 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------TSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210
Query: 140 HTRCT-----------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ CT N Y ++ Y+ LYF+ GPF F+ YT
Sbjct: 211 NATCTDKSIPLVKYRGNATYLLLHGEEDYKRE---LYFN---GPFVAVFF-----VYTD- 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
LF VY + + + V+IVGWG+ NG PYW + +++ +G G + ILRG N
Sbjct: 259 LFAYKSGVYR-NVDGDFLGGQAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYMLILRGNN 317
Query: 249 EAIIESL 255
E IE L
Sbjct: 318 ECNIEHL 324
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 80/177 (45%), Gaps = 20/177 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C+ G+ W +V + G+VTGG + C+P PC NH S P + TP
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTP--A 222
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPL 189
C C YG+ + +DK + + + + GP AF Y
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----ITYEDFS 276
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
F T G +Y + + A+A VK+VGWG ENG YW + +++ +G+ G +ILRG
Sbjct: 277 FYTKG-IYVHTRGRQRGAHA-VKVVGWGVENGTKYWNVANSWSTDWGENGYFRILRG 331
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 81/182 (44%), Gaps = 23/182 (12%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC----- 143
W++ K+G+ +GG + SN GC P PP P+ +P C TRC
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD-------EPNCSTRCNAGYN 193
Query: 144 -TNDNYGRGFFQDKYQI--NGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
T D R F + Y I + + D GP F ++ + +G VY
Sbjct: 194 VTEDLRDRRFGRVAYSIPADERKIMEDIFVNGPVQAVF------QWYEDIVNYSGGVYR- 246
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
S + VK++GWG E+G YW + +++G +GD G K++RG N IE V+
Sbjct: 247 HQSGRLKGGHAVKLIGWGVEDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAG 306
Query: 260 LP 261
LP
Sbjct: 307 LP 308
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/186 (28%), Positives = 80/186 (43%), Gaps = 14/186 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+SG+ + + ++G+ +GG + + C+P F PC + + C P P C
Sbjct: 164 CTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPCPDGMWPTPTC 223
Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQT 192
C +D N R F + G F GP + T Y +
Sbjct: 224 EKACQSDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNNGPLVATY-----TVYEDFAYYK 278
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
NG +Y A+A VKI+GWGEENG YW I +++ +G+ G ++LRG N I
Sbjct: 279 NG-IYMTGLGRATGAHA-VKIIGWGEENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDI 336
Query: 253 ESLVNG 258
E G
Sbjct: 337 ELSATG 342
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 96/245 (39%), Gaps = 47/245 (19%)
Query: 33 AVATATPLAFAVCRSSK----MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSST 88
A ATA LA +C ++ + F G+K + + V
Sbjct: 116 AYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVR------------GDDV 163
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
W ++ GLV+GG +++N GCQP PP N T C RC +N
Sbjct: 164 WEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYGNNT 213
Query: 149 GRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
++ D +++ Y++ +GP F + F VY
Sbjct: 214 IH-YYHDHVKVSH---YYNIKSNEDIQKEVQTYGPVSVKF------RVYDDFFLYKSGVY 263
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
+ + V K++GWG ENG YW +V+++G ++G G KI RG NE +E V
Sbjct: 264 VKTEKSLYVRRHFAKLIGWGVENGVDYWLLVNSWGNEWGQNGLFKIKRGTNEVHVEDYVY 323
Query: 258 GALPK 262
P+
Sbjct: 324 AGEPE 328
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 78/183 (42%), Gaps = 13/183 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S + G+V+GG +SN GC+P + + C+ +
Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPYTADAHDQGQTPACTKSCRNGYSTSYSA 210
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+++Y D+ Q + GP F + + + VY
Sbjct: 211 DKHYGSNDYVVSSVIDQIQYEVMTN------GPIIVNF------EVFQDFYNYVSGVYR- 257
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
S E V + VKIVGWG ENG PYW I +++G +GD G K+LRG+NE IE+
Sbjct: 258 HVSGESVGFHVVKIVGWGVENGVPYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAV 317
Query: 260 LPK 262
+P+
Sbjct: 318 MPR 320
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 43/188 (22%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
WV K G+V+GG ++SN GCQP Y S L + PKC T+C N Y
Sbjct: 163 WVAK-GIVSGGDYNSNEGCQP----------YEGSA----FLNSVTPKCSTKCLNSKYTT 207
Query: 151 GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYAT 210
+ +DK H+G + + + V + + +Y +
Sbjct: 208 PYAKDK------------HYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKS 255
Query: 211 ---------------VKIVGWGEENGRPYWTIVSTFGEQFGD-KGTIKILRGRNEAIIES 254
VKI+GWG E G PYW I +++G ++ D G KILRG+N IE+
Sbjct: 256 GVYQHVSGNSMGGHAVKIIGWGTEKGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIET 315
Query: 255 LVNGALPK 262
+ G P+
Sbjct: 316 YIYGGTPQ 323
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 69/158 (43%), Gaps = 24/158 (15%)
Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-----------NDNYGRGFFQDKY 157
CQP FP C H ++ C PKC+ CT N Y ++ Y
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIPLVKYRGNATYLLLHGEEDY 239
Query: 158 QINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG 217
+ LYF+ GPF F+ YT LF VY + + VK+VGWG
Sbjct: 240 KRE---LYFN---GPFVAVFY-----VYTD-LFAYKSGVYR-HVDGDFLGGTAVKVVGWG 286
Query: 218 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+ NG PYW + +T+ +G G + ILRG NE IE L
Sbjct: 287 KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 90/188 (47%), Gaps = 20/188 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC-KTLATPQPK 138
C+ G T+ + G+ TGG + SN C+P PPC++ + T + P+C K+ + P
Sbjct: 359 CNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPCSNCSETRT-PKCSKSCISTYPL 417
Query: 139 CHTRCTNDNYGRGFFQ----DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
+ +YG ++Q +K + + LY GP + Y L G
Sbjct: 418 SLNE--DRHYGSTYYQFWLGEKSMMKDISLY-----GPIVAGM-----SVYEDFLHYKEG 465
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY + + +A V+I+GWGE++ PYW + +++ FG+ G KI RG +E IES
Sbjct: 466 -VYTQESGIFLGGHA-VRIIGWGEQDNIPYWLVANSWNTTFGEDGLFKIRRGFDECGIES 523
Query: 255 LVNGALPK 262
V+ K
Sbjct: 524 YVSAGRAK 531
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 86/199 (43%), Gaps = 30/199 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+ TGG + + C+P +F PC +Y +C + P PKC
Sbjct: 159 CGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYG----KCPKDSFPTPKC 214
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C Y + + DKY N Y P GP +F Y
Sbjct: 215 RKICQY-KYSKKYADDKYYANSA--YRIPQNETWIKLEIMRNGPVTASF-----RIYPDF 266
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD-KGTIKI 243
F G VY S E+ +A +KI+GWG E NG PYW I +++G +G+ G +I
Sbjct: 267 GFYEKG-VYVTSGGRELGGHA-IKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRI 324
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG+N IE V + K
Sbjct: 325 LRGQNHCQIEQKVIAGMIK 343
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 81/186 (43%), Gaps = 27/186 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W +G+VTGG +H GC+P PC + S PE KT P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + +DK Y + GP AF T Y +
Sbjct: 206 SLSC-QPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAF-----TVY-EDFY 258
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+ VY +A + +A +KI+GWG E+G PYW + +++G +G+ G KI RG ++
Sbjct: 259 KYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPYWLVANSWGTSWGESGFFKIFRGDDQC 317
Query: 251 IIESLV 256
IES V
Sbjct: 318 GIESAV 323
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 77/180 (42%), Gaps = 27/180 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G S W W+H G+VTGG + + + GC P PPC H +T P+C
Sbjct: 207 CHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPCAHYTNSTLYPKCPKTK 266
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF----------GPFWPAFWRSFCT 183
P C C N Y +D++ + L GP ++
Sbjct: 267 YDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALRSIDAIKKEIMTNGPVSASY-----L 321
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L +G VY ++ + +A VKI+GWGE+ YW +V+++ + +GD G KI
Sbjct: 322 VYDDFLTYKSG-VYKRTSHNALGGHA-VKIIGWGED----YWLVVNSWNKNWGDNGMFKI 375
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 85/200 (42%), Gaps = 24/200 (12%)
Query: 67 CAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTT 124
C L+S W C W GL TGG + GC+P + PC+ + N TT
Sbjct: 5 CVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTT 64
Query: 125 SEPECKTLATPQPKCHTRCTND-----------NYGRGFFQDKYQINGLGLYFDPHFGPF 173
S P C TP C RCT++ ++G+ + ++ + + GP
Sbjct: 65 SVP-CPGYHTPV--CEERCTSNITWPISYKQVKHFGKAHYNVGKKMTDIQTEIMRN-GPV 120
Query: 174 WPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGE 233
+F + +Y +A + T KI+GWG +NG PYW V +G
Sbjct: 121 IASF------IIYDDFWDYKSGIYVHTAGDQEGGMDT-KIIGWGVDNGVPYWLCVHQWGT 173
Query: 234 QFGDKGTIKILRGRNEAIIE 253
FG+ G ++ILRG NE IE
Sbjct: 174 DFGENGFMRILRGVNEVHIE 193
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 78/184 (42%), Gaps = 26/184 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+ +++ CQP FP C H +P C P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------TSSQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQC 211
Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFW-RSFCTKYTRPLFQ 191
+ CT+ + ++ Y++ G LYF+ GPF F S Y ++Q
Sbjct: 212 NATCTDKSVPLIKYRGNHSYEVRGEEDYKRELYFN---GPFVVRFQVHSDFLAYKSGVYQ 268
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+ + V+IVGWG+ NG PYW + +++ +G G ILRG NE
Sbjct: 269 --------HVAGNFLGGKAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECN 320
Query: 252 IESL 255
IE L
Sbjct: 321 IEHL 324
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 73/164 (44%), Gaps = 9/164 (5%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ +GC+ FP C+H + P C P P+C
Sbjct: 155 CRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDH-HVQGHYPPCPRQIYPTPEC 213
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
C D G+ +DK + N + R + T Y Q R
Sbjct: 214 VQDC--DTPELGYLEDKTRANISYNIYASEISIMKEIMLRGPVEAVFTVY-EDFLQYKSR 270
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
VY + A + +A ++I+GWGEE PYW I +++ E +G+KG
Sbjct: 271 VYFHAWGAPMSGHA-IRILGWGEEGDVPYWLIANSWNEDWGEKG 313
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 80/193 (41%), Gaps = 21/193 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W G+ TGG + S C SFP C H P ++ TP+ C
Sbjct: 138 CDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGESQETPE--C 195
Query: 140 HTRCTND-----NYGRGFFQDKYQINGLGLYFDPHF---GPFWPAF--WRSFCTKYTRPL 189
+C + FF + Y + G GP +F + F T Y +
Sbjct: 196 VKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLT-YKSGI 254
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+Q + + + VK+VGWG E+G YW I +++ E +G+ G +I+ G+ E
Sbjct: 255 YQ--------HVAGKYLGGHAVKLVGWGVEDGIEYWKIANSWNEDWGENGYFRIVAGKGE 306
Query: 250 AIIESLVNGALPK 262
IE G +PK
Sbjct: 307 CGIEVGPIGGIPK 319
>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 228
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 84/184 (45%), Gaps = 24/184 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTL 132
C G ++ GLVTGG + ++ GC P FP CNH S+ P C +
Sbjct: 39 CMFGSVPEGLNFMKNHGLVTGGEYKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQV 98
Query: 133 ATPQPKCHTRCTNDNYGRGFFQDKYQINGLG-LYFDPH--------FGPFWPAFWRSFCT 183
P C T C N YG +D ++ G L P GP + T
Sbjct: 99 RD-LPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIFDNGPV-----AAMMT 152
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y + +G VY V + +++A T+K++GWG E+G+ YW ++ + E++GD G IK+
Sbjct: 153 LYEDFRYYKSG-VY-VHKTGQLLAAHTLKLIGWGVESGQEYWLAMNAWNEEWGDHGMIKL 210
Query: 244 LRGR 247
G+
Sbjct: 211 AVGK 214
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 68/155 (43%), Gaps = 18/155 (11%)
Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG----- 163
CQP FP C H ++ C P+C+T CT+ ++ K L
Sbjct: 180 CQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIPLIKYRGKDAYMLLPGEEEF 239
Query: 164 ---LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN 220
LYF+ GPF + YT LF VY + + VK+VGWG+ N
Sbjct: 240 KRELYFN---GPF-----VAILFVYTD-LFAYKSGVYR-NVDGSYMGVTAVKVVGWGKLN 289
Query: 221 GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
G PYW + +T+ +G G + ILRG NE IE L
Sbjct: 290 GTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 84/192 (43%), Gaps = 28/192 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +RG+ +GG ++S GC P C+ A+ P+C KC
Sbjct: 158 CQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHSADEDADTPKCTR------KC 211
Query: 140 HT--RCTNDNYGRGFFQDKYQINGLGLYFDPHF---GPFWPAF-----WRSFCTKYTRPL 189
+ TN + R F + Y ++ GP +F ++++ T R +
Sbjct: 212 QSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHV 271
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
F + VK++GWG ENG YW +++GE +G++G KI+RG N
Sbjct: 272 F------------GPMEGGHAVKMIGWGVENGTKYWLCSNSWGEDWGERGFFKIVRGENH 319
Query: 250 AIIESLVNGALP 261
IES V+ LP
Sbjct: 320 CGIESDVHAGLP 331
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 69/179 (38%), Gaps = 18/179 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H + P C + PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+T C ++ Y + G GP +
Sbjct: 219 NTTCERSEMDLVKYKGSTSYSVKGEKELMIELMTNGPL------ELTMQVYSDFVGYKSG 272
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY E + VK+VGWG ++G PYW + +++ +GDKG I RG NE IES
Sbjct: 273 VYK-HVLGEFLGGHAVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIES 330
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 86/181 (47%), Gaps = 29/181 (16%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 150
W + RG+VTGG +GC+P F PC S PE KT P C C Y
Sbjct: 162 WWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTCSLSC-QFGYST 208
Query: 151 GFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
+ +DK + ++ + + GP AF T Y +++ VY +A
Sbjct: 209 AYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAF-----TMY-EDMYKYKSGVYRHTA 262
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+ +A +KI+GWG +NG PYW I +++G +G+ G +K+ RG NE IE V +P
Sbjct: 263 GRLLGGHA-IKIIGWGTQNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMP 321
Query: 262 K 262
+
Sbjct: 322 R 322
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 68/155 (43%), Gaps = 18/155 (11%)
Query: 109 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG----- 163
CQP FP C H ++ C P+C+T CT+ ++ K L
Sbjct: 180 CQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIPLIKYRGKDAYMLLPGEEEF 239
Query: 164 ---LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN 220
LYF+ GPF + YT LF VY + + VK+VGWG+ N
Sbjct: 240 KRELYFN---GPF-----VAILFVYTD-LFAYKSGVYR-NVDGSYMGVTAVKVVGWGKLN 289
Query: 221 GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
G PYW + +T+ +G G + ILRG NE IE L
Sbjct: 290 GTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHL 324
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 79/190 (41%), Gaps = 48/190 (25%)
Query: 80 CSSGISSSTWAWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G ++ G+VTG G S GC P FP C HA Y++
Sbjct: 112 CQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKHAGYSS--------- 162
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
P C T+CTN Y QD ++ G PA ++ + +F TN
Sbjct: 163 ---PACQTKCTNKAYKTSLQQDLHRAKSFGRL---------PAIPQNI----KQEIF-TN 205
Query: 194 G------------RVYA----VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGD 237
G RVY V + T+KI+GWG E+G+ YW V+++ E++GD
Sbjct: 206 GPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGD 265
Query: 238 KGTIKILRGR 247
G IK+ GR
Sbjct: 266 HGMIKLAVGR 275
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 70/179 (39%), Gaps = 18/179 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H + P C + PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+T C + ++ Y + G GP +
Sbjct: 219 NTTCERNEMDLVKYKGSTSYSVKGEKELMIELMTNGPL------ELTMQVYSDFVGYKSG 272
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY + + VK+VGWG ++G PYW + +++ +GDKG I RG NE IES
Sbjct: 273 VYK-HVLGDFLGGHAVKLVGWGTQDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIES 330
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 95/245 (38%), Gaps = 47/245 (19%)
Query: 33 AVATATPLAFAVCRSSK----MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSST 88
A ATA LA +C ++ + F G+K + + V
Sbjct: 116 AYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVR------------GDDV 163
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
W ++ GLV+GG +++N GCQP PP N T C RC +N
Sbjct: 164 WEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYGNNT 213
Query: 149 GRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
++ D +++ Y++ +GP F + F VY
Sbjct: 214 IH-YYHDHVKVSH---YYNIKSNEDIQKEVQTYGPVSVKF------RVYDDFFLYKSGVY 263
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
+ + V K++GWG ENG YW +V+ +G ++G G KI RG NE +E V
Sbjct: 264 VKTEKSLYVRRHFAKLIGWGVENGVDYWLLVNFWGNEWGQNGLFKIKRGTNEVHVEDYVY 323
Query: 258 GALPK 262
P+
Sbjct: 324 AGEPE 328
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 79/185 (42%), Gaps = 14/185 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G+ W + ++G+ +GG ++S GC F C+ + P+C
Sbjct: 167 CQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTCHSPDEDDDAPKCSRKCQSSYSV 226
Query: 140 HTRCTNDNYGR---GFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
+ +GR D+++I ++ + GP AF F+T
Sbjct: 227 QDVSKDRRFGRVAYSVVADEHRIME-EIFVN---GPVQAAF-------QVYLDFKTYKSG 275
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+ + +KI+GWG ENG YW +++GE +GD G KI+RG N IE+ V
Sbjct: 276 VYRHVTGPLEGGHAIKILGWGVENGTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDV 335
Query: 257 NGALP 261
+ LP
Sbjct: 336 HAGLP 340
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 83/190 (43%), Gaps = 31/190 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + + ++ G+ TGG + S +GC+P S P P + A P C
Sbjct: 163 CNGGFPLLAFKYWNEIGVPTGGPYGSKSGCKPFSIAP----------PTSSSTAAQTPLC 212
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH------------FGPFWPAF--WRSFCTKY 185
+C +D Y R +D+Y L + GP A + SF Y
Sbjct: 213 QLKCISD-YKRKLDKDRYYGESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLY-Y 270
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
++ N R S + VK++GWGE+ PYW +V+++ FG++G KI R
Sbjct: 271 KSGVYSANKRNDDPS-----LGLHAVKLIGWGEQKRIPYWLVVNSWNTTFGEQGLFKIRR 325
Query: 246 GRNEAIIESL 255
G NE IE+L
Sbjct: 326 GTNECGIENL 335
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 73/162 (45%), Gaps = 24/162 (14%)
Query: 105 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-----------NDNYGRGFF 153
+++GCQP FP C H ++ C PKC+ CT N Y
Sbjct: 176 ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPLVKYRGNATYLLLHG 235
Query: 154 QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKI 213
++ Y+ LYF+ GPF F+ YT LF VY + + + V+I
Sbjct: 236 EEDYKRE---LYFN---GPFVAVFF-----VYTD-LFAYKSGVYR-NVDGDFLGGQAVRI 282
Query: 214 VGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VGWG+ NG PYW + +++ +G G + IL G NE IE L
Sbjct: 283 VGWGKLNGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHL 324
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 82/186 (44%), Gaps = 24/186 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNT------GCQPVSFPPCNHA-NYTTSEPECKTL 132
C+ G +++ G+VTG GC P F CNH T P+CK +
Sbjct: 108 CNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDV 167
Query: 133 AT-PQPKCHTRCTNDNYGRGFFQDKYQ-------INGLGLYFDPHF--GPFWPAFWRSFC 182
P P C T CTN Y + +D ++ +N F GP + AF
Sbjct: 168 VQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVFSAF----- 222
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
Y + +G VY V + E+ +KI+GWG ++ R YW ++ + E++GD G IK
Sbjct: 223 EMYKDFRYYKSG-VY-VPTTKEVDCLHVIKIIGWGADSVREYWLAMNAWNEEWGDHGLIK 280
Query: 243 ILRGRN 248
+ G+N
Sbjct: 281 MAFGKN 286
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 78/194 (40%), Gaps = 48/194 (24%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H ++ P C PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---FQTNGRV 196
+T C N + + G+ S+ K R L NG +
Sbjct: 219 NTTCDN------VEMELVKYKGV----------------SSYSIKGERELMVELMNNGPL 256
Query: 197 -YAVSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
A+ A+ VAY + VK+VGWG ++G PYW I +++ +GDKG
Sbjct: 257 EVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWNTDWGDKGY 316
Query: 241 IKILRGRNEAIIES 254
I RG +E IES
Sbjct: 317 FLIQRGNDECGIES 330
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 78/194 (40%), Gaps = 48/194 (24%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H ++ P C PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL---FQTNGRV 196
+T C N + + G+ S+ K R L NG +
Sbjct: 219 NTTCDN------VEMELVKYKGV----------------SSYSIKGERELDHELMNNGPL 256
Query: 197 -YAVSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
A+ A+ VAY + VK+VGWG ++G PYW I +++ +GDKG
Sbjct: 257 EVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGVKDGIPYWKIANSWNTDWGDKGY 316
Query: 241 IKILRGRNEAIIES 254
I RG +E IES
Sbjct: 317 FLIQRGNDECGIES 330
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 69/179 (38%), Gaps = 18/179 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H + P C PKC
Sbjct: 171 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 223
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+T C ++ Y + G GP +
Sbjct: 224 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL------EVTMQVYSDFVGYKSG 277
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S +++ VK+VGWG + G PYW I +++ +GDKG I RG NE IES
Sbjct: 278 VYK-HVSGDLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 335
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 75/163 (46%), Gaps = 23/163 (14%)
Query: 108 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFD 167
GC P FPPC H T P+C P P C +C N Y D++ + Y
Sbjct: 2 GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPY-- 59
Query: 168 PHF------------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVG 215
H+ GP +F T Y L +G VY ++ + + +A VKI+G
Sbjct: 60 -HYSVNDAKNAIRTDGPVSASF-----TVYEDFLAYRSG-VYKHTSGSYLGGHA-VKIIG 111
Query: 216 WGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
WGE++G+ YW V+++ E +GD G KI G N I + L+ G
Sbjct: 112 WGEKSGQAYWLAVNSWNEDWGDHGLFKIALG-NCGIDDDLLGG 153
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 69/179 (38%), Gaps = 18/179 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H + P C PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+T C ++ Y + G GP +
Sbjct: 219 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL------EVTMQVYSDFVGYKSG 272
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S +++ VK+VGWG + G PYW I +++ +GDKG I RG NE IES
Sbjct: 273 VYK-HVSGDLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 330
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 82/202 (40%), Gaps = 45/202 (22%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + GLV+GG ++++ GCQP S N P+C
Sbjct: 143 CKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYSKSNFNDG--------------VSPEC 188
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA- 198
C N Y + D+ HFG ++ T L + G V A
Sbjct: 189 SKTCQNTKYPTSYLNDR------------HFGDGTYYILKNVTTIQQEILLR-GGPVMAG 235
Query: 199 ---------------VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI-K 242
V S ++ VKI+GWG ENG YW + +++G+ +G G + K
Sbjct: 236 FDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGTENGWAYWLVANSWGKDWGALGGVFK 295
Query: 243 ILRGRNEAIIE-SLVNGALPKD 263
I RG NE IE S++ G + KD
Sbjct: 296 IRRGTNECKIEQSIITGHVRKD 317
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 69/179 (38%), Gaps = 18/179 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H + P C PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+T C ++ Y + G GP +
Sbjct: 219 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL------EVTMQVYSDFVGYKSG 272
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S +++ VK+VGWG + G PYW I +++ +GDKG I RG NE IES
Sbjct: 273 VYK-HVSGDLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 330
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 83/179 (46%), Gaps = 15/179 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
C G S W G+VTGG + + GC+P F CN A + PEC + Q K
Sbjct: 157 CDGGYSIQALRWWVFDGVVTGGDYQGD-GCKPYQF--CNSAGCPDAVTPECAL--SCQSK 211
Query: 139 CHTRCTND-NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
+T D N+G + +N + + GP +F K ++ VY
Sbjct: 212 YNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTN-GPVEASF------KVYEDFYKYKSGVY 264
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
A + +A +KI+GWG ENG YW I +++G ++G+ G KI RG NE IE+ V
Sbjct: 265 KYIAGKMLGGHA-IKIIGWGTENGTAYWLIANSWGTKWGENGFFKIRRGVNECGIENNV 322
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 80/193 (41%), Gaps = 14/193 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + ++G+ +GG ++S GC P C+ + P+C
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNV 215
Query: 140 HTRCTNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
+ YGR + D+ +I +Y + GP AF + L V
Sbjct: 216 TDVWQDRRYGRVAYSIPNDEQKIMEE-IYIN---GPVQAAF------MTYQDLHAYKSGV 265
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y + VK++GWG ENG YW + +++G+ +GD G KI+RG N IE V
Sbjct: 266 YR-HVWGHMAGGHAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDV 324
Query: 257 NGALPKDNYGVEF 269
+ LP N E
Sbjct: 325 HAGLPSFNKHKEL 337
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 96/247 (38%), Gaps = 41/247 (16%)
Query: 23 PYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
P +C AV++A ++ +C +SK + V C W C
Sbjct: 111 PDQANCGSCWAVSSAAAMSDRICIASKGAKQV--LISAQDVVSCCTWCGDG------CEG 162
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G S + + G+VTGG +++ C+P PC H T EC +A P+C R
Sbjct: 163 GWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGMAD-TPRCKRR 221
Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSAS 202
C Y + + D+Y + A+ K + NG V A
Sbjct: 222 CLL-GYPKSYPSDRY---------------YKKAYQLKNSVKAIQKDIMKNGPVVATYTV 265
Query: 203 AEIVAY----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
E A+ VK++GWGEE G PYW + +++ + +G+ G ++ RG
Sbjct: 266 YEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRG 325
Query: 247 RNEAIIE 253
N+ E
Sbjct: 326 SNDCGFE 332
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 80/193 (41%), Gaps = 14/193 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + ++G+ +GG ++S GC P C+ + P+C
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNV 215
Query: 140 HTRCTNDNYGRGFF---QDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
+ YGR + D+ +I +Y + GP AF + L V
Sbjct: 216 TDVWQDRRYGRVAYSIPNDEQKIMEE-IYIN---GPVQAAF------MTYQDLHAYKSGV 265
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
Y + VK++GWG ENG YW + +++G+ +GD G KI+RG N IE V
Sbjct: 266 YR-HVWGHMAGGHAVKLMGWGVENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDV 324
Query: 257 NGALPKDNYGVEF 269
+ LP N E
Sbjct: 325 HAGLPSFNKHKEL 337
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 76/169 (44%), Gaps = 14/169 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
C G + W + +G+VTGG + SN GCQP PC+H +S C +L Q
Sbjct: 98 CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 156
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
C +C N NY + D Y+ + + + + P +F Y +
Sbjct: 157 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMGY 214
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKG 239
G VY S + E++ Y VK++GWG +E G YW ++++ +G G
Sbjct: 215 KEG-VYK-STAGELIGYHHVKLIGWGVDEAGIEYWLAMNSWNSNWGTNG 261
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 76/185 (41%), Gaps = 7/185 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C G + + + + GL TGG + CQP +F PC NHA+ P C P P
Sbjct: 164 CKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGP-CPDELWPTPT 222
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCT---KYTRPLFQTNGR 195
C C Y F +DK + F + R K R
Sbjct: 223 CRRTCQL-GYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKG 281
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + E+ VKI+GWG+ N PYW + +++ +GD G +I+RG + IE
Sbjct: 282 VY-IHREGEVTGLHAVKIIGWGKGNDVPYWLVANSWNTDWGDNGYFRIVRGTDNCEIERQ 340
Query: 256 VNGAL 260
+ G +
Sbjct: 341 MVGGI 345
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 72/179 (40%), Gaps = 18/179 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI + W W G+ T CQP F PC+H + P C PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218
Query: 140 HTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
+T C ++ Y + G GP Y+ + +G
Sbjct: 219 NTTCEKSEMDLVKYKGGTSYSVKGEKELMIELMTNGPL-----EVTMQVYSDFVGYKSGG 273
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VS +++ VK+VGWG + G PYW I +++ +GDKG I RG NE IES
Sbjct: 274 YKHVSG--DLLGGHAVKLVGWGTQGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIES 330
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 77/197 (39%), Gaps = 26/197 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+ TGG + C+P +F PC H EC P P+C
Sbjct: 165 CRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHHRNEIYYGECPKEIFPTPQC 224
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAF--WRSFCTKYT 186
C Y + DK I G Y P+ GP AF + F +
Sbjct: 225 TQSC-QAGYASDYEDDK--IYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRS 281
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILR 245
T GR A VK++GWG +++G YW +++ +G+ G +I+R
Sbjct: 282 GIYVHTAGRREGGHA---------VKLIGWGVDDDGNKYWLAANSWNSDWGENGYFRIVR 332
Query: 246 GRNEAIIESLVNGALPK 262
G + IES V +P
Sbjct: 333 GVDHCGIESAVVAGMPD 349
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 19/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
C+ G WA+ GLV+ CQP FP C+H + + + P C PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214
Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
C+ C + ++ Y + G Y F GPF AF N
Sbjct: 215 CNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 268
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S + + V++VGWG NG PYW I +++ ++G G I RG +E IE
Sbjct: 269 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 327
Query: 255 LVNGALP 261
+ +P
Sbjct: 328 GGSAGIP 334
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 78/183 (42%), Gaps = 24/183 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+ +++ CQP FP C H + C P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------ASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQC 211
Query: 140 HTRCTNDNYGRGFFQ--DKYQING-----LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+ CT+ ++ Y++ G LYF+ GPF F ++ L
Sbjct: 212 NATCTDKTIPLIKYRGNHSYEVRGEEDYKRELYFN---GPFVVRF-----QVHSDFLAYK 263
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
NG V+ + + V+IVGWG+ NG PYW + +++ +G G ILRG NE I
Sbjct: 264 NGVYQHVAGN--FLGGKAVRIVGWGKLNGTPYWKVANSWDTDWGMNGYFLILRGDNECNI 321
Query: 253 ESL 255
E L
Sbjct: 322 EHL 324
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 19/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
C+ G WA+ GLV+ CQP FP C+H + + + P C PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214
Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
C+ C + ++ Y + G Y F GPF AF N
Sbjct: 215 CNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 268
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S + + V++VGWG NG PYW I +++ ++G G I RG +E IE
Sbjct: 269 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 327
Query: 255 LVNGALP 261
+ +P
Sbjct: 328 GGSAGIP 334
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 19/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
C+ G WA+ GLV+ CQP FP C+H + + + P C PK
Sbjct: 139 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 191
Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
C+ C + ++ Y + G Y F GPF AF N
Sbjct: 192 CNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 245
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S + + V++VGWG NG PYW I +++ ++G G I RG +E IE
Sbjct: 246 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 304
Query: 255 LVNGALP 261
+ +P
Sbjct: 305 GGSAGIP 311
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 93/233 (39%), Gaps = 45/233 (19%)
Query: 33 AVATATPLAFAVCRSSKMHVE--CTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
AV+ A ++ +C SK V+ + +A + C C+ G+ W
Sbjct: 125 AVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGR---------GCNGGMDHKAWE 175
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 149
+V + G+VTGG + C+P PC NH S P + TP C C YG
Sbjct: 176 YVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA--CKKYCQY-GYG 232
Query: 150 RGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAY- 208
+ + +DK + + + + K + NG V A S + E ++
Sbjct: 233 KRYEKDKSYVKSVYILDEDE--------------KAIQREMMKNGPVQAASITYEDFSFY 278
Query: 209 ---------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
VK+VGWG ENG YW + +++ +G+ G +ILRG
Sbjct: 279 RRGIYVHTRGRQRGAHAVKVVGWGVENGTKYWNVANSWSTDWGEDGYFRILRG 331
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 79/203 (38%), Gaps = 36/203 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
CS G W W G+VTGG + H+ C P P C H + P+C+
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368
Query: 137 PKCHTRCTNDNYGRGF--FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
PKC C Y F+D D HF A+ + R L +
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKD-----------DLHFAT--SAYSVEGRDQIKRELMENGT 415
Query: 195 RVYAVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
A + + Y VK++G+G E+GR YW V+++ E +GDKG
Sbjct: 416 LTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKG 475
Query: 240 TIKILRGRNEAIIESLVNGALPK 262
T KI G EA I+ G PK
Sbjct: 476 TFKIEMG--EAGIDKEFCGGEPK 496
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 85/197 (43%), Gaps = 33/197 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + GLV+GG ++++TGCQP S N+ T P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS--ELNYYRIT-------------PPC 186
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF------------GPFWPAFWRSFCTKYTR 187
+T C ND Y + DK+ G +Y+ P GP AF K R
Sbjct: 187 NTTCQNDKYPIPYVSDKHF--GDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYR 244
Query: 188 PLFQTNGRVYAVS--ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKIL 244
Q + + V S + VKI+GWG ENG YW +++G+ +G G KI
Sbjct: 245 DGEQHDTILEGVYIYTSGALFGRTAVKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIR 304
Query: 245 RGRNE-AIIESLVNGAL 260
RG NE ES++ G +
Sbjct: 305 RGTNECGFEESIIAGQV 321
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 79/203 (38%), Gaps = 36/203 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
CS G W W G+VTGG + H+ C P P C H + P+C+
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368
Query: 137 PKCHTRCTNDNYGRGF--FQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
PKC C Y F+D D HF A+ + R L +
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKD-----------DLHFAT--SAYSVEGRDQIKRELMENGT 415
Query: 195 RVYAVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
A + + Y VK++G+G E+GR YW V+++ E +GDKG
Sbjct: 416 LTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKG 475
Query: 240 TIKILRGRNEAIIESLVNGALPK 262
T KI G EA I+ G PK
Sbjct: 476 TFKIEMG--EAGIDKEFCGGEPK 496
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 76/187 (40%), Gaps = 19/187 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 138
C+ G WA+ GLV+ CQP FP C+H + + + P C PK
Sbjct: 140 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 192
Query: 139 CHTRCTNDNYGRGFFQD--KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNG 194
C C + ++ Y + G Y F GPF AF N
Sbjct: 193 CDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAF------DVYEDFIAYNS 246
Query: 195 RVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY S + + V++VGWG NG PYW I +++ ++G G I RG +E IE
Sbjct: 247 GVYH-HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIED 305
Query: 255 LVNGALP 261
+ +P
Sbjct: 306 GGSAGIP 312
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 94/237 (39%), Gaps = 38/237 (16%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC AVA A+ ++ C + R AG C + + C+ G
Sbjct: 116 SCGSCWAVAAASAMSDRYCTLGGVR----DLRISAGDLMSCCDVCG-----YGCNGGFPE 166
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + GLV+ CQP FP C H ++ C PKC++ CT
Sbjct: 167 VAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPCSG-DYKTPKCNSTCTEK 218
Query: 147 NYGRGFFQ--DKYQINGLGLYFDPHF-------GPFWPAFWRSFCTKYTRPLFQTNGRVY 197
++ Y ++G + HF GPF AF + G VY
Sbjct: 219 KIPLIRYRGNHSYVLSG-----EEHFKRELLLNGPFEVAF------EVYADFMAYTGGVY 267
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+ +++ V++VGWGE NG PYW I +++ ++G G I RG NE IES
Sbjct: 268 K-HVAGDLLGGHAVRLVGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIES 323
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 84/179 (46%), Gaps = 24/179 (13%)
Query: 95 RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC---------TN 145
+G+V+GG + SN GC P PC H T P CK P C +C +
Sbjct: 15 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKCEEGYKVPYAQD 72
Query: 146 DNYGRGFFQDKYQINGL--GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA 203
++G+ + + ++ + +Y + GP AF T Y + G VY A
Sbjct: 73 LHHGKSAYSIRNDVDQIRQEIYTN---GPVEGAF-----TVYEDFIAYRAG-VYKHVAGK 123
Query: 204 EIVAYATVKIVGWGEENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+ +A ++I+GWG +NG PYW + +++ +G G KILRG +E IE +N LP
Sbjct: 124 ALGGHA-IRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 86/197 (43%), Gaps = 42/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + GLV+GG ++++TGCQP S N+ T P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS--ELNYYRIT-------------PPC 186
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF------------GPFWPAF--WRSFCTKY 185
+T C ND Y + DK+ G +Y+ P GP AF + F
Sbjct: 187 NTTCQNDKYPIPYVSDKHF--GDSIYYIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYR 244
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKIL 244
T+G ++ +A VKI+GWG ENG YW +++G+ +G G KI
Sbjct: 245 DGVYIYTSGALFGRTA---------VKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIR 295
Query: 245 RGRNE-AIIESLVNGAL 260
RG NE ES++ G +
Sbjct: 296 RGTNECGFEESIIAGQV 312
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 70/159 (44%), Gaps = 19/159 (11%)
Query: 105 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ--DKYQING- 161
+++ CQP FP C H +P C P C+ CT+ + ++ Y++ G
Sbjct: 177 TSSQCQPYPFPRCEHRGAQGKKPPCSKYNFDTPTCNATCTDKSVPLIKYRGNHSYEVRGE 236
Query: 162 ----LGLYFDPHFGPFWPAFW-RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGW 216
LYF+ GPF F S Y ++Q + + V+IVGW
Sbjct: 237 EDYKRELYFN---GPFVVRFQVHSDFLAYKSGVYQ--------HVAGNFLGGKAVRIVGW 285
Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
G+ NG PYW + +++ +G G ILRG NE IE L
Sbjct: 286 GKMNGTPYWKVANSWDTDWGMNGYFLILRGNNECNIEHL 324
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 68/159 (42%), Gaps = 25/159 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI + +W + + G+VTGG + TGC P FP C+H T P C P PKC
Sbjct: 132 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 191
Query: 140 HTRCTNDNYGRGFFQDKYQ-------------INGLGLYFDPHFGPFWPAFWRSFCTKYT 186
+C + Y + + QDK + I + P G F+ + F +
Sbjct: 192 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFY--MFEDFLVYKS 248
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYW 225
T GR +V ++++GWG ENG YW
Sbjct: 249 GIYHYTTGR---------LVGGHAIRVIGWGVENGVNYW 278
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 80/177 (45%), Gaps = 21/177 (11%)
Query: 106 NTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLY 165
++GC P +FP C+H T CK +P P C T C N ++ F D++ G
Sbjct: 219 DSGCWPYNFPECSHHVDTKGMEPCKG-NSPSPVCSTTCRNHHFKPSFESDRHFTEDEGYS 277
Query: 166 FDP---------HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGW 216
D GP AF T Y + +G VY +E+ +A VKI+GW
Sbjct: 278 LDEVDEIKREIIDNGPVAAAF-----TVYEDFPYYKSG-VYKHVNGSELGGHA-VKIIGW 330
Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK--DNYGVEFGE 271
G + YW +++++ +GD+G KI G E I+S V +PK GVE E
Sbjct: 331 GIDQNEQYWLVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIPKYEKTSGVEQSE 385
>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 234
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 71/149 (47%), Gaps = 18/149 (12%)
Query: 105 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
++ GC P FP CNH S+ P C + P C T C N YG +D ++ G
Sbjct: 38 NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 96
Query: 164 -LYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIV 214
L P GP + T Y F +G VY V + +++A T+K++
Sbjct: 97 RLPIGPEKIKQEIFDNGPV-----AAMMTLYEDFRFYKSG-VY-VHKTGQMLAAHTLKLI 149
Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
GWG E+G+ YW V+ + E++GD G IK+
Sbjct: 150 GWGVESGQEYWLAVNAWNEEWGDHGMIKL 178
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/240 (27%), Positives = 101/240 (42%), Gaps = 39/240 (16%)
Query: 33 AVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS----ST 88
AV+TA+ L+ +C +SK G KQ C G
Sbjct: 121 AVSTASALSDRICIASK------------GAKQVYVSATDILSCCHSCGDGCDGGYVIDA 168
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
+ + ++G VTGG + + C+P F PC H T EC + P+C +C + Y
Sbjct: 169 FKFFAEQGAVTGGDYGAKDCCRPYPFHPCGHHGNETYYGECPEDGS-TPECVRKC-QEGY 226
Query: 149 GRGFFQDKYQINGLGLYFDP------------HFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
+ +D+ + G Y P GP AF + F G +
Sbjct: 227 ETEYHEDR--VRGEDAYRLPIGSVKAIQKEIMRNGPVVAAF-----IVFDDFSFYRKG-I 278
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
YA A + +A VKI+GWG E+G PYW I +++ +G+ G +++RG N+ IE+ V
Sbjct: 279 YAHVAGSPRGGHA-VKIIGWGTEHGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNV 337
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 86/200 (43%), Gaps = 32/200 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI SS +++ G+V GG + +GC PC H ++ P C PKC
Sbjct: 57 CNGGIPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPD-EVRAPKC 115
Query: 140 HTRCTNDNYGRGFFQD--KYQINGLGLYFDPHFGPFWPAFWRSFCT-KYTRPLFQTNGRV 196
+C +++ +D K ++ G Y G C K ++Q
Sbjct: 116 ARKCESED------KDWTKAKVKGEKGYSVCQQGEL-----EGTCAIKMAADIYQNGPIT 164
Query: 197 YAVSASAEIVAYAT----------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
+ +AY + +KI+G+G E+G+ YW + +++ E +GD G
Sbjct: 165 GMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGKDYWLVANSWNEDWGDDGY 224
Query: 241 IKILRGRNEAIIES-LVNGA 259
KI+RG+N IE ++NG
Sbjct: 225 FKIIRGKNACQIEDPVINGG 244
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 84/200 (42%), Gaps = 45/200 (22%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + RG+ TGG + S GC+P S + SE E +T P C
Sbjct: 153 CDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIG-------SNSEDEAET-----PLC 200
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFG--PFWPAFWRSFCTKYTRPLFQTNGRVY 197
+C N+ Y QD+ HFG P+W S + + L++ V
Sbjct: 201 TRQCINE-YPYNLSQDR------------HFGEKPYWV---NSNEEQIMQELYKNGPVVV 244
Query: 198 AVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
A + + + Y VK++GWG EN + YW I +++ +G+ G K
Sbjct: 245 AFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGIENSKKYWLISNSWNTTWGENGFFK 304
Query: 243 ILRGRNEAIIESLVNGALPK 262
I+RG+N IES V + +
Sbjct: 305 IIRGKNCCAIESYVVAGMAR 324
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 79/189 (41%), Gaps = 23/189 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++ W W G+VTGGA+ C+P FP C A+ + C + P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C YG+ + DK I Y+ P+ GP F
Sbjct: 225 KPYCQY-GYGKRYENDK--IKAKTWYWLPNDERTIQLEIMKKGPVHATF------NIYED 275
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG-DKGTIKILRGR 247
NG VY +A A + ++KI+GWG + G YW I +++ +G D G +++RG
Sbjct: 276 FEHYNGGVYIHTAGA-MEGGHSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGI 334
Query: 248 NEAIIESLV 256
N IE V
Sbjct: 335 NNCDIEGGV 343
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 81/182 (44%), Gaps = 18/182 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + GL T + P FPPC H T C + P PKC
Sbjct: 85 CNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTHYKPCGP-SQPTPKC 136
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
++ R + Y ++ + + GP AF T Y L +G VY
Sbjct: 137 VR--ASEKKPRYHGKSVYSVSPAKIQAEIMTNGPVEAAF-----TVYQDFLAYQSG-VYR 188
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
+ E+ +A +KI+GWG E G YW + +++ E +GDKGT KI RG +E IES V
Sbjct: 189 HVSGPELGGHA-IKIMGWGVEAGNKYWLVANSWNEDWGDKGTFKIARGDDECGIESSVVA 247
Query: 259 AL 260
+
Sbjct: 248 GM 249
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 87/210 (41%), Gaps = 26/210 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
C+ G W W ++G+VTGG A T C P P C H + P+C P+
Sbjct: 350 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 408
Query: 137 --PKCHTRCTNDNYGRG---FFQDKYQI---------NGLGLYFDPHFGPFWPAFWRSFC 182
PKC C Y F QD ++ + + H GP AF
Sbjct: 409 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTH-GPVSGAF----- 462
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
Y L +G VY + + +A +KI+GWG ENG YW V+++ +GD G K
Sbjct: 463 MVYEDFLSYKSG-VYKHVSGLPVGGHA-IKIIGWGTENGEEYWHAVNSWNTYWGDGGQFK 520
Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
I G+ E + A ++ GV GEE
Sbjct: 521 IAMGQCGIDGEMVAGEAAWQETEGVVNGEE 550
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 75/185 (40%), Gaps = 13/185 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL--ATPQP 137
C G + + G+VTGG + GC P SF PC+ P CKT A+ +
Sbjct: 155 CQGGYTIEAMKYWMNSGVVTGGDYQG-AGCIPYSFRPCSTCKEPKDAPSCKTTCQASYKA 213
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
K R + Q+ +Y + GP A+ + + VY
Sbjct: 214 KSAYRLPTTTSSNAIVANAVQMIQTEIYNN---GPVEVAY------QVYDDFYHYKSGVY 264
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
+ +A VKI+GWG E YW + +++ FG+ G KI RG NE IE V
Sbjct: 265 YHVYGDKPSGHA-VKIIGWGTEKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIEENVV 323
Query: 258 GALPK 262
LPK
Sbjct: 324 AGLPK 328
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 87/210 (41%), Gaps = 26/210 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
C+ G W W ++G+VTGG A T C P P C H + P+C P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405
Query: 137 --PKCHTRCTNDNYGRG---FFQDKYQI---------NGLGLYFDPHFGPFWPAFWRSFC 182
PKC C Y F QD ++ + + H GP AF
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTH-GPVSGAF----- 459
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
Y L +G VY + + +A +KI+GWG ENG YW V+++ +GD G K
Sbjct: 460 MVYEDFLSYKSG-VYKHVSGLPVGGHA-IKIIGWGTENGEEYWHAVNSWNTYWGDGGQFK 517
Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
I G+ E + A ++ GV GEE
Sbjct: 518 IAMGQCGIDGEMVAGEAAWQETEGVVNGEE 547
>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 422
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 71/153 (46%), Gaps = 16/153 (10%)
Query: 105 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
++ GC P FP CNH S+ P C + P C T C N YG +D ++ G
Sbjct: 262 NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 320
Query: 164 -LYFDPH--------FGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIV 214
L P GP + T Y F VY V + +++A T+K++
Sbjct: 321 RLPIGPEKIKQEIFDNGPL--RXXAAMMTLYED--FDLQVCVY-VHKTGQMLAAHTLKLI 375
Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
GWG E+G+ YW V+ + E++GD G IK+ G+
Sbjct: 376 GWGVESGQEYWLAVNAWNEEWGDHGMIKLAVGK 408
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 87/210 (41%), Gaps = 26/210 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
C+ G W W ++G+VTGG A T C P P C H + P+C P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405
Query: 137 --PKCHTRCTNDNYGRG---FFQDKYQI---------NGLGLYFDPHFGPFWPAFWRSFC 182
PKC C Y F QD ++ + + H GP AF
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSRDDVKRDMMTH-GPVSGAF----- 459
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
Y L +G VY + + +A +KI+GWG ENG YW V+++ +GD G K
Sbjct: 460 MVYEDFLSYKSG-VYKHVSGLPVGGHA-IKIIGWGTENGEEYWHAVNSWNTYWGDGGQFK 517
Query: 243 ILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
I G+ E + A ++ GV GEE
Sbjct: 518 IAMGQCGIDGEMVAGEAAWQETEGVVNGEE 547
>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 156
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 69/153 (45%), Gaps = 31/153 (20%)
Query: 112 VSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF 170
+ F NHA+ S+ P+C + A QP C T C N++Y QD ++ G
Sbjct: 5 IQFIXXNHASSAASQYPKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRL----- 59
Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE----------------IVAYATVKIV 214
P P K + +F NG V V + E +V ++KI+
Sbjct: 60 -PTSP-------QKIKQEIFD-NGTVLGVISMYEDFRLYKSGVYVHTTGGLVGVHSLKII 110
Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
GWG E+G+ YW V+++ E++GD G IK+ G
Sbjct: 111 GWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGE 143
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 75/161 (46%), Gaps = 19/161 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 44 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 101
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
C Y + QDK Y ++ GP AF + Y+ L
Sbjct: 102 SKIC-EPGYSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAF-----SVYSDFLL 155
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
+G V + E++ ++I+GWG ENG PYW + +++
Sbjct: 156 YKSGVYQHV--TGEMMGGHAIRILGWGVENGTPYWLVANSW 194
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 22/192 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W+ + G+VTGG + C+P SF PC C P PKC
Sbjct: 158 CQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPYYGPCPGGLWPTPKC 217
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH-----------FGPFWPAFWRSFCTKYTRP 188
+ + Y + + +DK+ Y P+ GP AF
Sbjct: 218 R-KSSQRKYNKTYQEDKH--FATRSYSLPNNERSIRQEIYKNGPVVAAF-------KVYE 267
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+ + G +Y + A+A K++GWG ENG YW I +++ +G+ G +I+R +
Sbjct: 268 DYSSTGGIYVHKWGIQTGAHAD-KVIGWGRENGTDYWLIANSWNTDWGEDGYYRIVRETD 326
Query: 249 EAIIESLVNGAL 260
IE + G
Sbjct: 327 NCEIERQMVGEF 338
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 88/207 (42%), Gaps = 41/207 (19%)
Query: 96 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP----ECKT-LATPQPKCHTRCTNDNYGR 150
G+VTGG ++ TGCQP +FPPC+ + S P +C+T K R N+
Sbjct: 162 GVVTGG-DYNGTGCQPYTFPPCSSCEASKSTPSCQKKCQTGYLEATYKNDKRFENEEQDS 220
Query: 151 GFFQDK-YQI-----NGLGLYFDP---------------------HFGPFWPAFWRSFCT 183
+ + YQ+ G Y + GP ++ R F
Sbjct: 221 SYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIYNNGPVEVSY-RVF-- 277
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
+Q VY S ++ VKI+GWG EN YW + +++G FG+KG KI
Sbjct: 278 ---EDFYQYKSGVYHY-VSGKLTGAHAVKIIGWGTENKVDYWLVANSWGTDFGEKGFFKI 333
Query: 244 LRGRNEAIIESLVNGALPKDNYGVEFG 270
RG NE IE V L K N G +FG
Sbjct: 334 RRGTNECGIEENVVAGLAK-NGGTKFG 359
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 32/192 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + ++ G+ +GG + C+P F PC+ NY P K A PKC
Sbjct: 158 CEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD-GNYG---PCPKEGAFDTPKC 213
Query: 140 HTRCT---------NDNYGRG---FFQDKYQINGLGLYFDPHFGPFWPAFW--RSFCTKY 185
C + +G+ QD ++ + GP F+ F Y
Sbjct: 214 RKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFIN---GPVGANFYVFEDF-IHY 269
Query: 186 TRPLF-QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
++ QT G+ V A +K++GWG ENG YW + +++ +G+ GT +IL
Sbjct: 270 KEGIYKQTYGKWIGVHA---------IKLIGWGTENGTDYWLVANSYNYDWGENGTFRIL 320
Query: 245 RGRNEAIIESLV 256
RG N +IES V
Sbjct: 321 RGTNHCLIESQV 332
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 77/193 (39%), Gaps = 31/193 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C G + + + G+ TGG C+P +F PC H N P C P PK
Sbjct: 158 CDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGP-CPKELWPTPK 216
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
C C Y + DK I G Y P+ T+ + +F V +
Sbjct: 217 CRKMC-QLKYNVAYKDDK--IYGNDAYSLPNNE-----------TRIMQEIFTNGPVVGS 262
Query: 199 VSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
S A+ Y VKI+GWG ++G YW I +++ +GD+G ++
Sbjct: 263 FSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQDGLKYWLIANSWNNDWGDEGYVRF 322
Query: 244 LRGRNEAIIESLV 256
LRG N IES V
Sbjct: 323 LRGDNHCGIESRV 335
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 73/165 (44%), Gaps = 25/165 (15%)
Query: 108 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT---------NDNYGRGFFQDKYQ 158
GC P CN P CKTL P C C + +Y + ++ +
Sbjct: 111 GCMSYPLPRCN--------PSCKTLYD-APTCKKECDKGSPLKYEEDKHYAKQAYRIMSK 161
Query: 159 INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE 218
+ GP +F T Y + +G VY ++++ V+I+GWG
Sbjct: 162 VERQIQLEIIKNGPVVASF-----TVYADFIHYLSG-VYKFDGESKLLGGHAVRIIGWGI 215
Query: 219 ENGR-PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
ENG PYW + +++ E++GD+G KI RG+NE IE + LP+
Sbjct: 216 ENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 82/191 (42%), Gaps = 21/191 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C G + W + G+VTGG + + C+P PC NH N T C ++TP
Sbjct: 162 CDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPNETFYR-NCTGVSTPS-- 218
Query: 139 CHTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
C T C + GR + ++ + H GP F + Y +
Sbjct: 219 CKTSCQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKH-GPLVATF-----SVYEDFM 272
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+ G +Y + +A V+I+GWG EN YW I +++ +G+ G +++RG N+
Sbjct: 273 YYKKG-IYRYTHGGYEGGHA-VRILGWGVENNVKYWIIANSWNTDWGEDGFFRMVRGIND 330
Query: 250 AIIESLVNGAL 260
IE V+ L
Sbjct: 331 CGIEESVSAGL 341
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 71/168 (42%), Gaps = 21/168 (12%)
Query: 107 TGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRG---------FFQDKY 157
GC+P PPC TS P K H RCT YG F +D Y
Sbjct: 13 VGCEPYRVPPCPRNEDGTSS----CAGQPIEKNH-RCTRMCYGNQDLDYNDDHRFTRDYY 67
Query: 158 QINGLGLYFD-PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGW 216
+ + D ++GP +F + VY + +A + VK++GW
Sbjct: 68 YLTYGSIQKDVMNYGPIEASF------DVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGW 121
Query: 217 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
G E G PYW +V+++ Q+GD G KI RG +E I+S +P N
Sbjct: 122 GVEEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVPVTN 169
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 14/169 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL-ATPQPK 138
C G + W +G+VTGG SN GCQP PCNH + C +L T
Sbjct: 96 CDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYG-NGNLKNCSSLRRTQMTV 154
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
C +C N NY + D ++ + + + + P +F Y +
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMGY 212
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKG 239
G +Y S + E++ Y VK++GWG + +G YW ++++ +G G
Sbjct: 213 KEG-IYK-STAGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGTNG 259
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/204 (30%), Positives = 84/204 (41%), Gaps = 40/204 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + ++G+VT + N GC S P C EP A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211
Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
KCH +C N + R + +N + DPH GP +F T Y
Sbjct: 212 KCHRKCVKQNLLWSR---SKHFGVNAYMISSDPHSIMTEVYKNGPVEVSF-----TVYED 263
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+G VY + +I+ VK++GWG E+G YW + + + +GD G KI RG
Sbjct: 264 FAHYKSG-VYK-HVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRG 321
Query: 247 RNEAIIESLVNGALPKD-NYGVEF 269
NE IE V LP N VE
Sbjct: 322 TNECEIEDEVVAGLPSARNLNVEL 345
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 82/193 (42%), Gaps = 31/193 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W+WVH +G+ TG + + P + + P P C
Sbjct: 210 CGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDIY-----------PTPNC 258
Query: 140 HTRCTNDNYGRGFFQDK----------YQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C N Y D+ Y +N GP +F T Y L
Sbjct: 259 VEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTD-GPVSASF-----TVYEDFL 312
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY ++ + + +A VKI+GWGE++G+ YW V+++ E +GDKG KI G N
Sbjct: 313 AYKSG-VYKHTSGSYLGGHA-VKIIGWGEKSGQAYWLAVNSWNEDWGDKGLFKIALG-NC 369
Query: 250 AIIESLVNGALPK 262
I + L+ G PK
Sbjct: 370 GIDDDLLGGT-PK 381
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 85/184 (46%), Gaps = 15/184 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC-KTLATPQPK 138
CS G + + + K+G+V+GG +SN GC+P + A+ P C K+ P
Sbjct: 149 CSGGYMMAAFDFYIKQGVVSGGDLNSNEGCRPYT----ADAHDKGVTPSCTKSCRKGYPT 204
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
++ ++ +YG + ++ + + GP +F K + + VY
Sbjct: 205 SYS--SDKHYGSKDYIVDAGVSNIQYEIMTN-GPIIVSF------KVYQDFYNYGSGVYH 255
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
S VKIVGWG E + YW I +++G +G+ G KILRG+NE IE+
Sbjct: 256 -HVSGNYTGNHIVKIVGWGTEKEQDYWLIANSWGSSWGEHGFFKILRGKNECGIENNPYA 314
Query: 259 ALPK 262
LPK
Sbjct: 315 VLPK 318
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 78/189 (41%), Gaps = 23/189 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++ W W G+VTGGA+ C+P FP C A+ + C + P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRP 188
C YG+ + DK I Y+ P+ GP F
Sbjct: 225 KPYCQY-GYGKRYENDK--IKARTWYWLPNDERTIQLEIMQKGPVHATF------NIYED 275
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG-DKGTIKILRGR 247
G VY +A A + ++KI+GWG + G YW I +++ +G D G +++RG
Sbjct: 276 FEHYEGGVYIHTAGA-MEGGHSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVVRGI 334
Query: 248 NEAIIESLV 256
N IE V
Sbjct: 335 NNCDIEGGV 343
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 96/253 (37%), Gaps = 42/253 (16%)
Query: 27 SCIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
+C AV+TA ++ +C ++K V + + RC C G
Sbjct: 113 NCGSCWAVSTAAAISDRICIATKGKKQVYASDTDILTCCGARCGL---------GCRGGW 163
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
W + G+V+GG + C P PC T C +A P P C +C
Sbjct: 164 PIEAWKFFEYDGVVSGGPYLGKGCCSPYPLHPCGRHGNDTFYGNCVGMA-PTPPCKRKC- 221
Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
+ F+ Y++ D +G + R + G V AV A E
Sbjct: 222 -----QPGFRGMYRV-------DKRYGEPGRTYTLPRSEVKIRRDIKERGSVVAVFAVYE 269
Query: 205 IVA-----------------YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+ Y VK++GWG++NG YW I +++ + +G+ G +++RG
Sbjct: 270 DFSHYQSGIYKHTAGRFTGGYHAVKMIGWGKDNGTDYWLIANSWHDDWGENGFFRMIRGI 329
Query: 248 NEAIIESLVNGAL 260
N IE V+ +
Sbjct: 330 NNCGIEEQVDAGI 342
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 70/165 (42%), Gaps = 14/165 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL-ATPQPK 138
C G + W +G+VTGG + SN GCQP PC+H +S C +L T
Sbjct: 96 CDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMTV 154
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
C +C N NY + D ++ + + + + P Y F
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV----TALMYVYENFM 210
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQF 235
+ S + E++ Y VK++GWG +E+G YW ++++ +
Sbjct: 211 GYKKGIYKSTAGELIGYHHVKLIGWGVDEDGTEYWLAMNSWNSNW 255
>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
P15]
Length = 627
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y+ + ++ V IVGWGEENG PYW +T+G +GD+G KI RG NE IE+
Sbjct: 220 IYSSGPNTKLRGGHAVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETW 279
Query: 256 VNGALPKD 263
ALP D
Sbjct: 280 PGSALPID 287
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 79/192 (41%), Gaps = 29/192 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 136
C G ++ W W G+VTGGA+ C+P FP C H + ATP +
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPARK 225
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKY 185
P C YG+ + DK I Y+ P+ GP F
Sbjct: 226 PYCQY-----GYGKRYENDK--IKARTWYWLPNDERTIQLEIMQKGPVHATF------NI 272
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG-DKGTIKIL 244
NG VY +A A + ++KI+GWG + G YW I +++ +G D G +++
Sbjct: 273 YEDFEHYNGGVYIHTAGA-MEGGHSIKIIGWGVDKGVKYWLIANSWSTDWGEDGGYFRVV 331
Query: 245 RGRNEAIIESLV 256
RG N IE V
Sbjct: 332 RGINNCDIEGGV 343
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 40/62 (64%)
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
S ++ V+++GWGEEN PYW I +++ +GD G KI+RG+NE IES VN +
Sbjct: 57 VSGALLGGHAVRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGI 116
Query: 261 PK 262
PK
Sbjct: 117 PK 118
>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
Length = 804
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/53 (56%), Positives = 35/53 (66%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 263
V IVGWGEENG PYW +T+G +GD+G KI RG NE IE+ ALP D
Sbjct: 235 VMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALPID 287
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 80/199 (40%), Gaps = 43/199 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G SS W + G+V+GG +++ GC P S A ++ P C +
Sbjct: 148 CGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSV----QAFRDSTTPNCSSF------- 196
Query: 140 HTRCTNDNYGRGFFQDKY-------------QINGLGLYFDPHFGPF--WPAFWRSFCTK 184
CTN Y + + +DK QI + P + + F+
Sbjct: 197 ---CTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNGV 253
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKI 243
Y L +GR +VKI+GWG ENG YW + +++G +G G K
Sbjct: 254 YQHVLGNVSGR-------------HSVKILGWGRENGTDYWLVANSWGRDWGRLGGFFKF 300
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG N IES + G PK
Sbjct: 301 LRGENHCDIESNILGGDPK 319
>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
Length = 804
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y+ + ++ V IVGWGEENG PYW +T+G +GD+G KI RG NE IE+
Sbjct: 220 IYSSGPNTKLRGGHAVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETW 279
Query: 256 VNGALPKD 263
ALP D
Sbjct: 280 PGSALPID 287
>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
Length = 804
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y+ + ++ V IVGWGEENG PYW +T+G +GD+G KI RG NE IE+
Sbjct: 220 IYSSGPNTKLGGGHAVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETW 279
Query: 256 VNGALPKD 263
ALP D
Sbjct: 280 PGSALPID 287
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 75/170 (44%), Gaps = 16/170 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL-ATPQP 137
C G + W +G+VTGG SN GCQP PC+H Y S C +L T
Sbjct: 96 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 153
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLF 190
C +C N NY + D ++ + + + + P +F Y +
Sbjct: 154 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPV--TAFMYVYENFMG 211
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKG 239
G +Y S + E++ Y VK++GWG + +G YW ++++ +G+ G
Sbjct: 212 YKEG-IYK-STTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDG 259
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 81/184 (44%), Gaps = 25/184 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + + G+VTG + +N GC+P F P Y+T P+C
Sbjct: 178 CNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLPHTTVEYST------------PEC 225
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLY-------FDPHFGPFWPAFWRSFCTKYTRPLFQT 192
+C N Y + + QDK+ G+ +Y D + + Y +F
Sbjct: 226 SKKCENYQYKKAYKQDKH--FGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVYYDFMFYK 283
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
+G VY + +A V+IVGWG + PYW + +++ +G+ G +I RG +E+
Sbjct: 284 SG-VYQTVFPWPLGGHA-VRIVGWGVDGPTKVPYWLVANSWNTDWGEDGYFRIRRGTDES 341
Query: 251 IIES 254
IES
Sbjct: 342 YIES 345
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 78/189 (41%), Gaps = 27/189 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + + G+ +GG + S GC+P YT + ++ P+C
Sbjct: 153 CRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKP----------YTAA------VSGETPQC 196
Query: 140 HTRCTNDNYGRGFFQD------KYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
C + Y + + +D YQ+NG L P ++ Y F +
Sbjct: 197 QKACVS-GYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPV--TAYMEVYED--FYSY 251
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G S V VKI+GWG EN PYW +++G FG+ G +ILRG N A IE
Sbjct: 252 GTGIYQHTSGSFVGGHAVKIIGWGSENDVPYWIAANSWGTGFGEDGFFRILRGSNCAGIE 311
Query: 254 SLVNGALPK 262
S + P
Sbjct: 312 SYIVAGYPN 320
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 79/189 (41%), Gaps = 24/189 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G TW + GL + G + S GC F +Y ++P L T C
Sbjct: 149 CDGGYVGKTWQYWVDSGLTSEGPYKSGQGCNSYPF-----GSYCVNDP----LPTCSRTC 199
Query: 140 H-----TRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
T + YG ++ + N + + GP F + +Q
Sbjct: 200 QAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQN-GPVVVQF------EVFADFYQYKS 252
Query: 195 RVY-AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
VY V+ + E + V+++GWG ENG YW + +++G ++GDKG K +RG N IE
Sbjct: 253 GVYRHVTGATE--GWHAVRVIGWGVENGVKYWLVANSWGVRWGDKGFFKFVRGENHLGIE 310
Query: 254 SLVNGALPK 262
V LPK
Sbjct: 311 DFVYAGLPK 319
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 72/168 (42%), Gaps = 20/168 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
C+ G+ + + G TG + GCQP F C H +T P C ++ P+ K
Sbjct: 140 CNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCDSV--PEYKA 197
Query: 139 --CHTRCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRP 188
C C D Y R + +D Y + D GP +F T Y
Sbjct: 198 DTCSHECQKD-YDRKYEEDLYYGKEQYGFSDEAPIQREIMTNGPVAVSF-----TVYESF 251
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFG 236
L+ + G +Y + I Y V++VGWG ENG YW I +++ EQ+G
Sbjct: 252 LYYSGG-IYRSTPGERIKGYHAVRVVGWGVENGTKYWKIANSWNEQWG 298
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 81/197 (41%), Gaps = 43/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + ++G+VT + N GC S P C EP A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
KCH +C N + F + Y I+ DPH GP +F T Y
Sbjct: 212 KCHRKCVKQNLLWSKSKHFGVNAYMISS-----DPHSIMTELYKNGPVEVSF-----TVY 261
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
+G VY + +++ VK++GWG E+G YW + + + +GD G KI
Sbjct: 262 EDFAHYKSG-VYK-HVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIR 319
Query: 245 RGRNEAIIESLVNGALP 261
RG +E IE V LP
Sbjct: 320 RGTDECEIEDEVVAGLP 336
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 62/150 (41%), Gaps = 7/150 (4%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + G+VTGG + TGCQP F C+H + C P+P C
Sbjct: 132 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPKPPC 191
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
C Y + + QDK+ N H ++ + T +FQ G VY
Sbjct: 192 ARACQT-GYNKTYEQDKFYGNS-SYNVGEHESYIMQEIMKNGPVEVTFAIFQDFG-VYRS 248
Query: 200 S----ASAEIVAYATVKIVGWGEENGRPYW 225
+ + + V+++GWG ENG YW
Sbjct: 249 GIYHHVAGKFIGRHAVRMIGWGVENGVNYW 278
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 85/194 (43%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W + + G+VT + TGCQ P C+ A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 212
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KCH +C +N + + ++K + +N ++ +PH GP AF T Y
Sbjct: 213 KCHRKCKVEN--QVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAF-----TVYEDF 265
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G ++ ++ VK++GWG + G YW + + + +GD G KI+RG+
Sbjct: 266 AHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGK 323
Query: 248 NEAIIESLVNGALP 261
NE IE V +P
Sbjct: 324 NECGIEEDVTAGMP 337
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 86/213 (40%), Gaps = 43/213 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + + G+VT + TGC S P C EP A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
C +C N + F + Y++N D H GP +F T Y
Sbjct: 208 ACEKKCVKKNLLWSESKHFSVNAYRVNS-----DQHSIMTEVYTNGPAEVSF-----TVY 257
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
+G VY +E+ +A VK++GWG E+G YW + + + +GD G KI+
Sbjct: 258 EDFAHYKSG-VYKHVTGSEMGGHA-VKLIGWGTSEDGEDYWLLANQWNRSWGDDGYFKII 315
Query: 245 RGRNEAIIESLVNGALPKDNYGVEFGEESGERL 277
RG NE IE + G N +E G + L
Sbjct: 316 RGTNECGIEDVTAGMPSTKNLDIESGVRDDDSL 348
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 26/172 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 136
C G W WV + G+VTGG + C+P +F PC H P + +TP +
Sbjct: 164 CEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACK 223
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTR 187
P C YG+ + +DK+ + + + GP AF T
Sbjct: 224 PYCQF-----GYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAF-------ITY 271
Query: 188 PLFQT-NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
F G +Y E A+A VK++GWG ENG YWT+ +++ + +G K
Sbjct: 272 EDFSPYKGGIYVHVKGRERGAHA-VKLIGWGVENGTKYWTVANSWHDDWGGK 322
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 100/244 (40%), Gaps = 25/244 (10%)
Query: 27 SCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
+C AV+TA ++ +C ++ + V +S + +C + C G
Sbjct: 108 NCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGF---------GCGGGW 158
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
S W + G+V+GG + + C+P PC H T EC A P C +C
Sbjct: 159 SIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPREAA-TPPCKKKC- 216
Query: 145 NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR------SFCTKYTRPLFQTNGRVYA 198
Y + F DK Q + +P R SF L++T VY
Sbjct: 217 QPGYKKIFRMDKRQ-GKVAYGVEPKEEAIQREILRHGPVVASFAVYEDFSLYKTG--VYK 273
Query: 199 VSASAEIVAYATVKIVGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+A A + Y VK++GWG ++ YW I +++ +G+ G + +RG N+ IE V
Sbjct: 274 HTAGA-LRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTV 332
Query: 257 NGAL 260
+
Sbjct: 333 AAGI 336
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 82/204 (40%), Gaps = 36/204 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + + G+VT + TGC S P C+ L P P
Sbjct: 168 CDGGYPIAAWRYFKRSGVVTEECDPYFDTTGC---------------SHPGCEPL-YPTP 211
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPL 189
KCH +C N Y +N + DP GP +F T Y
Sbjct: 212 KCHRKCVKGNV-LWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSF-----TVYEDFA 265
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY + +A VK++GWG E G YW IV+++ +G+ G KI RG N
Sbjct: 266 HYKSG-VYKHVTGGNMGGHA-VKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYFKIRRGTN 323
Query: 249 EAIIESLVNGALPKD-NYGVEFGE 271
E IE V LP N VE G+
Sbjct: 324 ECGIEHSVVAGLPSARNLNVELGD 347
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 91/220 (41%), Gaps = 38/220 (17%)
Query: 27 SCIEARAVATATPLAFAVCRSS----KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
+C AV+TA L+ +C S+ ++++ T L + + C
Sbjct: 118 NCGSCWAVSTAAALSDRICISTNGTKQVNISATDI------------LTCCYKCGYGCQG 165
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G W +V + G VTGG + + C+ FPPC H T EC A PKC T
Sbjct: 166 GWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRAR-TPKCRTS 224
Query: 143 CTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAFWRSFCTKYTRPLFQ 191
CT Y + DK I G Y P+ GP AF T Y +
Sbjct: 225 CT-PGYKNSYSDDK--IRGKDAYELPNSVKAIQREIMKNGPVVAAF-----TVYADFSYY 276
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
G +Y +A ++A VK++GWGEE PYW + +++
Sbjct: 277 KKG-IYKHTAGRARGSHA-VKVIGWGEEGDVPYWIVKNSW 314
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 76/189 (40%), Gaps = 20/189 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+C + +QD++ G Y P+ F + F T ++A
Sbjct: 244 SNKCRSGYNVTDVWQDRHY--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299
Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
+ + VK++GWG ENG YW + +++G ++G+ G KI+RG N IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKIVRGENHCGIE 359
Query: 254 SLVNGALPK 262
++ LP
Sbjct: 360 ENIHAGLPN 368
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 67/152 (44%), Gaps = 22/152 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG + S GC+P + PPC H + S P C PKC
Sbjct: 83 CNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEH-HVNGSRPSCSGEGGDTPKC 141
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAFWRSFCTKYTRP 188
+C + Y + +DK I G Y P GP AF T Y
Sbjct: 142 VQKC-DSGYTPAYEKDK--IYGQSAYSVPSSPESIMEEIYKDGPVEGAF-----TVYEDF 193
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN 220
L +G VY + E V +KI+GWG EN
Sbjct: 194 LLYKSG-VYQ-HHTGEAVGGHAIKILGWGIEN 223
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 84/201 (41%), Gaps = 38/201 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPV-------------SFPPCNHANYTTSE 126
C+ G W G TGG GC+P + PC + Y
Sbjct: 153 CNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYG-- 210
Query: 127 PECKTLATPQPKCHTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAF 177
EC +A P+C RC ++ YG+ + K + + + GP +F
Sbjct: 211 -ECVGMAD-TPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKN-GPVVASF 267
Query: 178 --WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
+ F Y +++ + E+ Y VKI+GWG+EN +W I +++ + +
Sbjct: 268 AVYEDF-RHYKSGIYK--------HTAGELRGYHAVKIIGWGKENNTDFWLIANSWHQDW 318
Query: 236 GDKGTIKILRGRNEAIIESLV 256
G+KG +I+RG+NE IE+ V
Sbjct: 319 GEKGYFRIVRGKNECGIETDV 339
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 86/194 (44%), Gaps = 35/194 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W + + G+VT + TGCQ P C+ A P P
Sbjct: 165 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 208
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C +N + + ++K + +N ++ +PH GP AF ++C
Sbjct: 209 KCQRKCKVEN--QAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAF--TYCQILDFA 264
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+++ VY + ++ VK++GWG + G YW + + + +GD G KI+RG
Sbjct: 265 HYKSG--VYK-HITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGE 321
Query: 248 NEAIIESLVNGALP 261
NE IE V +P
Sbjct: 322 NECGIEGDVTAGMP 335
>gi|48762483|dbj|BAD23811.1| cathepsin B-S [Tuberaphis takenouchii]
Length = 155
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 67/162 (41%), Gaps = 20/162 (12%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + +G+ TGG + S GC P PPC + P +
Sbjct: 5 CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 59
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPH--------FGPFWPAFWRSFCTKYTRPLFQ 191
H +C YG Q +Y++ + P+ +GP +F L
Sbjct: 60 H-QCPKTCYGSTTVQKRYKVKNEYVLNSPNTMEQDLIKYGPIEASF------NLFDDLSA 112
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGE 233
+Y + A+ ++ ++KI+GWG+ENG PYW V+++ +
Sbjct: 113 YKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWSK 154
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 95/241 (39%), Gaps = 48/241 (19%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC AVA A+ ++ C + R AG C + + C+ G
Sbjct: 116 SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDLMSCCDVCG-----YGCNGGYPE 166
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-- 144
W + G+V+ CQP FP C H ++ C P C++ CT
Sbjct: 167 VAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTCNSTCTDK 218
Query: 145 ---------NDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
N +Y G F+ + +NG PF +F + Y L T
Sbjct: 219 KVPLIKYRGNTSYLLSGEESFKRELLLNG----------PFEVSF-----SVYADFLAYT 263
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
G VY A + +A V+IVGWGE NG PYW I +++ ++G G I RG +E I
Sbjct: 264 GG-VYKHVAGTFLGGHA-VRIVGWGELNGEPYWKIANSWNREWGMNGYFLIARGVDECGI 321
Query: 253 E 253
E
Sbjct: 322 E 322
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/238 (27%), Positives = 96/238 (40%), Gaps = 45/238 (18%)
Query: 52 VECTSFRFIAGVKQRCAWLVSR------WMTIWVCSSGISSSTWAWVHKRGLVTG--GAH 103
VEC RF + + V+ +M C G S W ++ + G+VT +
Sbjct: 132 VECLQDRFCIHLNMNISLSVNDLVACCGFMCGDGCDGGYPISAWQYLVENGVVTDECDPY 191
Query: 104 HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDK--YQING 161
GC+ P C EP A P P C +C N +Q+K + IN
Sbjct: 192 FDQVGCK---HPGC--------EP-----AYPTPACEKKCKVQNQ---VWQEKKHFSINA 232
Query: 162 LGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKI 213
+ DPH GP AF T Y +G VY + E++ VK+
Sbjct: 233 YRVNSDPHDIMAEVYKNGPVEVAF-----TVYEDFAHYKSG-VYE-HITGEMMGGHAVKL 285
Query: 214 VGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 270
+GWG +G+ YW + + + +GD G KI+RG+NE IE V +P V G
Sbjct: 286 IGWGTSADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNTVRTG 343
>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
Length = 807
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 38/66 (57%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y + ++ V IVGWGEENG PYW +T+G +GD G +I RG NE IE+
Sbjct: 220 IYVSGPNTKLSGGHAVMIVGWGEENGVPYWDCANTYGTNWGDHGYFRIKRGSNELKIETW 279
Query: 256 VNGALP 261
ALP
Sbjct: 280 PGAALP 285
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 76/189 (40%), Gaps = 20/189 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+C + +QD++ G Y P+ F + F T ++A
Sbjct: 244 SNKCRSGYNVTDVWQDRHY--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299
Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
+ + VK++GWG ENG YW + +++G ++G+ G K++RG N IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359
Query: 254 SLVNGALPK 262
++ LP
Sbjct: 360 ENIHAGLPN 368
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 76/189 (40%), Gaps = 20/189 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+C + +QD++ G Y P+ F + F T ++A
Sbjct: 244 SNKCRSGYNVTDVWQDRHY--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299
Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
+ + VK++GWG ENG YW + +++G ++G+ G K++RG N IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359
Query: 254 SLVNGALPK 262
++ LP
Sbjct: 360 ENIHAGLPN 368
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 78/196 (39%), Gaps = 41/196 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPK 138
C G W + + G+VT C P P C H P C+ A P PK
Sbjct: 22 CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 66
Query: 139 CHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYT 186
C +C N + F D Y+IN DPH GP AF T Y
Sbjct: 67 CEKKCKEQNQVWQEKKHFSIDAYRINS-----DPHDIMAEVYKNGPVEVAF-----TVYE 116
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILR 245
+G ++ I+ VK++GWG + G YW + + + +GD G KI+R
Sbjct: 117 DFAHYKSGVYKHITGG--IMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIR 174
Query: 246 GRNEAIIESLVNGALP 261
G+NE IE V +P
Sbjct: 175 GKNECGIEEGVVAGMP 190
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 88/230 (38%), Gaps = 47/230 (20%)
Query: 52 VECTSFRFIAGVKQRCAWLVSR------WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHS 105
VEC RF + V+ +M C G W + + G+VT
Sbjct: 127 VECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVT------ 180
Query: 106 NTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTNDNY----GRGFFQDKYQIN 160
C P P C H P C+ A P PKC +C N + F D Y+IN
Sbjct: 181 -DECDPYFDPVGCKH-------PGCEP-AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231
Query: 161 GLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVK 212
DPH GP AF T Y +G ++ I+ VK
Sbjct: 232 S-----DPHDIMAEVYKNGPVEVAF-----TVYEDFAHYKSGVYKHITGG--IMGGHAVK 279
Query: 213 IVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
++GWG + G YW + + + +GD G KI+RG+NE IE V +P
Sbjct: 280 LIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 329
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 76/188 (40%), Gaps = 20/188 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+C + +QD++ G Y P+ F + F T ++A
Sbjct: 244 SNKCRSGYNVTDVWQDRHI--GRVAYSLPN--DERKIMEEIFINGPVQAAFHTYLDLHAY 299
Query: 200 SAS------AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
+ + VK++GWG ENG YW + +++G ++G+ G K++RG N IE
Sbjct: 300 KSGIYRHVWGPLSGGHAVKLLGWGVENGVKYWLVANSWGREWGENGFFKMVRGENHCGIE 359
Query: 254 SLVNGALP 261
++ LP
Sbjct: 360 ENIHAGLP 367
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 67/159 (42%), Gaps = 23/159 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--------ANYTTSEPECKT 131
C+ G S W + +G+VTG +++ GCQP FPPC H + P CK
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHHTLGPLPVCDGDVETPPCKR 223
Query: 132 LATPQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAF--WRSFCTKYTRP 188
T Q + ND YG+ ++ K + H GP F + F Y
Sbjct: 224 --TCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQH-GPVEVDFEVYADF-PNYKSG 279
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 227
++Q S ++ V+++GWGEEN PYW I
Sbjct: 280 VYQ--------HVSGALLGGHAVRLLGWGEENNVPYWLI 310
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 88/230 (38%), Gaps = 47/230 (20%)
Query: 52 VECTSFRFIAGVKQRCAWLVSR------WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHS 105
VEC RF + V+ +M C G W + + G+VT
Sbjct: 127 VECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVT------ 180
Query: 106 NTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTNDNY----GRGFFQDKYQIN 160
C P P C H P C+ A P PKC +C N + F D Y+IN
Sbjct: 181 -DECDPYFDPVGCKH-------PGCEP-AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231
Query: 161 GLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVK 212
DPH GP AF T Y +G ++ I+ VK
Sbjct: 232 S-----DPHDIMAEVYKNGPVEVAF-----TVYEDFAHYKSGVYKHITGG--IMGGHAVK 279
Query: 213 IVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
++GWG + G YW + + + +GD G KI+RG+NE IE V +P
Sbjct: 280 LIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMP 329
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 76/195 (38%), Gaps = 34/195 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G T+ + K GL +GG +HS GC+P F KC
Sbjct: 115 CDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPYPFGGATQD------------VNIVLKC 162
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP------------HFGPFWPAFWRSFCTKYTR 187
+C Y + QD +G Y P GP +F
Sbjct: 163 DRQC-QAGYPLTYSQD--LKHGASSYILPWGDENAMKAEIYQNGPIVTSF------DVYG 213
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
FQ VY A ++A V+++GWG ENG YW +++ E++G+ G KI+RG
Sbjct: 214 DFFQYRSGVYRHVTGAYKGSHA-VRVIGWGVENGVKYWLCANSWNERWGENGFFKIVRGE 272
Query: 248 NEAIIESLVNGALPK 262
N +E + LPK
Sbjct: 273 NHVGVEDISYAGLPK 287
>gi|321446975|gb|EFX60976.1| hypothetical protein DAPPUDRAFT_274869 [Daphnia pulex]
Length = 71
Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 35/52 (67%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
++I+GWG E G PYW I + + +GD G IK+LRG++ IES + G LPK
Sbjct: 19 IRILGWGVEEGVPYWLIANNWNTDWGDNGYIKLLRGKDHCGIESQITGGLPK 70
>gi|123483120|ref|XP_001323959.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121906833|gb|EAY11736.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 255
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 6/77 (7%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
YT LF+ R Y + TV+I+GWG+E G PYW I++ +G +G+ G ++I
Sbjct: 181 YTGGLFEDPPRDYIADRTH------TVEIIGWGQEKGIPYWIILNQYGRLWGENGMMRIR 234
Query: 245 RGRNEAIIESLVNGALP 261
GR++A +ES V A P
Sbjct: 235 MGRDDARVESYVLAAEP 251
>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
Length = 125
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 36/57 (63%)
Query: 206 VAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ V+++GWG EN PYW + +++ + +GD GT KILRG NEA IE N P+
Sbjct: 65 IGLHAVRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNVGYPQ 121
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 38/52 (73%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
VKI+GWG E+G YW + +++ E++G+ G +I+RGR+E IES ++ ALP
Sbjct: 273 VKILGWGVEDGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALPD 324
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 76/199 (38%), Gaps = 39/199 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + +G+VTGG + GC P S+ C+ + + P+CK +C
Sbjct: 148 CQGGFVLEAMKFWKSKGVVTGGDFQGD-GCIPYSYGSCSDCHTAQTTPKCKN------EC 200
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+ T + Y +DKY +G S + + NG V A
Sbjct: 201 QVKYTKNEYK----EDKY------------YGSSAYRLSTSNAVRTIQSEILRNGPVEAT 244
Query: 200 SASAEIVAY----------------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
E Y VKI+GWG E YW I +++G FG+ G K+
Sbjct: 245 YQVYEDFYYYKSGVYEYISGRHMGGHAVKIIGWGVEENVNYWLIANSWGTGFGENGFFKM 304
Query: 244 LRGRNEAIIESLVNGALPK 262
RG NE IE+ V + K
Sbjct: 305 RRGNNECGIENYVVAGMAK 323
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++ K G TGG++ + GC+P S PC T+ P C T P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209
Query: 140 HTRCTNDNYGRGFFQDKY 157
+CTN NY + DK+
Sbjct: 210 VNKCTNSNYNVAYKDDKH 227
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 38/57 (66%)
Query: 204 EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
E+ + +V+I+GWGE+ G YW + +++G Q+G+ G +I RG NEA IES V L
Sbjct: 364 ELKGFHSVRIIGWGEDRGDRYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGL 420
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 75/191 (39%), Gaps = 32/191 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 137
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 138 --KCHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
C T D Y +N H GP + R F +
Sbjct: 296 ANGCQTPVNVDRDTLYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYS 349
Query: 194 GRVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G VY +A+ + + +VK+VGWGEE NG YW +++G +G+ G +ILRG NE
Sbjct: 350 GGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNEC 409
Query: 251 IIESLVNGALP 261
IE V + P
Sbjct: 410 GIEDYVLASWP 420
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/250 (26%), Positives = 97/250 (38%), Gaps = 48/250 (19%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC AVA A+ ++ C + R AG C + + C+ G
Sbjct: 116 SCGSCWAVAAASAMSDRYCTLGGVR----DLRISAGDLMSCCDVCG-----YGCNGGYPE 166
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT-- 144
W + G+V+ CQP FP C H ++ C P C++ CT
Sbjct: 167 VAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTCNSTCTDK 218
Query: 145 ---------NDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
N +Y G F+ + +NG PF +F + Y + T
Sbjct: 219 KIPLIKYRGNTSYILSGEESFKRELLLNG----------PFEVSF-----SVYADFVAYT 263
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
G VY + +A V+IVGWGE NG PYW I +++ ++G G I RG +E I
Sbjct: 264 GG-VYKHVTGVFLGGHA-VRIVGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVDECGI 321
Query: 253 ESLVNGALPK 262
E +P+
Sbjct: 322 EGSGVAGIPR 331
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 77/191 (40%), Gaps = 32/191 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQQRDTCKIRHNSRSLR 295
Query: 140 HTRC-TNDNYGRGFFQD---KYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
C T N R F Y +N H GP + R F
Sbjct: 296 ANGCQTPYNVDRDTFYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYA 349
Query: 194 GRVYAVSASAEI--VAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G VY +A+ + + +VK+VGWGEE NG YW +++G +G++G +ILRG NE
Sbjct: 350 GGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKYWIAANSWGPWWGERGYFRILRGSNEC 409
Query: 251 IIESLVNGALP 261
IE V + P
Sbjct: 410 GIEEYVLASWP 420
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 85/213 (39%), Gaps = 43/213 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + + G+VT + TGC S P C EP A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
C +C N + F + Y++N D H GP +F T Y
Sbjct: 208 ACEKKCVKKNLLWSESKHFSVNAYRVNS-----DQHSIMTEVYTNGPAEVSF-----TVY 257
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
+G VY +E+ +A VK++GWG E+G YW + + + +G G KI+
Sbjct: 258 EDFAHYKSG-VYKHVTGSEMGGHA-VKLIGWGTSEDGEDYWLLANQWNRSWGGDGYFKII 315
Query: 245 RGRNEAIIESLVNGALPKDNYGVEFGEESGERL 277
RG NE IE + G N +E G + L
Sbjct: 316 RGTNECGIEDVTAGTPSTKNLDIESGVRDDDSL 348
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES V +
Sbjct: 204 VTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 263
Query: 261 PK-DNY 265
P+ D Y
Sbjct: 264 PRTDQY 269
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 85/194 (43%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W + + G+VT + TGCQ P C EP A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---HPGC--------EP-----AYPTP 212
Query: 138 KCHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KCH +C +N + + ++K+ +N ++ +PH GP AF T Y
Sbjct: 213 KCHRKCKVEN--QVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAF-----TVYEDF 265
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G ++ ++ VK++GWG + G YW + + + +G G KI+RG+
Sbjct: 266 AHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGGDGYFKIIRGK 323
Query: 248 NEAIIESLVNGALP 261
NE IE V +P
Sbjct: 324 NECGIEEDVTAGMP 337
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 69/161 (42%), Gaps = 19/161 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G+ W +V + G+VTGG + C+P PC S P + TP C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCEITGKFWSCPRDHSFRTPA--C 222
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF---------GPFWPAFWRSFCTKYTRPLF 190
C YG+ + +DK + + + + GP AF T Y F
Sbjct: 223 KKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAF-----TTYEDFSF 276
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
G +Y S + A+A VK+VGWG ENG YW + +++
Sbjct: 277 YRKG-IYVHSYGRQRGAHA-VKVVGWGVENGTKYWNVANSW 315
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 101/236 (42%), Gaps = 33/236 (13%)
Query: 33 AVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISSSTWA 90
AV+TA+ L+ +C +SK + +S ++ K + + C G +
Sbjct: 124 AVSTASALSDRICIASKGETQLHISSIDIVSCCK----------LCGYGCDGGWPIEAFD 173
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEP---ECKTLATP------QPKCH 140
+ ++G VTG S GC+P F P + N T CK T + H
Sbjct: 174 YFSRQGAVTGETT-SKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKRVTRNH 232
Query: 141 TRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
TR T R + Q + G D GP F T Y + G +Y
Sbjct: 233 TRRTGLTARRLRITEFCQSHSEG---DHGNGPVVAVF-----TVYEDFSYYKKG-IYVHI 283
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
A A+A +KI+GWG ENG PYW I +++ + +G++G +I+RG NE IE V
Sbjct: 284 AGKARGAHA-IKIIGWGVENGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEV 338
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
+ E++ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES V +
Sbjct: 84 VTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 143
Query: 261 PK-DNY 265
P+ D Y
Sbjct: 144 PRTDQY 149
>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 180
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G S W+WVH +G+ TGG + + + GC P FPPC H T P+C
Sbjct: 100 CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKCPEGL 159
Query: 134 TPQPKCHTRCTNDNY 148
P P C +C N Y
Sbjct: 160 YPTPNCVEQCHNPKY 174
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 39/55 (70%)
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+ ++ ++ ++I+GWGEEN PYW I +++ E +G+ G +ILRGR+E IES V
Sbjct: 35 TGKLFSWHAIRIIGWGEENNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEV 89
>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 328
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 67/165 (40%), Gaps = 24/165 (14%)
Query: 88 TWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH-------ANYTTSEPECKTLATPQPKCH 140
W ++ GLV+GG ++++ GCQP PP NYT ++ H
Sbjct: 161 VWEYLKSHGLVSGGKYNTSDGCQPSKIPPIEEYMEYSEIKNYTCNDHCYGNKTINYNDDH 220
Query: 141 TRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVS 200
+ +N ++Q +Y+ + ++GP F+ P N R
Sbjct: 221 VKVSN------YYQVQYEDIQEEV---QNYGPVSVEFY--IRDDIFTPFLSINPRFQRRK 269
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
VK++GWG ENG YW +V ++G + G G K+ R
Sbjct: 270 YKG------YVKLIGWGVENGEDYWLLVDSWGYERGQNGVFKVER 308
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 54/202 (26%), Positives = 86/202 (42%), Gaps = 22/202 (10%)
Query: 27 SCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
SC A ++ +C S K++VE ++ ++ K C C+ G
Sbjct: 19 SCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLECGN---------GCNGGY 69
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
S W + GLV+GG ++S+ GC+P S PC H + S P+C + P+C RC
Sbjct: 70 PSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEH-HVNGSRPKC-SGEIETPRCSRRC- 126
Query: 145 NDNYGRGFFQDKYQINGLGLY-FDPHFGPFWPAFWRSFCTKYTRPLFQT----NGRVYAV 199
Y + +DK+ GL Y +++ + +F+ VY
Sbjct: 127 EAGYSPKYSEDKHY--GLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSGVYQH 184
Query: 200 SASAEIVAYATVKIVGWGEENG 221
I +A +KI+GWGEENG
Sbjct: 185 KTGGSIGGHA-IKILGWGEENG 205
>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
[Acyrthosiphon pisum]
Length = 129
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 40/69 (57%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + +A + VK++GWG E G PYW +V+++ Q+GD G KI RG +E I+S
Sbjct: 61 VYQRTPNATKLGGHAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSA 120
Query: 256 VNGALPKDN 264
+P N
Sbjct: 121 TTAGVPVTN 129
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 79/198 (39%), Gaps = 33/198 (16%)
Query: 74 WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
WM C G W + + G+VT C P + S P C+
Sbjct: 145 WMCGAGCDGGSPIDAWRYFVQSGVVT-------EECDPY------FDDIGCSHPGCEP-G 190
Query: 134 TPQPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTK 184
P PKC +C + N + + + K + +N + DPH GP AF T
Sbjct: 191 FPTPKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAF-----TV 243
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKI 243
Y +G ++ A + VK++GWG E+G YW + + + +GD G KI
Sbjct: 244 YEDFAHYKSGVYKHITGDA--MGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKI 301
Query: 244 LRGRNEAIIESLVNGALP 261
RG NE IE V LP
Sbjct: 302 KRGTNECGIEGAVVAGLP 319
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 87/213 (40%), Gaps = 35/213 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G + W + G+VT + NTGC S P C EP A P P
Sbjct: 174 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 217
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
KC +C + N YG ++ + + + + GP AF T Y
Sbjct: 218 KCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKN-GPVEVAF-----TVYEDFA 271
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY I +A VK++GWG ++G YW + + + +GD G KI RG N
Sbjct: 272 HYKSG-VYKHITGTNIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 329
Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERLSEEF 281
E IE V LP D V+ S + L F
Sbjct: 330 ECGIEHGVVAGLPSDRNVVKGITTSDDLLVSSF 362
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
S EI+ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES + +
Sbjct: 57 VSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGM 116
Query: 261 P 261
P
Sbjct: 117 P 117
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 98/248 (39%), Gaps = 53/248 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSEPECKTLA 133
C G + ++ G+VTGG + ++ GC P FP CNH +
Sbjct: 114 CRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFPKCNHV---------PGMK 164
Query: 134 TPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYF--DPHFGPFWPAFWRSFCTKYTRPLFQ 191
P+C ++ G +GL D H W S K + +F
Sbjct: 165 VKYPRCGSKV-------GRLAAPSHCDGLHCRRAGDVHRAKSWGRLPISP-EKIKQEIFD 216
Query: 192 TNGRVYAVSASAE----------------IVAYATVKIVGWGEENGRPYWTIVSTFGEQF 235
NG V A+ E +V T+K++GWG E G+ YW V+++ E++
Sbjct: 217 -NGPVAAIMTIHEDFRLYKSGVYEYKTGAMVGAHTLKLIGWGVEAGQEYWLAVNSWNEEW 275
Query: 236 GDKGTIKILRGRNEAIIES-------LVNGALPKDNYGVEFG---EESGERLSEEFGVRA 285
GD+G IK+ G+N ES VN L +D E G +++ +L E+ V
Sbjct: 276 GDQGKIKLAVGKNALDEESRQQVPRRAVN-ELDEDAMMAESGAKTQKAMAQLKEDVFVEK 334
Query: 286 ESSEEFRE 293
+ F E
Sbjct: 335 QVHSHFEE 342
>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 254
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 1/86 (1%)
Query: 74 WMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 133
++ + C G +W + + G V+GG ++SN GCQP + PPC N C T
Sbjct: 126 YLCGYGCDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGHSCTTFN 185
Query: 134 TPQ-PKCHTRCTNDNYGRGFFQDKYQ 158
+ P C +C N NY F D Y+
Sbjct: 186 REETPTCEKKCNNPNYYTSFRADIYR 211
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 87/213 (40%), Gaps = 35/213 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G + W + G+VT + NTGC S P C EP A P P
Sbjct: 105 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 148
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
KC +C + N YG ++ + + + + GP AF T Y
Sbjct: 149 KCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKN-GPVEVAF-----TVYEDFA 202
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY I +A VK++GWG ++G YW + + + +GD G KI RG N
Sbjct: 203 HYKSG-VYKHITGTNIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 260
Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERLSEEF 281
E IE V LP D V+ S + L F
Sbjct: 261 ECGIEHGVVAGLPSDRNVVKGITTSDDLLVSSF 293
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 100/253 (39%), Gaps = 41/253 (16%)
Query: 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
T + IRD S SC AVA A+ ++ C + R AG
Sbjct: 107 TVTEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 149
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C + + C+ G W + G+V+ CQP FP C H ++
Sbjct: 150 MSCCDVCG-----FGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 197
Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK--YQINGLGLYFDPHF--GPFWPAFWRS 180
C P C++ CT+ ++ Y ++G + GPF +F
Sbjct: 198 DLSPCSG-EYDTPTCNSTCTDKKIPLIKYRGNTSYVLSGEEPFKRELILNGPFEVSF--- 253
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
+ Y + T G VY A + +A V+IVGWGE NG PYW I +++ ++G G
Sbjct: 254 --SVYADFVAYTGG-VYKHVAGIFLGGHA-VRIVGWGELNGEPYWKIANSWNREWGMNGY 309
Query: 241 IKILRGRNEAIIE 253
I RG +E IE
Sbjct: 310 FLIARGVDECGIE 322
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 73/182 (40%), Gaps = 36/182 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G ++ W + G+V+GG ++S+ GCQP S +A + C+
Sbjct: 146 CVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKASFQYAVASKCVKACQNDKYDV--- 202
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+ +YG F+ + + + + TNG V A
Sbjct: 203 -KYDDDKHYGDSFYTLETNVTQI------------------------QTEILTNGPVMAT 237
Query: 200 SASAEIVAY-------ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEAI 251
E + Y + V I+ WG E G PYW I +++G +GD G IKI RG NE
Sbjct: 238 FNVFEDIIYYKSGIQLSNVSILRWGTEEGVPYWLIANSWGTWWGDLGGFIKIKRGTNECA 297
Query: 252 IE 253
IE
Sbjct: 298 IE 299
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 74/190 (38%), Gaps = 31/190 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
C G + W ++HK+G+V + C P YT CK + K
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHSRSLKA 295
Query: 139 --CHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTNG 194
C D Y +N H GP + R F +G
Sbjct: 296 NGCQKPVNVDRDSLYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYSG 349
Query: 195 RVYAVSASAEI--VAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY +A+ + +VK+VGWGEE NG YW +++G +G+ G +ILRG NE
Sbjct: 350 GVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECG 409
Query: 252 IESLVNGALP 261
IE V + P
Sbjct: 410 IEEYVLASWP 419
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 69/162 (42%), Gaps = 33/162 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG + S+ GC P + PPC H + P T P+C
Sbjct: 46 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDT--PRC 103
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-A 198
+ C Y + +DK HFG + ++ S K NG V A
Sbjct: 104 NKSC-EAGYSPSYKEDK------------HFG--YTSYSVSNSVKEIMAEIYKNGPVEGA 148
Query: 199 VSASAEIVAYAT---------------VKIVGWGEENGRPYW 225
+ ++ + Y + ++I+GWG ENG PYW
Sbjct: 149 FTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYW 190
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 86/206 (41%), Gaps = 29/206 (14%)
Query: 27 SCIEARAVATATPLAFAVCRSSK--MHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGI 84
SC A ++ VC SK ++VE ++ ++ C + C+ G
Sbjct: 37 SCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLLSCCGMECGF---------GCNGGY 87
Query: 85 SSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 144
S W + + GLV+GG S+ GC+P + PPC H + S P C PKC +C
Sbjct: 88 PSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEH-HVNGSRPSCTGEEGDTPKCVMQC- 145
Query: 145 NDNYGRGFFQDKY--------QINGLGLYFDPH-FGPFWPAFWRSFCTKYTRPLFQTNGR 195
Y +F+DK+ N + + + GP AF T Y L +G
Sbjct: 146 EAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAF-----TVYEDFLQYKSGV 200
Query: 196 VYAVSASAEIVAYATVKIVGWGEENG 221
V+ A V ++I+GWG E+G
Sbjct: 201 YKHVTGDA--VGGHAIRILGWGVESG 224
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 78/191 (40%), Gaps = 32/191 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++HK+G+V + C P YT CK +
Sbjct: 252 CDGGHLDAAWRFLHKKGVV-------DDSCYP----------YTQQRDTCKIRHNSRSLK 294
Query: 140 HTRCT-NDNYGRGFFQD---KYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
C + N R F Y +N G H GP + R F +
Sbjct: 295 ANGCRPSPNVDRDSFYTVGPAYTLNREGDIMAEIYHSGPVQATM------RVYRDFFSYS 348
Query: 194 GRVYAVSASAEIV--AYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G +Y +A+ + +VK+VGWGEE NG YW +++G +G++G +ILRG NE
Sbjct: 349 GGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNEC 408
Query: 251 IIESLVNGALP 261
IE V + P
Sbjct: 409 GIEEYVLASWP 419
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 85/222 (38%), Gaps = 42/222 (18%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC SK +F F A C W + C+ G
Sbjct: 52 SCGSCWAFGAVEAMSDRVCIHSK---GTKNFHFSAENLVSCCWTCG-----FGCNGGFPG 103
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ W + +G+V+GG + SN GC P PC H T P CK PKC +C D
Sbjct: 104 AAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPKCVKKC-ED 160
Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-AVSASAEI 205
Y + QD ++ A+ S R TNG V A + +
Sbjct: 161 GYKVPYEQDLHRGKS--------------AYSLSNDVDQIRQEIYTNGPVEGAFTVYEDF 206
Query: 206 VAY---------------ATVKIVGWGEENGR-PYWTIVSTF 231
+AY ++I+GWG +NG PYW + +++
Sbjct: 207 IAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSW 248
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 1/78 (1%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ GI W + G+VTGG++ ++TGCQP FP C H + + + C+ P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220
Query: 140 HTRCTNDNYGRGFFQDKY 157
+ C D Y + DKY
Sbjct: 221 YQTCQPD-YAIQYENDKY 237
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 79/194 (40%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + G+VT + N GC S P C EP P P
Sbjct: 186 CDGGYPMYAWRYFVHHGVVTEECDPYFDNIGC---SHPGC--------EP-----GFPTP 229
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C + N + + Q K Y +N + DPH GP +F T Y
Sbjct: 230 KCVRKCIDKN--QLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSF-----TVYEDF 282
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + E++ VK++GWG +NG YW + + + +GD G KI RG
Sbjct: 283 AHYKSG-VYK-HITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGT 340
Query: 248 NEAIIESLVNGALP 261
NE IE LP
Sbjct: 341 NECGIEDDAVAGLP 354
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 77/196 (39%), Gaps = 41/196 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 138
C G S W + + G+VT C P C H P C+ A P P
Sbjct: 164 CDGGYPISAWQYFVQNGVVT-------EECDPYFDQVGCKH-------PGCEP-AYPTPV 208
Query: 139 CHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYT 186
C +C N + F D YQ+N DPH GP AF T Y
Sbjct: 209 CEKKCKVQNQVWQEKKHFSIDAYQVNS-----DPHDIMAEVYKNGPVEVAF-----TVYE 258
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILR 245
+G ++ ++ VK++GWG + G YW + + + +GD G KI+R
Sbjct: 259 DFAHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIR 316
Query: 246 GRNEAIIESLVNGALP 261
G+NE IE V +P
Sbjct: 317 GKNECGIEEDVTAGMP 332
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 34/53 (64%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V + GWG ENG PYW + +++G +G+KG KILRG N IES V +PK
Sbjct: 228 AVLLCGWGVENGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVPK 280
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 86/210 (40%), Gaps = 38/210 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECK-----TLAT 134
CS G + W +V K G V N C P Y +++ CK TL T
Sbjct: 252 CSGGHLDTAWNYVRKVGTV-------NDECYP----------YISAQNACKIRPSDTLIT 294
Query: 135 PQPKCHTRCTNDN-YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTN 193
T+ N Y G + + H GP + R F
Sbjct: 295 ANCDLPTKVDRTNMYKMGPAFSLNNETDIMIEIKKH-GPVQAIL------RVHRDFFSYK 347
Query: 194 GRVY----AVSASAEIVAYATVKIVGWGEE-NG---RPYWTIVSTFGEQFGDKGTIKILR 245
+Y A SA E Y +V+++GWGEE NG YW V+++G +G+ G +I+R
Sbjct: 348 SGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVR 407
Query: 246 GRNEAIIESLVNGALPKDNYGVEFGEESGE 275
G+NE IES V +LP + V+ + GE
Sbjct: 408 GQNECEIESYVLASLPYVHQQVKPMRQVGE 437
>gi|123469339|ref|XP_001317882.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121900627|gb|EAY05659.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 241
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 35/51 (68%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V+++GWG+ENG YW +++ G+ +G GT+ I G NE +IES + GA P
Sbjct: 188 VELIGWGKENGVEYWILLNQHGKNWGINGTMHIKMGSNEGLIESFIYGATP 238
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 81/203 (39%), Gaps = 51/203 (25%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G W + G+VT + NTGC S P C EP P P
Sbjct: 169 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 212
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSF 181
KC +C + N YG G Y+IN DP GP AF
Sbjct: 213 KCERKCVSRNQLWGESKHYGVG----AYRINP-----DPQDIMAEVYKNGPVEVAF---- 259
Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGT 240
T Y +G VY +I +A VK++GWG ++G YW + + + +GD G
Sbjct: 260 -TVYEDFAHYKSG-VYKYITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGY 316
Query: 241 IKILRGRNEAIIESLVNGALPKD 263
KI RG NE IE V LP +
Sbjct: 317 FKIRRGTNECGIEQSVVAGLPSE 339
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 83/200 (41%), Gaps = 37/200 (18%)
Query: 74 WMTIWVCSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKT 131
WM C G W + + G+VT + + GC S P C EP
Sbjct: 131 WMCGDGCDGGYPIDAWRYFVQSGVVTEECDPYFDDIGC---SHPGC--------EP---- 175
Query: 132 LATPQPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFC 182
P PKC +C + N + + + K + +N + DPH GP AF
Sbjct: 176 -GFPTPKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAF----- 227
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTI 241
T Y +G VY + +++ VK++GWG ++G YW + + + +GD G
Sbjct: 228 TVYEDFAHYKSG-VYK-HITGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYF 285
Query: 242 KILRGRNEAIIESLVNGALP 261
KI RG NE IE V LP
Sbjct: 286 KIRRGTNECGIEEDVVAGLP 305
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 74/200 (37%), Gaps = 63/200 (31%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W W+ K+G+ T C P Y + P C
Sbjct: 121 CEGGYADRVWNWIQKKGITT-------EQCLP----------YVSGSGRV-------PTC 156
Query: 140 HTRCTN-DNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
++C N N R F W SF +K NG VYA
Sbjct: 157 PSKCKNGSNIVRSFVSS----------------------WGSFNSKTVMDEVANNGPVYA 194
Query: 199 --------VSASAEIVAYAT--------VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
++ + I + T V ++GWG ENG PYW + +++G +G+KG +
Sbjct: 195 CFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGTENGVPYWLLQNSWGSGWGEKGFFR 254
Query: 243 ILRGRNEAIIESLVNGALPK 262
I RG N+ I+ + LPK
Sbjct: 255 IRRGTNDCHIDEIFYSGLPK 274
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 77/168 (45%), Gaps = 18/168 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP--QP 137
C+ G + W++ ++G+V+GG + S GC+P PC H + + P C +TP Q
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPSCQH 213
Query: 138 KCHTR-----CTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQT 192
KC + N+G + + + + + GP AF T Y +
Sbjct: 214 KCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTN-GPVEGAF-----TVYEDLILYK 267
Query: 193 NGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDK 238
+G VY E+ +A ++I+GWG E+ PYW I +++ +GD
Sbjct: 268 SG-VYQHEHGKELGGHA-IRILGWGVWGESKVPYWLIGNSWNTDWGDN 313
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 81/203 (39%), Gaps = 51/203 (25%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G W + G+VT + NTGC S P C EP P P
Sbjct: 191 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 234
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSF 181
KC +C + N YG G Y+IN DP GP AF
Sbjct: 235 KCERKCVSRNQLWGESKHYGVG----AYRINP-----DPQDIMAEVYKNGPVEVAF---- 281
Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGT 240
T Y +G VY +I +A VK++GWG ++G YW + + + +GD G
Sbjct: 282 -TVYEDFAHYKSG-VYKYITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGY 338
Query: 241 IKILRGRNEAIIESLVNGALPKD 263
KI RG NE IE V LP +
Sbjct: 339 FKIRRGTNECGIEQSVVAGLPSE 361
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 38/61 (62%)
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+ E + +KI+GWG E+G YW + +++ E +GD+G KILRG +E IES + P
Sbjct: 32 TGEALGGHAIKILGWGNEDGHDYWLVANSWNEDWGDQGFFKILRGVDECGIESQITAGSP 91
Query: 262 K 262
K
Sbjct: 92 K 92
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 82/194 (42%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 168 CDGGYPLYAWQYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 211
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C + N + + + K Y +N + DPH GP AF T Y
Sbjct: 212 KCVKKCVSGN--QVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAF-----TVYEDF 264
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY E+ +A VK++GWG E+G YW + + + ++GD G KI RG
Sbjct: 265 AHYKSG-VYKHITGYELGGHA-VKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGT 322
Query: 248 NEAIIESLVNGALP 261
NE IE V LP
Sbjct: 323 NECGIEEDVTAGLP 336
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 74/191 (38%), Gaps = 32/191 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 139 ---CHTRCTNDNYGRGFFQDKYQINGLGLYFDP--HFGPFWPAFWRSFCTKYTRPLFQTN 193
C T D Y +N H GP + R F +
Sbjct: 296 ANGCQTPVNVDRDTLYTVGPAYSLNREADIMAEIFHSGPVQATM------RVNRDFFAYS 349
Query: 194 GRVYAVSASAEIV--AYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
G VY +A+ + +VK+VGWGEE NG YW +++G +G+ G +ILRG NE
Sbjct: 350 GGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNEC 409
Query: 251 IIESLVNGALP 261
IE V + P
Sbjct: 410 GIEEYVLASWP 420
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 3/79 (3%)
Query: 187 RPLFQTNGRVYAVSASAE--IVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
R F +G +Y +A++ V + +VK++GWGEE +G YW +++G +G+ G +I
Sbjct: 298 RDFFSYSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGEHGNFRI 357
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG NE IE V A P
Sbjct: 358 LRGSNECGIEEYVLAAWPN 376
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 63/153 (41%), Gaps = 17/153 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + K G+ TGG++ S +GC+P PPC H T C T P C
Sbjct: 44 CEGGYPIEAWKYWVKTGICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVC 103
Query: 140 HTRCT---------NDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
+C + +YG + + G+ + GP A+ T Y +
Sbjct: 104 TNKCIAAYKTPYSDDKHYGTSAYNVAKTVAGIQKEIMTN-GPVEAAY-----TVY-EDFY 156
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRP 223
Q G VY + AE+ +A V+I+GWG P
Sbjct: 157 QYTGGVYTHTGGAEVGGHA-VRILGWGVRQQDP 188
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 75/194 (38%), Gaps = 38/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G S W + G+VT + GCQ P C+ L P P
Sbjct: 164 CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQ---------------HPGCEPL-YPTP 207
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
+C +C ++N G + ++ + P + YT+ + + VY
Sbjct: 208 QCVKQCKDENQNWGNSK-RFSATAYRITSKP---------YDIMAEVYTKGPVEVDFLVY 257
Query: 198 AVSA----------SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
A + + + VK++GWG ENG YW + +++ +G+ G KI RG
Sbjct: 258 EDFAHYKSGVYKYITGDFLGGHAVKLIGWGTENGTDYWLVANSWNTAWGEDGYFKIARGS 317
Query: 248 NEAIIESLVNGALP 261
NE IE V +P
Sbjct: 318 NECSIEEDVVAGMP 331
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 77/213 (36%), Gaps = 32/213 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHS---NTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
C+ G W W ++G+VTGG + T C P P C H + P C T P+
Sbjct: 241 CNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPYEIPFCAH-HAKAPFPNCDTDVRPR 299
Query: 137 --PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG 194
PKC C Y L FD ++ R +
Sbjct: 300 KTPKCRKDCEEAAYSEHV-----------LPFDKDVHKASSSYSLRSRDAVKRDMMAHGT 348
Query: 195 RVYAVSASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKG 239
A + + Y + +KI+GWG E+G YW V+++ +GD G
Sbjct: 349 VTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDGEEYWHAVNSWNTYWGDSG 408
Query: 240 TIKILRGRNEAIIESLVNGALPKDNYGVEFGEE 272
KI G+ E + A ++ GV G++
Sbjct: 409 HFKIEMGQCGVDNEMVAGEAAWQETEGVVNGDK 441
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDKY 157
+C Y + QDK+
Sbjct: 218 KQKCQK-GYKTPYEQDKH 234
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 38/61 (62%)
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+ +++ ++I+GWG ENG PYW + +++ +GD G KILRG N IES + +P
Sbjct: 62 AGDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIP 121
Query: 262 K 262
+
Sbjct: 122 R 122
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 81/195 (41%), Gaps = 35/195 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G + W + G+VT + NTGC S P C EP A P P
Sbjct: 172 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 215
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
KC +C + N YG ++ + + + + GP AF T Y
Sbjct: 216 KCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKN-GPVEVAF-----TVYEDFA 269
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY I +A VK++GWG ++G YW + + + +GD G KI RG N
Sbjct: 270 HYKSG-VYKHITGTNIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 327
Query: 249 EAIIESLVNGALPKD 263
E IE V LP D
Sbjct: 328 ECGIEHGVVAGLPSD 342
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 75/199 (37%), Gaps = 47/199 (23%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + + G+VT GCQ P C+ A P P
Sbjct: 165 CDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQ---------------HPGCEP-AYPTP 208
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAF--WRSFCT 183
C +C N + F D YQ+N DPH GP +F + F
Sbjct: 209 VCEKKCKVQNQVWEEKKHFSIDAYQVNS-----DPHDIMAEVYKNGPVEVSFIIYEDFAH 263
Query: 184 KYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIK 242
+ Q GR+ A+ K++GWG + G YW + + + +GD G K
Sbjct: 264 YKSGVYKQITGRMVGGHAA---------KLIGWGTSDAGEDYWLLANQWNRGWGDDGYFK 314
Query: 243 ILRGRNEAIIESLVNGALP 261
I+RG NE IE VN +P
Sbjct: 315 IIRGTNECGIEGDVNAGMP 333
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 75/190 (39%), Gaps = 31/190 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++HK+G++ + C P YT S CK + K
Sbjct: 255 CEGGHLDAAWRYLHKKGVL-------DESCYP----------YTQSRGTCKVRHSGSLKA 297
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----HFGPFWPAFWRSFCTKYTRPLFQTNG 194
H R L D H GP + R F +G
Sbjct: 298 HGCRPAPGVDRDSLYTVGPAYSLSREADIKAEIFHSGPVQATM------RVYRDFFSYSG 351
Query: 195 RVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+Y +A+ + +VK+VGWGEE NG YW +++G +G++G +ILRG NE
Sbjct: 352 GIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECG 411
Query: 252 IESLVNGALP 261
IE V + P
Sbjct: 412 IEDYVLASWP 421
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 68/154 (44%), Gaps = 25/154 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W K GLVTGG + S GC+P PPC + Y + T + +
Sbjct: 75 CYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN-----TCSGQPMES 129
Query: 140 HTRCTNDNYGRG---------FFQDKYQINGLGLYFDP-HFGPFWPAF--WRSFCTKYTR 187
+ RCT YG + +D Y + G+ D ++GP +F + F
Sbjct: 130 NHRCTRMCYGNQDLDFDQDHRYTRDHYYLTYRGIQKDVINYGPIEASFDVYDDF------ 183
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG 221
P +++ +Y S +A + +VK++GWGEE G
Sbjct: 184 PSYKSG--IYVKSENASYLGGHSVKLIGWGEEYG 215
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CDGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 140 HTRC-TNDNYGRGFFQD---KYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTN 193
C T N R F Y +N F GP + R F +
Sbjct: 296 ANGCETPVNVDRDTFYTVGPAYSLNREADIMAEIFNSGPVQATM------RVNRDFFSYS 349
Query: 194 GRVYAVSASAE--IVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
VY +A+ + +VK+VGWGEE NG YW +++G +G+KG +ILRG NE
Sbjct: 350 RGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEKGYFRILRGSNEC 409
Query: 251 IIESLVNGALP 261
IE V + P
Sbjct: 410 GIEEYVLASWP 420
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 71/200 (35%), Gaps = 63/200 (31%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W W+ K+G+ T C P Y + P C
Sbjct: 119 CNGGYADRVWNWIQKKGITT-------EQCIP----------YVSGSGRV-------PTC 154
Query: 140 HTRCTN-DNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
++C N N R F W SF +K NG VYA
Sbjct: 155 PSKCKNGSNIVRSFVSS----------------------WGSFNSKTVMDEVANNGPVYA 192
Query: 199 V----------------SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
+ + V ++GWG ENG PYW + +++G +G+KG +
Sbjct: 193 CFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGTENGVPYWLLQNSWGSGWGEKGFFR 252
Query: 243 ILRGRNEAIIESLVNGALPK 262
I RG N+ I+ + LPK
Sbjct: 253 IRRGTNDCHIDEIFYSGLPK 272
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 7/89 (7%)
Query: 184 KYTRPLFQTNGRVYAVSAS--AEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD 237
K ++ F VY S Y TV+IVGWGEE NGR YW + +++G +G+
Sbjct: 375 KVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGE 434
Query: 238 KGTIKILRGRNEAIIESLVNGALPK-DNY 265
G +IL+G NE IE V A+P DN+
Sbjct: 435 SGYFRILKGTNECQIEDFVVAAMPDIDNF 463
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 3/81 (3%)
Query: 184 KYTRPLFQTNGRVYAVSASAEI--VAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGT 240
+ R F +G VY +A+ + +VK+VGWGEE NG YW +++G +G+ G
Sbjct: 340 RVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGY 399
Query: 241 IKILRGRNEAIIESLVNGALP 261
+ILRG NE IE V + P
Sbjct: 400 FRILRGSNECGIEEYVLASWP 420
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 38/61 (62%)
Query: 201 ASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
+ +++ V+I+GWG ENG PYW + +++ +GD G KILRG++ IES + +
Sbjct: 65 VAGDMMGGHAVRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGI 124
Query: 261 P 261
P
Sbjct: 125 P 125
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 78/197 (39%), Gaps = 43/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G S W + + G+VT + GC+ P C EP A P P
Sbjct: 125 CDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 168
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
C +C N + F + YQ+N DPH GP AF T Y
Sbjct: 169 VCEKKCKVQNQVWEEKKHFSINAYQVNS-----DPHDIMAEVYNNGPVEVAF-----TVY 218
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKIL 244
+G ++ ++ VK++GWG + G YW + + + +GD G KI+
Sbjct: 219 EDFAHYKSGVYKHITGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKII 276
Query: 245 RGRNEAIIESLVNGALP 261
RG+NE IE V +P
Sbjct: 277 RGKNECGIEEDVTAGMP 293
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 97/241 (40%), Gaps = 46/241 (19%)
Query: 52 VECTSFRFIAGVKQRCAWLVSRWMTI--WVCSSGISS----STWAWVHKRGLVTG--GAH 103
VEC RF + + V+ + ++C SG + S W + ++G+VT +
Sbjct: 131 VECLQDRFCIHLNMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRKGVVTDECDPY 190
Query: 104 HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
GC+ P C EP +T PKC +C N Q + ++
Sbjct: 191 FDQVGCK---HPGC--------EPAYRT-----PKCEKKCKVQNEVWKE-QKHFSVDAYR 233
Query: 164 LYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVG 215
++ +PH GP AF T Y +G ++ ++ VK++G
Sbjct: 234 VHSNPHDIMAEVYTNGPVEVAF-----TVYEDFAHYKSGVYKHITGG--VMGGHAVKLIG 286
Query: 216 WGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD-----NYGVEF 269
WG + G YW + + + +GD G KI+RG+NE IE V +P NY F
Sbjct: 287 WGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNMARNYDDAF 346
Query: 270 G 270
G
Sbjct: 347 G 347
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/58 (46%), Positives = 37/58 (63%), Gaps = 1/58 (1%)
Query: 204 EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
E+V +A V +VGWG ENG PYW I +++ +GD G KILRG +E +ES +P
Sbjct: 372 EVVNHA-VLMVGWGVENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 81/197 (41%), Gaps = 43/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
KC +C + N + + Y++N DPH GP AF T Y
Sbjct: 213 KCVKKCVSGNQVWKKSKHYSVSAYRVNS-----DPHDIMAEVYKNGPVEVAF-----TVY 262
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKIL 244
+ +G VY E+ +A VK++GWG ++G YW + + + ++GD G KI
Sbjct: 263 EDFAYYKSG-VYKHITGYELGGHA-VKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIR 320
Query: 245 RGRNEAIIESLVNGALP 261
RG NE IE V LP
Sbjct: 321 RGTNECGIEEDVTAGLP 337
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 62/168 (36%), Gaps = 32/168 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG + C+P FPPC EC A PKC
Sbjct: 43 CEGGWPMKAWQYFXLEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSAK-TPKC 101
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
C RG+ + + D HFG A+ K + NG V A
Sbjct: 102 QKTCQ-----RGYLKPYKE--------DKHFGK--SAYRLPNNVKAIQRDIMKNGPVVAG 146
Query: 200 SASAEIVAY----------------ATVKIVGWGEENGRPYWTIVSTF 231
E A+ VKI+GWG+E G PYW I +++
Sbjct: 147 FIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEXGTPYWLIANSW 194
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 63/144 (43%), Gaps = 31/144 (21%)
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
P C + C N YG F +D++ L F FG T + TNG
Sbjct: 6 PSCSSSCPNAKYGTAFDKDRHYTESL---FPSRFG----------STSSIKKEIMTNGPT 52
Query: 197 YAV-SASAEIVAYAT---------------VKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
A S + ++Y + V+I+GWG E G YW +++++ E++GD GT
Sbjct: 53 SAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVDYWLVMNSWNEEWGDHGT 112
Query: 241 IKILRGRNEAIIESLVNGALPKDN 264
KI++G + I+ ++ P N
Sbjct: 113 FKIVQG--DCGIDDMILAGTPAIN 134
>gi|146163742|ref|XP_001012227.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145940|gb|EAR91982.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 581
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 55/109 (50%), Gaps = 1/109 (0%)
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+ G +Y + + +A + +VGWG ENG YW + +++G +G+KG +++RG N
Sbjct: 209 YNYTGGIYVNTTEVDYHNHA-ISVVGWGVENGTKYWIVRNSWGSYWGEKGYFRLVRGINS 267
Query: 250 AIIESLVNGALPKDNYGVEFGEESGERLSEEFGVRAESSEEFRENGEEE 298
IES A+PKD + + + + + R +EN +++
Sbjct: 268 LNIESDCAWAVPKDTWTNDVRNTTASNTNSQSNFRQLHDCVRQENNQKD 316
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 71/162 (43%), Gaps = 24/162 (14%)
Query: 103 HHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ---PKCHTRCTNDNYGRGFFQDKYQI 159
H N G S+ H+ TT E C + P C +CTN G + K +
Sbjct: 125 HGCNGGSPLFSWEWVKHSGITTEE--CIPYVSGGGRVPSCPKKCTN---GSAIVRTKAKS 179
Query: 160 NGL--GLYFDPHF---GPFWPAF--WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVK 212
GL G GPF AF + F + + G++ A V
Sbjct: 180 VGLVKGDKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHA---------VM 230
Query: 213 IVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+VGWG E+G PYW I +++G +G++G KILRG+NE IE+
Sbjct: 231 VVGWGVEDGTPYWLIQNSWGTTWGEQGFFKILRGKNECGIET 272
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 8/97 (8%)
Query: 187 RPLFQTNGRVYAVSASA----EIVAYATVKIVGWGEE----NGRPYWTIVSTFGEQFGDK 238
R F +Y SA+A E AY +V+++GWGEE + YW ++++G+ +G+
Sbjct: 342 RDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGEN 401
Query: 239 GTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGE 275
G +ILRG NE IES V + P + V+ + GE
Sbjct: 402 GRFRILRGSNECDIESYVLASNPYVHEHVQAIRKVGE 438
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 78/186 (41%), Gaps = 35/186 (18%)
Query: 89 WAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + G+VT + NTGC S P C EP P PKC +C ++
Sbjct: 180 WLYFKYHGVVTEECDPYFDNTGC---SHPGC--------EP-----GYPTPKCVRKCVSE 223
Query: 147 NYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
N G + Y ++ + DP GP AF T Y +G VY
Sbjct: 224 NQLWGESK-HYGVSAYRINHDPQDIMAEVYKNGPVEVAF-----TVYEDFAHYKSG-VYK 276
Query: 199 VSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
+I +A VK++GWG ++G YW + + + +GD G KI RG NE IE V
Sbjct: 277 HITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 335
Query: 258 GALPKD 263
LP D
Sbjct: 336 AGLPSD 341
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 77/199 (38%), Gaps = 48/199 (24%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G S W + G+VT + + GCQ P C+ L P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207
Query: 138 KCHTRCTNDNYGRG----FFQDKYQINGLGLYFDPHF---GPFWPAF--------WRSFC 182
+C +C ++N G F Y+I+ GP +F ++S
Sbjct: 208 QCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGV 267
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
KYT+ + + VK+VGWG E+G YW + +++ +G+ G K
Sbjct: 268 YKYTK---------------GDYMGGHAVKLVGWGTEDGTDYWLVANSWNTAWGEDGYFK 312
Query: 243 ILRGRNEAIIESLVNGALP 261
I RG NE IE V +P
Sbjct: 313 IARGSNECGIEGDVVAGMP 331
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 77/199 (38%), Gaps = 48/199 (24%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G S W + G+VT + + GCQ P C+ L P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207
Query: 138 KCHTRCTNDNYGRG----FFQDKYQINGLGLYFDPHF---GPFWPAF--------WRSFC 182
+C +C ++N G F Y+I+ GP +F ++S
Sbjct: 208 QCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGV 267
Query: 183 TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
KYT+ + + VK+VGWG E+G YW + +++ +G+ G K
Sbjct: 268 YKYTK---------------GDYMGGHAVKLVGWGTEDGTDYWLVANSWNTAWGEDGYFK 312
Query: 243 ILRGRNEAIIESLVNGALP 261
I RG NE IE V +P
Sbjct: 313 IARGSNECGIEGDVVAGMP 331
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 79/188 (42%), Gaps = 26/188 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++HK+G+V + C P + H + ++L +
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYPYT----QHRDTCKIRHNSRSLKANGCRP 303
Query: 140 HTRCTNDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
D++ G + +K +Y H GP + R F + V
Sbjct: 304 SANVDRDSFYTVGPAYTLNKESDIMAEIY---HSGPVQATM------RVYRDFFSYSSGV 354
Query: 197 YAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
Y +A+ + +VK+VGWGEE NG YW +++G +G++G +ILRG NE IE
Sbjct: 355 YRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIE 414
Query: 254 SLVNGALP 261
V + P
Sbjct: 415 DYVLASWP 422
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 38/60 (63%)
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+ +V ++++GWG ENG YW I +++ E +G+KG ++ RG NE IE+ +N LP
Sbjct: 53 TGRLVGGHAIRVIGWGVENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 2/78 (2%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 140 HTRCTNDNYGRGFFQDKY 157
C Y + QDK+
Sbjct: 218 KQTCQK-GYKTPYEQDKH 234
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 79/188 (42%), Gaps = 26/188 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W ++HK+G+V + C P + H + ++L +
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYPYT----QHRDTCKIRHNSRSLKANGCRP 303
Query: 140 HTRCTNDNY---GRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
D++ G + +K +Y H GP + R F + V
Sbjct: 304 SANVDRDSFYTVGPAYTLNKESDIMAEIY---HSGPVQATM------RVYRDFFSYSSGV 354
Query: 197 YAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
Y +A+ + +VK+VGWGEE NG YW +++G +G++G +ILRG NE IE
Sbjct: 355 YRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIE 414
Query: 254 SLVNGALP 261
V + P
Sbjct: 415 DYVLASWP 422
>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
Length = 91
Score = 54.3 bits (129), Expect = 6e-05, Method: Composition-based stats.
Identities = 22/52 (42%), Positives = 33/52 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
++I+GWG E PYW + +++ ++GD G KILRG NE IE + +PK
Sbjct: 39 IRIIGWGVEEDVPYWLVANSWNREWGDNGYFKILRGSNECGIEDDIVAGIPK 90
>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 217
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 38/80 (47%), Gaps = 1/80 (1%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 138
C G +W + + G V+GG ++SN GCQP + PPC N C T + P
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189
Query: 139 CHTRCTNDNYGRGFFQDKYQ 158
C +C N NY F D Y+
Sbjct: 190 CEKKCYNPNYYTSFRTDIYK 209
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 67/163 (41%), Gaps = 19/163 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W K+G VTGG++ TGC+P +PPC H T C + P +
Sbjct: 44 CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQN 103
Query: 140 HTRCTNDNYGRGFFQDKY-----------QINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
+ + +D + + G+ H R T +
Sbjct: 104 ANALGKLDIALTYHKDLHFRTILHTPASKEAAGIPKGIKTH------GQLRGGITVF-ED 156
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
+G VY +A A + +A VK++GWG +NG PYW I +++
Sbjct: 157 FEHYSGGVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLIANSW 198
>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 355
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 31/45 (68%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+VKI+GWG ENG YW I STFG +G++GT LRG N ++ S
Sbjct: 202 SVKIIGWGVENGTEYWLITSTFGIGWGNQGTAMFLRGVNHLVLPS 246
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 187 RPLFQTNGRVYAVSASAEIVA--YATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
R F + VY +A+ A + +VK+VGWGEE NG YW +++G +G++G +I
Sbjct: 344 RDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGPWWGERGYFRI 403
Query: 244 LRGRNEAIIESLVNGALP 261
LRG NE IE V + P
Sbjct: 404 LRGSNECGIEEYVLASWP 421
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 1/74 (1%)
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L + G +Y + A+ + +A + +VGWGEE+G+ YW +++G +G+KG +I+RG N
Sbjct: 204 LMEYKGGIYRDTTGAKTLDHA-ISVVGWGEEDGQKYWIARNSWGTFWGEKGWFRIVRGEN 262
Query: 249 EAIIESLVNGALPK 262
IE+ A+P+
Sbjct: 263 NLGIEADCQWAVPR 276
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 199 VSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
V IV Y V++ GWGE E+G YW +++G +G+ G +++ G ++ +I N
Sbjct: 496 VDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSWGPYWGEHGWFRMIVGVSKGLITGYCN 555
Query: 258 GALP 261
+P
Sbjct: 556 WGVP 559
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 187 RPLFQTNGRVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
R F +G VY +A+ + +VKIVGWGEE +G YW +++G +G+ G +I
Sbjct: 343 RDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGEHGYFRI 402
Query: 244 LRGRNEAIIESLVNGALPK 262
LRG NE IE V + P
Sbjct: 403 LRGSNECGIEEYVLASWPN 421
>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
Length = 69
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 33/53 (62%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+KI+GWGEE+G PYW +++ +GD G K LRG + IES + +PK
Sbjct: 17 AIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIESEIVAGIPK 69
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
Y ++Q + AS+ Y +V+++GWG ++ GRP YW +++G Q+G+ G
Sbjct: 368 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 427
Query: 241 IKILRGRNEAIIESLVNGALPK 262
KILRG N IES V GA K
Sbjct: 428 FKILRGENHCEIESFVIGAWGK 449
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 61/157 (38%), Gaps = 21/157 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + + G+VTGG + TGC P FP C H + C P P C
Sbjct: 132 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPGYIYPTPSC 191
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLY-FDPH----------FGPFWPAFWRSFCTKYTRP 188
+ C Y + + +DK + G Y D H GP F YT
Sbjct: 192 YPYC-QAGYDKTYEEDK--VYGKTSYNVDRHEYTIMQEIMKNGPVEAGF-----IVYTDF 243
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYW 225
+G + V S ++I+GWG ENG YW
Sbjct: 244 AVYKSGIYHHV--SGRYAGKHAIRIIGWGVENGVNYW 278
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 78/202 (38%), Gaps = 30/202 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTG------CQPVSFPPCNHANYTTSEP------ 127
C G + W +V K G VTGG ++ TG C P C+H +P
Sbjct: 92 CDGGQIITPWTYVAKAGAVTGG-QYNGTGPFGAGLCADWFAPHCHHHGPRGDDPYPAEGD 150
Query: 128 -ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQING---------LGLYFDPHFGPFWPAF 177
C + +P+ T F DK+ G + GP AF
Sbjct: 151 AGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVETAF 210
Query: 178 WRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGD 237
T Y G +Y E +A VK VGWG ENG YW + +++ +G+
Sbjct: 211 -----TVY-EDFENYAGGIYHHVTGEEAGGHA-VKFVGWGVENGTKYWKVANSWNPYWGE 263
Query: 238 KGTIKILRGRNEAIIESLVNGA 259
G +ILRG NE IE V G+
Sbjct: 264 AGYFRILRGSNEGGIEDQVTGS 285
>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 159
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 34/160 (21%)
Query: 105 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
S GC P FP CNH S P C ++ H T+ + + +D ++ G
Sbjct: 4 SADGCWPYPFPKCNHVRSAASRYPACPAVSPSAVGAHQMETSYSL---YIRDLHRAKSFG 60
Query: 164 LYFDPHFGPFWPAFWRSFCTKYTRPLFQTNG------------RVYA----VSASAEIVA 207
PA ++ + +F TNG RVY V +
Sbjct: 61 RL---------PAIPQNI----KQEIF-TNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQG 106
Query: 208 YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
T+KI+GWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 107 IHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 146
>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
Length = 349
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
Y ++Q + AS+ Y +V+++GWG ++ GRP YW +++G Q+G+ G
Sbjct: 247 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 306
Query: 241 IKILRGRNEAIIESLVNGALPK 262
KILRG N IES V GA K
Sbjct: 307 FKILRGDNHCEIESFVVGAWGK 328
>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
Length = 635
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
T + +G ++ +A V +A + IVGWGEENG P+W + +++G +G+ G ++++R
Sbjct: 227 TDGFLKYSGGIFDDKTNATDVDHA-ISIVGWGEENGVPFWVLRNSWGSFWGESGWMRLVR 285
Query: 246 GRNEAIIESLVNGALPKDN 264
G N +E +P+D+
Sbjct: 286 GVNNVGVEGECAFGVPRDD 304
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
Y ++Q + AS+ Y +V+++GWG ++ GRP YW +++G Q+G+ G
Sbjct: 364 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 423
Query: 241 IKILRGRNEAIIESLVNGALPK 262
KILRG N IES V GA K
Sbjct: 424 FKILRGDNHCEIESFVIGAWGK 445
>gi|428169747|gb|EKX38678.1| hypothetical protein GUITHDRAFT_76993, partial [Guillardia theta
CCMP2712]
Length = 85
Score = 53.1 bits (126), Expect = 2e-04, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 36/58 (62%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
VY SA A+ V V +VGWG ENG YW + +++G+ GD+G K+ +G NE IE
Sbjct: 28 VYTKSAKAQKVGGHAVVLVGWGRENGVDYWLVQNSWGKSSGDEGMWKVRKGSNECGIE 85
>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 487
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 6/83 (7%)
Query: 184 KYTRPLFQTNGRVYAVS--ASAEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD 237
K ++ F VY S A Y TV+IVGWGEE NGR YW + +++G +G+
Sbjct: 375 KVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGE 434
Query: 238 KGTIKILRGRNEAIIESLVNGAL 260
G +IL+G NE IE V A+
Sbjct: 435 SGYFRILKGTNECQIEDFVVAAM 457
>gi|301119245|ref|XP_002907350.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262105862|gb|EEY63914.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 710
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
T + +G ++ +A V +A + IVGWGEENG P+W + +++G +G+ G ++++R
Sbjct: 227 TDGFLKYSGGIFDDKTNATDVDHA-ISIVGWGEENGVPFWVLRNSWGSFWGESGWMRLVR 285
Query: 246 GRNEAIIESLVNGALPKDN 264
G N +E +P+D+
Sbjct: 286 GVNNVGVEGECAFGVPRDD 304
>gi|145541902|ref|XP_001456639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424451|emb|CAK89242.1| unnamed protein product [Paramecium tetraurelia]
Length = 487
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 36/52 (69%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+V GWGEENG +W + +++GEQ+G++G ++ RG +E+ IES+ A P
Sbjct: 415 SVLCYGWGEENGVKFWLLQNSWGEQWGEQGNFRMKRGTDESAIESMAEAADP 466
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 8/89 (8%)
Query: 184 KYTRPLFQTNGRVYAVSASAEIV--AYATVKIVGWGEE----NGRP--YWTIVSTFGEQF 235
K + F +Y + AE+ Y +V+I+GWGE+ +G P YW +V+++G+++
Sbjct: 350 KVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNSWGQEW 409
Query: 236 GDKGTIKILRGRNEAIIESLVNGALPKDN 264
G+ G +I RG NE IES V K N
Sbjct: 410 GENGLFRIRRGINECDIESFVVAVWAKTN 438
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
Y ++Q + AS+ Y +V+++GWG ++ GRP YW +++G Q+G+ G
Sbjct: 424 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 483
Query: 241 IKILRGRNEAIIESLVNGA 259
KILRG N IES V GA
Sbjct: 484 FKILRGENHCEIESFVIGA 502
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 82/208 (39%), Gaps = 42/208 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 138
C G W + + G+VT C P C H P C+ A PK
Sbjct: 162 CDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCKH-------PGCEP-AYDTPK 206
Query: 139 CHTRCTNDNYGRGFFQDK--YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
C +C N +++K + IN + DPH GP AF T Y
Sbjct: 207 CEKKCKVQNQ---VWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAF-----TVYEDF 258
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G V+ ++ VK++GWG + G YW + + + +GD G KI+RG+
Sbjct: 259 AHYKSGVYKHVTGG--VMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGK 316
Query: 248 NEAIIESLVNGALPK-----DNYGVEFG 270
NE IE V +P N+G FG
Sbjct: 317 NECGIEEEVVAGMPSTKNMAGNHGSAFG 344
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 187 RPLFQTNGRVYAVSAS--AEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 243
R F +G +Y +A+ + +VK+VGWGEE +G YW +++G +G+ G +I
Sbjct: 344 RDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGEHGYFRI 403
Query: 244 LRGRNEAIIESLVNGALP 261
LRG NE IE V + P
Sbjct: 404 LRGSNECGIEEYVLASWP 421
>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
Length = 224
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 52/92 (56%), Gaps = 15/92 (16%)
Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYA-----TVKIVGWG--EENGRP 223
GP + AFW Y+ + T G VY SAS E +A V +VGWG +E G+
Sbjct: 140 GPVFAAFWV-----YSDFMAYTGG-VY--SASKEALAQGKTGGHAVMMVGWGTDKETGQD 191
Query: 224 YWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
YW + +++ E++GDKG KI RG +E IESL
Sbjct: 192 YWLLQNSWSEKWGDKGRFKIKRGVDECGIESL 223
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 86/215 (40%), Gaps = 28/215 (13%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC SK +F F A C W + C+ G
Sbjct: 48 SCGSCWAFGAVEAMSDRVCIHSKG---AKNFHFSAENLVSCCWTCG-----FGCNGGFPG 99
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ W + +G+V+GG + S GC P PC H T P CK P C +C D
Sbjct: 100 AAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPACVKKC-ED 156
Query: 147 NYGRGFFQDKYQ---INGLGLYFDP------HFGPFWPAFWRSFCTKYTRPLFQTNGRVY 197
Y + QD ++ LG D GP AF T Y + G VY
Sbjct: 157 GYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAF-----TVYEDFIAYRAG-VY 210
Query: 198 AVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTF 231
A + +A ++I+GWG +NG PYW + +++
Sbjct: 211 KHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSW 244
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 14/154 (9%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 138
C G + W + G+VTGG +SN GCQP PC+H +S C + Q
Sbjct: 96 CHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRPCDHYG-DSSMTNCSSFRRTQMSI 154
Query: 139 CHTRCTNDNYGRGFFQDKYQINGLGL-------YFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
C +C N NY + D ++ + + + + P Y F
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSVVYMTSWTNVTQIQQEIMTYGPV----TALMYVYENFM 210
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWG-EENGRPY 224
S ++V Y VK++GWG +++G Y
Sbjct: 211 GYKEGIYKSTVGDLVGYHHVKLIGWGVDDDGNEY 244
>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
Length = 561
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 15/80 (18%)
Query: 198 AVSASAEIVAYA---------------TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIK 242
A+ A+ E+VAY + +VGWGEE+G+ YW + +++G +G+ G +
Sbjct: 197 ALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRNSWGTYWGENGWFR 256
Query: 243 ILRGRNEAIIESLVNGALPK 262
I+RG N IES A+P+
Sbjct: 257 IVRGTNNLGIESECTWAVPR 276
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 34/49 (69%)
Query: 208 YATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+ +VK++GWG ENG YW +++G ++G+ G KILRG NE IES V
Sbjct: 364 WHSVKLLGWGVENGIKYWLGANSWGTKWGEDGYFKILRGENECNIESYV 412
>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 185
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 18/136 (13%)
Query: 131 TLATPQPKCHTRCTNDNYGRGFFQDKYQ-------INGLGLYFDPHF--GPFWPAFWRSF 181
+ P P C T CTN Y + +D ++ +N F GP +F
Sbjct: 53 VVQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVLSSF---- 108
Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
Y + +G VY V + E ++KI+GWG +GR YW V+++ E++GD G I
Sbjct: 109 -KMYEDFRYYKSG-VY-VPTTKESSTSHSIKIIGWGGASGREYWLAVNSWNEEWGDHGLI 165
Query: 242 KILRGRN--EAIIESL 255
K+ G+N E I+ S+
Sbjct: 166 KMAFGKNRLEKIVLSI 181
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 46/83 (55%), Gaps = 8/83 (9%)
Query: 187 RPLFQTNGRVY---AVSASAEIVA-YATVKIVGWGEENG----RPYWTIVSTFGEQFGDK 238
R F +Y A S SA+ A Y +V+++GWGEE YW V+++G +G+
Sbjct: 340 RDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTWWGEN 399
Query: 239 GTIKILRGRNEAIIESLVNGALP 261
G +ILRG NE IES V +LP
Sbjct: 400 GRFRILRGSNECEIESYVLASLP 422
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 79/187 (42%), Gaps = 28/187 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W+++ K GLV + C P S N P L T C
Sbjct: 146 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTAN--C 191
Query: 140 HTRCTNDNYGRGFFQDKYQINGLG--LYFDPHFGPFWPAF--WRSFCTKYTRPLFQTNGR 195
D + Y++ +Y H GP + F T Y R +++
Sbjct: 192 QLPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFT-YKRGIYR---- 246
Query: 196 VYAVSASAEIVAYATVKIVGWGEENG----RPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
++ ++ + Y +V+IVGWGEE + YW + +++G ++G+ G +ILRG NE
Sbjct: 247 -HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECE 305
Query: 252 IESLVNG 258
IES V G
Sbjct: 306 IESFVLG 312
>gi|67613207|ref|XP_667285.1| preprocathepsin c precursor [Cryptosporidium hominis TU502]
gi|54658406|gb|EAL37056.1| preprocathepsin c precursor [Cryptosporidium hominis]
Length = 635
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 21/123 (17%)
Query: 139 CHTRCTNDNYGRGFFQDKYQING---LGLYFDPHFGPFWPAFWRSFCTKYTR----PLFQ 191
C+ C D F+ NG + ++ D + + S +T+ P Q
Sbjct: 472 CYGCCDEDRMKEEIFK-----NGPIAVAMHIDTSLLVYENGVYDSIPNDHTKYCDLPNKQ 526
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
NG Y A + IVGWGEENG PYW I +++G +G+KG KI RG+N
Sbjct: 527 LNGWEYTNHA---------IAIVGWGEENGIPYWIIRNSWGANWGNKGYAKIRRGKNIGG 577
Query: 252 IES 254
IE+
Sbjct: 578 IEN 580
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Query: 189 LFQTNGRVYAVSASAEIVA--YATVKIVGWGEE-------NGRPYWTIVSTFGEQFGDKG 239
F VY S +AE+ Y +V+I+GWGEE YW + +++G+Q+G+ G
Sbjct: 358 FFSYESGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENG 417
Query: 240 TIKILRGRNEAIIESLVNGALPK 262
+I +G NE IES V G K
Sbjct: 418 LFRIQKGTNECEIESFVLGVWAK 440
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 184 KYTRPLFQTNGRVYAVSASAEIVA--YATVKIVGWGEENGR--------PYWTIVSTFGE 233
K + F +Y +A+ E A Y +V+I+GWGE+ YW +V+++G+
Sbjct: 355 KVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHHNLPIKYWLVVNSWGQ 414
Query: 234 QFGDKGTIKILRGRNEAIIESLV 256
Q+G+ G +I RG NE IES V
Sbjct: 415 QWGESGLFRIQRGTNECDIESFV 437
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 187 RPLFQTNGRVYAVSASA----EIVAYATVKIVGWGEE----NGRPYWTIVSTFGEQFGDK 238
R F +Y SA+A E AY +V+++GWGEE + YW V+++G +G+
Sbjct: 451 RDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGEN 510
Query: 239 GTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGE 275
G +ILRG NE IES V + P + V+ G+
Sbjct: 511 GRFRILRGTNECEIESYVLASNPYVHQHVQTVRNVGD 547
>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
Length = 562
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 46/74 (62%), Gaps = 1/74 (1%)
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
L + G +Y + A+ + + ++ +VGWGEE+G+ YW +++G +G+KG +I+RG N
Sbjct: 204 LMEYKGGIYRDTTGAKSLDH-SISVVGWGEEDGQKYWIARNSWGTFWGEKGWFRIVRGEN 262
Query: 249 EAIIESLVNGALPK 262
IE+ A+P+
Sbjct: 263 NLGIEADCQWAVPR 276
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 42/68 (61%), Gaps = 7/68 (10%)
Query: 196 VYAVSASAEI--VAYATVKIVGWGEEN---GRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
+Y S SAE+ Y +V+I+GWGEE G P YW +V+++G +G+ G KI RG N
Sbjct: 426 IYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVVNSWGYNWGENGLFKIQRGTN 485
Query: 249 EAIIESLV 256
E IES V
Sbjct: 486 ECEIESYV 493
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 17/161 (10%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + G+ +GG + C+P +F PC + T EC P C
Sbjct: 44 CNGGYSARAWLYARNSGVCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPAC 103
Query: 140 HTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLF 190
C YG+ + +DK Y + + D GP +F Y
Sbjct: 104 KKYCQY-GYGKRYEKDKIYAXDAYRVSSDEAAIRAEIFARGPVQASF-----ATYEDFAH 157
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF 231
+G +Y +A +A VKI+GWG ENG W + +++
Sbjct: 158 YKSG-IYVHTAGKRRGGHA-VKIIGWGVENGTKXWIVANSW 196
>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
Length = 118
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC
Sbjct: 1 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 58
Query: 140 HTRCTNDNYGRGFFQDKY 157
C Y + +DK+
Sbjct: 59 SKTC-EPGYSPSYKEDKH 75
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 203 AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
++ + + V IVGWG E+ PYW + +++G FG G KI RG NE IES + +L
Sbjct: 285 SDSIGWHAVIIVGWGVEDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSL-V 343
Query: 263 DNYGVEFGEESGERLSE 279
+ GV F SG +++
Sbjct: 344 NTEGVVFASTSGAAVAK 360
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 184 KYTRPLFQTNGRVYAVSASAEIVA--YATVKIVGWGEENGR--------PYWTIVSTFGE 233
K + F +Y +A+ E A Y +V+I+GWGE+ YW +V+++G+
Sbjct: 355 KVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYRNLPIKYWLVVNSWGQ 414
Query: 234 QFGDKGTIKILRGRNEAIIESLV 256
Q+G+ G +I RG NE IES V
Sbjct: 415 QWGESGLFRIQRGTNECDIESFV 437
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/36 (58%), Positives = 30/36 (83%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
V +VG+GEENGR YW I +++GE++G+KG IKI +G
Sbjct: 319 VLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKG 354
>gi|449670327|ref|XP_002160467.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra magnipapillata]
Length = 458
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 36/47 (76%)
Query: 215 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
G+GEE+G+ YW + +++GE++G+KG +I RG +E IESLV A+P
Sbjct: 405 GYGEEDGQKYWIVKNSWGEEWGEKGYFRIRRGTDEIAIESLVVYAVP 451
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 83/196 (42%), Gaps = 41/196 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + G+VT + TGC S P C EP P P
Sbjct: 169 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---SHPGC--------EP-----GYPTP 212
Query: 138 KCHTRCTNDN--------YGRGFFQ---DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
KC +CT++N YG+ ++ D YQI +Y + GP AF T Y
Sbjct: 213 KCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMA-EVYKN---GPVEVAF-----TVYE 263
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILR 245
+G VY + + +++ VK++GWG ++G YW + + + +GD G I R
Sbjct: 264 DFAHYESG-VYRYT-TGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRR 321
Query: 246 GRNEAIIESLVNGALP 261
G NE IE V LP
Sbjct: 322 GVNECGIEEGVVAGLP 337
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 76/186 (40%), Gaps = 35/186 (18%)
Query: 89 WAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
W + G+VT + NTGC S P C EP P PKC +C +
Sbjct: 4 WLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTPKCERKCVSR 47
Query: 147 NYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRPLFQTNGRVYA 198
N G + Y + + DP GP AF T Y +G VY
Sbjct: 48 NQLWGESK-HYGVGAYRINPDPQDIMAEVYKNGPVEVAF-----TVYEDFAHYKSG-VYK 100
Query: 199 VSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 257
+I +A VK++GWG ++G YW + + + +GD G KI RG NE IE V
Sbjct: 101 YITGTKIGGHA-VKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVV 159
Query: 258 GALPKD 263
LP +
Sbjct: 160 AGLPSE 165
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 38/59 (64%), Gaps = 6/59 (10%)
Query: 207 AYATVKIVGWGEE----NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
Y +V+IVGWGEE NG+P +W + +++G +G+ G +I+RG NE IES V G
Sbjct: 414 GYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIESFVLGV 472
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 82/196 (41%), Gaps = 41/196 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + G+VT + TGC S P C+ P P
Sbjct: 203 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---------------SHPGCEP-GYPTP 246
Query: 138 KCHTRCTNDN--------YGRGFFQ---DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYT 186
KC +CT++N YG+ ++ D YQI +Y + GP AF T Y
Sbjct: 247 KCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMA-EVYKN---GPVEVAF-----TVYE 297
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILR 245
+G VY + + +++ VK++GWG ++G YW + + + +GD G I R
Sbjct: 298 DFAHYESG-VYRYT-TGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRR 355
Query: 246 GRNEAIIESLVNGALP 261
G NE IE V LP
Sbjct: 356 GVNECGIEEGVVAGLP 371
>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 163
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 67/171 (39%), Gaps = 25/171 (14%)
Query: 108 GCQPVSFPPCNHANYTTSEPE--------CKTLATPQPKCHTRCTNDNYGRGFFQDKYQI 159
G QP PCN A+ T ++P C PKC C N + + D +
Sbjct: 1 GRQPWLVQPCN-ASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKA 59
Query: 160 NGLGLYFDP--------HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATV 211
+ FD GP+ + VY + + + +V
Sbjct: 60 KKV-FTFDGCSARKNLRKHGPY------VVTMRVYEDFLAYKSGVYH-HVTGDYLGLLSV 111
Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+++GWG E G+ +W +++G +GDKG KI R NE IE+ +PK
Sbjct: 112 RMIGWGLEGGQAFWLFANSWGTSWGDKGFFKIRRFVNERWIENFRYAGVPK 162
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 79/187 (42%), Gaps = 28/187 (14%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W+++ K GLV + C P S N P L T C
Sbjct: 272 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTAN--C 317
Query: 140 HTRCTNDNYGRGFFQDKYQINGLG--LYFDPHFGPFWPAF--WRSFCTKYTRPLFQTNGR 195
D + Y++ +Y H GP + F T Y R +++
Sbjct: 318 QLPTNVDRRSKYKVAPAYRVGNETDIMYEILHSGPVQATMKVYHDFFT-YKRGIYR---- 372
Query: 196 VYAVSASAEIVAYATVKIVGWGEENG----RPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
++ ++ + Y +V+IVGWGEE + YW + +++G ++G+ G +ILRG NE
Sbjct: 373 -HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECE 431
Query: 252 IESLVNG 258
IES V G
Sbjct: 432 IESFVLG 438
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 79/193 (40%), Gaps = 35/193 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G S W + + G+VT + TGC S P C EP A P P
Sbjct: 169 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 212
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C C + N YG ++ K N + + GP +F T Y
Sbjct: 213 RCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKN-GPVEVSF-----TVYEDFA 266
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY + +++ VK++GWG ++G YW + + + +GD G KI RG N
Sbjct: 267 HYKSG-VYK-HITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTN 324
Query: 249 EAIIESLVNGALP 261
E IE V LP
Sbjct: 325 ECGIEEDVVAGLP 337
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 79/193 (40%), Gaps = 35/193 (18%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G S W + + G+VT + TGC S P C EP A P P
Sbjct: 170 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 213
Query: 138 KCHTRCTNDN--------YGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPL 189
+C C + N YG ++ K N + + GP +F T Y
Sbjct: 214 RCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKN-GPVEVSF-----TVYEDFA 267
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+G VY + +++ VK++GWG ++G YW + + + +GD G KI RG N
Sbjct: 268 HYKSG-VYK-HITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTN 325
Query: 249 EAIIESLVNGALP 261
E IE V LP
Sbjct: 326 ECGIEEDVVAGLP 338
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 34/51 (66%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
+V+IVGWGE+ G YW + +++G +G+ G +I RG NE+ IES V L
Sbjct: 366 SVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVL 416
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 4/63 (6%)
Query: 204 EIVAYATVKIVGWGEE--NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
EI Y +V+++GWGE+ G P YW +++G +G+ GT +ILRG N IES V GA
Sbjct: 256 EIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGENGTFRILRGENHCEIESFVIGA 315
Query: 260 LPK 262
K
Sbjct: 316 WGK 318
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 33/46 (71%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+KI+GWG +NG PYW I +++G ++G+ G KI RG NE IE+ V
Sbjct: 165 IKIIGWGTQNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNV 210
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGT 240
Y ++Q + AS+ Y +V+++GWG ++ G+P YW +++G Q+G+ G
Sbjct: 350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409
Query: 241 IKILRGRNEAIIESLVNGALPK 262
K+LRG N IES V GA K
Sbjct: 410 FKVLRGENHCEIESFVIGAWGK 431
>gi|126647906|ref|XP_001388062.1| preprocathepsin c precursor [Cryptosporidium parvum Iowa II]
gi|126117150|gb|EAZ51250.1| preprocathepsin c precursor, putative [Cryptosporidium parvum Iowa
II]
Length = 635
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 40/73 (54%), Gaps = 10/73 (13%)
Query: 183 TKY-TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
TKY P Q NG Y A + IVGWGEENG PYW I +++G +G KG
Sbjct: 517 TKYCDLPNKQLNGWEYTNHA---------IAIVGWGEENGIPYWIIRNSWGANWGKKGYA 567
Query: 242 KILRGRNEAIIES 254
KI RG+N IE+
Sbjct: 568 KIRRGKNIGGIEN 580
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 13/125 (10%)
Query: 137 PKCHTRCTNDNYGRGFFQDKYQI-----NGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQ 191
P+C ++CT + + F Y N + + + GP AF T Y+ +
Sbjct: 146 PECMSKCTGEGHAYQKFYGLYLYTVSGENQIKVEIMTN-GPVEAAF-----TVYSDIVHY 199
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G VY ++ ++ +A VK++GWG E+ YW + +++G +GD+G KI RG +E
Sbjct: 200 KSG-VYHHTSGGKLGGHA-VKVLGWGVEDEEEYWLVANSWGPDWGDQGFFKIKRGSDECG 257
Query: 252 IESLV 256
IES V
Sbjct: 258 IESRV 262
>gi|260821944|ref|XP_002606363.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
gi|229291704|gb|EEN62373.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
Length = 113
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 39/64 (60%), Gaps = 6/64 (9%)
Query: 207 AYATVKIVGWGEENGRPY------WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
+ +V+I+GWG E PY WT+ +++G Q+G++G +I+RG NE IES V G
Sbjct: 32 GWHSVRIIGWGVEMSDPYQAPIKYWTVANSWGTQWGEEGYFRIVRGENECQIESFVLGVW 91
Query: 261 PKDN 264
K N
Sbjct: 92 GKVN 95
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 77/197 (39%), Gaps = 43/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W ++ G+VT + GC S P C EP +T P
Sbjct: 163 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 206
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
KC +C N N + + Y +N DP GP AF T Y
Sbjct: 207 KCVKKCVNGNQLWETSKHYSVKAYTVNS-----DPQDIMAEVYKNGPVEVAF-----TVY 256
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKIL 244
+G ++ A + VK+VGWG + G YW + + + +GD G KI
Sbjct: 257 EDFAHYKSGVYKHITGFA--LGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIK 314
Query: 245 RGRNEAIIESLVNGALP 261
RG NE IE+ V LP
Sbjct: 315 RGTNECGIENAVTAGLP 331
>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 109
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 41/67 (61%), Gaps = 3/67 (4%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY ++ E+ +A VKI+GWGEE G+ YW +V+++ E +GD G KI G E I+
Sbjct: 45 VYKHTSGKELGGHA-VKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE--IDDD 101
Query: 256 VNGALPK 262
+ G PK
Sbjct: 102 LLGGTPK 108
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 77/197 (39%), Gaps = 43/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W ++ G+VT + GC S P C EP +T P
Sbjct: 168 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 211
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
KC +C N N + + Y +N DP GP AF T Y
Sbjct: 212 KCVKKCVNGNQLWETSKHYSVKAYTVNS-----DPQDIMAEVYKNGPVEVAF-----TVY 261
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKIL 244
+G ++ A + VK+VGWG + G YW + + + +GD G KI
Sbjct: 262 EDFAHYKSGVYKHITGFA--LGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIK 319
Query: 245 RGRNEAIIESLVNGALP 261
RG NE IE+ V LP
Sbjct: 320 RGTNECGIENAVTAGLP 336
>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
Length = 476
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 35/52 (67%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ +VGWGEENG PYW + +++G +G++G +I+RG+N IE +P+
Sbjct: 137 ISVVGWGEENGIPYWIVRNSWGTYWGEEGFFRIVRGKNNLGIEEGCTYGIPR 188
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 38/78 (48%), Gaps = 3/78 (3%)
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKI 243
T+ G V+ S + + V++ GWG EE PYW + +++G +G+ G +I
Sbjct: 396 TQTFLDYTGGVFT-SREGKWLGKHAVEVTGWGVDEETRTPYWIVRNSWGTYWGENGWFRI 454
Query: 244 LRGRNEAIIESLVNGALP 261
G+N IE + +P
Sbjct: 455 AMGQNLLNIEQMCTWGVP 472
>gi|325184271|emb|CCA18763.1| cathepsin B putative [Albugo laibachii Nc14]
gi|325190706|emb|CCA25201.1| cathepsin B putative [Albugo laibachii Nc14]
Length = 436
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 42/84 (50%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 270
V+IVGWGEENG YW +++G +G G KI+RG N IES + +P
Sbjct: 253 VEIVGWGEENGVKYWHARNSWGSFWGMNGFFKIVRGTNNLAIESDCHYVVPDIREEEVVF 312
Query: 271 EESGERLSEEFGVRAESSEEFREN 294
EE +G+R EE EN
Sbjct: 313 EEHPIYGGSHYGIRPFRPEEALEN 336
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 51.6 bits (122), Expect = 4e-04, Method: Composition-based stats.
Identities = 41/176 (23%), Positives = 69/176 (39%), Gaps = 21/176 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + +V K G+VT + CQP + P C A + C P C
Sbjct: 136 CEGGDPYTAYKYVQKNGVVT-------SNCQPYTIPTCPPA-----QQPCMNFVN-TPPC 182
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRS----FCTKYTRPLFQTNGR 195
+C N + F QD + + + P+ + C +
Sbjct: 183 SAKCANSSVN--FQQDLHHLKTV-YAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSG 239
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
VY + ++ + +KIVG+G NG PYW +++ +G+ G I G+NE +
Sbjct: 240 VYTHKSGKDLGGHC-IKIVGFGVSNGTPYWICNNSWTTSWGNNGIFWIEAGKNECV 294
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 196 VYAVSASAEI--VAYATVKIVGWGEEN---GRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
+Y S SAE+ Y +V+I+GWGEE G P YW + +++G +GD G KI +G N
Sbjct: 368 IYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVANSWGYNWGDNGLFKIQKGTN 427
Query: 249 EAIIESLV 256
E IES V
Sbjct: 428 ECEIESYV 435
>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 233
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + KRG+VTGG+ ++TGCQP FP C H P C T P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217
Query: 140 HTRC 143
C
Sbjct: 218 KQTC 221
>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 620
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 34/56 (60%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY 265
V IVGWG ENG YW + +++G +G+KG + LRG N IE A+PKD +
Sbjct: 225 VVSIVGWGVENGVKYWIVRNSWGSYWGEKGFYRQLRGVNMINIEQFCYWAVPKDTW 280
>gi|348690656|gb|EGZ30470.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 647
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
T + +G ++ +A +A + IVGWGEE+G P+W + +++G +G+ G ++++R
Sbjct: 228 TDGFLKYSGGIFDDKTNATETDHA-ISIVGWGEEDGVPFWVLRNSWGSFWGEDGWMRLVR 286
Query: 246 GRNEAIIESLVNGALPKDN 264
G N +E +PKD+
Sbjct: 287 GVNNVGVEGECAFGVPKDD 305
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 78/197 (39%), Gaps = 43/197 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W + G+VT + + GC S P C EP P P
Sbjct: 169 CNGGYPISAWRYFVHHGVVTEECDPYFDDIGC---SHPGC--------EP-----GYPTP 212
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
KC +C N N + + Y+I+ DP GP AF T Y
Sbjct: 213 KCARKCVNKNQLWKKSKHYGVKPYRIDS-----DPESIMAEIYKNGPVEVAF-----TVY 262
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKIL 244
+G VY + +A VK++GWG E+G YW + + + +GD G KI
Sbjct: 263 EDFAHYKSG-VYKHITGGMMGGHA-VKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIR 320
Query: 245 RGRNEAIIESLVNGALP 261
RG NE IE V LP
Sbjct: 321 RGTNECGIEGDVVAGLP 337
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 89/217 (41%), Gaps = 32/217 (14%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC S +F F A C W + C+ G
Sbjct: 50 SCGSCWAFGAVEAMSDRVCIHSNG---TKNFHFSAENLVSCCWTCG-----FGCNGGFPG 101
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC--- 143
+ W + +G+V+GG + SN GC P PC H T P CK P C +C
Sbjct: 102 AAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKCEEG 159
Query: 144 ------TNDNYGRGFFQDKYQINGL--GLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGR 195
+ ++G+ + + ++ + +Y + GP AF T Y + G
Sbjct: 160 YKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN---GPVEGAF-----TVYEDFIAYRAG- 210
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVSTF 231
VY A + +A ++I+GWG +NG PYW + +++
Sbjct: 211 VYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSW 246
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 76/196 (38%), Gaps = 42/196 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + G+VT + GC + P C Y T E P
Sbjct: 171 CEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC---AHPGC----YPTYE---------TP 214
Query: 138 KCHTRCTNDNYGRGFFQDKYQ-INGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRV 196
KC +C +D + + Q K+ +N + +P YT + V
Sbjct: 215 KCEKQCVDDEF---WVQSKHLGVNAYEMSMEPE---------DLMAELYTNGPVEVAFEV 262
Query: 197 YAVSASAEIVAYA----------TVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILR 245
Y A + Y VK++GWG ++G YWTIV+++ +G+ G +I+R
Sbjct: 263 YEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEDGLFRIVR 322
Query: 246 GRNEAIIESLVNGALP 261
G +E IES LP
Sbjct: 323 GNDECGIESNAVAGLP 338
>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
Length = 61
Score = 50.8 bits (120), Expect = 8e-04, Method: Composition-based stats.
Identities = 21/53 (39%), Positives = 35/53 (66%)
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
+ +++ ++I+GWG ENG PYW + +++ +GD G KILRG++ IES
Sbjct: 7 TGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIES 59
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 73/175 (41%), Gaps = 29/175 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G ++W WV G+ T +G + P C H S + T+ +
Sbjct: 128 CNGGYQENSWTWVLTTGITTESCWPYRSGSGRI--PSCPHRCVNGSVLQRNTINNYRRLD 185
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
+ ++ Y G Q Y + Y D + Y++ +++
Sbjct: 186 SSELQDELYNNGPIQVTYVV-----YEDFFY--------------YSKGIYK-------- 218
Query: 200 SASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
S V V ++GWG E+G YW + +++G ++G++G +ILRG NE IES
Sbjct: 219 HLSGNKVGGHAVVLMGWGIEDGVKYWLVQNSWGYEWGEQGYFRILRGSNECGIES 273
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 38/63 (60%), Gaps = 5/63 (7%)
Query: 207 AYATVKIVGWGEEN---GRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
Y +VKI+GWGEE G+P YW +++G+Q+G+ G KI RG NE IE V A
Sbjct: 371 GYHSVKILGWGEETNIYGQPIKYWLAANSWGQQWGENGFFKIRRGTNECEIEEFVLAAWA 430
Query: 262 KDN 264
+ N
Sbjct: 431 ETN 433
>gi|145546673|ref|XP_001459019.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426842|emb|CAK91622.1| unnamed protein product [Paramecium tetraurelia]
Length = 476
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 21/141 (14%)
Query: 128 ECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLG---------LYFDPHFGPFWPAFW 178
ECK + + K H R N + G + ++N + L F+P F F+
Sbjct: 340 ECKAV---EKKKHYRVINYRFIGGAYGKSNELNIMEEIHKNGPVVLNFEPSFDFM---FY 393
Query: 179 RSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDK 238
T P + NG ++ Y GWGEENG YW + +++G+Q+G+
Sbjct: 394 VGGVFHSTIPDWIINGLAKPEWVDHSVLCY------GWGEENGVKYWLLQNSWGKQWGEN 447
Query: 239 GTIKILRGRNEAIIESLVNGA 259
G ++ RG++E+ IES+ A
Sbjct: 448 GRFRMKRGQDESSIESMAEAA 468
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +ILRG NE IES V G
Sbjct: 402 SVKITGWGEETLPDGRTLKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLGV 456
>gi|300176576|emb|CBK24241.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V I+GWG EN PYW + +++G +G+ G +ILRG N IES + A+P
Sbjct: 200 VNIIGWGSENETPYWIVRNSWGSSWGEDGYFRILRGVNLLGIESSCSYAVP 250
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 211 VKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V++VGWG E G YW + +GE +G+KG +I+ G N +IES + +P
Sbjct: 509 VEVVGWGRTEEGVEYWIGRNNWGENWGEKGWFRIMMGGNNLLIESSCSWGVP 560
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +ILRG NE IES V G
Sbjct: 401 SVKITGWGEEMLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLGV 455
>gi|146163744|ref|XP_001471259.1| cathepsin z [Tetrahymena thermophila]
gi|146145941|gb|EDK31861.1| cathepsin z [Tetrahymena thermophila SB210]
Length = 585
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 49/86 (56%), Gaps = 4/86 (4%)
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
+ T+Y R + G +Y ++S + +++VGWGEEN YW I +++G +G+KG
Sbjct: 203 YATEYLR--YNYTGGIYNDTSSYPGTNHV-IEVVGWGEENNEKYWIIRNSWGSYWGEKGF 259
Query: 241 IKILRGRNEAIIESL-VNGALPKDNY 265
+ LRG N IES N A+P D +
Sbjct: 260 YRQLRGVNMLNIESSNCNWAVPLDTW 285
>gi|294952601|ref|XP_002787371.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239902343|gb|EER19167.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 744
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 57/140 (40%), Gaps = 22/140 (15%)
Query: 108 GCQPVSFPPCNHANYTTSE-PECKTLA-TPQPKCHTRCTNDNYGRGFFQDKYQING---- 161
GC P F CNH +E P+CK A P P C T CTN Y R +D ++ G
Sbjct: 494 GCWPYPFQKCNHVPTEKTEYPKCKDAAHPPLPPCRTTCTNKAYKRSLKKDVHRAKGWRKV 553
Query: 162 -------LGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIV 214
FD GP + AF +Y + VY V + E ++ +KI+
Sbjct: 554 LNNAQSVKQEIFD--NGPVFSAFKMYEDFRYYK------SGVY-VPTTEEFHSFHLIKII 604
Query: 215 GWGEENGRPYWTIVSTFGEQ 234
GWG +VS E+
Sbjct: 605 GWGVHPDAQDLGVVSLLNEE 624
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 82/208 (39%), Gaps = 42/208 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + + G+VT + GC+ P C EP A P P
Sbjct: 46 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 89
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
C +C N + + + K + +N + DPH GP AF T Y
Sbjct: 90 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAF-----TVYEDF 142
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + +A VK++GWG + G YW + + + +GD G KI+RG
Sbjct: 143 AHYKSG-VYKHITGGMMGGHA-VKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGT 200
Query: 248 NEAIIESLVNGALPKD-----NYGVEFG 270
NE IE V +P NY FG
Sbjct: 201 NECGIEEDVVAGMPSTKNMVRNYDSAFG 228
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG+KG +I RG I S+V A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEKGYFRIYRGDGTCGINSIVTTAIIK 326
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 78/211 (36%), Gaps = 41/211 (19%)
Query: 27 SCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSGISS 86
SC A ++ VC SK +F F A C W + C+ G
Sbjct: 52 SCGSCWAFGAVEAMSDRVCIHSK---GTKNFHFSAENLVSCCWTCG-----FGCNGGFPG 103
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ W + +G+V+GG + SN GC P PC H T P CK PKC +C D
Sbjct: 104 AAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGP-CKE-GGKTPKCVKKC-ED 160
Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVY-AVSASAEI 205
Y + QD H G A+ S R TNG V A + +
Sbjct: 161 GYKVPYAQDL------------HHGK--SAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDF 206
Query: 206 VAY---------------ATVKIVGWGEENG 221
+AY ++I+GWG +NG
Sbjct: 207 IAYRAGVYKHVAGKALGGHAIRILGWGVQNG 237
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 79/194 (40%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212
Query: 138 KCHTRCTNDNYGRGFFQDKY-QINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C N + + + KY +N + DP+ GP AF T Y
Sbjct: 213 KCVRKCVKGN--QIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAF-----TVYEDF 265
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY +++ +A VK++GWG + G YW I + + +GD G I RG
Sbjct: 266 AHYKSG-VYKHITGSQLGGHA-VKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGT 323
Query: 248 NEAIIESLVNGALP 261
NE IE V LP
Sbjct: 324 NECGIEEDVTAGLP 337
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 81/208 (38%), Gaps = 42/208 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + + G+VT + GC+ P C+ A P P
Sbjct: 170 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 213
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
C +C N + + + K + +N + DPH GP AF T Y
Sbjct: 214 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAF-----TVYEDF 266
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + +A VK++GWG + G YW + + + +GD G KI+RG
Sbjct: 267 AHYKSG-VYKHITGGMMGGHA-VKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGT 324
Query: 248 NEAIIESLVNGALPKD-----NYGVEFG 270
NE IE V +P NY FG
Sbjct: 325 NECGIEEDVVAGMPSTKNMVRNYDSAFG 352
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 80/209 (38%), Gaps = 32/209 (15%)
Query: 26 LSCIEARAVATATPLAFAVCRSS--KMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSSG 83
+S I AV+ ++ +C S K VE ++ I+ + C C G
Sbjct: 35 ISFINKHAVSAVGAMSDRICIQSGGKQSVELSAIDLISCC-ENCGS---------GCDGG 84
Query: 84 ISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC 143
W + G+VTGG+ ++TGCQP FP C H + P C P+C +C
Sbjct: 85 FPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQCKRKC 143
Query: 144 TNDNYGRGFFQDKYQINGLGLYFDPH----------FGPFWPAFWRSFCTKYTRPLFQTN 193
Y + DK+ G+ + + +GP A+ F
Sbjct: 144 -QKGYTTPYEHDKHY-GGISINVIKNESAIQKEIMMYGPV-EAYLLIF-----EDFLNYK 195
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGR 222
+Y + + V V+I+GWG EN R
Sbjct: 196 SGIYRYT-TGSFVGEHYVRIIGWGIENER 223
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 71/168 (42%), Gaps = 33/168 (19%)
Query: 87 STWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 146
+ W + G+ +GG ++S+ GCQP S +A + EC T +
Sbjct: 153 NAWDYYINEGIASGGDYNSSEGCQPYSESSFQYAEAS----ECVKFYTLETNVAQ----- 203
Query: 147 NYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIV 206
Q + NG + + F F C K +G Y S + V
Sbjct: 204 ------IQMEILTNGPVMAYYNVFEDF-------ACHK--------SGVYYY--KSGKFV 240
Query: 207 AYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEAIIE 253
+VK++GWG E G PYW I +++G ++G+ G K+ RG NE IE
Sbjct: 241 GRHSVKVIGWGTEEGIPYWLIANSWGSEWGELGGFFKMRRGTNECWIE 288
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 77/209 (36%), Gaps = 53/209 (25%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W ++ +RG+VT C P P A + + +++ + +
Sbjct: 75 CAGGRLDGAWWYLRRRGVVT-------EDCYPYRPPQQTPAELSRCMMQSRSVGRGKRQA 127
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
RC N N + D YQ P + S K Q NG V A+
Sbjct: 128 TQRCPNTN---NYQNDIYQST--------------PPYRLSTSEKEIMKEIQDNGPVQAI 170
Query: 200 SASAEI-------------VAYA-----------TVKIVGWGEENG-----RPYWTIVST 230
E V++ +VKI GWGEE R YW ++
Sbjct: 171 MEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVKITGWGEERNFDGTTRKYWIAANS 230
Query: 231 FGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+G+ +G+ G +I RG NE IE+ V G
Sbjct: 231 WGKNWGENGYFRIARGENECEIEAFVIGV 259
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 77/191 (40%), Gaps = 22/191 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPV-SFPPCNHANY------TTSEPECKTL 132
C+ G + W + +GLV+GG + S+ GC+ S PC H + T P+C
Sbjct: 164 CNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGXPYVXTGDSPKCSMT 223
Query: 133 ATPQPKCHTRCTNDNYGRGFFQ--DKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
P T + +YG + D + +Y + + + K+ +
Sbjct: 224 CEPG---QTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKE--Y 278
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 250
Q + E+ + I+G EN YW + + + +GD G KILRG++
Sbjct: 279 Q--------GVTGEMXGGHAICILGCKVENSTSYWLVANXWNRDWGDNGFFKILRGQDHY 330
Query: 251 IIESLVNGALP 261
IES V +P
Sbjct: 331 GIESEVVAEIP 341
>gi|124487938|gb|ABN12052.1| cathepsin B endopeptidase-like protein [Maconellicoccus hirsutus]
Length = 66
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 2/61 (3%)
Query: 211 VKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVE 268
++I+GWG ++ PYW + +++ +GD G KI RG NE IE +N +PK N +
Sbjct: 6 IRILGWGVCKKTNAPYWLVANSWNTDWGDHGYFKIKRGSNECGIEDSINAGIPKLNKDLR 65
Query: 269 F 269
F
Sbjct: 66 F 66
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 35/50 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V I G+G EN PYWTI +++GEQ+G+ G +++RG+N + LV+ A+
Sbjct: 410 VLITGYGIENNLPYWTIKNSWGEQWGENGYFQLMRGKNICGVSDLVSSAI 459
>gi|294891623|ref|XP_002773656.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
gi|239878860|gb|EER05472.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
Length = 815
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 40/66 (60%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+Y +A + + V+I+G+G E P+W +++++G+ +G+ G ++LRGRN IE L
Sbjct: 572 LYTTTAGSPEIGNHAVRIIGFGVEGNVPFWLLMNSWGDDWGEHGCFRMLRGRNLCGIEEL 631
Query: 256 VNGALP 261
G P
Sbjct: 632 PVGMDP 637
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 196 VYAVSASAEI--VAYATVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
VY S SAE+ Y +++I+GWGEE G P YW + +++G +G+ G +I RG N
Sbjct: 368 VYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQRGTN 427
Query: 249 EAIIESLV 256
E IES V
Sbjct: 428 ECEIESYV 435
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +ILRG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLGV 455
>gi|48762481|dbj|BAD23810.1| cathepsin B-S [Tuberaphis taiwana]
Length = 182
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 67/150 (44%), Gaps = 24/150 (16%)
Query: 89 WAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 148
W + +G+ TGG + + GC P PPC + + C P + H +C Y
Sbjct: 47 WKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 100
Query: 149 GRGFFQDKYQ------INGLGLYFD--PHFGPFWPAF--WRSFCTKYTRPLFQTNGRVYA 198
G+ Q++Y+ +N + +GP +F + F ++++ +Y
Sbjct: 101 GKTTVQNRYKTKSEYVMNSIKTIEQDLKTYGPVEASFDVYDDFS------VYKSG--IYR 152
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIV 228
+ A+ ++KI+GWG++NG PYW V
Sbjct: 153 KTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 182
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 36/54 (66%), Gaps = 5/54 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
+VKI GWGEE +G+ YWT +++G +G+KG +ILRG NE IES V G
Sbjct: 399 SVKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIESFVVG 452
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 75/195 (38%), Gaps = 40/195 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G S W + +RG+VT + N GC N+ EP + P P
Sbjct: 163 CNGGFPLSAWRYFSRRGVVTDECDPYFDNDGC-----------NHPGCEP-----SYPTP 206
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--------GPFWPAF--WRSFCTKYTR 187
+C C ++ R Y N + DP+ GP +F + F T
Sbjct: 207 RCVKNCKDNQ--RWSHSKHYSANAYRIKSDPYNIMAEVFNNGPVEVSFSVYEDFAHYETG 264
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRG 246
GR A VK++GWG ++G YW I +++ +G+ G KI RG
Sbjct: 265 VYKHVQGRYLGGHA---------VKLIGWGTTDDGIDYWLIANSWNTAWGEGGYFKIARG 315
Query: 247 RNEAIIESLVNGALP 261
NE IE +P
Sbjct: 316 VNECGIERDPVAGMP 330
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 37/54 (68%), Gaps = 1/54 (1%)
Query: 211 VKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 263
VK+VGWG ++G YW++V+++ +G+ GT +ILRG++E IES LP +
Sbjct: 285 VKLVGWGTTDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECGIESNAVAGLPSN 338
>gi|145525479|ref|XP_001448556.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416111|emb|CAK81159.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 35/50 (70%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+V GWGEENG YW + +++G+Q+G+ G ++ RG++E+ IES+ A
Sbjct: 433 SVLCYGWGEENGVKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAEAA 482
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 81/208 (38%), Gaps = 42/208 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W + + G+VT + GC+ P C+ A P P
Sbjct: 215 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 258
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
C +C N + + + K + +N + DPH GP AF T Y
Sbjct: 259 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAF-----TVYEDF 311
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + +A VK++GWG + G YW + + + +GD G KI+RG
Sbjct: 312 AHYKSG-VYKHITGGMMGGHA-VKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGT 369
Query: 248 NEAIIESLVNGALPKD-----NYGVEFG 270
NE IE V +P NY FG
Sbjct: 370 NECGIEEDVVAGMPSTKNMVRNYDSAFG 397
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 62/148 (41%), Gaps = 9/148 (6%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 38 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 96
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWR----SFCTKYTRPLFQTNGR 195
+C D G+ +DK + N + R + T Y L ++G
Sbjct: 97 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSG- 153
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRP 223
VY + A + +A V+I+GWGE P
Sbjct: 154 VYFHALGAPMSGHA-VRILGWGELGNVP 180
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 52/97 (53%), Gaps = 10/97 (10%)
Query: 196 VYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE- 253
VY S++ ++ ++A V++VGWG +G YW +++G +GDKG KI RG +EA E
Sbjct: 211 VYIKSSNTQVESHA-VRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE 269
Query: 254 -----SLVNGALPKDNYGVE--FGEESGERLSEEFGV 283
+ ++P YG+E FG S L F +
Sbjct: 270 GFITVTADTASVPTSQYGLEYQFGGNSSTFLKPSFLI 306
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 78/184 (42%), Gaps = 21/184 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G + + ++ K+G+V+ C P + P C A +P + TPQ C
Sbjct: 136 CQGGDAYTAMKFIQKKGIVS-------NDCLPYTIPTCAPA----QQPCLNFVDTPQ--C 182
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRS----FCTKYTRPLFQTNGR 195
+C+N +Y + QD + I+G+ +P + C +
Sbjct: 183 VEKCSNASYT--YAQDLHFIDGV-YSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSG 239
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + ++ + VK++GWG +N YW +++ +G++G I G NE IES
Sbjct: 240 VYQHTTGKDLGGHC-VKMIGWGTQNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIESD 298
Query: 256 VNGA 259
V A
Sbjct: 299 VVAA 302
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 41/68 (60%), Gaps = 7/68 (10%)
Query: 196 VYAVSASAEI--VAYATVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRN 248
VY S SAE+ Y +V+I+GWGEE G P YW + +++G +G+ G +I +G N
Sbjct: 368 VYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQKGTN 427
Query: 249 EAIIESLV 256
E IES V
Sbjct: 428 ECEIESYV 435
>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
Length = 559
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 33/53 (62%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ +VGWGEENG YW +++G +G++G +I RG N IES A+PK
Sbjct: 225 AISVVGWGEENGEKYWIGRNSWGNYWGEEGWFRIARGINNLAIESECQWAVPK 277
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 182 CTKYTRPLF-QTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGT 240
C+ R F +G VY S S+ +VA V+I GWG ENGRPYW +++GE +G++G
Sbjct: 477 CSMTVRESFLDYHGGVYE-SDSSPMVAGHIVEIAGWGVENGRPYWIGRNSWGEYWGEEGW 535
Query: 241 IKILRGRNEAI 251
+I ++ I
Sbjct: 536 FRIDMEKDSGI 546
>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 236
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +RGLV+GG + S+ GC+P S PPC H + + P C P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEH-HVNGTRPPCTGEGGSTPRC 208
Query: 140 HTRC 143
C
Sbjct: 209 SRHC 212
>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
Length = 322
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%)
Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+++GWGEENG PYW ++++G +FG G K+ G N A ES + P
Sbjct: 213 RVIGWGEENGTPYWLALNSWGTEFGMDGAFKVPMGENIAGFESQLLSVKPN 263
>gi|403223101|dbj|BAM41232.1| cysteine proteinase [Theileria orientalis strain Shintoku]
Length = 489
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 31/44 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V I+GWGE +G YW + +++G+ +GDKG K+ RGRN +ES
Sbjct: 417 VAIIGWGESDGFKYWLVRNSWGKDWGDKGFFKLTRGRNAFGVES 460
>gi|123377855|ref|XP_001298125.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121878571|gb|EAX85195.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 135
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 40/71 (56%)
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+ +G +V + E V I GWG+E P+W I++++G +G G++K LRG N
Sbjct: 64 YYKSGVYQSVLSEEESSFQHAVVIYGWGKEKETPFWWILNSYGPNWGINGSMKFLRGSNH 123
Query: 250 AIIESLVNGAL 260
IE+ V+ AL
Sbjct: 124 CNIETHVSSAL 134
>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 238
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 3/86 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHH---SNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 136
CS G ++W ++H G+V+G + GC P +FP C H + C
Sbjct: 133 CSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPKCAHHQKESDYKPCAKELYDT 192
Query: 137 PKCHTRCTNDNYGRGFFQDKYQINGL 162
P C + C N YG F +D++ L
Sbjct: 193 PSCSSSCPNAKYGTAFDKDRHYTESL 218
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 33/52 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+K+VGWG +G YWTIV+++ E +G G + I RG +E IES V PK
Sbjct: 259 IKVVGWGILDGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPK 310
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 61/155 (39%), Gaps = 27/155 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + + GLVTGG+ + +GC+ FP CNH P C P P C
Sbjct: 38 CHGGFPPRAWDFWMENGLVTGGSKENPSGCRSYPFPKCNHHGKGPDAP-CPEKIFPTPAC 96
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHF-----------GPFWPAF--WRSFCTKYT 186
+ C D + DK + Y P+ GP AF + F +
Sbjct: 97 NKTC--DTPEVNYILDKTKAK--SSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYES 152
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG 221
F + GR+ A ++++GWGEENG
Sbjct: 153 GVYFHSFGRMIGGHA---------IRMLGWGEENG 178
>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
Length = 242
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 36/50 (72%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V I G+G ENG PYWTI +++GE++G+ G +++RG++ + LV+ A+
Sbjct: 191 VLITGYGIENGLPYWTIKNSWGEEWGENGYFRLMRGKDICGVSDLVSSAI 240
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 79/190 (41%), Gaps = 29/190 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W ++ K+G+VT C+P + P C A +P + TP C
Sbjct: 144 CEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTCPPA----QQPCLNFVNTPN--C 190
Query: 140 HTRCTNDNYGRGFFQDK------YQINGLGLYFD--PHFGPFWPAFWRSFCTKYTRPLFQ 191
+C + N + QDK Y IN + GP F + Y L
Sbjct: 191 VKQCES-NSTLIYSQDKHKMAKIYSINSVEAIMQEISTNGPVEACF-----SVYEDFLGY 244
Query: 192 TNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 251
+G VY + + + VKI G+G NG YW++ +++ +GD G I RG +E
Sbjct: 245 KSG-VYQ-HTTGKFLGGHCVKIFGYGTLNGVNYWSVANSWTTSWGDNGIFLIKRGSDECG 302
Query: 252 IESLVNGALP 261
IE V +P
Sbjct: 303 IEDEVVAGIP 312
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 4/63 (6%)
Query: 204 EIVAYATVKIVGWGEENGR----PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
++ Y +V+I+GWGE+ YW +++G ++G+ G +ILRG N IES V GA
Sbjct: 369 KVQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGA 428
Query: 260 LPK 262
K
Sbjct: 429 WGK 431
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 10/76 (13%)
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN--GRP--YWTIVSTFGEQFGDKGTIKI 243
P G YA S Y +V+I+GWG ++ G P YW +++GE++G+ G +I
Sbjct: 479 PYANDKGPAYARSG------YHSVRILGWGVDHSTGVPIKYWLCANSWGEEWGENGLFRI 532
Query: 244 LRGRNEAIIESLVNGA 259
LRG N IES + GA
Sbjct: 533 LRGENHCDIESFIIGA 548
>gi|294898471|ref|XP_002776250.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239883121|gb|EER08066.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 219
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 7/91 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTL 132
C+ G ++ G+VTG S GC P P CNHA+ S+ P+C +
Sbjct: 85 CNRGNLIEGLNFMKNHGIVTGNEFKPADQLASADGCWPYPLPKCNHASSAASQYPKCPSE 144
Query: 133 ATPQPKCHTRCTNDNYGRGFFQDKYQINGLG 163
A QP C T C N++Y QD ++ G
Sbjct: 145 ALSQPACQTECINESYKTSLQQDLHRAKSWG 175
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 30/40 (75%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V ++GWG E+G PYW + +++G +G+KG KI+RG+NE
Sbjct: 228 AVLLIGWGVEDGVPYWLLQNSWGPAWGEKGHFKIIRGKNE 267
>gi|66814230|ref|XP_641294.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|60469326|gb|EAL67320.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 291
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 64/150 (42%), Gaps = 21/150 (14%)
Query: 117 CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--GPFW 174
C + N+ S P A P Y F ++ Q+NG F GP
Sbjct: 158 CKNCNFDLSNPTADCFAQP-----------TYTTYFVEEHGQVNGSVAMMQEIFARGPIA 206
Query: 175 PAFWRSFC-TKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGE 233
+ YT +F + +V ++ EI + I+GWG ENG YW +++G
Sbjct: 207 CGMEVTDAFESYTSGVFTS-----SVGSTGEI--NHEISIIGWGTENGVDYWIGRNSWGT 259
Query: 234 QFGDKGTIKILRGRNEAIIESLVNGALPKD 263
FG+ G +I RG + IES + A+PK+
Sbjct: 260 YFGELGFFRIQRGIDLLSIESACDWAVPKN 289
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 29/46 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+KI+GWG E G YW + +++ +G GT KILRG NE IE V
Sbjct: 264 IKIIGWGVEGGVDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDV 309
>gi|402588459|gb|EJW82392.1| papain family cysteine protease containing protein [Wuchereria
bancrofti]
Length = 323
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 55/115 (47%), Gaps = 8/115 (6%)
Query: 148 YGRGFFQDKYQIN---GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
YG F YQIN +F + P A +F +Y F +G + +
Sbjct: 212 YGEIFIDKLYQINPDPNAMAWFVANVAPI--ALNLAFPKRYK---FYKSGILPDTDECST 266
Query: 205 IVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+ +++G+G ENG+ YW + +++GE +GD+G KI RG N +E+ V A
Sbjct: 267 MEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKIERGINACKVETYVASA 321
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 34/55 (61%), Gaps = 1/55 (1%)
Query: 211 VKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
VK++GWG ++G YWTIV+++ +G+ G +I RG NE IES LP D
Sbjct: 279 VKLIGWGTTDDGVDYWTIVNSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDK 333
>gi|145540170|ref|XP_001455775.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423583|emb|CAK88378.1| unnamed protein product [Paramecium tetraurelia]
Length = 500
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 35/52 (67%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+V GWGEE+G +W + +++G Q+G+ G+ ++ RG +E+ IES+ A P
Sbjct: 427 SVLCYGWGEEDGVKFWLLQNSWGSQWGENGSFRMKRGVDESAIESMAEAADP 478
>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Equus caballus]
Length = 436
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 424
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 468 SVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 522
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 30/36 (83%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
V +VG+GEENGR YW I +++GE++G+KG IKI +G
Sbjct: 319 VLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKG 354
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 67/166 (40%), Gaps = 22/166 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + ++G+VTGG +++ C+P PC + EC LA P+C
Sbjct: 43 CQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLAD-TPRC 101
Query: 140 HTRC---------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLF 190
RC ++ +YGR +Q + + + GP F T Y
Sbjct: 102 KRRCQLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRN-GPVVAGF-----TVY-EDFA 154
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEENGR----PYWTIVSTFG 232
G +Y ++ + +A VK++GWG E PYW G
Sbjct: 155 HYKGGIYKHTSGKKTGGHA-VKVIGWGSEQKGSEKIPYWXHCXLHG 199
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 400 SVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 454
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 388 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 442
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 455
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 424
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 326
>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
Length = 259
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 208 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 259
>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 204
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 38/67 (56%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + + + + ++I+GWG E PYW +++ +++G+ G +K+ RG IES
Sbjct: 137 VYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCANSWSKEWGENGYVKVRRGVQAGYIESY 196
Query: 256 VNGALPK 262
V +PK
Sbjct: 197 VRAPIPK 203
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLGV 455
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 455
>gi|290980376|ref|XP_002672908.1| predicted protein [Naegleria gruberi]
gi|284086488|gb|EFC40164.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 185 YTRPLFQTNGRVYAVSASA-EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
Y L+ ++G VY SA+ + +A V+I+GWG ENG YW + + +G+ +G +G I I
Sbjct: 184 YQDFLYYSSG-VYQHSANLRQPIAKFVVRIIGWGVENGVKYWIVPNIWGKTWGMQGYIWI 242
Query: 244 LRGRNEAIIE 253
RG NE+ IE
Sbjct: 243 RRGNNESNIE 252
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 455
>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
[Loxodonta africana]
Length = 468
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 402 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 456
>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 229
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 38/67 (56%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
VY + + + + ++I+GWG E PYW +++ +++G+ G +K+ RG IES
Sbjct: 162 VYFPTPKSSNLGWINLRIIGWGYEGKTPYWLCANSWSKEWGENGYVKVRRGVQAGYIESY 221
Query: 256 VNGALPK 262
V +PK
Sbjct: 222 VRAPIPK 228
>gi|290998718|ref|XP_002681927.1| predicted protein [Naegleria gruberi]
gi|284095553|gb|EFC49183.1| predicted protein [Naegleria gruberi]
Length = 303
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 29/36 (80%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+ +VGWGEENG+ YW + +++GE +G++G +I+RG
Sbjct: 248 ISVVGWGEENGKKYWIVRNSWGEPYGEQGFFRIIRG 283
>gi|449663703|ref|XP_002169139.2| PREDICTED: uncharacterized protein LOC100198320 [Hydra
magnipapillata]
Length = 1092
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 28/79 (35%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 182 CTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTI 241
C + T + + VY E+ +A V I+G+G EN +PYW I +++G+ +GD G +
Sbjct: 960 CARKTFKFYSSG--VYDDPKCTEVTDHAVV-IIGYGVENNKPYWLIKNSWGKLWGDNGYM 1016
Query: 242 KILRGRNEAIIESLVNGAL 260
KI N + L NGAL
Sbjct: 1017 KI--DMNNNLCGVLTNGAL 1033
>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
[Loxodonta africana]
Length = 437
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 371 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 425
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 296 SVKITGWGEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 350
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 280 SVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 334
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 75/192 (39%), Gaps = 33/192 (17%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + G+VT + NTGC S P C EP A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 214
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF---GPFWPAFWRSFCTKYTRPLF 190
+C +C +DN + + Y +N GP +F T Y
Sbjct: 215 RCLRKCVSDNKLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSF-----TVYEDFAH 269
Query: 191 QTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+G VY + I +A VK++GWG N G YW + + + +GD G I RG NE
Sbjct: 270 YKSG-VYKHITGSNIGGHA-VKLIGWGTSNEGEDYWLMANQWNRGWGDDGYFMIRRGTNE 327
Query: 250 AIIESLVNGALP 261
IE LP
Sbjct: 328 CGIEDEPVAGLP 339
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 362 SVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLGV 416
>gi|358255491|dbj|GAA57187.1| cathepsin L [Clonorchis sinensis]
Length = 368
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 34/46 (73%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
++ +VG+GEENG PYW I +++GE +G+KG +++ RG N + S+
Sbjct: 317 SMVVVGYGEENGTPYWIIKNSWGEHWGEKGYLRLRRGVNMCGVASV 362
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 69/158 (43%), Gaps = 18/158 (11%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W + + G+V+ CQP FPPC H +T C ++ P C
Sbjct: 65 CNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPC-SVEYDTPFC 116
Query: 140 HTRCTNDNYGRGFF-QDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGRV 196
+ CTN + + Y ++G Y F GPF AF T Y + ++G
Sbjct: 117 NITCTNTIPPIKYKGRISYSLSGEEDYKRELFLYGPFEVAF-----TVYEDFVAYSDGVY 171
Query: 197 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQ 234
S +A + V++VGWG NG PYW I +++ +
Sbjct: 172 KHFSGNA--LGGHAVRLVGWGNLNGTPYWKIANSWNHE 207
>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Nomascus leucogenys]
Length = 436
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424
>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Nomascus leucogenys]
Length = 467
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|159488843|ref|XP_001702410.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
gi|158271078|gb|EDO96905.1| papain-type cysteine protease [Chlamydomonas reinhardtii]
Length = 382
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
+ NG +Y S + V++VGWGEE+G YW + +++G +G++G ++ RG N
Sbjct: 242 WHYNGGIYK-DTSGDTELDHDVEVVGWGEEDGEKYWIVRNSWGTYWGERGFFRVRRGDNS 300
Query: 250 AIIES 254
+ES
Sbjct: 301 LQLES 305
>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
Length = 232
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 34/52 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A+ K
Sbjct: 181 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAIIK 232
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 75/196 (38%), Gaps = 30/196 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAH-HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 138
CS G W ++ +RG+VT + ++ QP + P H+ T T P P+
Sbjct: 269 CSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQATARCPNPQ 328
Query: 139 CHTRCTNDNYG-----------RGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTR 187
H ND Y + ++ + + + H F Y
Sbjct: 329 THA---NDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFL----------YKS 375
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-----GRPYWTIVSTFGEQFGDKGTIK 242
+++ + +VKI GWGEE + YWT +++G +G+ G +
Sbjct: 376 GIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAANSWGRAWGEDGHFR 435
Query: 243 ILRGRNEAIIESLVNG 258
I RG NE +ES V G
Sbjct: 436 IARGVNECEVESFVVG 451
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 80/194 (41%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + G+VT + NTGC S P C EP A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 214
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C +DN + + + K Y ++ + +P GP +F T Y
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSF-----TVYEDF 267
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + I +A VK++GWG + G YW + + + +GD G I RG
Sbjct: 268 AHYKSG-VYKHITGSNIGGHA-VKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGT 325
Query: 248 NEAIIESLVNGALP 261
NE IE LP
Sbjct: 326 NECGIEDEPVAGLP 339
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 396 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 450
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 406 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 460
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424
>gi|290974021|ref|XP_002669745.1| predicted protein [Naegleria gruberi]
gi|284083296|gb|EFC37001.1| predicted protein [Naegleria gruberi]
Length = 335
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 1/59 (1%)
Query: 204 EIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+I ++++ KI+GWG E+ PYW V FG +G+ G +LRG +E IES ALP
Sbjct: 272 DIGSFSSTKIIGWGVAEDQTPYWICVFEFGTDWGNNGMFWMLRGADECGIESSAWSALP 330
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 370 SVKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 424
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 80/194 (41%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + G+VT + NTGC S P C EP A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 214
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C +DN + + + K Y ++ + +P GP +F T Y
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSF-----TVYEDF 267
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + I +A VK++GWG + G YW + + + +GD G I RG
Sbjct: 268 AHYKSG-VYKHITGSNIGGHA-VKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGT 325
Query: 248 NEAIIESLVNGALP 261
NE IE LP
Sbjct: 326 NECGIEDEPVAGLP 339
>gi|301609080|ref|XP_002934105.1| PREDICTED: cathepsin S-like [Xenopus (Silurana) tropicalis]
Length = 334
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 29/38 (76%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V IVG+ +ENG+ YW + +++GE FGDKG IK+ R RN
Sbjct: 283 VLIVGYSKENGQYYWLVKNSWGEYFGDKGYIKMARKRN 320
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 401 SVKITGWGEETWPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 455
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 6/77 (7%)
Query: 189 LFQTNGRVYAVSA--SAEIVAYATVKIVGWG----EENGRPYWTIVSTFGEQFGDKGTIK 242
F G VY S S + Y +V+IVGWG + N YW + +++G +G+ G +
Sbjct: 358 FFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDGYFR 417
Query: 243 ILRGRNEAIIESLVNGA 259
I+RG NE+ IE V A
Sbjct: 418 IVRGENESDIEKFVLAA 434
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 35/50 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V I G+G ENG PYWTI +++G+Q+G+ G +++ G++ + LV+ A+
Sbjct: 421 VLITGYGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAI 470
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 28/38 (73%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G ENG PYW I +++GE +GDKG K+ RG+N
Sbjct: 311 VLAVGYGVENGTPYWLIKNSWGESWGDKGYFKMERGKN 348
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 296 SVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLGV 350
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 76/185 (41%), Gaps = 43/185 (23%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212
Query: 138 KCHTRCTNDNY----GRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKY 185
KC +C + N + + Y++N DPH GP AF T Y
Sbjct: 213 KCVKKCVSGNQVWKKSKHYSVSAYRVNS-----DPHDIMAEVYKNGPVEVAF-----TVY 262
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKIL 244
+ +G VY E+ +A VK++GWG ++G YW + + + ++GD G KI
Sbjct: 263 EDFAYYKSG-VYKHITGYELGGHA-VKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIR 320
Query: 245 RGRNE 249
RG NE
Sbjct: 321 RGTNE 325
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 35/50 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V I G+G ENG PYWTI +++G+Q+G+ G +++ G++ + LV+ A+
Sbjct: 386 VLITGYGVENGLPYWTIKNSWGDQWGEDGYFRLMLGKDVCGVSDLVSSAI 435
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLV+GG + S+ GC+P + PPC H + + P C P+C
Sbjct: 148 CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206
Query: 140 HTRCTNDNYGRGFFQDKY 157
+C + Y + DK+
Sbjct: 207 ILQCES-GYTPSYKADKH 223
>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
Length = 163
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 32/70 (45%), Gaps = 5/70 (7%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 38 CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKN 93
Query: 140 HTRCTNDNYG 149
H RCT YG
Sbjct: 94 H-RCTRMCYG 102
>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
griseus]
gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
Length = 465
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 400 SVKITGWGEEKLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIESFVLGV 454
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEENGRP-----YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE GR YW +++G +G+ G +I RG NE IE+ + G
Sbjct: 417 SVKITGWGEERGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVGV 471
>gi|145486176|ref|XP_001429095.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124396185|emb|CAK61697.1| unnamed protein product [Paramecium tetraurelia]
Length = 464
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 34/52 (65%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+V GWGEE G +W + +++G+Q+G+ G ++ RG +E+ IES+ + P
Sbjct: 373 SVLCYGWGEEEGVKFWMLQNSWGDQWGESGNFRMKRGVDESAIESMAEASDP 424
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 29/35 (82%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
V +VG+G E+G+PYW I +++GE +GDKG +KIL+
Sbjct: 321 VLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILK 355
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350
>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 29/35 (82%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
V +VG+G E+G+PYW I +++GE +GDKG +KIL+
Sbjct: 321 VLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILK 355
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 29/35 (82%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
V +VG+G E+G+PYW I +++GE +GDKG +KIL+
Sbjct: 321 VLLVGYGIEDGKPYWLIKNSWGEDWGDKGYVKILK 355
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 296 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 350
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S+ W + GLV+GG + S+ GC+P + PPC H + + P C P+C
Sbjct: 44 CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 102
Query: 140 HTRCTNDNYGRGFFQDKY 157
+C + Y + DK+
Sbjct: 103 ILQCES-GYTPSYKADKH 119
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 77/209 (36%), Gaps = 53/209 (25%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G W ++ +RG+VT C P P A + +++ + +
Sbjct: 272 CTGGRIDGAWWFLRRRGVVT-------EDCYPYRPPQQTPAELGRCMMQSRSVGRGKRQA 324
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAV 199
RC N N + D YQ P + S K Q NG V A+
Sbjct: 325 TQRCPNTN---NYQNDIYQST--------------PPYRLSTNEKEIMKEIQDNGPVQAI 367
Query: 200 SASAEI-------------VAYA-----------TVKIVGWGEENG-----RPYWTIVST 230
E V++ +VKI GWGEE R YW ++
Sbjct: 368 MEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVKITGWGEERNVDGAKRKYWIAANS 427
Query: 231 FGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+G+ +G++G +I RG NE IE+ V G
Sbjct: 428 WGKNWGEEGYFRIARGENECEIEAFVIGV 456
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 43/82 (52%), Gaps = 6/82 (7%)
Query: 184 KYTRPLFQTNGRVYAVS--ASAEIVAYATVKIVGWGEE--NGR--PYWTIVSTFGEQFGD 237
K +R F VY S AS Y +V+IVGWGEE G+ YW +++G +G+
Sbjct: 353 KVSRDFFMYKSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGE 412
Query: 238 KGTIKILRGRNEAIIESLVNGA 259
G +IL+G +E IE V A
Sbjct: 413 NGYFRILKGVDECEIEDFVIAA 434
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 256 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 310
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+V+IVGWGE+ YW + +++G +G+KG +I RG + IES V LP
Sbjct: 350 SVRIVGWGEDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLP 401
>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
Length = 156
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 2/78 (2%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + +RGLV+GG + S+ GC + PPC H + S P C P+C
Sbjct: 63 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCAGYTIPPCEH-HVNGSRPPCTGEGGETPRC 121
Query: 140 HTRCTNDNYGRGFFQDKY 157
C Y + +DK+
Sbjct: 122 SRHC-EPGYSPSYKEDKH 138
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 91/244 (37%), Gaps = 61/244 (25%)
Query: 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
T + IRD S SC AVA A+ ++ C + R AG
Sbjct: 12 TITEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 54
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C + + C+ G W + G+V+ CQP FP C H ++
Sbjct: 55 MSCCDVCG-----YGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 102
Query: 125 SEPECKTLATPQPKCHTRCT-----------NDNY---GRGFFQDKYQINGLGLYFDPHF 170
C P C++ CT N +Y G F+ + +NG
Sbjct: 103 DLSPCSG-EYDTPTCNSTCTDKKVPLIKYRGNTSYLLSGEESFKRELLLNG--------- 152
Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVST 230
PF +F + Y L T G VY A + +A V+IVGWGE NG PYW I ++
Sbjct: 153 -PFEVSF-----SVYADFLAYTGG-VYKHVAGTFLGGHA-VRIVGWGELNGEPYWKIANS 204
Query: 231 FGEQ 234
+ +
Sbjct: 205 WNHE 208
>gi|14042811|dbj|BAB55403.1| unnamed protein product [Homo sapiens]
Length = 218
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 152 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 206
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 55/153 (35%), Gaps = 23/153 (15%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G W + G+VTGG+ GC+P FP C H + P C P PKC
Sbjct: 38 CDGGFPPMAWDFWKTHGIVTGGSKEEPAGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 96
Query: 140 HTRC-----------TNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRP 188
C T N Q + I L P +F P
Sbjct: 97 VKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGP--------VEATFEVHEDFP 148
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGEENG 221
+++ +A S V ++I+GWGEENG
Sbjct: 149 EYKSGIYFHAWGGS---VGGHAIRILGWGEENG 178
>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
Length = 218
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IES V G
Sbjct: 152 SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLGV 206
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V +GWG E+ PYW +++G +G+KG KILRG N IE+ V G K
Sbjct: 229 VLCIGWGVEDNTPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQVYGPQMK 280
>gi|340509339|gb|EGR34889.1| nucleotide binding protein, putative [Ichthyophthirius multifiliis]
Length = 732
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 35/48 (72%), Gaps = 3/48 (6%)
Query: 210 TVKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 255
+V VGWGE+ NG+ YW + +++GE +G+KG KI RG NEA IES+
Sbjct: 665 SVLCVGWGEDDINGK-YWIVQNSWGESWGEKGYFKIARGNNEASIESM 711
>gi|58617822|gb|AAW80530.1| cathepsin L-like cysteine protease [Leishmania infantum]
Length = 234
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 38/56 (67%)
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
AV AS+ + + V +VG+ + G PYW I +++GE +G+KG ++++ GRN +++
Sbjct: 127 AVDASSFMSYQSGVLLVGYNKTGGVPYWVIKNSWGEDWGEKGYVRVVMGRNACLLK 182
>gi|358255476|dbj|GAA57175.1| cathepsin L [Clonorchis sinensis]
Length = 385
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 28/39 (71%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V +VG+GEENG PYW I +++G +G+ G +KILR N
Sbjct: 334 VLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNN 372
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 28/39 (71%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V +VG+GEENG PYW I +++G +G+ G +KILR N
Sbjct: 322 VLLVGYGEENGIPYWLIKNSWGPHWGENGYVKILRDHNN 360
>gi|301122543|ref|XP_002908998.1| cathepsin B, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
gi|262099760|gb|EEY57812.1| cathepsin B, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
Length = 384
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V+IVGWGEE+G YW I +++G +G G KI+RG+N IE+
Sbjct: 228 VEIVGWGEEDGVKYWHIRNSWGTYWGMNGFFKIVRGKNNLGIEA 271
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 91/244 (37%), Gaps = 61/244 (25%)
Query: 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
T + IRD S SC AVA A+ ++ C + R AG
Sbjct: 12 TITEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 54
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C + + C+ G W + G+V+ CQP FP C H ++
Sbjct: 55 MSCCDVCG-----YGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 102
Query: 125 SEPECKTLATPQPKCHTRCT-----------NDNY---GRGFFQDKYQINGLGLYFDPHF 170
C P C++ CT N +Y G F+ + +NG
Sbjct: 103 DLSPCSG-EYDTPTCNSTCTDKKVPLIKYRGNTSYLLSGEESFKRELLLNG--------- 152
Query: 171 GPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVST 230
PF +F + Y L T G VY A + +A V+IVGWGE NG PYW I ++
Sbjct: 153 -PFEVSF-----SVYADFLAYTGG-VYKHVAGIFLGGHA-VRIVGWGELNGEPYWKIANS 204
Query: 231 FGEQ 234
+ +
Sbjct: 205 WNHE 208
>gi|440801087|gb|ELR22112.1| papain family cysteine protease subfamily protein, partial
[Acanthamoeba castellanii str. Neff]
Length = 557
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 39/61 (63%), Gaps = 1/61 (1%)
Query: 200 SASAEIVAYATVKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
S++++ V + +VGWG + NG YW I +++ Q+GDKG ++ RG N+A IE V+
Sbjct: 275 SSASDYVGGHAIAVVGWGTDVNGVDYWLIENSWSTQWGDKGYYRMKRGVNQAGIEGYVSA 334
Query: 259 A 259
A
Sbjct: 335 A 335
>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
Length = 488
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 29/44 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V +VGWGEENG PYW + +++G +G G KI RG +E ES
Sbjct: 435 VLLVGWGEENGVPYWLVKNSWGTSWGINGFFKIKRGTDECDCES 478
>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
Length = 467
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IE+ V G
Sbjct: 401 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 455
>gi|320162754|gb|EFW39653.1| papain family cysteine protease [Capsaspora owczarzaki ATCC 30864]
Length = 589
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 77/180 (42%), Gaps = 29/180 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGC--QPVSFPPCNHANYTTSEPECKTLATPQP 137
C+ G + +AW+ G+ H +TG + + P ++ T EP K A P
Sbjct: 424 CNGGDPLAAYAWIAVNGI------HDDTGTWYEAKNLPCTDYYKCHTCEPSGKCNAVPN- 476
Query: 138 KCHTRCTNDNYGRGFFQDKYQINGLGLYFDPHF--GPFWPAFWRSFCTKYTRPLFQTNGR 195
C N +G F +I G F GP + T L G
Sbjct: 477 -----CLN--FGVAQFG---EIVGEAAMKAEIFARGPV------AVTIAVTTDLINYTGG 520
Query: 196 VYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V+ + A I +V + GWG +N G PYWTIV+++G +G+ G +I+RG N IES
Sbjct: 521 VFHDTTGA-IGDDHSVMLTGWGVDNSGTPYWTIVNSWGTYWGETGAARIVRGVNNLGIES 579
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 74/195 (37%), Gaps = 39/195 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212
Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
KC +C N + R Y + + DP GP AF T +
Sbjct: 213 KCVRKCVKGNQIWKR---SKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAF-----TVFED 264
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+G ++ SA + VK++GWG + G YW + + + +GD G KI RG
Sbjct: 265 FAHYKSGVYKHITGSA--LGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 322
Query: 247 RNEAIIESLVNGALP 261
NE IE V LP
Sbjct: 323 TNECGIEDDVTAGLP 337
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 74/195 (37%), Gaps = 39/195 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214
Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
KC +C N + R Y + + DP GP AF T +
Sbjct: 215 KCVRKCVKGNQIWKR---SKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAF-----TVFED 266
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+G ++ SA + VK++GWG + G YW + + + +GD G KI RG
Sbjct: 267 FAHYKSGVYKHITGSA--LGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 324
Query: 247 RNEAIIESLVNGALP 261
NE IE V LP
Sbjct: 325 TNECGIEDDVTAGLP 339
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 29/46 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
+KIVGWG EN YW + +++G +G G KI RG NE IE+ V
Sbjct: 180 IKIVGWGVENNVKYWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 74/195 (37%), Gaps = 39/195 (20%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214
Query: 138 KCHTRCTNDN--YGRGFFQDKYQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTR 187
KC +C N + R Y + + DP GP AF T +
Sbjct: 215 KCVRKCVKGNQIWKR---SKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAF-----TVFED 266
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG 246
+G ++ SA + VK++GWG + G YW + + + +GD G KI RG
Sbjct: 267 FAHYKSGVYKHITGSA--LGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRG 324
Query: 247 RNEAIIESLVNGALP 261
NE IE V LP
Sbjct: 325 TNECGIEDDVTAGLP 339
>gi|170579559|ref|XP_001894882.1| cathepsin F-like cysteine proteinase [Brugia malayi]
gi|158598358|gb|EDP36268.1| cathepsin F-like cysteine proteinase, putative [Brugia malayi]
Length = 137
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 35/50 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V I G+G E+ PYWTI +++GEQ+G+ G +++RG++ + LV+ A+
Sbjct: 86 VLITGYGIEDNLPYWTIKNSWGEQWGENGYFRLMRGKDICGVSDLVSSAI 135
>gi|294935201|ref|XP_002781340.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239891890|gb|EER13135.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 77
Score = 47.4 bits (111), Expect = 0.009, Method: Composition-based stats.
Identities = 23/55 (41%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 264
T I+GWG E G YW +++++ E +GD GT KI +G + I+ V G+LP N
Sbjct: 25 TSLIIGWGTEKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAMN 77
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 6/81 (7%)
Query: 185 YTRPLFQTNGRVYAVSAS-AEIVAYATVKIVGWGEE-----NGRPYWTIVSTFGEQFGDK 238
Y R +++ + + S S ++ + +VKIVGWG + N YW +++G +G++
Sbjct: 361 YNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDRSDWYNPIKYWLCTNSWGRNWGEQ 420
Query: 239 GTIKILRGRNEAIIESLVNGA 259
G +I+RG NE IES V G
Sbjct: 421 GMFRIVRGVNECEIESFVLGV 441
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IE+ V G
Sbjct: 400 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 454
>gi|188501543|gb|ACD54672.1| cysteine protease [Adineta vaga]
Length = 333
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%)
Query: 198 AVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 256
AV +I Y ++IVG+G E G+PYW ++ G+ +G++G +I R +N I LV
Sbjct: 241 AVDYVVKINKYYELQIVGYGVERGKPYWICKNSLGQNWGEEGYFRIARDKNMCRIAELV 299
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326
>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IE+ V G
Sbjct: 349 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 403
>gi|145490612|ref|XP_001431306.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398410|emb|CAK63908.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 34/52 (65%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
+V GWGEE+G +W + +++G Q+G+ G ++ RG +E+ IES+ + P
Sbjct: 399 SVLCYGWGEEDGVKFWMLQNSWGNQWGEGGNFRMKRGVDESAIESMAEASDP 450
>gi|312068028|ref|XP_003137021.1| papain family cysteine protease containing protein [Loa loa]
gi|307767820|gb|EFO27054.1| papain family cysteine protease containing protein [Loa loa]
Length = 332
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 32/48 (66%)
Query: 212 KIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+++G+G ENG+ YW I +++GE +GD G KI RG N +E+ V A
Sbjct: 283 EVIGYGTENGKKYWLIKNSWGEWWGDHGFFKIERGINACQVETYVASA 330
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 11/78 (14%)
Query: 196 VYAVSASAEIVA-------YATVKIVGWGEE----NGRPYWTIVSTFGEQFGDKGTIKIL 244
VY+ +A IV Y +VKI+GWGE+ N YW + +++G +G+ G +I
Sbjct: 456 VYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWIVQNSWGANWGEGGYFRIR 515
Query: 245 RGRNEAIIESLVNGALPK 262
+G NE IE ++ A P+
Sbjct: 516 KGVNECGIEEMILAAWPQ 533
>gi|340503546|gb|EGR30116.1| hypothetical protein IMG5_141560 [Ichthyophthirius multifiliis]
Length = 599
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 33/53 (62%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+V VGWGE YW + +++GE +G+KG KI RG +E+ IES+ A K
Sbjct: 532 SVLCVGWGENEDGKYWLVQNSWGEDWGEKGYFKIRRGTDESNIESMGERAFIK 584
>gi|324105223|gb|ADY18374.1| cathepsin B [Glycera tridactyla]
Length = 117
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 83 GISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 142
G S W + G+VTGG ++++ GC+P + P C H + + P C + P P+C +
Sbjct: 1 GFPRSAWEYFKVTGIVTGGQYNTHEGCRPYTIPKCEH-HVNGTLPPCSSTIKPTPRCERK 59
Query: 143 C 143
C
Sbjct: 60 C 60
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V VG+G +NG+PYW + +++GE FG++G +I RG I S+V A K
Sbjct: 275 VLTVGYGVQNGKPYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTARIK 326
>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
Length = 237
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 2/63 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W++ K+GLV+GG ++S+ GC P + PPC H + S P C T P+C
Sbjct: 144 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 201
Query: 140 HTR 142
+ +
Sbjct: 202 NKK 204
>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 29/38 (76%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
T+KI+GWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 80 TLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 117
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 29/45 (64%)
Query: 218 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+E G PYW IV+++GE FG G + I RG NE IES V +PK
Sbjct: 320 KEEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGIPK 364
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 31/52 (59%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
V IVGWG E+ PYW + +++G +G+ G KILRG + ES V P+
Sbjct: 229 VLIVGWGVEDEVPYWLVQNSWGTDWGENGFFKILRGSDHCECESNVTAGYPE 280
>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 29/38 (76%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
T+KI+GWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 80 TLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 117
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 26/36 (72%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
VKI+GWG ENG YW I +++G FG +G KI+RG
Sbjct: 184 VKIIGWGTENGVDYWLIANSWGTTFGLQGFFKIVRG 219
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 3/72 (4%)
Query: 86 SSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 145
S W + K+GLV+GG + S+ GC+P S PPC H + S P C T P+C C
Sbjct: 135 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 191
Query: 146 DNYGRGFFQDKY 157
Y + +DK+
Sbjct: 192 PGYSPSYKEDKH 203
>gi|348513412|ref|XP_003444236.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 328
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 40/64 (62%), Gaps = 2/64 (3%)
Query: 186 TRPLFQTNGR-VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
+RP F R VY + + V + ++ +VG+G E G+ YW + +++G QFG++G IK+
Sbjct: 252 SRPQFHFYHRGVYMDNTCTQKVNHGSL-VVGYGREKGQDYWLVKNSWGVQFGEEGYIKMA 310
Query: 245 RGRN 248
R RN
Sbjct: 311 RNRN 314
>gi|348676075|gb|EGZ15893.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 383
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 31/44 (70%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V+IVGWGEE+G YW + +++G +G G KI+RG+N IE+
Sbjct: 224 VEIVGWGEEDGVKYWHVRNSWGTYWGMNGFFKIVRGKNNLGIEA 267
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +GR YWT +++G +G++G +I+RG NE IE+ V G
Sbjct: 401 SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLGV 455
>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 309
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 36/54 (66%), Gaps = 1/54 (1%)
Query: 208 YATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
Y +V+IVG+G + G+ YW + + +G +G+ G +I+RG+NE IE V GA+
Sbjct: 245 YLSVEIVGYGTSDEGQDYWIVKNYWGSNWGEDGYFRIVRGQNECQIEEAVYGAI 298
>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 118
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 29/38 (76%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
T+KI+GWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 68 TLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 105
>gi|308163070|gb|EFO65432.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 97
Score = 46.6 bits (109), Expect = 0.013, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 4/77 (5%)
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVSTFGEQFGDKGTIKI 243
R G VY +I ++A V+I+G+G +E+ PYW + ++ G +G+ G I
Sbjct: 18 RDFLYYRGGVYRHVYGVQISSHA-VEIIGYGTTDDEDRVPYWIVKNSLGPNWGEDGYFNI 76
Query: 244 LRGRNEAIIESLVNGAL 260
+RG NE IES V+ L
Sbjct: 77 VRGSNECDIESAVHSGL 93
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 65/147 (44%), Gaps = 20/147 (13%)
Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWP 175
S P C+ A PKC +C N + + + K + +N + DP+ GP
Sbjct: 53 SHPGCEP-AYQTPKCVRKCVKGN--QIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEV 109
Query: 176 AFWRSFCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQ 234
AF T Y +G VY +++ +A VK++GWG + G YW I + +
Sbjct: 110 AF-----TVYEDFAHYKSG-VYKHITGSQLGGHA-VKLIGWGTTDEGEDYWLIANQWNRS 162
Query: 235 FGDKGTIKILRGRNEAIIESLVNGALP 261
+GD G I RG NE IE V LP
Sbjct: 163 WGDDGYFMIRRGTNECGIEEDVTAGLP 189
>gi|195455847|ref|XP_002074892.1| GK22908 [Drosophila willistoni]
gi|194170977|gb|EDW85878.1| GK22908 [Drosophila willistoni]
Length = 381
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 30/38 (78%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G ENG+ YWTI +++GE +G+ G +++RG+N
Sbjct: 331 VLVVGYGSENGQDYWTIKNSWGENWGESGYFRLIRGQN 368
>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 105
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 2/52 (3%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
VKI+GWGE++G+ YW V+++ E +GD G KI G N I + L+ G PK
Sbjct: 55 VKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFKIALG-NCGIDDDLLGGT-PK 104
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE +G+ YWT +++G +G+ G +I+RG NE IES V G
Sbjct: 400 SVKITGWGEEIQPDGQKVKYWTAANSWGPTWGENGYFRIVRGANECDIESFVVGV 454
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 73/190 (38%), Gaps = 39/190 (20%)
Query: 70 LVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC 129
LVS T C+ G WAW G+ T +G V
Sbjct: 117 LVSCDTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSGRV----------------- 159
Query: 130 KTLATPQPKCHTRCTNDNYGRGFFQDK----YQINGLGLYFDPH-FGPFWPAFWRSFCTK 184
P C +C N G ++K ++N + + + GP AF T
Sbjct: 160 -------PACPAKCVN---GSAIVRNKSVSYKKLNAQQMMEELYENGPISVAF-----TV 204
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
Y + +G VY V + I V VGWG E+ PYW +++G +G+KG KIL
Sbjct: 205 YYDFMNYKSG-VY-VHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNSWGPAWGEKGHFKIL 262
Query: 245 RGRNEAIIES 254
RG N IE+
Sbjct: 263 RGSNHCGIEN 272
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 196 VYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
VY E+ +A VK++GWG E+G YW + + + +GD G KI RG NE IE
Sbjct: 37 VYKHVTGDELGGHA-VKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECDIED 95
Query: 255 LVNGALP 261
V +P
Sbjct: 96 EVVAGMP 102
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 73/190 (38%), Gaps = 39/190 (20%)
Query: 70 LVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC 129
LVS T C+ G WAW G+ T +G V
Sbjct: 117 LVSCDTTDMGCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSGRV----------------- 159
Query: 130 KTLATPQPKCHTRCTNDNYGRGFFQDK----YQINGLGLYFDPH-FGPFWPAFWRSFCTK 184
P C +C N G ++K ++N + + + GP AF T
Sbjct: 160 -------PACPAKCVN---GSAIVRNKSVSYKKLNAQQMMEELYENGPISVAF-----TV 204
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 244
Y + +G VY V + I V VGWG E+ PYW +++G +G+KG KIL
Sbjct: 205 YYDFMNYKSG-VY-VHKTGGIAGGHAVLCVGWGVEDNTPYWLCQNSWGPAWGEKGHFKIL 262
Query: 245 RGRNEAIIES 254
RG N IE+
Sbjct: 263 RGSNHCGIEN 272
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 35/55 (63%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE NGR +W +++G +G+ G+ +ILRG NE IES V G
Sbjct: 255 SVKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDIESFVVGV 309
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 55/194 (28%), Positives = 79/194 (40%), Gaps = 37/194 (19%)
Query: 80 CSSGISSSTWAWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 137
C G + W + G+VT + +TGC S P C EP A P P
Sbjct: 174 CDGGYPIAAWQYFSYSGVVTEECDPYFDDTGC---SHPGC--------EP-----AYPTP 217
Query: 138 KCHTRCTNDNYGRGFFQDK-YQINGLGLYFDPHF--------GPFWPAFWRSFCTKYTRP 188
KC +C + N + + Q K Y ++ + +P GP +F T Y
Sbjct: 218 KCMRKCVSGN--QLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSF-----TVYEDF 270
Query: 189 LFQTNGRVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+G VY + I +A VK++GWG + G YW + + + +GD G I RG
Sbjct: 271 AHYKSG-VYKHITGSNIGGHA-VKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGT 328
Query: 248 NEAIIESLVNGALP 261
NE IE LP
Sbjct: 329 NECGIEDEPVAGLP 342
>gi|294876288|ref|XP_002767632.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239869318|gb|EER00350.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 97
Score = 46.2 bits (108), Expect = 0.016, Method: Composition-based stats.
Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 2/60 (3%)
Query: 202 SAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
S + +V+I+GWG E G YW +++++ E +GD GT KI +G + I +V GA P
Sbjct: 37 SGTFMGVHSVEIIGWGIEKGVDYWLVMNSWNEDWGDNGTFKIAQG--DCGINDMVLGAPP 94
>gi|170579333|ref|XP_001894785.1| Papain family cysteine protease containing protein [Brugia malayi]
gi|158598509|gb|EDP36387.1| Papain family cysteine protease containing protein [Brugia malayi]
Length = 324
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 8/115 (6%)
Query: 148 YGRGFFQDKYQIN---GLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASAE 204
YG F YQI+ +F + P A +F +Y F +G + +
Sbjct: 213 YGEIFINKLYQIDPDPNAMAWFVANVAPI--ALNLAFPKRYK---FYKSGVLPDTDECST 267
Query: 205 IVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+ +++G+G ENG+ YW + +++GE +GD+G K+ RG N +E+ V A
Sbjct: 268 MEPNHAAEVIGYGTENGKKYWLLKNSWGEWWGDQGFFKMERGVNACKVETYVASA 322
>gi|403343435|gb|EJY71046.1| Papain family cysteine protease containing protein [Oxytricha
trifallax]
Length = 619
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 24/102 (23%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 194 GRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 253
G +Y + + + + V +VG+G ENG +W + +++G +G+ G ++++RG N IE
Sbjct: 220 GGIYQDTTGDQNIVH-DVSVVGFGVENGTKFWVVRNSWGSHYGENGFVRVIRGVNNIAIE 278
Query: 254 SLVNGALPKDNYGVEFGEESGERLSEEFGVRAESSEEFRENG 295
+ A P D + ++ + + ++++R+NG
Sbjct: 279 TDCAWATPVDTWTNRVPHKTTDAEKND-----PKNDKYRKNG 315
>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
Length = 462
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 34/53 (64%), Gaps = 2/53 (3%)
Query: 211 VKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V +VG+G + NG YW + +++G +G+KG +ILRG +E IES+ A P
Sbjct: 406 VLLVGYGTDSANGMDYWIVKNSWGTSWGEKGYFRILRGTDECAIESIAMAATP 458
>gi|303277733|ref|XP_003058160.1| cathepsin [Micromonas pusilla CCMP1545]
gi|226460817|gb|EEH58111.1| cathepsin [Micromonas pusilla CCMP1545]
Length = 583
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 24/33 (72%)
Query: 213 IVGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 245
+VGWG ENG YW + +T+GE FG+KG K+ R
Sbjct: 419 VVGWGVENGMKYWLVRNTYGEDFGEKGYFKLER 451
>gi|384249023|gb|EIE22506.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
Length = 404
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 45/83 (54%), Gaps = 1/83 (1%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 270
V++ GWGEE+G P+W + +++G +G+ G +I RG N +E + + + +E
Sbjct: 258 VEVTGWGEEHGVPFWIVRNSWGTFWGEMGFFRIERGINSLFLED-SDCWYAEPEHEMEDE 316
Query: 271 EESGERLSEEFGVRAESSEEFRE 293
E GE + +GV SE+ E
Sbjct: 317 VEDGELVGSMYGVLDAKSEQGSE 339
>gi|281209002|gb|EFA83177.1| cathepsin Z precursor [Polysphondylium pallidum PN500]
Length = 309
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 33/59 (55%), Gaps = 13/59 (22%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEF 269
V IVGWGEENG YW + +++G +G++G +I+RG P +N G+E
Sbjct: 249 VSIVGWGEENGESYWIVRNSWGMYYGEQGFFRIVRGS-------------PFENLGIEL 294
>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
Length = 210
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 2/61 (3%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 140 H 140
+
Sbjct: 208 N 208
>gi|221505681|gb|EEE31326.1| cathepsin L, putative [Toxoplasma gondii VEG]
Length = 733
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 5/49 (10%)
Query: 211 VKIVGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V IVGWGE ENG+P YW + +T+G +G G +KI RG+N IES
Sbjct: 656 VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704
>gi|237838179|ref|XP_002368387.1| cathepsin C [Toxoplasma gondii ME49]
gi|211966051|gb|EEB01247.1| cathepsin C [Toxoplasma gondii ME49]
gi|221484340|gb|EEE22636.1| cathepsin C, putative [Toxoplasma gondii GT1]
Length = 733
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 5/49 (10%)
Query: 211 VKIVGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V IVGWGE ENG+P YW + +T+G +G G +KI RG+N IES
Sbjct: 656 VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704
>gi|70919569|gb|AAZ15654.1| cathepsin C1 [Toxoplasma gondii]
Length = 730
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 32/49 (65%), Gaps = 5/49 (10%)
Query: 211 VKIVGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V IVGWGE ENG+P YW + +T+G +G G +KI RG+N IES
Sbjct: 653 VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 701
>gi|294950069|ref|XP_002786445.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239900737|gb|EER18241.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 149
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 32/45 (71%)
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKI 243
V + ++V T+KI+GWG E+G+ YW ++++ E++GD G IK+
Sbjct: 79 VHTTGDLVGSHTLKIIGWGVESGQEYWLAMNSWNEEWGDHGLIKM 123
>gi|407399825|gb|EKF28451.1| cysteine peptidase, putative, partial [Trypanosoma cruzi
marinkellei]
Length = 257
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 36/50 (72%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V +VG+ + PYWTI +++G+Q+G++G I+I +G N+ +++ V+ A+
Sbjct: 119 VLLVGYNDSAPVPYWTIKNSWGKQWGEEGYIRIAKGSNQCLVKDRVSSAV 168
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 79/200 (39%), Gaps = 60/200 (30%)
Query: 91 WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 145
WV++ GLVTGG GC+P SF PC+ A + +E E +T C RC N
Sbjct: 223 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAE-EKRT-------CMRRCQN 269
Query: 146 DNYGRGFFQDK------YQINGLGLYFDPHFGPFW--PAFWRSFCTKYTRPLFQTNGR-- 195
Y + + +DK Y + + P P F K T L T R
Sbjct: 270 IYYQQKYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFNDKNTEKLNVTEYRNV 329
Query: 196 ------VYAVSASA-------------------------EIVAYATVKIVGWGE-ENGRP 223
+Y + A IV + V+++GWGE ++G+
Sbjct: 330 IKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHVVRLIGWGESDDGQH 389
Query: 224 YWTIVSTFGEQFGDKGTIKI 243
YW V++FG +GD G KI
Sbjct: 390 YWLAVNSFGNHWGDNGIFKI 409
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 28/40 (70%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
VK++GWGEE G PYW + +++ + +G+ G ++ RG N+
Sbjct: 53 AVKVIGWGEEKGTPYWIVANSWHDDWGENGFFRMHRGSND 92
>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
Length = 446
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 37/58 (63%), Gaps = 1/58 (1%)
Query: 204 EIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
EI +A V +VG+G + G YW + +++G+ +G++G +ILRG +E IES+ P
Sbjct: 388 EITNHA-VLLVGYGADEGTKYWIVKNSWGKGWGEEGYFRILRGADECAIESIAVETFP 444
>gi|37903264|gb|AAO64475.1| cathepsin K [Fundulus heteroclitus]
Length = 191
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G E G YW I +++GE FGD+G IK+ R RN
Sbjct: 140 VLAVGYGTEKGEDYWLIKNSWGEHFGDEGYIKMARNRN 177
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
F + G Y S ++ + + V +VG+G +NG+ YW + +++ E +GD+G IKI R R
Sbjct: 263 FYSKGVYYEPSCDSDDLDHG-VLVVGYGSDNGKDYWLVKNSWSEHWGDQGYIKIARNR 319
>gi|58332124|ref|NP_001011214.1| cathepsin S, gene 2 precursor [Xenopus (Silurana) tropicalis]
gi|56556518|gb|AAH87770.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 332
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 28/39 (71%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+V +VG+G +NG YW + +++G +GDKG IK+ R RN
Sbjct: 280 SVLVVGYGTDNGNDYWLVKNSWGAGYGDKGYIKMARNRN 318
>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 261
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
C+ G S W + +RGLV+GG + S+ GC+P S PPC H T P
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYSIPPCEHHVNGTRPP 197
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 28/38 (73%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G +NG PYW I +++GE +G+ G ++ILR N
Sbjct: 320 VLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRNHN 357
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 210 TVKIVGWGEE-----NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
+VKI GWGEE + + YW +++G+ +G+ G +I RG NE IE+ V G
Sbjct: 452 SVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIETFVVGV 506
>gi|308462797|ref|XP_003093679.1| hypothetical protein CRE_29188 [Caenorhabditis remanei]
gi|308249543|gb|EFO93495.1| hypothetical protein CRE_29188 [Caenorhabditis remanei]
Length = 353
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 47/83 (56%), Gaps = 8/83 (9%)
Query: 168 PHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA--EIVAYATVKIVGWGEENGRPYW 225
P FGP +F + + +Y+ S + + +A T+ I+G+G +NG+P+W
Sbjct: 263 PVFGPV------AFGMPVPKSIMYYKSGIYSPSPADCNQPIAAHTMSIIGYGIDNGKPFW 316
Query: 226 TIVSTFGEQFGDKGTIKILRGRN 248
T+ +++G ++G+ G +++ RG N
Sbjct: 317 TVKNSWGPRWGENGYMRMARGSN 339
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 185 YTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENG-----RPYWTIVSTFGEQFGDKG 239
Y +F+ Y + A +V+I GWGEE R YW +++G+ +G+ G
Sbjct: 372 YKSGIFRHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYWIGANSWGKNWGEDG 431
Query: 240 TIKILRGRNEAIIESLVNGA 259
+I RG NE IE+ V G
Sbjct: 432 YFRIARGVNECDIETFVIGV 451
>gi|344238391|gb|EGV94494.1| Ras-specific guanine nucleotide-releasing factor 1 [Cricetulus
griseus]
Length = 1632
Score = 45.4 bits (106), Expect = 0.032, Method: Composition-based stats.
Identities = 18/49 (36%), Positives = 32/49 (65%)
Query: 214 VGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
VG+GE++G PYW + +++G +GDKG I RG+N + + + +P+
Sbjct: 1583 VGYGEKDGIPYWIVKNSWGTNWGDKGYFLIERGKNMCGLAACASYPIPQ 1631
>gi|294936554|ref|XP_002781799.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239892784|gb|EER13594.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 88
Score = 45.4 bits (106), Expect = 0.033, Method: Composition-based stats.
Identities = 17/36 (47%), Positives = 28/36 (77%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
V+I+GWG E G YW +++++ E++GD GT KI++G
Sbjct: 37 VEIIGWGTEKGVDYWLVMNSWNEEWGDHGTFKIVQG 72
>gi|28932706|gb|AAO60047.1| midgut cysteine proteinase 4 [Rhipicephalus appendiculatus]
Length = 345
Score = 45.4 bits (106), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 28/38 (73%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+GEE G PYW + +++G +G+ G IKILR RN
Sbjct: 295 VLLVGYGEERGVPYWIVKNSWGPGWGEGGYIKILRNRN 332
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
F + G Y S ++ + + V +VG+G +NG+ YW + +++ E +GD+G IKI R R
Sbjct: 267 FYSKGVYYEPSCDSDDLDHG-VLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNR 323
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 259
V VG+G ENG PYWT+ +++G FG+ G +I RG I LV+ A
Sbjct: 552 VLTVGYGTENGLPYWTVKNSWGTAFGEDGYFRIYRGGGTCGINRLVSTA 600
>gi|37903252|gb|AAO64474.1| cathepsin F [Fundulus heteroclitus]
Length = 166
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 32/51 (62%)
Query: 210 TVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
V +VG+GE NG P+W I +++GE +G++G + RG N I + + A+
Sbjct: 114 AVLLVGYGERNGTPFWAIKNSWGEDYGEQGYYYLYRGSNACGINKMCSSAV 164
>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
Length = 320
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 29/39 (74%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V +VG+G ENG YW + +++ E +G+ G +K+LRG+NE
Sbjct: 270 VLVVGYGSENGVNYWLVKNSWAEDWGESGYLKLLRGQNE 308
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 4/68 (5%)
Query: 196 VYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 252
VY ++I ++A V+I+G+G +E+ PYW + ++ G +G++G I+RG NE I
Sbjct: 235 VYRHVYGSQISSHA-VEIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDI 293
Query: 253 ESLVNGAL 260
ES V L
Sbjct: 294 ESAVYSGL 301
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 190 FQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
F + G Y S ++ + + V +VG+G +NG+ YW + +++ E +GD+G IKI R R
Sbjct: 267 FYSKGVYYEPSCDSDDLDHG-VLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNR 323
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 90/234 (38%), Gaps = 41/234 (17%)
Query: 5 TSSRIRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVECTSFRFIAGVK 64
T + IRD S SC AVA A+ ++ C + R AG
Sbjct: 12 TVTEIRDQS-------------SCGSCWAVAAASAISDRYCTLGGVR----DLRISAGDL 54
Query: 65 QRCAWLVSRWMTIWVCSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT 124
C + + C+ G W + G+V+ CQP FP C H ++
Sbjct: 55 MSCCDVCG-----FGCNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSS 102
Query: 125 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDK--YQINGLGLYFDPHF--GPFWPAFWRS 180
C P C++ CT+ ++ Y ++G + GPF +F
Sbjct: 103 DLSPCSG-EYDTPTCNSTCTDKKIPLIKYRGNTSYVLSGEEPFKRELILNGPFEVSF--- 158
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQ 234
+ Y + T G VY A + +A V+IVGWGE NG PYW I +++ +
Sbjct: 159 --SVYADFVAYTGG-VYKHVAGIFLGGHA-VRIVGWGELNGEPYWKIANSWNHE 208
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 28/39 (71%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V +VG+G NG+ YW + +++G FG+ G ++LRG+NE
Sbjct: 272 VLVVGYGTSNGKKYWIVKNSWGGSFGESGYFRLLRGKNE 310
>gi|410904753|ref|XP_003965856.1| PREDICTED: cathepsin S-like [Takifugu rubripes]
Length = 334
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 37/54 (68%), Gaps = 1/54 (1%)
Query: 196 VYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
VY + ++ + +A V VG+G NG+ YW + +++G +FGDKG I+++R +N+
Sbjct: 269 VYDDPSCSQTINHA-VLAVGYGTLNGQDYWLVKNSWGVKFGDKGYIRMVRNKND 321
>gi|403355703|gb|EJY77438.1| Papain family cysteine protease containing protein [Oxytricha
trifallax]
Length = 617
Score = 45.1 bits (105), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 34/55 (61%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY 265
+ IVG+G ENG YW + +++G +G+ G +++RG N +ES A+P D +
Sbjct: 232 ISIVGYGVENGTKYWVVRNSWGTSWGESGFARVIRGINNLNLESDCAYAVPVDTW 286
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 3/78 (3%)
Query: 186 TRPLFQTNGRVYAVSASAEIVAYATVKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKI 243
T + G +++ + I+ + + + GWG E YW + +++G +G+KG +KI
Sbjct: 528 TDKMHDYMGGIFSEKKAVPIINH-IISVAGWGLDEATNTEYWIVRNSWGTYWGEKGWMKI 586
Query: 244 LRGRNEAIIESLVNGALP 261
+ IE+ NGA+P
Sbjct: 587 KMHSDNLAIETDCNGAIP 604
>gi|218139209|gb|ACK57788.1| cathepsin C [Litopenaeus vannamei]
Length = 451
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 36/53 (67%), Gaps = 2/53 (3%)
Query: 211 VKIVGWGEE--NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V +VG+GE+ G YW++ +++GE++G+ G +I RG +E IES+ A+P
Sbjct: 397 VLLVGYGEDEATGEKYWSVKNSWGEEWGEDGYFRIRRGVDECAIESMAVEAVP 449
>gi|294893015|ref|XP_002774310.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239879603|gb|EER06126.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 81
Score = 45.1 bits (105), Expect = 0.042, Method: Composition-based stats.
Identities = 19/48 (39%), Positives = 31/48 (64%)
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
V + +V ++KI+GWG E+G+ YW V+++ E+ GD G IK+ G
Sbjct: 20 VHTTGGLVGVHSLKIIGWGVESGQDYWLAVNSWNEESGDHGMIKLAVG 67
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 37/56 (66%), Gaps = 5/56 (8%)
Query: 210 TVKIVGWG---EENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 260
+VK++GWG ++NG+ +W +++G+ +G+ G +ILRG+NE IE L+ L
Sbjct: 411 SVKLLGWGALPDKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466
>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
Length = 205
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
C+ G S+ W + GLVTGG + S+ GC+P S PPC H T P
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPP 195
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVSTFGEQFGDKGTIKI 243
R G VY +I ++A V+I+G+G +E PYW + ++ G +G++G I
Sbjct: 252 RDFLYYRGGVYKHVYGIQISSHA-VEIIGYGTTDDEERIPYWIVKNSLGPNWGEEGYFNI 310
Query: 244 LRGRNEAIIESLVNGAL 260
+RG NE IES V L
Sbjct: 311 VRGSNECDIESAVYSGL 327
>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
Length = 207
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP 127
C+ G S+ W + GLVTGG + S+ GC+P S PPC H T P
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPP 195
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 35/63 (55%)
Query: 199 VSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 258
V A+ + + +KI+GWG E+ YW +++G +G +G KI RG +E IE +
Sbjct: 247 VHATGKQLGGHAIKILGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAA 306
Query: 259 ALP 261
LP
Sbjct: 307 GLP 309
>gi|77744608|gb|ABB02268.1| cathepsin B [Ovis aries]
Length = 76
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 26/40 (65%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNH 119
C+ G S W + K+GLV+GG + S+ GC+P S PPC H
Sbjct: 34 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH 73
>gi|431920312|gb|ELK18347.1| Cathepsin H [Pteropus alecto]
Length = 232
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 32/51 (62%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V VG+GEENG+PYW + +++G Q+G G I RG+N + + + +P
Sbjct: 180 VLAVGYGEENGKPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP 230
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.438
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,266,296,325
Number of Sequences: 23463169
Number of extensions: 230236207
Number of successful extensions: 453287
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2343
Number of HSP's successfully gapped in prelim test: 439
Number of HSP's that attempted gapping in prelim test: 449832
Number of HSP's gapped (non-prelim): 3097
length of query: 298
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 157
effective length of database: 9,050,888,538
effective search space: 1420989500466
effective search space used: 1420989500466
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)