BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10824
(185 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 74/103 (71%), Gaps = 3/103 (2%)
Query: 73 QSNTELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+SN LPE FD R+++P C+++ G ++ QSNCGSCWA++ + SDR+CIAT G + L
Sbjct: 85 ESNEALPENFDARERWPECSSLLGSIKDQSNCGSCWAVSAASVFSDRLCIATGGAVARNL 144
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ L TCC C G+ C+GG+P AWY+ + +G+ TGGDYGS
Sbjct: 145 SAEQLNTCCYRC--GNGCDGGSPESAWYFFMRHGIVTGGDYGS 185
>gi|157058755|gb|ABV03135.1| cathepsin B-84 [Aulacorthum solani]
Length = 218
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 76/125 (60%), Gaps = 13/125 (10%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N+E+PE FD R ++ C IGHV+ Q NCGSCWA TT A +DR+C+AT G ++ +
Sbjct: 38 YMDNSEVPEFFDSRLEWKYCKTIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEVNQLI 97
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDR 180
S++ + CC C G C GGNP+RAW Y +GV TGGDY + C + D+
Sbjct: 98 SAEEVTFCCHRCGFG--CNGGNPLRAWQYFKRHGVVTGGDYNTTDGCQPYRVPPCVKDDK 155
Query: 181 GNCNC 185
G+ +C
Sbjct: 156 GHNSC 160
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 54/116 (46%), Positives = 76/116 (65%), Gaps = 5/116 (4%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D + LP+EFD RKQ+PNC+ IG ++ Q +CGSCWA A+SDR+CI + G+L
Sbjct: 75 DVSLDITLPDEFDARKQWPNCSTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNGKLQVH 134
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LS+++LL+CC +C GD C GG+P AW Y + G+ +GG+YGS CQ + C
Sbjct: 135 LSAENLLSCCDSC--GDGCLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPC 188
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 112 bits (280), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/125 (44%), Positives = 73/125 (58%), Gaps = 13/125 (10%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N E+PE FD R ++ NC IG V+ Q NCGSCWA TT A +DR+CIAT G + +
Sbjct: 78 YVDNGEVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELI 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDR 180
S++ L CC C G C GGNP++AW Y +GV TGG+Y + C R D
Sbjct: 138 SAEELTFCCHTCGFG--CNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPCVRDDE 195
Query: 181 GNCNC 185
G+ +C
Sbjct: 196 GHNSC 200
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N + + FD R+ + C IGHV+ Q NCGSCWA TT A +DR+C+AT G + L
Sbjct: 80 YTKNNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQL 139
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ L CC C G C+GGNP++AW Y +G+ TGGDYGS
Sbjct: 140 SAEKLTFCCWTCGLG--CQGGNPIKAWKYFKRHGITTGGDYGS 180
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 88/145 (60%), Gaps = 14/145 (9%)
Query: 34 ILKSSPSF-LSSLKFGL---SLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYP 89
+ K+ P+ + L+ + +S++P D ++G E F ++P+ FD R Q+P
Sbjct: 54 LFKAEPAAAIEELRMKIMKSKFISRSKKPRVD-EIGEEGF-------KIPDSFDARVQWP 105
Query: 90 NCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVC 149
+C +I +++ QS CGSCWA + A+SDR+CIA+ G LS+D +L+CC C GD C
Sbjct: 106 HCPSISYIRDQSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDILSCCYDC--GDGC 163
Query: 150 EGGNPMRAWYYMLENGVPTGGDYGS 174
+GG P+ AW Y +E GV TGG YG+
Sbjct: 164 DGGYPISAWEYFVETGVVTGGLYGT 188
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 53/101 (52%), Positives = 67/101 (66%), Gaps = 2/101 (1%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q+N +PE FD R+Q+PNC +I ++ QS CGSCWA A T SDR+CIA+ L ++S
Sbjct: 82 QTNDPIPESFDSREQWPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSIS 141
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
S+ LL CCA C G+ C+GG P AW YM GV TGG YG
Sbjct: 142 SEDLLECCATC--GNGCQGGYPSAAWKYMKATGVSTGGLYG 180
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N+E+PE FD R ++ C IGHV+ Q NCGSCWA TT A +DR+C+AT G + +
Sbjct: 78 YMDNSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELI 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
S++ L CC C G C GG P++AW Y +GV TGGDY
Sbjct: 138 SAEELTFCCHRCVFG--CNGGYPLKAWQYFKRHGVVTGGDY 176
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 71/105 (67%), Gaps = 2/105 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +P EFD RK++ NCT IG ++ Q NCGSCWA +T+ A +DR+CIA+ G + L
Sbjct: 78 YSPTGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLL 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
S++H+ +CC C G C+GG P+RAW Y ++G+ TGG++ S +
Sbjct: 138 SAEHVTSCCYRCGLG--CQGGYPIRAWRYYSKHGLVTGGNFNSFE 180
>gi|157058761|gb|ABV03138.1| cathepsin B-84 [Myzus persicae]
Length = 220
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/125 (44%), Positives = 73/125 (58%), Gaps = 13/125 (10%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N E+PE FD R ++ NC IG V+ Q NCGSCWA TT A +DR+CIAT G + +
Sbjct: 41 YVDNGEVPEFFDSRLEWKNCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIATDGEFNELI 100
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDR 180
S++ L CC C G C GGNP++AW Y +GV TGG+Y + C R D
Sbjct: 101 SAEELTFCCHTCGFG--CNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPSRVPPCVRDDE 158
Query: 181 GNCNC 185
G+ +C
Sbjct: 159 GHNSC 163
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 64/103 (62%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N + FD R+ + C IGHV+ Q NCGSCWA TT A +DR+C+AT G + L
Sbjct: 80 YTKNNNKIKHFDARENWKICKQIGHVRDQGNCGSCWAFGTTGAFADRLCVATGGGFNEQL 139
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ L CC C G C+GGNP++AW Y G+ TGGDYGS
Sbjct: 140 SAEKLTFCCWTCGLG--CQGGNPIKAWKYFKRRGITTGGDYGS 180
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 71/105 (67%), Gaps = 2/105 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +P EFD RK++ NCT IG ++ Q NCGSCWA +T+ A +DR+CIA+ G + L
Sbjct: 78 YSPAGNIPNEFDARKRWKNCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLL 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
S++H+ +CC C G C+GG P+RAW Y ++G+ TGG++ S +
Sbjct: 138 SAEHVTSCCYRCGLG--CQGGYPIRAWRYYSKHGLVTGGNFNSFE 180
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N+E+PE FD R ++ C IGHV+ Q NCGSCWA TT A +DR+C+AT G + +
Sbjct: 78 YMDNSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELI 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
S++ L CC C G C GG P++AW Y +GV TGGDY
Sbjct: 138 SAEELTFCCHRCGFG--CNGGYPLKAWQYFKRHGVVTGGDY 176
>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
Length = 230
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/101 (49%), Positives = 65/101 (64%), Gaps = 2/101 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y N+E+PE FD R ++ C IGHV+ Q NCGSCWA TT A +DR+C+AT G + +
Sbjct: 38 YMDNSEVPEFFDSRLEWDYCETIGHVRNQGNCGSCWAHGTTGAFADRLCVATNGEFNELI 97
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
S++ L CC C G C GG P++AW Y +GV TGGDY
Sbjct: 98 SAEELTFCCHTCGFG--CNGGYPLKAWQYFKRHGVVTGGDY 136
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD N+ ELPEEFD RKQ+PNC IG ++ Q +CGSCWA A+SDR+CI +
Sbjct: 74 EVLGDLYMNSLDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSG 133
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
G+++ S+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 134 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 73/121 (60%), Gaps = 14/121 (11%)
Query: 64 LGSEHFGDYQSNTEL------------PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y + E+ P++FD R+ + +C IGH++ Q NCGSCW+ +T
Sbjct: 59 LGSRGYKNYTNEAEIKKYDPLYVENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFST 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + L CC C G+ CEGG P++AW Y GV TGGD
Sbjct: 119 TGAFADRLCVSTGGKFNELLSPEELAFCCKDC--GNGCEGGYPIKAWRYFRTQGVTTGGD 176
Query: 172 Y 172
Y
Sbjct: 177 Y 177
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD N+ ELPEEFD RKQ+PNC IG ++ Q +CGSCWA A+SDR+CI +
Sbjct: 74 EVLGDLYMNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSG 133
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
G+++ S+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 134 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD N+ ELPEEFD RKQ+PNC IG ++ Q +CGSCWA A+SDR+CI +
Sbjct: 74 EVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSG 133
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
G+++ S+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 134 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 73/113 (64%), Gaps = 5/113 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ E+P EFD RKQ+P C IG ++ QSNCGSCWA A+SDR+CIAT GR +SS
Sbjct: 99 DMEIPVEFDSRKQWPYCPTIGEIRDQSNCGSCWAFGAVEAISDRICIATDGRQKPHISST 158
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
LL+CC C G C+GG+P +AW + ++ G+ TGG+Y + C+ + CN
Sbjct: 159 DLLSCCKICGFG--CQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCN 209
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 109 bits (273), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD N+ ELPEEFD RKQ+PNC IG ++ Q +CGSCWA A+SDR+CI +
Sbjct: 64 EVLGDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSG 123
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
G+++ S+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 124 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSNQ 173
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 52/123 (42%), Positives = 71/123 (57%), Gaps = 14/123 (11%)
Query: 64 LGSEHFGDYQSNTELP------------EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y S E+ E+FD R+ + +C IG ++ Q NCGSCWA T
Sbjct: 59 LGSRGYTNYSSEVEIKTYDPLYEENASVEQFDSRENWKSCKQIGRIRDQGNCGSCWAFGT 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + + CC C G CEGG P++AW Y GVPTGGD
Sbjct: 119 TGAFADRLCVSTGGKFNELLSPEDVAFCCQNC--GKGCEGGYPIKAWQYFRTQGVPTGGD 176
Query: 172 YGS 174
Y S
Sbjct: 177 YDS 179
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 51/123 (41%), Positives = 72/123 (58%), Gaps = 14/123 (11%)
Query: 64 LGSEHFGDYQSNTEL------------PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y + E+ P++FD R + +C IGH++ Q NCGSCW+ +T
Sbjct: 59 LGSRGYKNYTNEFEIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFST 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + L CC C G C GGNPM+AW Y GV TGGD
Sbjct: 119 TGAFADRLCVSTGGKFNQLLSPEELTFCCKDC--GQGCGGGNPMKAWEYFRTQGVTTGGD 176
Query: 172 YGS 174
Y +
Sbjct: 177 YNT 179
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 108 bits (271), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 71/111 (63%), Gaps = 2/111 (1%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L E+ +Y +++E+P FD R Q+ +C IG V+ Q NCGSCWA TT A +DR+CIAT
Sbjct: 70 LIKENDSEYINDSEIPNFFDARIQWSHCKTIGEVRNQGNCGSCWAHGTTGAFADRLCIAT 129
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
G + +S++ L CC C G C GGNP++AW Y +GV TGG+Y +
Sbjct: 130 NGDFNELISAEELTFCCHRCGFG--CNGGNPLKAWQYFKRHGVVTGGNYNT 178
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 108 bits (271), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 70/112 (62%), Gaps = 4/112 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD N+ E+PEEFD RKQ+PNC IG ++ Q +CGSCWA A+SDR+CI +
Sbjct: 74 EVLGDLYMNSVDEIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSG 133
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
G+++ S+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 134 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 108 bits (270), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 69/112 (61%), Gaps = 4/112 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD NT ++PEEFD RKQ+PNC IG ++ Q CGSCWA A+SDR+CI +
Sbjct: 74 EVLGDLYMNTVDQIPEEFDSRKQWPNCPTIGEIRDQGECGSCWAFGAVEAMSDRVCIHSG 133
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
G+++ S+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 134 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 108 bits (270), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 72/119 (60%), Gaps = 5/119 (4%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P + LG + D ++PEEFD RK +PNC IG ++ Q +CGSCWA A+SD
Sbjct: 69 PEKRIVLGDLYADD---GVDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSD 125
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
R+CI ++G+++ LS+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 126 RVCIHSEGKVNFHLSADDLVSCCHICGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSTQ 182
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 67/100 (67%), Gaps = 2/100 (2%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
SN +PE FD RK +P C ++ +++ QS+CGSCWA+A A+SDR+CI ++G+ LS
Sbjct: 116 HSNIFIPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILS 175
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+D LL+CC C G C GG PM AW Y + +G+ TG DY
Sbjct: 176 ADDLLSCCKTCGFG--CFGGEPMAAWKYWVLSGIVTGSDY 213
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 72/119 (60%), Gaps = 5/119 (4%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P + LG + D ++PEEFD RK +PNC IG ++ Q +CGSCWA A+SD
Sbjct: 69 PEKRIVLGDLYADD---GIDIPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSD 125
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
R+CI ++G+++ LS+D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 126 RVCIHSEGKVNFHLSADDLVSCCHICGFG--CNGGFPGAAWSYWTRKGIVSGGPYGSTQ 182
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 70/98 (71%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R ++PNCT+I H++ Q+NCGSCWA++T + LSDR+CIA++ + +SS
Sbjct: 93 DIPESFDARTKWPNCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDF 152
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++CC +C G CEGG P+ A+ Y GV TGGDYGS
Sbjct: 153 VSCCDSCGFG--CEGGWPIDAFEYYSYQGVVTGGDYGS 188
>gi|157058759|gb|ABV03137.1| cathepsin B-84 [Rhopalosiphum padi]
Length = 219
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 68/109 (62%), Gaps = 2/109 (1%)
Query: 66 SEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
E+ Y ++ E+P+ FD R ++ C IG V+ Q NCGSCWA TT A +DR+C+AT G
Sbjct: 34 KEYDSKYTNDVEVPDFFDARIEWKYCKTIGEVRNQGNCGSCWAHGTTGAFADRLCVATNG 93
Query: 126 RLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ +S++ L CC C G C GGNP+RAW Y +GV TGG+Y +
Sbjct: 94 DFNELISAEELTFCCHTCGFG--CNGGNPIRAWLYFKRHGVVTGGNYNT 140
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 67/100 (67%), Gaps = 2/100 (2%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
SN +PE FD RK +P C ++ +++ QS+CGSCWA+A A+SDR+CI ++G+ LS
Sbjct: 72 HSNIFIPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVEAMSDRICITSKGKKQVILS 131
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+D LL+CC C G C GG PM AW Y + +G+ TG DY
Sbjct: 132 ADDLLSCCKTCGFG--CFGGEPMAAWKYWVLSGIVTGSDY 169
>gi|161343827|tpg|DAA06094.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 207
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 91/160 (56%), Gaps = 18/160 (11%)
Query: 31 DPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSE--------HFGDYQSNTE----- 77
+ D + ++ K G++ P++ + + LGS+ ++ Y+S E
Sbjct: 25 EKDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKGVQIPSKVNYKMYKSEDENYDNL 84
Query: 78 ---LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+P +FD RK++ NC IG ++ Q NCGSCWA+AT++A +DR+C+A+ G + LS++
Sbjct: 85 LGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCVASNGNFNQLLSAE 144
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L CC C G C GG P++AW +++G+ TGGDY S
Sbjct: 145 ELTFCCHKCGFG--CNGGYPIKAWERFMKHGLVTGGDYKS 182
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 88/147 (59%), Gaps = 10/147 (6%)
Query: 30 ADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYP 89
A P+ ++ P +S ++ + + P S+ P ++ H D E+P++FD RKQ+P
Sbjct: 40 AGPNFAENVP--MSYIRRLMGVPPNSKYHMPSVK---RHLLD---AMEIPDDFDARKQWP 91
Query: 90 NCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVC 149
NC I ++ Q +CGSCWA A+SDR+CI ++G ++ LS+D L++CC +C G C
Sbjct: 92 NCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGAVNVRLSADDLVSCCYSCGMG--C 149
Query: 150 EGGNPMRAWYYMLENGVPTGGDYGSCQ 176
GG P AW+Y + G+ +GG +GS Q
Sbjct: 150 NGGFPGAAWHYWVNKGIVSGGSFGSNQ 176
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 69/98 (70%), Gaps = 1/98 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+Q+PNC +I +++ QS+CGSCWA+A +SDR CIA+ G ++ +S++ LL
Sbjct: 75 IPDHFDAREQWPNCVSIDNIRDQSDCGSCWAVAAAETISDRTCIASNGEVNVLISAEDLL 134
Query: 138 TCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC GD CEGG P++AW Y + NG+ TGG Y S
Sbjct: 135 SCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTGGSYES 172
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 87/160 (54%), Gaps = 18/160 (11%)
Query: 31 DPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSE--------HFGDYQSNTE----- 77
+ D + S + K G++ P + + LGS+ + Y+++ E
Sbjct: 27 EEDFIDSINEKAKTWKAGINFDPNTPKEYIVKLLGSKGVQVPHKLNLKMYKTDDEAYVNL 86
Query: 78 ---LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+P++FD RK++ C IG V+ Q NCGSCWA+AT++A +DR+CIAT + LS++
Sbjct: 87 FGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAE 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L CC C G C GG P++AW Y +G+ TGGDY S
Sbjct: 147 ELTFCCHLC--GFACHGGYPIKAWSYFRRHGIVTGGDYQS 184
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 66/100 (66%), Gaps = 2/100 (2%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
S +PE FD RK +P C ++ +V+ QS+CGSCWA+A A+SDR+CI ++G+ TLS
Sbjct: 118 HSTIFIPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLS 177
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+D LL+CC C G C GG PM AW Y + G+ TG +Y
Sbjct: 178 ADDLLSCCKTCGFG--CFGGEPMAAWKYWVLRGIVTGSEY 215
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 69/102 (67%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ LP+EFD RK +PNCT+I ++ Q +CGSCWA A+SDR+CI + G+L LS++
Sbjct: 78 DVTLPKEFDARKHWPNCTSIAEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAE 137
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L++CC +C G C+GG P AW Y G+ +GG+YGS Q
Sbjct: 138 NLVSCCDSCGFG--CDGGYPASAWDYWQNVGIVSGGNYGSKQ 177
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 88/150 (58%), Gaps = 13/150 (8%)
Query: 32 PDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNC 91
PD + + + +KF ++ P+ EPN L + + ++P+ FD R ++PNC
Sbjct: 57 PDAEEFVRNRIMDVKF--AVDPEKTEPNYVL-------ANTEMKVDIPDTFDARDRWPNC 107
Query: 92 TNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEG 151
T++ H++ QS+CGSCWA+A +A+SDR+C T GR++ LS +L+CC G C+G
Sbjct: 108 TSMKHIRDQSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVLSCCFGSCGFG-CKG 166
Query: 152 GNPMRAWYYMLENGVPTGGDYG---SCQRF 178
G P RA+ Y G+ TGG YG +CQ +
Sbjct: 167 GYPARAFGYAWRYGLSTGGPYGEKDACQPY 196
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 71/121 (58%), Gaps = 14/121 (11%)
Query: 64 LGSEHFGDYQSNTEL------------PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y + E+ P++FD R+ + +C IGH++ Q NCGSCW+ +T
Sbjct: 59 LGSRGYKNYTNEVEIKKYDPLYVENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCWSFST 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + L CC C G C GG P++AW Y GV TGGD
Sbjct: 119 TGAFADRLCVSTGGKFNQLLSPEELAFCCMDC--GKGCGGGYPIKAWKYFRTQGVTTGGD 176
Query: 172 Y 172
Y
Sbjct: 177 Y 177
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 71/102 (69%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ ++PE FD R+ + NC +I +++ QS+CGSCWA A+SDR+CIA+ ++ TLS+D
Sbjct: 118 DIDIPETFDARQHWSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSAD 177
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
LL+CC C G CEGG+PM AW Y +++G+ TG ++ + Q
Sbjct: 178 DLLSCCRTCGFG--CEGGDPMFAWQYWVDHGIVTGSNFTANQ 217
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 71/109 (65%), Gaps = 3/109 (2%)
Query: 65 GSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
G +H + + ++PE FD R +P C +I ++ QS+CGSCWA A+SDR+CIA+
Sbjct: 90 GKQHLSKTKDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIAS 149
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G L TLS+D LL+CC +C G C GG+P+ AW Y +++G+ TG +Y
Sbjct: 150 HGELQVTLSADDLLSCCKSCGFG--CNGGDPLAAWRYWVKDGIVTGSNY 196
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 71/109 (65%), Gaps = 3/109 (2%)
Query: 65 GSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
G +H + + ++PE FD R +P C +I ++ QS+CGSCWA A+SDR+CIA+
Sbjct: 91 GKQHLSKTKDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIAS 150
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G L TLS+D LL+CC +C G C GG+P+ AW Y +++G+ TG +Y
Sbjct: 151 HGELQVTLSADDLLSCCKSCGFG--CNGGDPLAAWRYWVKDGIVTGSNY 197
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 73/109 (66%), Gaps = 3/109 (2%)
Query: 65 GSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
G +H + + ++PE FD R+ +P C +I +++ QS+CGSCWA A+SDR+CIA+
Sbjct: 91 GKQHLSKTKDLDMDIPENFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIAS 150
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G L +LS+D LL+CC +C G C GG+P+ AW Y +++G+ TG +Y
Sbjct: 151 HGELQVSLSADDLLSCCRSCGFG--CNGGDPLAAWRYWVKDGIVTGSNY 197
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 71/109 (65%), Gaps = 3/109 (2%)
Query: 65 GSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
G +H + + ++PE FD R +P C +I ++ QS+CGSCWA A+SDR+CIA+
Sbjct: 81 GKQHLSKTKDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIAS 140
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G L TLS+D LL+CC +C G C GG+P+ AW Y +++G+ TG +Y
Sbjct: 141 HGELQVTLSADDLLSCCKSCGFG--CNGGDPLAAWRYWVKDGIVTGSNY 187
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 52/122 (42%), Positives = 76/122 (62%), Gaps = 10/122 (8%)
Query: 49 LSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWA 108
LS +++EP +G E+ ++PE FD R +PNC+++ H++ Q+NCGSCWA
Sbjct: 73 LSFIGENREP----IVGDEN----DEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWA 124
Query: 109 IATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++T AALSDR+CI+T G +S+ +LTCC C G C+GG P+ AW Y+ G T
Sbjct: 125 VSTAAALSDRICISTNGTKQVNISATDILTCCYKCGYG--CQGGWPIEAWEYVAREGAVT 182
Query: 169 GG 170
GG
Sbjct: 183 GG 184
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 70/121 (57%), Gaps = 14/121 (11%)
Query: 64 LGSEHFGDYQSNTEL------------PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y + E+ P++FD R + +C IGH++ Q NCGSCW+ +T
Sbjct: 59 LGSRGYKNYTNEFEIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFST 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + L CC C G C GG P++AW Y GV TGGD
Sbjct: 119 TGAFADRLCVSTGGKFNQLLSPEELAFCCKDC--GQGCGGGYPIKAWKYFRTQGVTTGGD 176
Query: 172 Y 172
Y
Sbjct: 177 Y 177
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 73/109 (66%), Gaps = 3/109 (2%)
Query: 65 GSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
G +H + + ++PE FD R+ +P C +I +++ QS+CGSCWA A+SDR+CIA+
Sbjct: 92 GKQHLSKTKDLDLDIPESFDSRENWPKCQSIRNIRDQSSCGSCWAFGAVEAMSDRICIAS 151
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G L +LS+D LL+CC +C G C GG+P+ AW Y +++G+ TG +Y
Sbjct: 152 HGELQVSLSADDLLSCCRSCGFG--CNGGDPLAAWRYWVKDGIVTGSNY 198
>gi|339242631|ref|XP_003377241.1| cathepsin B [Trichinella spiralis]
gi|316973973|gb|EFV57514.1| cathepsin B [Trichinella spiralis]
Length = 199
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 72/114 (63%), Gaps = 5/114 (4%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN LP+E+D+RK YP+C I ++ QSNCGSCWA+++ + +SDR CIAT G LS
Sbjct: 50 SNIILPKEYDVRKAYPHCKYINFIKDQSNCGSCWAVSSASVMSDRHCIATNGTEQPFLSE 109
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG---SCQRFDRGNCN 184
+ L++CC C G C+GG A+ Y +E G+P+GG YG C+ + CN
Sbjct: 110 EELISCCKTCGLG--CDGGYVSHAFEYWVEKGLPSGGAYGWKTGCKPYSIAPCN 161
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 73/121 (60%), Gaps = 2/121 (1%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAAL 115
+ P PD Q D S +LP +FD R ++ +C I ++ Q +CGSCWAIATT+ +
Sbjct: 67 KYPLPDKQEVLGESDDEISLADLPVDFDARLRWTSCPTISEIREQGSCGSCWAIATTSVM 126
Query: 116 SDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSC 175
SDR+CI + G ++ LS +L+CCA C G C+GG P AW Y G+ +GGDYGS
Sbjct: 127 SDRLCIGSNGVMNFRLSGLDMLSCCAIC--GFACQGGYPGAAWAYWARKGLVSGGDYGSQ 184
Query: 176 Q 176
Q
Sbjct: 185 Q 185
>gi|157058775|gb|ABV03145.1| cathepsin B-16D [Myzus persicae]
Length = 236
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/120 (42%), Positives = 75/120 (62%), Gaps = 4/120 (3%)
Query: 56 QEPNPD-LQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAA 114
Q PN + + L DY +NT +P FD R+++ +C+ IG V+ Q NCGSCWA+AT++A
Sbjct: 59 QIPNKNNMNLYKSEDADY-NNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSA 117
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+DR+C+AT + LS++ + CC C G C GG P++AW + G+ TGGDY S
Sbjct: 118 FADRLCVATNADFNELLSAEEITFCCHTCGFG--CNGGYPIKAWKRFSKKGLVTGGDYKS 175
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S++++P EFD R+++ NC IG ++ Q +CGSCWA A+SDR+CI +QG+++ LS+
Sbjct: 84 SDSDIPTEFDAREKWSNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSQGKVNFHLSA 143
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
D L++CC C G C GG P AW Y G+ +GG++GS Q
Sbjct: 144 DDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGNFGSQQ 184
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R +PNCT+I H++ Q+NCGSCWA++T +ALSDR+CI + G +SS
Sbjct: 93 DIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDF 152
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++CC +C+ G C+GG P+ A+ + G TGGDYGS
Sbjct: 153 VSCCESCSYG--CDGGWPILAFDFYTYEGAVTGGDYGS 188
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 71/98 (72%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R +PNC+++ H++ Q++CGSCWA++T +ALSDR+CIA++G +S+ +
Sbjct: 90 DIPESFDARTHWPNCSSLTHIRDQADCGSCWAVSTASALSDRICIASKGAKQVYVSATDI 149
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L+CC +C GD C+GG + A+ + E G TGGDYG+
Sbjct: 150 LSCCHSC--GDGCDGGYVIDAFKFFAEQGAVTGGDYGA 185
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 67/100 (67%), Gaps = 2/100 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
++ +P FD R Q+PNC +I ++ QS+CGSCWA A++DR+CIA++G + T+S+D
Sbjct: 90 DSAIPSSFDSRTQWPNCPSIKSIRDQSSCGSCWAFGAAEAMTDRICIASKGAIQFTVSAD 149
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC C G C+GG P AW Y +E G+ +GG Y S
Sbjct: 150 DLLSCCDECGFG--CDGGFPYAAWNYWVEKGIVSGGSYTS 187
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 75/115 (65%), Gaps = 3/115 (2%)
Query: 63 QLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCI 121
+LG + + ++P FD R+ + C++ I V QS+CGSCWA+A +A+SDR CI
Sbjct: 64 KLGVAKEFTHSEDIQVPNSFDARENWKECSDVISTVVDQSDCGSCWAVAAASAMSDRRCI 123
Query: 122 ATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
A+QG+L +S+++LL+CC +C G CEGG P AW Y ++ G+ TGG YGS Q
Sbjct: 124 ASQGKLKVPVSAENLLSCCDSCGYG--CEGGYPTMAWSYWIDTGITTGGLYGSKQ 176
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 95/180 (52%), Gaps = 27/180 (15%)
Query: 31 DPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSE--------HFGDYQS------NT 76
+ D + + + ++ K G++ P++ + + LGS + Y+S NT
Sbjct: 25 EKDFIDNINAQATTWKAGVNFDPKTSKEHIMKLLGSRGVQIPNKNNMNLYKSEDAEYDNT 84
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ +C+ IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 85 YIPRFFDARRKWRHCSTIGRVRDQGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEI 144
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDRGNCNC 185
CC C G C GG P++AW + G+ TGGDY S C D+GN C
Sbjct: 145 TFCCHTCGFG--CNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNNTC 202
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 81/147 (55%), Gaps = 17/147 (11%)
Query: 43 SSLKFGLSLTPQSQEPNPDLQLGSE---------------HFGDYQSNTELPEEFDLRKQ 87
++ K G++ P + E + LGS+ H Y + +P FD RK+
Sbjct: 37 TTWKAGVNFDPSTPETDFIKMLGSKGVEAAKNASAHMFKTHDVAYNKFSYIPRTFDARKR 96
Query: 88 YPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGD 147
+ +C IG V+ Q +CGSCWA T++A +DR+C+AT G + LS++ L CC AC G
Sbjct: 97 WRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELTFCCHACGHG- 155
Query: 148 VCEGGNPMRAWYYMLENGVPTGGDYGS 174
C GG P++AW Y +G+ TGG+Y S
Sbjct: 156 -CNGGYPIKAWKYFSTHGLVTGGNYKS 181
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 69/102 (67%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P+EFD RKQ+PNC +I ++ Q +CGSCWA A+SDR+CI + G+L LS++
Sbjct: 78 DVTIPDEFDARKQWPNCPSITDIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAE 137
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L++CC +C G C+GG P AW Y G+ +GG+YGS Q
Sbjct: 138 NLVSCCDSCGYG--CDGGFPASAWDYWQNEGIVSGGNYGSKQ 177
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 86/160 (53%), Gaps = 18/160 (11%)
Query: 31 DPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSE--------HFGDYQSNTE----- 77
+ D + S + K G++ P + + LGS+ + Y+++ E
Sbjct: 27 EEDFIDSINEKAKTWKAGINFDPNTPKEYIVKLLGSKGVQVPHKLNLKMYKTDDEAYVNL 86
Query: 78 ---LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+P++FD RK++ C IG V+ Q NCGSCWA+AT++A +DR+CIAT + LS++
Sbjct: 87 FGRIPKKFDARKEWRRCITIGQVRDQGNCGSCWALATSSAFADRLCIATNYEFNELLSAE 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L CC C G C GG P++AW Y +G+ TGG Y S
Sbjct: 147 ELTFCCHLC--GFACHGGYPIKAWSYFRRHGIVTGGGYQS 184
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 72/109 (66%), Gaps = 3/109 (2%)
Query: 65 GSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
G +H + + ++PE FD R+ +P C +I ++ QS+CGSCWA A+SDR+CIA+
Sbjct: 106 GKQHLSKTKDLDMDIPESFDSRENWPKCESIKAIRDQSSCGSCWAFGAVEAMSDRICIAS 165
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G L +LS+D LL+CC +C G C GG+P+ AW Y +++G+ TG ++
Sbjct: 166 HGELQVSLSADDLLSCCRSCGFG--CNGGDPLAAWRYWVKDGIVTGSNF 212
>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
Length = 239
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 68/103 (66%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y S+ ++P+ FD RK++ C IG V+ Q CGSCWA++T++A +DR+CIAT G + L
Sbjct: 41 YISSGKIPKTFDARKKWVQCDTIGRVRDQGQCGSCWAVSTSSAFADRLCIATDGDFNELL 100
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S+D + CC C G C+GG P++AW +G+ TGGD+ S
Sbjct: 101 SADEITFCCYTCGFG--CDGGYPIKAWKQFSRHGLVTGGDFDS 141
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 74/129 (57%), Gaps = 19/129 (14%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
HFG +P FD RK++ +C IG V+ Q +CGSCWA T++A +DR+C+AT G
Sbjct: 84 HFG------HIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDF 137
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQ 176
+ LS++ + CC C G C GG P++AW Y ++G+ TGG+Y S C
Sbjct: 138 NELLSAEEITFCCHTCGFG--CHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCP 195
Query: 177 RFDRGNCNC 185
R D+GN C
Sbjct: 196 RDDKGNNTC 204
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 96/173 (55%), Gaps = 18/173 (10%)
Query: 21 TRDSNPGLWADPDILKSSPSFLSSLKFGL-SLTPQSQ------EPNPDLQLGSEHFGDYQ 73
++ S P L P +++ S +S K G SL +S+ P+PD ++ ++H ++
Sbjct: 17 SKPSTPSL--QPQLIQEINSRQTSWKAGTNSLDIKSRLGFLGLHPDPDYKIQTKH---HK 71
Query: 74 SNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+PE FD R+++P C + IG ++ Q CGSCWA A+T ++DR+CI T+G S
Sbjct: 72 IAKSIPESFDAREKWPECKDVIGKIRDQGTCGSCWAFASTEVMTDRLCIGTKGETKFVFS 131
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGN 182
++LLTCC C C GG +AW Y + G+ +GGDY S CQ + + +
Sbjct: 132 PENLLTCCEDCR--LECVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKAS 182
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R +PNCT+I H++ Q+NCGSCWA++T +ALSDR+CI + G +SS
Sbjct: 93 DIPESFDARTHWPNCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDF 152
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++CC +C G C+GG P+ A+ + G TGGDYGS
Sbjct: 153 VSCCESCGYG--CDGGWPILAFDFYTYEGAVTGGDYGS 188
>gi|161343825|tpg|DAA06093.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 199
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 68/98 (69%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P++FD RK++ +CT IG V+ Q NCGSCWA++T++A +DR+C+AT G + LS++ L
Sbjct: 87 RIPKKFDARKKWRHCTTIGKVRDQGNCGSCWALSTSSAFADRLCVATNGDFNQLLSAEEL 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW ++G+ TGG+Y S
Sbjct: 147 TFCCHKCGYG--CNGGYPIKAWERFKKHGLVTGGEYKS 182
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 65/100 (65%), Gaps = 2/100 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R Q+PNC + V+ Q +CGSCWA A+SDR+CIA+ G+++ +S++ L
Sbjct: 94 DLPTNFDARTQWPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIASNGKVNAEISAEDL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
L CC++C G+ C+GG P AW Y G+ TGG Y S Q
Sbjct: 154 LACCSSC--GEGCQGGFPAEAWRYYEREGLVTGGLYNSSQ 191
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 68/96 (70%), Gaps = 1/96 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R Q+PNC +I +++ QS+CGSCWA A A+SDR CIA+ G ++ LSS+ LL
Sbjct: 82 IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141
Query: 138 TCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC G+ CEGG P++AW + +++G+ TGG Y
Sbjct: 142 SCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSY 177
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 98/176 (55%), Gaps = 11/176 (6%)
Query: 2 KLPTPQYVNHSHHLLLRHVTRDSNPGLW-ADPDILKSSPSFLSSLKFGLSLTPQSQEPNP 60
KL + +Y N + H+ +S W A + K+ P + +L + + P S P
Sbjct: 20 KLKSNKYFNPLSDEFINHI--NSMKSTWKAGRNFGKNFP--MGALTQMMGVHPDSNLYMP 75
Query: 61 DLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMC 120
L+ S+ + SN +PE FD R+Q+P+C I ++ Q +CGSCWA A+SDR+C
Sbjct: 76 PLKNVSQMY----SNQAIPEAFDAREQWPDCPTIQEIRDQGSCGSCWAFGAVEAMSDRIC 131
Query: 121 IATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
I ++G ++ LS+++L++CC C G C GG P AW + ++ G+ TGG++ S Q
Sbjct: 132 IHSKGEVNAHLSAENLVSCCYTCGFG--CNGGFPGAAWSHWVKKGIVTGGNFNSSQ 185
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 2/103 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+++++PEEFD RK +PNC IG ++ Q +CGSCWA A+SDR+CI + + S+
Sbjct: 88 ADSDVPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSA 147
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
D L++CC C G C GG P AW Y G+ +GG YGS Q
Sbjct: 148 DDLVSCCHTCGFG--CNGGFPGAAWAYWTRKGIVSGGPYGSSQ 188
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 68/95 (71%), Gaps = 2/95 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R+++ C ++ +++ QS+CGSCWA A+SDR+CIA+ G++ +LS+D LL
Sbjct: 121 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 180
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC +C G C+GG+PM AW Y ++ G+ TG ++
Sbjct: 181 SCCKSCGFG--CDGGDPMAAWKYWVKEGIVTGSNF 213
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 77/141 (54%), Gaps = 2/141 (1%)
Query: 36 KSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIG 95
K+ P+F + D + + + +++ + PE FD R Q+PNC IG
Sbjct: 46 KAGPNFSPETSMSFIRGLMGVHKDADKFMPPVYLHEMEADDDFPENFDSRTQWPNCPTIG 105
Query: 96 HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPM 155
++ Q +CGSCWA A+SDR+CI ++G++ +SS+ L++CC C G C GG P
Sbjct: 106 EIRDQGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSEDLVSCCHTCGFG--CNGGFPG 163
Query: 156 RAWYYMLENGVPTGGDYGSCQ 176
AW Y + G+ +GG +GS Q
Sbjct: 164 AAWSYWVRKGLVSGGPFGSDQ 184
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 68/95 (71%), Gaps = 2/95 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R+++ C ++ +++ QS+CGSCWA A+SDR+CIA+ G++ +LS+D LL
Sbjct: 80 IPEAFDAREKWDQCASLKNIRDQSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLL 139
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC +C G C+GG+PM AW Y ++ G+ TG ++
Sbjct: 140 SCCKSCGFG--CDGGDPMAAWKYWVKEGIVTGSNF 172
>gi|157058757|gb|ABV03136.1| cathepsin B-84 [Pterocomma populeum]
Length = 218
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +P+ FD R ++ C IG V+ Q NCGSCWA T+ A +DR+CIAT+G + +
Sbjct: 38 YVEDGGIPKAFDARLEWKYCKTIGQVRDQGNCGSCWAHGTSGAFADRLCIATKGDFNELI 97
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ L CC C G C GGNP+RAW Y +GV TGG+Y +
Sbjct: 98 SAEELTFCCHLCGIG--CNGGNPLRAWQYFKRHGVVTGGNYNT 138
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 73/109 (66%), Gaps = 2/109 (1%)
Query: 66 SEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
SE D + +PE +D+R + C ++ +++ QS+CGSCWA+A +SDR+CIA+ G
Sbjct: 66 SEDILDRKVLETIPESYDVRDHWSKCISVDNIRDQSDCGSCWAVAAAETISDRLCIASNG 125
Query: 126 RLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++ +S++ LL+CC +C GD C+GG P++AW Y ++ G+ +GG Y S
Sbjct: 126 SINTFVSAEDLLSCCTSC--GDGCDGGYPLQAWRYWVKQGLVSGGSYES 172
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 71/115 (61%), Gaps = 2/115 (1%)
Query: 60 PD-LQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
PD ++L ++ F + +PE FD R+Q+PNC +I ++ QS CGSCWA A T SDR
Sbjct: 69 PDWVKLPTKEFDPNANADPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWAFAATETFSDR 128
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
+CIA+ L ++SS+ LL CCA G C+GG P AW YM GV TGG YG
Sbjct: 129 ICIASNQTLQTSISSEDLLECCADYCGMG-CKGGYPSAAWGYMKRQGVSTGGLYG 182
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 71/103 (68%), Gaps = 1/103 (0%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
++ +P+ +D+R +P C ++ +++ QS+CGSCWA+A A+SDR CIA+ G ++ LS
Sbjct: 68 ETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLS 127
Query: 133 SDHLLTCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++ +LTCC GD CEGG P++AW Y ++NG+ TGG + S
Sbjct: 128 AEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFES 170
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 59 NPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
N + H Y +N +P FD R+++ +C IG V+ Q CGSCWA T++A +DR
Sbjct: 68 NASAHMFKTHDVAYNNNGYIPRTFDARRRWRHCKTIGEVRDQGYCGSCWAFGTSSAFADR 127
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+C+AT G + LS++ L CC C G+ C GG P++AW Y +G+ TGG+Y S
Sbjct: 128 LCVATDGDFNELLSAEELTFCCHTC--GNGCNGGYPIKAWKYFSSHGLVTGGNYKS 181
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 67/98 (68%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P++FD RK++ +CT IG V+ Q NCGSCWAIAT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPKKFDARKKWRHCTTIGAVRDQGNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW ++G+ TGG+Y S
Sbjct: 147 TFCCHKCGYG--CNGGYPIKAWERFKKHGLVTGGEYKS 182
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 75/109 (68%), Gaps = 5/109 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE +D R Q+ NC+++ H+ Q+NCGSCWA+++ AA+SDR+CIA++G +S+ ++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY---GSCQRFDRGNC 183
+CC C GD CEGG P+ A+ + + GV TGGDY GSC+ ++ C
Sbjct: 151 SCCTWC--GDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPC 197
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD RK++ C+ IG V+ Q +CGSCWA T++A +DR+CIAT G + LS++ L
Sbjct: 85 IPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELA 144
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW + ++G+ TGGDY S
Sbjct: 145 FCCHKCGFG--CHGGYPIKAWEWFKKHGLVTGGDYDS 179
>gi|552159|gb|AAA29434.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 240
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 75/109 (68%), Gaps = 5/109 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE +D R Q+ NC+++ H+ Q+NCGSCWA+++ AA+SDR+CIA++G +S+ ++
Sbjct: 95 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 154
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY---GSCQRFDRGNC 183
+CC C GD CEGG P+ A+ + + GV TGGDY GSC+ ++ C
Sbjct: 155 SCCTWC--GDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPC 201
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD RK++ C+ +G V+ Q NCG+CWA T++A +DR+CIAT G + LS++ L
Sbjct: 84 RIPSNFDARKKWRKCSTVGKVRDQGNCGTCWAFGTSSAFADRLCIATNGEFNELLSAEEL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW ++G+ TGGDY S
Sbjct: 144 AFCCHKCGSG--CHGGYPIKAWERFRKHGLVTGGDYNS 179
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 86/150 (57%), Gaps = 8/150 (5%)
Query: 27 GLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
G+ DP + S SF++ L G +++ +PD+ + Y +P FD RK
Sbjct: 39 GVNFDPKL--SIDSFVNLL--GSKGVQAAKKASPDMFKTGDK--AYNLAQRIPSNFDARK 92
Query: 87 QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGG 146
++ C +IG V+ Q +CGSCWA T++A +DR+CIAT+G + LS++ L CC C G
Sbjct: 93 KWKKCLSIGEVRDQGHCGSCWAFGTSSAFADRLCIATEGEFNELLSAEELTFCCHKCGFG 152
Query: 147 DVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
C GG P+RAW ++G+ TGG+Y S +
Sbjct: 153 --CNGGYPIRAWERFRKHGLVTGGNYDSYE 180
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 84/148 (56%), Gaps = 7/148 (4%)
Query: 27 GLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
G+ DP + S SF+ L G +++ +PD+ + + SN +P FD RK
Sbjct: 39 GVNFDPKL--SIDSFVKLL--GSKGVQAAKQASPDMFKTHDEAYNSWSN-RIPSSFDARK 93
Query: 87 QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGG 146
++ C+ IG V+ Q CGSCWA T++A +DR+CIAT G + LS++ L CC C G
Sbjct: 94 KWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELAFCCHKCGFG 153
Query: 147 DVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C GG P+RAW ++G+ TGG+Y S
Sbjct: 154 --CSGGYPIRAWERFKKHGLVTGGNYDS 179
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 64/101 (63%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
N ++P+ FD RK++ C IG V+ Q CGSCWA T++A +DR+CIAT G + LS+
Sbjct: 79 ENQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWAFGTSSAFADRLCIATNGEFNELLSA 138
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ L CC C G C GG P++AW ++G+ TGGDY S
Sbjct: 139 EELTFCCHKCGFG--CHGGYPIKAWERFQKHGLVTGGDYDS 177
>gi|552158|gb|AAA29433.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 236
Score = 102 bits (253), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 75/109 (68%), Gaps = 5/109 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE +D R Q+ NC+++ H+ Q+NCGSCWA+++ AA+SDR+CIA++G +S+ ++
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVV 150
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY---GSCQRFDRGNC 183
+CC C GD CEGG P+ A+ + + GV TGGDY GSC+ ++ C
Sbjct: 151 SCCTWC--GDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPC 197
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 102 bits (253), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 72/129 (55%), Gaps = 5/129 (3%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P+ D E D ELPE FD R+Q+PNC I ++ Q +CGSCWA A+SD
Sbjct: 65 PDADKFREPEILHDLSDGDELPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSD 124
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS--- 174
R+C+A+ G++ S++ L++CC C G C GG P AW Y + G+ +GG +GS
Sbjct: 125 RVCVASGGKIHFRFSAEDLVSCCHTCGFG--CNGGFPGAAWSYWVRKGLVSGGPFGSNLG 182
Query: 175 CQRFDRGNC 183
CQ + C
Sbjct: 183 CQPYAIAPC 191
>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
Length = 274
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 67/101 (66%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+N +P FD R+++ +C IG V+ Q +CGSCWA+AT++A +DR+C+AT G + LS+
Sbjct: 80 NNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSA 139
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ + CC C G C GG P++AW Y +G+ TGG+Y S
Sbjct: 140 EEITFCCHTCGFG--CNGGYPIKAWKYFSSHGIVTGGNYKS 178
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 85/148 (57%), Gaps = 7/148 (4%)
Query: 27 GLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
G+ DP + S SF+ L G +++ +PD+ + + SN +P FD RK
Sbjct: 39 GVNFDPKL--SIDSFVKLL--GSKGVQAAKQASPDMFKTHDEAYNNWSN-RIPSNFDARK 93
Query: 87 QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGG 146
++ C+ IG V+ Q +CGSCWA T++A +DR+CIAT G + LS + L CC C G
Sbjct: 94 KWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCCHKCGFG 153
Query: 147 DVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C GGNP++AW ++G+ TGG+Y S
Sbjct: 154 --CSGGNPIKAWERFQKHGLVTGGNYDS 179
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 69/111 (62%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E PE FD R + NCT+I H+ Q NC + WAI+ T+A++DR+CIA+QG + S L
Sbjct: 95 ETPESFDARYHWFNCTSISHIWNQGNCAADWAISVTSAMNDRICIASQGNITALYSPQKL 154
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
++CC C G+ C GG AW Y+L+ G+ TGGDYGS CQ + CN
Sbjct: 155 VSCCEDC--GNGCSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCN 203
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 75/126 (59%), Gaps = 10/126 (7%)
Query: 50 SLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWA 108
++ +++ P P+ L E+F P EFD RK +P C I ++ Q+NCGSCWA
Sbjct: 268 TVKERNEMPMPEDLLNLENFN-------YPVEFDSRKHWPQCEKVISFIKDQANCGSCWA 320
Query: 109 IATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+++ + +SDR CIAT G+ LS LL+CC +C G C GG P R + Y + +G+PT
Sbjct: 321 VSSASVMSDRTCIATDGQFTTLLSDAELLSCCTSCGYG--CNGGYPQRTFKYWVYSGMPT 378
Query: 169 GGDYGS 174
GG YGS
Sbjct: 379 GGPYGS 384
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 67/102 (65%), Gaps = 2/102 (1%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D N LPE FD R+++PNCT+I ++ QSNCGSCWA++ + +SDR+CI + G +
Sbjct: 82 DVDLNINLPETFDAREKWPNCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSW 141
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
S +L+CC C G C+GG P A+++ ++NGV TGG +
Sbjct: 142 ASDTDILSCCWNCGMG--CDGGRPFAAFFFAIDNGVCTGGPF 181
>gi|157058773|gb|ABV03144.1| cathepsin B-16D [Sitobion avenae]
Length = 215
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 65/98 (66%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ +C IG V+ Q NCGSCWA+AT++A +DR+C+AT G + LS++ +
Sbjct: 69 RIPRHFDARRKWRHCQTIGEVRDQGNCGSCWAVATSSAFADRLCVATDGDFNQLLSAEEI 128
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW ++G+ TGGDY S
Sbjct: 129 TFCCHTCGFG--CNGGYPIKAWERFKKHGLVTGGDYKS 164
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 68/98 (69%), Gaps = 1/98 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R Q+P+C +I +++ QS+CGSCWA A A+SDR CIA+ G ++ LSS LL
Sbjct: 83 IPDHFDARDQWPSCVSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLL 142
Query: 138 TCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC G+ CEGG P++AW + +++G+ TGG Y S
Sbjct: 143 SCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYES 180
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 41 FLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTE-LPEEFDLRKQYPNCTNIGHVQL 99
F +S L + +P+ D ++G + + +T +PE FD R Q+P+C +I ++
Sbjct: 55 FEASYTDANELRKKLMKPHYDRRIGKPQLQENEEDTAGIPESFDARTQWPHCPSISLIRD 114
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q++CGSCWA A ++SDR+CIAT S + +LTCC C G C+GG P AW
Sbjct: 115 QADCGSCWAFAVGESISDRVCIATDANKTAEFSVEDILTCCDECGFG--CDGGFPDAAWE 172
Query: 160 YMLENGVPTGGDYGS 174
Y + GV TGG YG+
Sbjct: 173 YFVSTGVVTGGLYGT 187
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 67/101 (66%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+N +P FD R+++ +C IG V+ Q +CGSCWA+AT++A +DR+C+AT G + LS+
Sbjct: 84 NNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSA 143
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ + CC C G C GG P++AW Y +G+ TGG+Y S
Sbjct: 144 EEITFCCHTCGFG--CNGGYPIKAWKYFSSHGIVTGGNYKS 182
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 66/109 (60%), Gaps = 5/109 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD RK++ C+ IG V+ Q NCGSCWA T++A +DR+CIAT G + LS + L
Sbjct: 85 IPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELA 144
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
CC C G C GG P+RAW ++G+ TGG+Y S CQ + C
Sbjct: 145 FCCHKCGFG--CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPC 191
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 66/106 (62%), Gaps = 2/106 (1%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI ++G++
Sbjct: 76 DLDEGDDLPENFDAREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFR 135
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+S++ L++CC C G C GG P AW Y + G+ +GG YGS Q
Sbjct: 136 VSAEDLVSCCHTCGFG--CNGGFPGAAWSYWVRKGLVSGGPYGSDQ 179
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 73/116 (62%), Gaps = 5/116 (4%)
Query: 59 NPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
N ++ +H +YQ E+P +FD RK++ C IG V+ Q NCGS WA++T++A +DR
Sbjct: 70 NYNMYKNDDHADNYQ---EIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADR 126
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+C+AT G + LS++ + CC C G+ C GG P+RAW +G+ TGG+Y S
Sbjct: 127 LCVATNGDFNQLLSAEEITFCCHKC--GNGCNGGYPIRAWKRFKNHGLVTGGNYKS 180
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 69/98 (70%), Gaps = 1/98 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+Q+P+C +I +++ QS+CGSCWA A A+SDR CIA+ G ++ LSS+ LL
Sbjct: 82 IPDRFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141
Query: 138 TCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC G+ CEGG P++AW + ++G+ TGG Y S
Sbjct: 142 SCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYES 179
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+PE FD R ++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT G + LS++ +
Sbjct: 87 RIPEHFDARNKWVYCDTIGRVRDQGNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW +G+ TGGDY S
Sbjct: 147 TFCCHTCGFG--CHGGYPIKAWKRFSTHGLVTGGDYNS 182
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 67/106 (63%), Gaps = 2/106 (1%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI ++G++
Sbjct: 76 DLDEGDDLPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKVLFR 135
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+S++ LLTCC C G C+GG P W + +E G+ +GG +GS Q
Sbjct: 136 VSAEDLLTCCTNCGHG--CDGGAPGAGWKHWIEKGLVSGGPFGSDQ 179
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD RK++ C+ IG V+ Q NCGSCWA T++A +DR+CIAT G + LS + L
Sbjct: 84 RIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEEL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P+RAW ++G+ TGG+Y S
Sbjct: 144 AFCCHKCGFG--CSGGYPIRAWERFKKHGLVTGGNYDS 179
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 67/100 (67%), Gaps = 2/100 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+T +P+ FD R +P C ++ ++ QS+CGSCWA+ A++DR+CIA++G T+S+D
Sbjct: 92 DTTIPKSFDSRTNWPECPSLYSIRDQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISAD 151
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC C G C+GG+P AW Y + NG+ TG +Y S
Sbjct: 152 DLLSCCDECGFG--CDGGDPYAAWSYWVSNGIVTGSNYTS 189
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 83/148 (56%), Gaps = 7/148 (4%)
Query: 27 GLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
G+ DP + S SF+ L G +++ +PD+ + + SN +P FD RK
Sbjct: 42 GVNFDPKL--SIDSFVKLL--GSKGVQAAKQASPDMFKTHDEAYNSWSN-RIPSSFDARK 96
Query: 87 QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGG 146
++ C+ IG V+ Q CGSCWA T++A +DR+CIAT G + LS + L CC C G
Sbjct: 97 KWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCCHKCGFG 156
Query: 147 DVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C GG P+RAW ++G+ TGG+Y S
Sbjct: 157 --CSGGYPIRAWERFKKHGLVTGGNYDS 182
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 73/110 (66%), Gaps = 4/110 (3%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+Q+P+C +I +++ QS+CGSCWA A A+SDR CIA+ G ++ LSS+ LL
Sbjct: 82 IPDHFDAREQWPSCVSIDNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141
Query: 138 TCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
+CC G+ CEGG P++AW + ++G+ TGG Y S C+ + C
Sbjct: 142 SCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPC 191
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 66/98 (67%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+++P+C +I ++ QS CGSCWA A+SDR+CI ++G++ +S++ L
Sbjct: 84 DLPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAEDL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L CC +C G C GG P AW Y E+G+ TGG YG+
Sbjct: 144 LDCCDSCGAG--CNGGTPAAAWEYWKESGLVTGGLYGT 179
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR CI + G+++ +S++ L
Sbjct: 82 DLPESFDSREQWPNCPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAEDL 141
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LTCC +C G C GG P AW Y ++ G+ TGG Y S CQ + +C
Sbjct: 142 LTCCDSCGMG--CNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASC 189
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 83/148 (56%), Gaps = 7/148 (4%)
Query: 27 GLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
G+ DP + S SF+ L G +++ +PD+ + + SN +P FD RK
Sbjct: 42 GVNFDPKL--SIDSFVKLL--GSKGVQAAKQASPDMFKTHDEAYNSWSN-RIPSSFDARK 96
Query: 87 QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGG 146
++ C+ IG V+ Q CGSCWA T++A +DR+CIAT G + LS + L CC C G
Sbjct: 97 KWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCCHKCGFG 156
Query: 147 DVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C GG P+RAW ++G+ TGG+Y S
Sbjct: 157 --CSGGYPIRAWERFKKHGLVTGGNYDS 182
>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
Length = 246
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 69/119 (57%), Gaps = 13/119 (10%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD RK++ +C IG V+ Q NCGSCWA T++A +DR+C+AT G + LS + +
Sbjct: 68 IPRTFDARKRWRHCKTIGEVRDQGNCGSCWAFGTSSAFADRLCVATDGDFNELLSPEEIA 127
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDRGNCNC 185
CC C G C GG P++AW Y +G+ TGG+Y S CQ +GN +C
Sbjct: 128 FCCHTCGFG--CHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGNNSC 184
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/148 (38%), Positives = 83/148 (56%), Gaps = 7/148 (4%)
Query: 27 GLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
G+ DP + S SF+ L G +++ +PD+ + + SN +P FD RK
Sbjct: 39 GVNFDPKL--SIDSFVKLL--GSKGVQAAKQASPDMFKTHDEAYNSWSN-RIPSSFDARK 93
Query: 87 QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGG 146
++ C+ IG V+ Q CGSCWA T++A +DR+CIAT G + LS + L CC C G
Sbjct: 94 KWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCCHKCGFG 153
Query: 147 DVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C GG P+RAW ++G+ TGG+Y S
Sbjct: 154 --CSGGYPIRAWERFKKHGLVTGGNYDS 179
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 65/98 (66%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELPE FD R+++P C++I ++ QSNCGSCWA A+SDR+CIA+ G+ +S + L
Sbjct: 87 ELPESFDAREKWPYCSSIAEIRDQSNCGSCWAFGAAGAISDRICIASGGKHQPRISPEDL 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ CCA C G C+GG P +AW Y + NG+ TG Y +
Sbjct: 147 VDCCADCGMG--CQGGYPAQAWEYWVRNGLVTGDLYNT 182
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 72/112 (64%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P+EFD RK +PNC++I ++ Q +CGSCWA A+SDR+CI + G+L LS++
Sbjct: 78 DVTVPDEFDARKHWPNCSSITEIRDQGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAE 137
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
+LL+CC +C G C GG+ AW Y + G+ +GG+YGS CQ + C
Sbjct: 138 NLLSCCDSCGYG--CLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPC 187
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 65/102 (63%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ E+PEEFD R+Q+P C + ++ Q +CGSCWA A+SDR+CI ++G+ S++
Sbjct: 90 DIEIPEEFDPREQWPECPTLKEIRDQGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAE 149
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
LLTCC++C G C GG P AW Y + G+ +GG Y S Q
Sbjct: 150 DLLTCCSSCGFG--CNGGEPGAAWDYWVSTGIVSGGSYNSHQ 189
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 67/99 (67%), Gaps = 2/99 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+++P+ FD R +P+C +I +++ QS CGSCWA ++ +SDR+CIA+ G LS+D
Sbjct: 64 SKIPDSFDARVTWPHCPSISYIRDQSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADD 123
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+L+CC GG C+GG P+ AW Y +E GV TGG YG+
Sbjct: 124 ILSCCT--DGGYGCDGGWPVSAWQYFVETGVVTGGLYGT 160
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
N +P+ FD RK++ C+ IG V+ Q CGSCWA T++A +DR+CIAT G + LS+
Sbjct: 79 ENQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATDGDFNELLSA 138
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+ L CC C G C GG P++AW ++G+ TGG+Y S +
Sbjct: 139 EELTFCCHTCGYG--CHGGYPIKAWERFKKHGLVTGGNYDSSE 179
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 73/116 (62%), Gaps = 5/116 (4%)
Query: 59 NPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
N ++ +H +YQ E+P +FD RK++ C IG V+ Q NCGS WA++T++A +DR
Sbjct: 12 NYNMYKNDDHADNYQ---EIPMKFDARKKWIRCKTIGEVRDQGNCGSDWALSTSSAFADR 68
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+C+AT G + LS++ + CC C G+ C GG P+RAW +G+ TGG+Y S
Sbjct: 69 LCVATNGDFNQLLSAEEITFCCHKC--GNGCNGGYPIRAWKRFKNHGLVTGGNYKS 122
>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 260
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 69/118 (58%), Gaps = 13/118 (11%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P FD RK++ +C IG V+ Q +CGSCWA T++A +DR+C+AT G + LS++ +
Sbjct: 89 PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 148
Query: 139 CCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDRGNCNC 185
CC C G C GG+P++AW Y +G+ TGG+Y S C R D+G C
Sbjct: 149 CCHTCGFG--CNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKNTC 204
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 68/103 (66%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+++ + +P+ FD R+Q+P C +I ++ + CGSCWA A +SDR+C+A++GR
Sbjct: 78 HEAISGIPDSFDAREQWPFCESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIF 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +++CC AC GG C GG + Y + NG+P+GGDYGS
Sbjct: 138 SAEEVVSCCTACGGG--CRGGFLNEPYKYWVTNGIPSGGDYGS 178
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P +FD RK++ NC IG ++ Q NCGSCWA+AT++A +DR+C+ + + LS++ L
Sbjct: 88 IPRKFDARKKWRNCKTIGAIRDQGNCGSCWALATSSAFADRLCVVSNEDFNQLLSAEELT 147
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW + ++G+ TGGDY S
Sbjct: 148 FCCHKCGFG--CNGGYPIKAWEHFKKHGLVTGGDYKS 182
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 64/102 (62%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +P++FD RKQ+P+C I ++ Q +CGSCWA A+SDR+CI + G ++ S+D
Sbjct: 85 NNMIPKDFDSRKQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSAD 144
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
L++CC C G C GG P AW Y + G+ +GG YGS Q
Sbjct: 145 DLVSCCHTCGFG--CNGGFPGAAWSYWVRKGIVSGGPYGSSQ 184
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 68/98 (69%), Gaps = 1/98 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ +D+R + C ++ +++ QS+CGSCWA+A A+SDR CIA+ G ++ LS++ +L
Sbjct: 81 IPDHYDVRDDFSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDIL 140
Query: 138 TCCAA-CTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
TCC GD CEGG P++AW Y ++NG+ TGG Y S
Sbjct: 141 TCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGSYES 178
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P+ FD RK++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPKHFDARKKWKRCHTIGKVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC++C G C GG P++AW G+ TGGDY S
Sbjct: 147 TFCCSSCGYG--CNGGYPIKAWESFNNRGLVTGGDYQS 182
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 65/98 (66%), Gaps = 1/98 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N ELPE FD R+++P+C +IG ++ S CGSCWA++ + +SDR+CI T G LSS
Sbjct: 85 NVELPERFDAREKWPHCASIGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSA 144
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC G CEGG P++A++Y+ GV +GG+Y
Sbjct: 145 DILACCGEDCGSG-CEGGYPIQAYFYLENTGVCSGGEY 181
>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
Length = 238
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 69/118 (58%), Gaps = 13/118 (11%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P FD RK++ +C IG V+ Q +CGSCWA T++A +DR+C+AT G + LS++ +
Sbjct: 67 PRTFDARKKWRHCKTIGEVRDQGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITF 126
Query: 139 CCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDRGNCNC 185
CC C G C GG+P++AW Y +G+ TGG+Y S C R D+G C
Sbjct: 127 CCHTCGFG--CNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKNTC 182
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 65/104 (62%), Gaps = 2/104 (1%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D + E+P FD R ++P CT+IG ++ QS+CGSCWA+++ +SDR+C+ + G +
Sbjct: 83 DMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSNGTIKVL 142
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS +L CC C G C GG+ +RAW Y GV TGG YG+
Sbjct: 143 LSDTDILACCPNCGAG--CGGGHTIRAWEYFKNTGVCTGGLYGT 184
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P++FD RK++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS + L
Sbjct: 87 RIPKKFDARKKWRKCKTIGAVRDQGNCGSCWALATSSAFADRLCVATDADFNEFLSPEEL 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW +G+ TGGDY S
Sbjct: 147 TFCCHTCGYG--CNGGYPIKAWERFKSHGLVTGGDYKS 182
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 82/131 (62%), Gaps = 6/131 (4%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
+ +K L T PD+++ EH D Q +T +P FD R Q+PNC +I +++ QS
Sbjct: 49 IEQVKKRLMRTEFVAPHTPDVEV-VEH--DIQEDT-IPATFDARTQWPNCVSINNIRDQS 104
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
+CGSCWA A A SDR CIA+ G ++ LS++ +L+CC+ C G C+GG P+ AW Y+
Sbjct: 105 DCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYG--CDGGYPINAWKYL 162
Query: 162 LENGVPTGGDY 172
+++G TGG Y
Sbjct: 163 VKSGFCTGGSY 173
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 99.8 bits (247), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 63/102 (61%), Gaps = 2/102 (1%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q ++P+EFD R+++PNC I ++ Q +CGSCWA A+SDR+CI + G ++ S
Sbjct: 83 QKVDDIPKEFDAREKWPNCPTINEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGNVNFRFS 142
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+D L++CC C G C GG P AW Y G+ +GG YGS
Sbjct: 143 ADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGIVSGGRYGS 182
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/109 (41%), Positives = 70/109 (64%), Gaps = 1/109 (0%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
++FG Q+N++L FD R+++P C +I + S C + WA A ++SDR+CI + G
Sbjct: 65 QNFGVSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGF 124
Query: 127 LDHTLSSDHLLTCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ LS++ LL+CC G+ CEGGNP +AW Y+ ++G+PTGG Y S
Sbjct: 125 KNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYES 173
>gi|38048307|gb|AAR10056.1| similar to Drosophila melanogaster CG10992, partial [Drosophila
yakuba]
Length = 174
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 63/102 (61%), Gaps = 4/102 (3%)
Query: 67 EHFGDYQSNT--ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
E GD N+ E+PEEFD RKQ+PNC IG ++ Q +CGSCWA A+SDR+CI +
Sbjct: 74 EVLGDLYMNSVDEIPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSG 133
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
G+++ S+D L++CC C G C GG P AW Y G+
Sbjct: 134 GKVNFHFSADDLVSCCHTCGFG--CNGGFPGAAWSYWTRKGI 173
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +
Sbjct: 74 FAEDLDLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LL+CC G+ C GG P AW Y G+ +GG YGS
Sbjct: 134 SAEDLLSCCGPLC-GEGCNGGYPTEAWKYWTRKGLVSGGLYGS 175
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 68/111 (61%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD R Q+ NC I V+ Q +CGSCWA A A+SDR C+A+ G++ LSS++L
Sbjct: 79 DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSENL 138
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
+ CC C G C GG P AW Y ++G+ TGG YGS CQ ++ C
Sbjct: 139 MACCETCGMG--CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCE 187
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 66/107 (61%), Gaps = 2/107 (1%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
H + +N +LP+ FDLR Q+PNC + ++ Q +CGSCWA ++SDR+CI ++G+
Sbjct: 65 HTVKHSTNVKLPDSFDLRDQWPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHSKGKQ 124
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LL+CC C G C GG P AW Y +G+ TGG Y S
Sbjct: 125 SPEISAEDLLSCCDQCGFG--CSGGFPAEAWDYWRRSGLVTGGLYNS 169
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 66/105 (62%), Gaps = 5/105 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R + C ++ +++ Q NCGSCWA + ++DR+CIA++G+ S+D LL
Sbjct: 82 IPETFDGRTHWSQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADDLL 141
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFD 179
CC AC G C+GG P RA+ Y + G+ +GGDY S CQ ++
Sbjct: 142 ACCTACGKG--CDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYE 184
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 89/167 (53%), Gaps = 12/167 (7%)
Query: 8 YVNHSHHLLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSE 67
Y+NH + + G+ DP + S SF+ L G +++ +PD+ +
Sbjct: 25 YINH-----INANAKTWKAGVNFDPKL--SIDSFVKLL--GSKGVQAAKQASPDMFKTHD 75
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
+ SN +P FD RK++ C IG V+ Q +CGSCWA T++A +DR+CIAT G
Sbjct: 76 EAYNNWSN-RIPSYFDARKKWRKCLTIGEVRDQGHCGSCWAFGTSSAFADRLCIATDGEF 134
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ LS + L CC C G C GG P++AW ++G+ TGG+Y S
Sbjct: 135 NELLSPEELAFCCHKCGFG--CSGGYPIKAWERFKKHGLVTGGNYES 179
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 68/97 (70%), Gaps = 3/97 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R ++P C+++ H++ Q+NCGSCWA++T +ALSDR+CIA+ GR +S+ +
Sbjct: 89 DIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 148
Query: 137 LTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L+CC C G C GG P++A+ Y + G TGGDY
Sbjct: 149 LSCCGNQCGYG--CNGGWPIQAFNYFSKQGAVTGGDY 183
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 67/110 (60%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CIA+ G++ S++ L
Sbjct: 82 DLPENFDSREQWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFSAEDL 141
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC C G C GG P AW Y + G+ +GG +GS CQ + C
Sbjct: 142 VSCCHTCGFG--CNGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAPC 189
>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 255
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 65/98 (66%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P+ FD R+++ +C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPKHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC +C G C GG P++AW + G+ TGGDY S
Sbjct: 147 TFCCHSCGFG--CNGGYPIKAWERFKKRGLVTGGDYQS 182
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 44/102 (43%), Positives = 66/102 (64%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LP E D RK++P C IG V+ Q+NCGSCWA+++ + ++DR+CI + LS +
Sbjct: 81 SVDLPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLLSEE 140
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
L++CC C G C+GG P +A+ Y G+PTGG YGS +
Sbjct: 141 ELVSCCKICGYG--CDGGYPDKAFIYWATRGIPTGGPYGSTK 180
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 72/131 (54%), Gaps = 8/131 (6%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALS 116
P+P+ QL E + T +P FD R+ +P C + IG+++ Q CGSCWA A +S
Sbjct: 55 PDPNFQL--EVLEWEEPRTVIPATFDAREYWPQCKDVIGNIRNQGKCGSCWAFAAAEVMS 112
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-- 174
DR+C+AT G + S + L+ CC C G C+GG AW Y G+ +GGDY +
Sbjct: 113 DRLCVATNGSVKFEFSPEDLINCCETC--GKKCKGGYSYYAWKYYTSTGLVSGGDYNTSR 170
Query: 175 -CQRFDRGNCN 184
CQ + + N N
Sbjct: 171 GCQPYSKSNFN 181
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 70/113 (61%), Gaps = 5/113 (4%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++P+ FD RK++ C ++ ++ Q NCGSCWA++ AA +DR+CIA+ + + +SS
Sbjct: 88 KKIKVPKYFDARKKWKKCKSLREIRDQGNCGSCWAVSVAAAFADRLCIASNAKWNGHISS 147
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
L++CC+ C G CEGG P AW ++ +G+ TGGDY S CQ + C
Sbjct: 148 RELMSCCSYCGFG--CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPC 198
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 63/100 (63%), Gaps = 2/100 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P +FD R+Q+P+C I ++ Q +CGSCWA A+SDR+CI + G + SSD L
Sbjct: 76 EIPADFDARQQWPHCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSDDL 135
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
++CC C G C GG P AW+Y + G+ +GG YG+ Q
Sbjct: 136 VSCCWTCGMG--CNGGYPGAAWHYWVRKGLVSGGQYGTKQ 173
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 64/96 (66%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLISCCEDC--GDGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|157058731|gb|ABV03123.1| cathepsin B-16D1 [Acyrthosiphon pisum]
Length = 243
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ +C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 85 RIPRHFDARRKWRSCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 144
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC +C G C GG P++AW + G+ TGGDY S
Sbjct: 145 TFCCYSCGFG--CNGGYPIKAWERFKKRGLVTGGDYQS 180
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/133 (40%), Positives = 82/133 (61%), Gaps = 6/133 (4%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
+ +K L T PD+++ D Q +T +P+ FD R Q+P+C +I +++ QS
Sbjct: 49 IEQVKKRLMRTEFVAPHTPDVEVIKH---DIQEDT-IPDTFDARTQWPSCVSINNIRDQS 104
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
+CGSCWA A A SDR CIA+ G ++ LS++ +L+CC+ C G CEGG P+ AW Y+
Sbjct: 105 DCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYG--CEGGYPINAWKYL 162
Query: 162 LENGVPTGGDYGS 174
+++G TGG Y S
Sbjct: 163 VKSGFCTGGSYVS 175
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LP+EFD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + G+++ +
Sbjct: 73 YAGDVKLPKEFDARQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNGKVNVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LLTCC +C G C GG P AW + G+ +GG Y S
Sbjct: 133 SSEDLLTCCDSCGMG--CNGGYPSAAWDFWASEGLVSGGLYES 173
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +P+EFD R Q+P+C I ++ Q +CGSCWA A+SDR+CI + G ++ S+D
Sbjct: 85 NDMIPKEFDSRNQWPHCPTIWEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSAD 144
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
L++CC C G C GG P AW Y + G+ +GG YGS Q
Sbjct: 145 DLVSCCHTCGFG--CNGGFPGAAWGYWVRKGIVSGGPYGSSQ 184
>gi|157058771|gb|ABV03143.1| cathepsin B-16D [Aulacorthum solani]
Length = 201
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R+++ +C IG V+ Q NCGSCWA+AT++A +DR+C+AT G + LS++ +
Sbjct: 72 IPRHFDARRKWRHCQTIGKVRDQGNCGSCWAMATSSAFADRLCVATNGDFNELLSAEEIT 131
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G C GG P++AW ++G+ TGG+Y S
Sbjct: 132 FCCHTCGFG--CHGGYPIKAWKRFNKHGLVTGGNYNS 166
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LP EFD R Q+PNC + ++ Q +CGSCWA A+SDR+CI + R+ +
Sbjct: 73 YAGDVKLPTEFDARAQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LLTCC +C G C GG P AW + + G+ TGG Y S
Sbjct: 133 SSEDLLTCCESCGMG--CNGGYPTAAWDFWTKEGLVTGGLYDS 173
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 98.6 bits (244), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 66/98 (67%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+++ +C +I ++ QS CGSCWA T A+SDR+CI ++G++ +S++ L
Sbjct: 83 DLPESFDAREKWSHCNSIHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAEDL 142
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LTCC +C G C GG P AW + +G+ TGG YG+
Sbjct: 143 LTCCDSCGAG--CNGGYPAAAWEFYKTDGIVTGGLYGT 178
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 65/99 (65%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP++FD RK++ C I V+ Q +CGSCWA A+SDR+CIA++G + +SS+ LL
Sbjct: 122 LPDDFDARKEWTGCPTIKEVRDQGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSEDLL 181
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC++C G C GG P AW Y + G+ +GG YG+ Q
Sbjct: 182 SCCSSCGMG--CNGGFPPAAWEYFRDTGLVSGGQYGTHQ 218
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 72/120 (60%), Gaps = 13/120 (10%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P +FD RK++ C IG V+ Q +CGS WA++T++A SDR+C+AT G + LS++ +
Sbjct: 24 EIPIKFDARKKWLRCKTIGEVRDQGHCGSDWAMSTSSAFSDRLCVATNGDFNQLLSAEEI 83
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS-----------CQRFDRGNCNC 185
CC C GD C GG P+RAW ++G+ TGG+Y S C D+GN C
Sbjct: 84 TFCCHTC--GDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTC 141
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/100 (47%), Positives = 63/100 (63%), Gaps = 2/100 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ LP+ FD R Q+PNC I ++ Q +CGSCWA A+SDR CI + G++ +S++
Sbjct: 76 DMSLPDSFDARLQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEISAE 135
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC AC G C GG P AW Y E+G+ TGG YGS
Sbjct: 136 DLLSCCDACGMG--CMGGFPSAAWDYWAESGLVTGGLYGS 173
>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 261
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC +C G C GG P++AW + G+ TGGDY S
Sbjct: 147 TFCCHSCGFG--CNGGYPIKAWERFKKRGLVTGGDYQS 182
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
T+LPE FD R+ +PNC I V+ Q +CGSCWA A+SDR+CI ++G + S+++
Sbjct: 87 TDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAEN 146
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L++CC C G C GG P AW+Y G+ +GG YGS
Sbjct: 147 LVSCCRTCGFG--CNGGFPGAAWHYWKTKGIVSGGPYGS 183
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LPEEFD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YTGDLKLPEEFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LLTCC +C G C GG P AW + + G+ +GG Y S
Sbjct: 133 SSEDLLTCCMSCGMG--CNGGYPSAAWDFWTKEGLVSGGLYDS 173
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 68/97 (70%), Gaps = 3/97 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R ++P C+++ H++ Q+NCGSCWA++T +ALSDR+CIA+ GR +S+ +
Sbjct: 1 DIPESFDARTKWPKCSSLKHIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 60
Query: 137 LTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L+CC C G C GG P++A+ Y + G TGGDY
Sbjct: 61 LSCCGNQCGYG--CNGGWPIQAFNYFSKQGAVTGGDY 95
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+ NC IG ++ Q +CGSCWA A+SDR CI T GR++ +
Sbjct: 74 FGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIQC-GDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+ NC IG ++ Q +CGSCWA A+SDR CI T GR++ +
Sbjct: 74 FGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIQC-GDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+ NC IG ++ Q +CGSCWA A+SDR CI T GR++ +
Sbjct: 74 FGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIQC-GDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC +C G C GG P++AW + G+ TGGDY S
Sbjct: 147 TFCCHSCGFG--CNGGYPIKAWERFKKRGLVTGGDYQS 182
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC +C G C GG P++AW + G+ TGGDY S
Sbjct: 147 TFCCHSCGFG--CNGGYPIKAWERFKKRGLVTGGDYQS 182
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 63/100 (63%), Gaps = 2/100 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R Q+PNC + V+ Q CGSCWA A+SDR+CI +QG+ + +S++ L
Sbjct: 88 DLPATFDSRTQWPNCPTLKEVRDQGACGSCWAFGAVEAMSDRICIKSQGKENTHISAEDL 147
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G+ CEGG P AW Y ++G+ TGG Y S Q
Sbjct: 148 TSCCRTC--GNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQ 185
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 66/111 (59%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R Q+PNC + V+ Q CGSCWA A+SDR+CI +QG+ + +S++ L
Sbjct: 88 DLPASFDSRTQWPNCPTLKEVRDQGACGSCWAFGAVEAMSDRICIKSQGKENVHISAEDL 147
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
+CC C G+ CEGG P AW Y +G+ TGG Y S CQ + C+
Sbjct: 148 TSCCRTC--GNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACD 196
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 65/103 (63%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + ELPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T G ++ +
Sbjct: 74 FAEDMELPENFDAREQWSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + ++ G+ +GG Y S
Sbjct: 134 SAEDLLTCCGS-QCGDGCNGGYPSGAWNFWIKKGLVSGGLYNS 175
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 66/96 (68%), Gaps = 1/96 (1%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R ++P C+++ H+ Q+NCGSCWA++T +ALSDR+CIA+ GR +S+ +
Sbjct: 1 DIPESFDARTKWPKCSSLKHIHDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 60
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L+CC G C GG P++A+ Y + G TGGDY
Sbjct: 61 LSCCGNQCGYG-CNGGWPIQAFNYFSKQGAVTGGDY 95
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 66/97 (68%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S+ +LPE FD R+++P C +I + QS+CGSCWA+A A+SDR+CI + G + LS+
Sbjct: 72 SDNDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSA 131
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC+ C G+ C+GG+P AW Y NG+ TGG
Sbjct: 132 IDLVSCCSYC--GNGCQGGSPPAAWDYWWRNGIVTGG 166
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
T+LPE FD R+ +PNC I V+ Q +CGSCWA A+SDR+CI ++G + S+++
Sbjct: 22 TDLPENFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAEN 81
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L++CC C G C GG P AW+Y G+ +GG YGS
Sbjct: 82 LVSCCWTCGFG--CNGGFPGAAWHYWKTKGIVSGGPYGS 118
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP++FD R+Q+P C + ++ Q +CGSCWA A+SDR+CI T+G++ +SS L
Sbjct: 78 KLPKQFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRICIHTKGKVSVEISSQDL 137
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LTCC +C G C GG P AW + E G+ TGG Y S
Sbjct: 138 LTCCDSCGMG--CNGGYPANAWEFWTEQGLVTGGLYNS 173
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P AW Y ++ G+ TGG
Sbjct: 147 DLISCCEDC--GDGCQGGFPGVAWDYWVKRGIVTGG 180
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+ NC IG ++ Q +CGSCWA A+SDR CI T GR++ +
Sbjct: 74 FGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTKKGLVSGGVYDS 175
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P AW Y ++ G+ TGG
Sbjct: 147 DLISCCEDC--GDGCQGGFPGVAWDYWVKRGIVTGG 180
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 67/111 (60%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD +++P C ++ ++ QS CGSCWA A +DR+CIA++G++ LS L
Sbjct: 68 DLPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDL 127
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
LTCC +C G C GG P AW + GV TGG+YGS C ++ C+
Sbjct: 128 LTCCESCGFG--CNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCD 176
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P AW Y ++ G+ TGG
Sbjct: 147 DLISCCEDC--GDGCQGGFPGVAWDYWVKRGIVTGG 180
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/103 (44%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+ NC IG ++ Q +CGSCWA A+SDR CI T GR++ +
Sbjct: 74 FGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTKKGLVSGGYYDS 175
>gi|239793652|dbj|BAH72931.1| ACYPI000018 [Acyrthosiphon pisum]
Length = 239
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R+++ C IG V+ Q NCGSCWA+AT++A +DR+C+AT + LS++ +
Sbjct: 87 RIPRHFDARRKWRRCHTIGAVRDQGNCGSCWAMATSSAFADRLCVATNTDFNELLSAEEI 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC +C G C GG P++AW + G+ TGGDY S
Sbjct: 147 TFCCHSCGFG--CNGGYPIKAWERFKKRGLVTGGDYQS 182
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 80/157 (50%), Gaps = 8/157 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
L+R V +S A S F + F L L S+ P L D N
Sbjct: 7 LIRFVNEESGASWKA-----ARSTRFSNVDHFKLDLGALSETPEERNALRPTIKHDISKN 61
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LPE FD R Q+P C I ++ Q++CGSCWA A +A+SDR+CI + G++ L++
Sbjct: 62 -DLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAAD 120
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L+CC C G C GG P +AW Y + G+ TGG +
Sbjct: 121 PLSCCTYC--GQGCRGGYPPKAWDYWMREGIVTGGTW 155
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 60/98 (61%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPEEFD R +P+C IG ++ Q +CGSCWA A+SDR+CI + ++ S+D L
Sbjct: 85 DLPEEFDARTAWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDL 144
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++CC C G C GG P AW Y G+ +GG YGS
Sbjct: 145 VSCCHTCGFG--CNGGFPGAAWSYWTHKGIVSGGSYGS 180
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 84/167 (50%), Gaps = 9/167 (5%)
Query: 6 PQYVNHSHHLLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLG 65
PQ+ S L+ R V +S A S F + F L L S+ P L
Sbjct: 21 PQFEAFSDELI-RFVNEESGASWKA-----ARSTRFSNVDHFKLHLGALSETPEERNALR 74
Query: 66 SEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
D N +LPE FD R Q+P C I ++ Q++CGSCWA A +A+SDR+CI + G
Sbjct: 75 PTIKHDISKN-DLPESFDARSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNG 133
Query: 126 RLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
++ L++ L+CC C G C GG P +AW Y + G+ TGG +
Sbjct: 134 QMRPRLAAADPLSCCTYC--GQGCRGGYPPKAWDYWMREGIVTGGTW 178
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 84/137 (61%), Gaps = 11/137 (8%)
Query: 38 SPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHV 97
+P F + + + ++Q PN ++ D + ++PEE+D RK + NCT+ ++
Sbjct: 55 TPGFKQKI---MDIKFRNQNPNLIVK------DDPEPEDDIPEEYDPRKIWSNCTSF-YI 104
Query: 98 QLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRA 157
+ Q+NCGSCWA++T AA+SDR+CIAT+ R +S+ L+TCC T G C+GG ++A
Sbjct: 105 RDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTP-TCGFGCDGGWSIKA 163
Query: 158 WYYMLENGVPTGGDYGS 174
W Y G+ +GG+Y S
Sbjct: 164 WEYFTYAGLVSGGEYRS 180
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 70/102 (68%), Gaps = 3/102 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D Q +T +P FD R Q+P+C +I +++ QS+CGSCWA A A SDR CIA+ G ++
Sbjct: 75 DIQEDT-IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTL 133
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LS++ +L+CC+ C G CEGG P+ AW Y++++G TGG Y
Sbjct: 134 LSAEDVLSCCSNCGYG--CEGGYPINAWKYLVKSGFCTGGSY 173
>gi|189308104|gb|ACD86936.1| cysteine protease [Caenorhabditis brenneri]
Length = 210
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 70/102 (68%), Gaps = 3/102 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D Q +T +P FD R Q+P+C +I +++ QS+CGSCWA A A SDR CIA+ G ++
Sbjct: 75 DIQEDT-IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTL 133
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LS++ +L+CC+ C G CEGG P+ AW Y++++G TGG Y
Sbjct: 134 LSAEDVLSCCSNCGYG--CEGGYPINAWKYLVKSGFCTGGSY 173
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G LS+
Sbjct: 49 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 108
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P +AW Y ++ G+ TGG
Sbjct: 109 DLISCCKDC--GDGCKGGFPGQAWDYWVKRGIVTGG 142
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 65/98 (66%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+++ +C +I ++ QS CGSCWA A+SDR+CI ++G++ +S++ L
Sbjct: 84 DLPESFDAREKWSHCASIHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAEDL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L CC +C G C GG P AW Y E+G+ TGG YG+
Sbjct: 144 LDCCDSCGAG--CNGGYPAAAWEYWKESGLVTGGLYGT 179
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 67/99 (67%), Gaps = 3/99 (3%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P+ FD R Q+PNC +I ++ QS+CGSCWA++ +SDR+CIA++G+ ++S+D
Sbjct: 94 DAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASKGQTQVSISAD 153
Query: 135 HLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CC AC G+ C GG P+ AW + ++NG TGG Y
Sbjct: 154 DINACCGMAC--GNGCNGGYPIEAWRHYVKNGYVTGGSY 190
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 66/103 (64%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LP+EFD R+Q+PNC + ++ Q +CGSCWA + A+SDR+CI + ++ L
Sbjct: 73 YAGDIKLPKEFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAMSDRVCIHSNAKVSVEL 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S+ LLTCC +C G C GG P AW + + +G+ +GG Y S
Sbjct: 133 SAQDLLTCCNSCGMG--CNGGYPSSAWNFWVSDGLVSGGLYDS 173
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +LPE FD R+++P C +I + QS+CGSCWA+A A+SDR+CI + G + LS+
Sbjct: 59 SENDLPESFDAREKWPLCRSIRQIPDQSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSA 118
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC+ C G+ C+GG+P AW Y NG+ TGG
Sbjct: 119 IDLVSCCSYC--GNGCQGGSPPAAWDYWWRNGIVTGG 153
>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 233
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLISCCKDC--GDGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 87/166 (52%), Gaps = 22/166 (13%)
Query: 7 QYVNHSHHLLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGS 66
+YVN L + +D + +K L T PD+++
Sbjct: 30 EYVNSKQSLWKAEIPKDIT----------------IEQVKKRLMRTEFVAPHTPDVEVVK 73
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
D +T +P FD R Q+PNC +I +++ QS+CGSCWA A A SDR CIA+ G
Sbjct: 74 H---DINEDT-IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGA 129
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
++ LS++ +L+CC+ C G CEGG P+ AW Y++++G TGG Y
Sbjct: 130 VNTLLSAEDVLSCCSNCGYG--CEGGYPINAWKYLVKSGFCTGGSY 173
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/106 (46%), Positives = 66/106 (62%), Gaps = 4/106 (3%)
Query: 72 YQSNTE---LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
Y S+T+ LP EFD RK +P+C IG ++ Q CGSCWA T A+SDR+CI ++G+
Sbjct: 75 YLSSTQKAALPSEFDARKAWPDCPTIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGKEV 134
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S+D LL+CC G C GG P AW Y +G+ +GG YGS
Sbjct: 135 VRISADDLLSCCGLFCGFG-CNGGLPENAWRYWAIDGIVSGGLYGS 179
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 70/102 (68%), Gaps = 3/102 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D Q +T +P FD R Q+P+C +I +++ QS+CGSCWA A A SDR CIA+ G ++
Sbjct: 75 DIQEDT-IPATFDARTQWPSCVSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTL 133
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LS++ +L+CC+ C G CEGG P+ AW Y++++G TGG Y
Sbjct: 134 LSAEDVLSCCSNCGYG--CEGGYPINAWKYLVKSGFCTGGSY 173
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +
Sbjct: 74 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCC-GIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|148704124|gb|EDL36071.1| cathepsin B, isoform CRA_b [Mus musculus]
Length = 237
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/100 (46%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LPE FD R+Q+ NC IG ++ Q +CGSCWA A+SDR CI T GR++ +S++
Sbjct: 71 SIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAE 130
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LLTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 131 DLLTCCGI-QCGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 169
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 75/127 (59%), Gaps = 10/127 (7%)
Query: 48 GLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCW 107
GL T Q++ P+L ++ + +LP+ FD R+Q+PNC I ++ Q +CGSCW
Sbjct: 56 GLCGTLQNKPTLPEL--------EHPAGVKLPDTFDARQQWPNCPTIQDIRDQGSCGSCW 107
Query: 108 AIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
A A+SDR+CI + ++ +S++ LL+CC C G C GG P AW Y ++G+
Sbjct: 108 AFGAAEAISDRLCIHSNAKITVEISAEDLLSCCEECGMG--CFGGYPSAAWEYWAKSGLV 165
Query: 168 TGGDYGS 174
TGG YGS
Sbjct: 166 TGGLYGS 172
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 80/152 (52%), Gaps = 14/152 (9%)
Query: 40 SFLSSLKFGLSLTPQSQEPNPDLQLGSEHF---------GDYQSNTELPEEFDLRKQYPN 90
FL +L+ + + P L +HF Q + E PE+FD R +P
Sbjct: 37 EFLRTLQSLFEVKKSEEVPVRMKYLLPKHFMVKPKEEDRTKIQLDKEPPEKFDARDAWPY 96
Query: 91 CTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVC 149
C I GHV+ QS CGSCWA++ + +SDR+C+ + G++ +S +L CC GD C
Sbjct: 97 CREIIGHVRDQSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDILACCGEFC-GDGC 155
Query: 150 EGGNPMRAWYYMLENGVPTGGDY---GSCQRF 178
GG P +AW ++ + GV TGGDY G C+ +
Sbjct: 156 SGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPY 187
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 65/98 (66%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P +FD RK++ C IG V+ Q NC S WA++T++A +DR+C+AT G + LS++ +
Sbjct: 5 EIPRKFDARKKWLRCKTIGEVRDQGNCASGWALSTSSAFADRLCVATNGDFNQLLSAEEI 64
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
CC C G+ C GG P+RAW ++G+ TGG+Y S
Sbjct: 65 TFCCHTC--GNGCYGGYPIRAWKSFKKHGLVTGGNYKS 100
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 64/101 (63%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++T+LPE FD R+++PNC I V+ Q +CGSCWA A+SDR+CI + G + S+
Sbjct: 87 ASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSA 146
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++L++CC C G C GG P AW Y G+ +GG YGS
Sbjct: 147 ENLVSCCWTCGFG--CNGGFPGAAWNYWKTKGIVSGGPYGS 185
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++PE FD R+ + NC++I +++ QSNCGSCWA++ +SDR+C+ ++GR+ +S
Sbjct: 91 SNDDIPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISD 150
Query: 134 DHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC C G C GG +AW Y+ E GV TGG Y
Sbjct: 151 VDILACCGRECGRG--CNGGMDHKAWEYVKEFGVVTGGRY 188
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ N LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +
Sbjct: 74 FAENMVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGD-QCGDGCNGGFPAEAWNFWTKQGLVSGGLYES 175
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +
Sbjct: 74 FAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCDG-ECGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 175
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ N LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +
Sbjct: 74 FAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGD-QCGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++PE FD R+ + NC++I +++ QSNCGSCWA++ +SDR+C+ ++GR+ +S
Sbjct: 91 SNDDIPESFDSREVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISD 150
Query: 134 DHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC C G C GG +AW Y+ E GV TGG Y
Sbjct: 151 VDILACCGRECGRG--CNGGMDHKAWEYVKEFGVVTGGRY 188
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 62/96 (64%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C GD C+GG P AW Y ++ G+ TGG
Sbjct: 147 DLISCCKDC--GDGCQGGFPGVAWDYWVKRGIVTGG 180
>gi|149030259|gb|EDL85315.1| rCG52258, isoform CRA_b [Rattus norvegicus]
Length = 210
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +
Sbjct: 74 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LP+ FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI ++GR++ +
Sbjct: 74 FAADMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC + GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGS-ECGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 67/103 (65%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LP+ FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI ++GR++ +
Sbjct: 74 FAADMVLPDSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC + GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGS-ECGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +
Sbjct: 74 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +
Sbjct: 74 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +
Sbjct: 74 FAEIVDLPESFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LL+CC GD C GG P AW Y + G+ +GG Y S
Sbjct: 134 SAEDLLSCCGL-ECGDGCNGGYPSAAWKYWTKKGLVSGGLYDS 175
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD R Q+ NC I V+ Q +CGSCWA+A A+SDR+C+A++G +S++ L
Sbjct: 79 DIPDTFDSRTQWANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAEDL 138
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
+CC +C G+ C GG P AW Y +G+ TGG YGS CQ ++ C
Sbjct: 139 NSCCKSC--GNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPC 186
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +
Sbjct: 6 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 65
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 66 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 107
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 64/101 (63%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++T+LPE FD R+++PNC I V+ Q +CGSCWA A+SDR+CI + G + S+
Sbjct: 22 ASTDLPETFDARERWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSA 81
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++L++CC C G C GG P AW Y G+ +GG YGS
Sbjct: 82 ENLVSCCWTCGFG--CNGGFPGAAWNYWKTKGIVSGGPYGS 120
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +
Sbjct: 1 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEV 60
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 61 SAEDLLTCCGI-QCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 102
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD RK +P+C +I ++ QS+CGSCWA A+SDR+CI ++G LS+++L
Sbjct: 91 ELPKEFDARKHWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENL 150
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
+ CC++C G C GG P AW Y +G+ TG Y + CQ ++ C
Sbjct: 151 VACCSSCGMG--CNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPC 198
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 65/99 (65%), Gaps = 3/99 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P +FD RKQ+P+C +I +++ Q +CGSCWA A+SDR CI + G++ +S++ L
Sbjct: 111 KIPNQFDSRKQWPHCPSISYIRDQGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDL 170
Query: 137 LTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L+CC C GD C GG P AW Y +G+ TGG YGS
Sbjct: 171 LSCCGFEC--GDGCNGGFPGSAWKYWNSDGLVTGGLYGS 207
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N LP+ FD RK +P+C++I ++ QS+CGSCWA A+SDR+CI + G + +LS+
Sbjct: 83 NMRLPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAV 142
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
LL+CC C G C GG P AW Y +G+ TGG
Sbjct: 143 DLLSCCKDCGFG--CRGGYPAVAWDYWKTHGIVTGG 176
>gi|350540002|ref|NP_001232104.1| putative cathepsin B variant 2 precursor [Taeniopygia guttata]
gi|197129221|gb|ACH45719.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 261
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D+ ++ ELP+ FD R Q+PNC I ++ Q +CGSCWA A+SDR+C+ T ++
Sbjct: 73 DFAADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVE 132
Query: 131 LSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LL+CC C G C GG P AW Y E G+ +GG Y S
Sbjct: 133 VSAEDLLSCCGFECGMG--CNGGYPSGAWRYWTERGLVSGGLYDS 175
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 64/98 (65%), Gaps = 1/98 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+T +P+ FD R Q+PNC +I ++ QS+CGSCWA++ +SDR+CIA+ G+ ++S+D
Sbjct: 94 DTAVPDSFDSRTQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQISISAD 153
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CC G+ C GG P+ AW + ++ G TGG Y
Sbjct: 154 DINACCGMVC-GNGCNGGYPIEAWRHYVKKGYVTGGSY 190
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 62/98 (63%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+ +PNC I V+ Q +CGSCWA A+SDR+CI ++G + S+++L
Sbjct: 27 DLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENL 86
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++CC C G C GG P AW+Y G+ +GG YGS
Sbjct: 87 VSCCWTCGFG--CNGGFPGAAWHYWKTKGIVSGGPYGS 122
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 65/104 (62%), Gaps = 3/104 (2%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + ELP+ FD RKQ+P+C I ++ Q +CGSCWA A+SDR+C+ T G+++ +
Sbjct: 64 FADDVELPDSFDSRKQWPSCPTINEIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEI 123
Query: 132 SSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LL+CC C G C GG P AW Y E G+ +GG Y S
Sbjct: 124 SAEDLLSCCGFECGMG--CNGGYPSGAWKYWTEKGLVSGGLYDS 165
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
T+LPE FD R+ +PNC I V+ Q +CGSCWA A+SDR+CI ++G + S+++
Sbjct: 26 TDLPETFDAREHWPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAEN 85
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L++CC C G C GG P AW Y G+ +GG YGS
Sbjct: 86 LVSCCWTCGFG--CNGGFPGAAWNYWKTKGIVSGGPYGS 122
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 62/110 (56%), Gaps = 2/110 (1%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
E G + +LPEEFD K +PNC I ++ Q +CGSCWA A+SDR+CI +
Sbjct: 77 ELLGADGEDKDLPEEFDSSKNWPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSNAT 136
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
++ S+D L+TCC C G C GG P AW Y G+ +GG Y S +
Sbjct: 137 VNFHFSADDLVTCCHTCGFG--CNGGFPGAAWSYWTTRGIVSGGSYNSTE 184
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+ + C +I ++ QS+CG+CWA A+SDR+CI T+G + +S+ LL
Sbjct: 83 LPESFDARQHWRKCNSIHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQDLL 142
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
TCC C G C+GG P AW + E G+ TGG YG+
Sbjct: 143 TCCDYCRTG--CKGGVPSYAWMFYKEKGIVTGGLYGT 177
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 64/100 (64%), Gaps = 3/100 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++PE FD R + NC++I +++ QSNCGSCWA++ +SDR+C+ ++GR+ +S
Sbjct: 91 SNDDIPESFDSRVVWKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISD 150
Query: 134 DHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC C G C GG +AW Y+ E GV TGG Y
Sbjct: 151 VDILACCGRECGRG--CNGGMDHKAWEYVKEFGVVTGGRY 188
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LP+ FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + G++ +
Sbjct: 73 YSGDMKLPKNFDSREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRLCIHSNGKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LLTCC +C G C GG P AW + + G+ +GG Y S
Sbjct: 133 SSEDLLTCCDSCGMG--CNGGYPSAAWDFWTDVGLVSGGLYDS 173
>gi|219565128|dbj|BAH04068.1| cathepsin B [Equus caballus]
Length = 162
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G + +
Sbjct: 50 FAEDVVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEV 109
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 110 SAEDMLTCCGD-QCGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 151
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 65/105 (61%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D+ ++ +LP+ FD RKQ+PNC I ++ Q +CGSCWA A+SDR+C+ T ++
Sbjct: 73 DFAADIDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVE 132
Query: 131 LSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LL+CC C G C GG P AW Y E G+ +GG Y S
Sbjct: 133 VSAEDLLSCCGFECGMG--CNGGYPSGAWRYWTERGLVSGGLYDS 175
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 43/98 (43%), Positives = 59/98 (60%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R +PNC IG ++ Q +CGSCWA A+SDR+CI + G ++ S++ L
Sbjct: 89 DLPESFDARTAWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSAEDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
++CC C G C GG P AW Y G+ +GG Y S
Sbjct: 149 VSCCHTCGFG--CNGGFPGAAWSYWTHKGIVSGGSYNS 184
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N LP+ FD RK +P+C++I ++ QS+CGSCWA A+SDR+CI + G + +LS+
Sbjct: 83 NMRLPKNFDARKTWPHCSSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAV 142
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
LL+CC C G C GG P AW Y +G+ TGG
Sbjct: 143 DLLSCCKDCGFG--CRGGYPAVAWDYWKTHGIVTGG 176
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 70/126 (55%), Gaps = 7/126 (5%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCG 104
L FG P+ Q + + E F S+ +P+ FD RKQ+P+C IG ++ QS+CG
Sbjct: 52 LMFGALREPEEQR-SKRPTVSHESF----SDEHIPKAFDARKQWPHCPTIGEIRDQSSCG 106
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA A+SDR+CI T G +S+ L++CC C G C+GG P AW +
Sbjct: 107 SCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFG--CQGGFPPTAWDFWQTE 164
Query: 165 GVPTGG 170
G+ TGG
Sbjct: 165 GIVTGG 170
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 64/99 (64%), Gaps = 1/99 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+N ++PE FD R + NC++I +V+ QS CGSCWA++ + +SDR+C+ T+G+L LS
Sbjct: 90 TNEDIPESFDSRIVWKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSD 149
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L+CC GD CEGG AW ++ GV TGG Y
Sbjct: 150 TDILSCCGRMC-GDGCEGGYDHLAWEWVQRFGVVTGGPY 187
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 65/99 (65%), Gaps = 1/99 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++PE FD R+ + +C++I +++ QSNCGSCWA++ +SDR+C+ ++GR+ +S
Sbjct: 91 SNDDIPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISD 150
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC + G C GG +AW Y+ E GV TGG Y
Sbjct: 151 VDILACCGS-ECGRGCNGGMDHKAWEYVKEFGVVTGGRY 188
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 68/110 (61%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD RK +P+C +I ++ QS+CGSCWA A+SDR+CI ++G LS+++L
Sbjct: 91 ELPKEFDARKYWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENL 150
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG---SCQRFDRGNC 183
+ CC++C G C GG P AW Y +G+ TG Y CQ ++ C
Sbjct: 151 VACCSSCGMG--CNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPC 198
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 62/96 (64%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 24 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 83
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG P AW Y ++ G+ TGG
Sbjct: 84 DLISCCEDC--GQGCQGGFPGVAWDYWVKRGIVTGG 117
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N+ +P+ FD R +P C ++ V+ QS CGS WA+A A+ DR+CIA++G+ LS+D
Sbjct: 91 NSTIPKSFDARTNWPKCASLRTVRDQSACGSGWAVAAVGAIMDRICIASEGKQQVILSAD 150
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L+CC C G CEGG+ +AW Y +G+ TG +Y
Sbjct: 151 DILSCCTECGYG--CEGGDTYKAWNYWTTDGIVTGSNY 186
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D+ + +LP+ FD RKQ+PNC I ++ Q +CGSCWA A+SDR+C+ T ++
Sbjct: 73 DFAEDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVE 132
Query: 131 LSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LL+CC C G C GG P AW Y E G+ +GG Y S
Sbjct: 133 VSAEDLLSCCGFECGMG--CNGGYPSGAWRYWTERGLVSGGLYDS 175
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G + +
Sbjct: 74 FAEDVVLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGD-QCGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175
>gi|197129222|gb|ACH45720.1| putative cathepsin B variant 2 [Taeniopygia guttata]
Length = 236
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D+ ++ ELP+ FD R Q+PNC I ++ Q +CGSCWA A+SDR+C+ T ++
Sbjct: 73 DFAADVELPDNFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVE 132
Query: 131 LSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LL+CC C G C GG P AW Y E G+ +GG Y S
Sbjct: 133 VSAEDLLSCCGFECGMG--CNGGYPSGAWRYWTERGLVSGGLYDS 175
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +LP EFD R+Q+P C + ++ Q +CGSCWA A+SDR+CI + G++ +
Sbjct: 73 YAGGLKLPAEFDAREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSGGKISVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LLTCC +C G C GG P AW + + G+ +GG Y S
Sbjct: 133 SSEDLLTCCDSCGMG--CNGGYPSSAWDFWTKEGLVSGGLYNS 173
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/133 (37%), Positives = 79/133 (59%), Gaps = 8/133 (6%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
+S ++ L + P+S+E +L F + +LPE FD R ++ +C +I ++ QS
Sbjct: 55 MSYIRGLLGVHPKSEE----YRLAE--FVHEEIPDDLPESFDARAKWSHCDSIHLIRDQS 108
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
CGSCWA T A+SDR+CI ++G++ +S++ LL CC C G C+GG P AW +
Sbjct: 109 TCGSCWAFGATEAMSDRICIHSKGKMQVNISAEDLLDCCDTCGHG--CKGGFPAAAWEHW 166
Query: 162 LENGVPTGGDYGS 174
E G+ +GG YG+
Sbjct: 167 KERGIVSGGLYGT 179
>gi|161343831|tpg|DAA06096.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 194
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 82/137 (59%), Gaps = 4/137 (2%)
Query: 38 SPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHV 97
+PS L+++ + + P+++ D L + + LPE +D+ + + C ++ +
Sbjct: 28 NPSLLTNVSRLMGVLPRNKLSEKDTLL---TYDSPAGSEPLPESYDVTQTWSECKSVVSI 84
Query: 98 QLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRA 157
+ QSNCGSCWA++T +A S R+CIA+ + LS +++ +CC GD C GG+P +A
Sbjct: 85 RDQSNCGSCWALSTASAFSGRLCIASNMDFNIVLSGEYINSCCNG-KCGDGCNGGHPEKA 143
Query: 158 WYYMLENGVPTGGDYGS 174
W Y+ +NG+ TGG+Y S
Sbjct: 144 WKYIKKNGLCTGGEYNS 160
>gi|149436731|ref|XP_001513125.1| PREDICTED: cathepsin B-like [Ornithorhynchus anatinus]
Length = 211
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 46/107 (42%), Positives = 64/107 (59%), Gaps = 1/107 (0%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
G S+ +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+C+ T G++
Sbjct: 71 RVGLANSDMKLPENFDARQQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCVHTNGQV 130
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LLTCC G C GG P AW Y + G+ +GG Y S
Sbjct: 131 SVEVSAEDLLTCCGLECGMG-CNGGYPTGAWTYWTKKGLVSGGLYDS 176
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 70/126 (55%), Gaps = 7/126 (5%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCG 104
L FG P+ Q + + E F S+ +P+ FD RKQ+P+C IG ++ QS+CG
Sbjct: 52 LMFGALREPEEQR-SKRPTVSHESF----SDEHIPKAFDARKQWPHCPTIGEIRDQSSCG 106
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA A+SDR+CI T G +S+ L++CC C G C+GG P AW +
Sbjct: 107 SCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGYCGFG--CQGGFPPIAWDFWQTE 164
Query: 165 GVPTGG 170
G+ TGG
Sbjct: 165 GIVTGG 170
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG P AW Y + +G+ TGG
Sbjct: 147 DLISCCENCGSG--CDGGFPGPAWDYWVSHGIVTGG 180
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG P AW Y + +G+ TGG
Sbjct: 147 DLISCCENCGSG--CDGGFPGPAWDYWVSHGIVTGG 180
>gi|221219800|gb|ACM08561.1| Cathepsin B precursor [Salmo salar]
gi|221222296|gb|ACM09809.1| Cathepsin B precursor [Salmo salar]
Length = 205
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + ELP+ FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LL+CC +C G C GG P AW + G+ TGG Y S
Sbjct: 133 SSEDLLSCCDSCGMG--CNGGYPSAAWDFWTTEGLVTGGLYDS 173
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 68/113 (60%), Gaps = 5/113 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+T LP+ FD R ++P+C+++ ++ QS+CGSCWA A+SDR+CI + G + +LS+
Sbjct: 83 DTRLPKNFDARSKWPHCSSVSEIRDQSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAV 142
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG---DYGSCQRFDRGNCN 184
LL+CC C G C GG P AW Y +G+ TGG D C+ + C+
Sbjct: 143 DLLSCCKDCGFG--CRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCD 193
>gi|221221056|gb|ACM09189.1| Cathepsin B precursor [Salmo salar]
gi|221222300|gb|ACM09811.1| Cathepsin B precursor [Salmo salar]
Length = 207
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + ELP+ FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LL+CC +C G C GG P AW + G+ TGG Y S
Sbjct: 133 SSEDLLSCCDSCGMG--CNGGYPSAAWDFWTTEGLVTGGLYDS 173
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 64/96 (66%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +LP+ FD RK + NC++I ++ QS+CGSCWA ++SDR+CI ++GR+ LS+
Sbjct: 89 NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAV 148
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+LL+CC+ C G C GG P AW Y + G+ TGG
Sbjct: 149 NLLSCCSRCGFG--CNGGIPGMAWDYWKDEGIVTGG 182
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+Q+ NC I ++ Q +CGSCWA A+SDR+CI T GR++ +S++ LL
Sbjct: 1 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 60
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
TCC GD C GG P AW + G+ +GG Y S
Sbjct: 61 TCCGI-QCGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 96
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + ELP+ FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YAGDVELPDTFDPRQQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LL+CC +C G C GG P AW + G+ TGG Y S
Sbjct: 133 SSEDLLSCCDSCGMG--CNGGYPSAAWDFWTTEGLVTGGLYDS 173
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ N LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G + +
Sbjct: 74 FAENMILPESFDSREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGD-QCGDGCNGGFPAEAWNFWTXXGLVSGGLYDS 175
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 64/96 (66%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +LP+ FD RK + NC++I ++ QS+CGSCWA ++SDR+CI ++GR+ LS+
Sbjct: 89 NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAV 148
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+LL+CC+ C G C GG P AW Y + G+ TGG
Sbjct: 149 NLLSCCSRCGFG--CNGGIPGMAWDYWKDEGIVTGG 182
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ T+ P +FD R+ + NC + ++ Q CGSCWA+A +A++DRMCI ++G+
Sbjct: 78 FSRKTKYPNQFDAREHWKNCPTLKDIRDQGGCGSCWAVAAVSAMTDRMCILSKGKEHFYF 137
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
S +L+CC C G+ CEGG RAW Y + G+ +GG Y S CQ + CN
Sbjct: 138 SIKDVLSCCGYC--GNGCEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCN 191
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD R ++P+C +I ++ QS+CGSCWA A+SDR+CI ++G+ LS+++L
Sbjct: 93 ELPKSFDARVEWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAENL 152
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC++C G C GG P AW Y G+ TG Y + CQ ++ C
Sbjct: 153 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPC 200
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 63/98 (64%), Gaps = 1/98 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P+ FD R Q+PNC +I ++ QS+CGSCWA++ +SDR+CIA+ G+ ++S+D
Sbjct: 94 DAAIPDSFDSRAQWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNGKTQLSISAD 153
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CC G+ C GG P+ AW + ++ G TGG Y
Sbjct: 154 DINACCGMVC-GNGCNGGYPIEAWRHYVKKGYVTGGSY 190
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +LPE FD R+++PNC++I + QS+C SCWA+ T +A++DR+CI + G LS+
Sbjct: 82 SENDLPESFDAREKWPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSA 141
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G CEGG P AW Y +G+ +GG
Sbjct: 142 VDLVSCCPYCGYG--CEGGYPSMAWDYWWRHGIVSGG 176
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD RK++ +C +I ++ QS+CGSCWA A+SDR+CI ++G+ LS+++L
Sbjct: 94 ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC++C G C GG P AW Y G+ TG Y + CQ ++ C
Sbjct: 154 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPC 201
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 9/113 (7%)
Query: 69 FGDYQ-------SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCI 121
FGD Q + +LPE FD +++P C ++ ++ QS CGSCWA A +DR+CI
Sbjct: 53 FGDRQLPSKTIVARGDLPESFDPVEKWPECPSLKEIRDQSVCGSCWAFGAAEAATDRLCI 112
Query: 122 ATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
A++G++ LS LLTCC +C G C+GG AW + GV TGG+YGS
Sbjct: 113 ASKGKIQDRLSEQDLLTCCDSCGFG--CDGGWLDMAWRWFQSTGVTTGGEYGS 163
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 65/103 (63%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LP+ FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +
Sbjct: 74 FAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGD-ECGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD RK++ +C +I ++ QS+CGSCWA A+SDR+CI ++G+ LS+++L
Sbjct: 94 ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC++C G C GG P AW Y G+ TG Y + CQ ++ C
Sbjct: 154 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPC 201
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 84/153 (54%), Gaps = 17/153 (11%)
Query: 41 FLSSLKFGLSLTPQSQEPNP-------DLQLGSEHFG-----DYQSNTELPEEFDLRKQY 88
+S L+ SL + +P P D++ + D ++P +D R +
Sbjct: 38 LVSYLRRSQSLFEVNSDPTPNFEQKIMDIKYNHQRLNLMVKEDPDPEVDIPPSYDPRDVW 97
Query: 89 PNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV 148
NCT +++ Q+NCGSCWA++T AA+SDR+CIA++ +S+ ++TCC GD
Sbjct: 98 KNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRP-QCGDG 155
Query: 149 CEGGNPMRAWYYMLENGVPTGGDY---GSCQRF 178
CEGG P+ AW Y + +GV +GG+Y G C+ +
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPY 188
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 68/106 (64%), Gaps = 5/106 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD RK++ +C +I ++ QS+CGSCWA A+SDR+CI ++G+ LS+++L
Sbjct: 94 ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFD 179
++CC++C G C GG P AW Y G+ TG Y + CQ ++
Sbjct: 154 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYE 197
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 63/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D+ + ELP+ FD R Q+PNC I ++ Q +CGSCWA A+SDR+C+ T ++
Sbjct: 73 DFAGDMELPDSFDSRTQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVE 132
Query: 131 LSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LL+CC C G C GG P AW Y E G+ +GG Y S
Sbjct: 133 VSAEDLLSCCGFECGMG--CNGGYPSGAWRYWTEKGLVSGGLYDS 175
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 68/106 (64%), Gaps = 5/106 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD RK++ +C +I ++ QS+CGSCWA A+SDR+CI ++G+ LS+++L
Sbjct: 94 ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFD 179
++CC++C G C GG P AW Y G+ TG Y + CQ ++
Sbjct: 154 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYE 197
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +LP+ FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS LLTCC +C G C GG P AW + +G+ TGG Y S
Sbjct: 133 SSQDLLTCCDSCGMG--CNGGYPSAAWDFWTTDGLVTGGLYNS 173
>gi|60600065|gb|AAX26576.1| unknown [Schistosoma japonicum]
Length = 190
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 69/110 (62%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD RK++ +C +I ++ QS+CGSCWA A+SDR+CI ++G+ LS+++L
Sbjct: 63 ELPKSFDARKEWTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENL 122
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC++C G C GG P AW Y G+ TG Y + CQ ++ C
Sbjct: 123 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPC 170
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 63/93 (67%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R + NCT++ H++ Q+NCGSCWA++T +ALSDR+CIA++G +SS +
Sbjct: 93 DIPESFDARTHWANCTSLRHIRDQANCGSCWAVSTASALSDRICIASKGETQLHISSIDI 152
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
++CC C G C+GG P+ A+ Y G TG
Sbjct: 153 VSCCKLCGYG--CDGGWPIEAFDYFSRQGAVTG 183
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 64/100 (64%), Gaps = 3/100 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++PE FD R+ + NC++I +++ QSN GSCWA++ +SDR+C+ ++GR+ +S
Sbjct: 91 SNEDIPESFDSREVWKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISD 150
Query: 134 DHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC C G C GG +AW Y+ E GV TGG Y
Sbjct: 151 VDILACCGRECGRG--CNGGMDHKAWEYVKEFGVVTGGRY 188
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%), Gaps = 10/127 (7%)
Query: 52 TPQSQEPNPDLQLGSEHFG-----DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC 106
TP ++ D++ ++ D + N ++PEE+D R+++ C+ +++ Q+NCGSC
Sbjct: 55 TPNFEQKIMDIKFKNQKLNFVVKNDPEPNEDIPEEYDPREKF-KCSTF-YIRDQANCGSC 112
Query: 107 WAIATTAALSDRMCIATQGRLDHTLSSDHLLTCC-AACTGGDVCEGGNPMRAWYYMLENG 165
WA++T AA+SDR+CIAT G +SS +LTCC C G C GG +RAW Y + G
Sbjct: 113 WAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGFG--CGGGWSIRAWEYFVYEG 170
Query: 166 VPTGGDY 172
V +GG+Y
Sbjct: 171 VVSGGEY 177
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 63/100 (63%), Gaps = 1/100 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+C+ + G + +S++
Sbjct: 78 NMKLPENFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHSNGNANVEVSAE 137
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC + GD C GG P AW + + G+ +GG Y S
Sbjct: 138 DLLSCCGS-ECGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 176
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 64/99 (64%), Gaps = 1/99 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ E+PE FD R+++ C +I ++ QS+CGSCWA++ +SDR CI + G+++ LS+
Sbjct: 92 DVEIPENFDAREKWSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSAT 151
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
+L+CC T G C GG P+ AW Y + +GV TGG Y
Sbjct: 152 DILSCCGT-TCGRGCRGGYPIEAWRYFMLHGVCTGGHYA 189
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 61/93 (65%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++ C +I +++ QS CGSCWA A A+SDR+CI ++G+ LS+ L
Sbjct: 89 EIPSSFDSRKKWRQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L+CC C G C+GG P AW Y +E+G+ TG
Sbjct: 149 LSCCTECGLG--CQGGFPGAAWDYWVEDGIVTG 179
>gi|325303156|tpg|DAA34330.1| TPA_inf: cysteine proteinase cathepsin L [Amblyomma variegatum]
Length = 207
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 62/100 (62%), Gaps = 5/100 (5%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCI---ATQGRLDHTLS 132
T LPE FD R+Q+P+C IG ++ Q +CGSCWA A+SDR CI A + R++ L+
Sbjct: 103 TALPENFDAREQWPDCPTIGEIRDQGSCGSCWAFGAVEAMSDRTCIHSPARKPRVNVHLA 162
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+D +L+CC C G C GG P AW Y + +G+ GG Y
Sbjct: 163 ADDVLSCCKDCGAG--CNGGFPGAAWSYWVHHGIVDGGHY 200
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 66/104 (63%), Gaps = 3/104 (2%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LP+ FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +
Sbjct: 74 FAADMILPKGFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEV 133
Query: 132 SSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC C GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGDEC--GDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 65/97 (67%), Gaps = 2/97 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
++P +D R + NCT +++ Q+NCGSCWA++T AA+SDR+CIA++ +S+
Sbjct: 85 VDIPPSYDPRDVWKNCTTF-YIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATD 143
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
++TCC GD CEGG P+ AW Y + +GV +GG+Y
Sbjct: 144 IMTCCRP-QCGDGCEGGWPIEAWKYFIYDGVVSGGEY 179
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 32 PDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNC 91
P+++K FL L PQ E + + Q + SN ++PE FD R+++ +C
Sbjct: 59 PEVVKKRRQFL--------LKPQFIERSYN-QENVLPIANITSNDDIPESFDSREKWKDC 109
Query: 92 TNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEG 151
++ + QSNCGSCWA++ +SDR+CI +QGR LS+ +L CC G C+G
Sbjct: 110 PSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYG-CDG 168
Query: 152 GNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
G RAW + GV TGG Y ++GNC
Sbjct: 169 GYNARAWKWATIAGVVTGGAYK-----EKGNCK 196
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 32 PDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNC 91
P+++K FL L PQ E + + Q + SN ++PE FD R+++ +C
Sbjct: 59 PEVVKKRRQFL--------LKPQFIERSYN-QENVLPIANITSNDDIPESFDSREKWKDC 109
Query: 92 TNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEG 151
++ + QSNCGSCWA++ +SDR+CI +QGR LS+ +L CC G C+G
Sbjct: 110 PSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYG-CDG 168
Query: 152 GNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
G RAW + GV TGG Y ++GNC
Sbjct: 169 GYNARAWKWATIAGVVTGGAYK-----EKGNCK 196
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +LPE FD R+++PNC +I ++ QS+C SCWA+++ +A++DR+CI + G+ LS+
Sbjct: 59 SENDLPESFDARQKWPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSA 118
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+++CCA C G C GG P +W Y GV TGG
Sbjct: 119 IDIVSCCAYCGYG--CNGGIPAMSWDYWTREGVVTGG 153
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 61/93 (65%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++ C +I +++ QS CGSCWA A A+SDR+CI ++G+ LS+ L
Sbjct: 89 EIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L+CC C G C+GG P AW Y +E+G+ TG
Sbjct: 149 LSCCTECGLG--CQGGFPGAAWDYWVEDGIVTG 179
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 74/126 (58%), Gaps = 7/126 (5%)
Query: 52 TPQSQEPNPDLQLGSEHFG-----DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC 106
TP ++ D++ + D ++P +D R + NCT +++ Q+NCGSC
Sbjct: 56 TPNFEQKIMDIKYKHQKLNLMVKEDPDPEVDIPPSYDPRDVWKNCTTF-YIRDQANCGSC 114
Query: 107 WAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
WA++T AA+SDR+CIA++ +S+ ++TCC GD CEGG P+ AW Y + +GV
Sbjct: 115 WAVSTAAAISDRICIASKAEKQVNISATDIMTCCRP-QCGDGCEGGWPIEAWKYFIYDGV 173
Query: 167 PTGGDY 172
+GG+Y
Sbjct: 174 VSGGEY 179
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 65/102 (63%), Gaps = 1/102 (0%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D + +LPE +D R + NC++ ++ Q+NCGSCWA++T AA+SDR+CIAT+G+
Sbjct: 82 DNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAVSTAAAISDRICIATKGKKQVY 141
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
S +LTCC A G C GG P+ AW + +GV +GG Y
Sbjct: 142 ASDTDILTCCGARCGLG-CRGGWPIEAWKFFEYDGVVSGGPY 182
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +LP+ FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSDAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS LLTCC +C G C GG P AW + G+ TGG Y S
Sbjct: 133 SSQDLLTCCDSCGMG--CNGGYPSAAWDFWATEGLVTGGLYNS 173
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 93.6 bits (231), Expect = 3e-17, Method: Composition-based stats.
Identities = 45/105 (42%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTE-LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
++++ TE +P FD R +P C ++ GHV+ Q +CGSCWA A+T A +DR+CI +QG+
Sbjct: 266 EFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRL 325
Query: 129 HTLSSDHLLTCCAACTGGDV-CEGGNPMRAWYYMLENGVPTGGDY 172
LS+ H +CC A C GG P AW + GV TGGD+
Sbjct: 326 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDF 370
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 93.6 bits (231), Expect = 3e-17, Method: Composition-based stats.
Identities = 45/105 (42%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTE-LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
++++ TE +P FD R +P C ++ GHV+ Q +CGSCWA A+T A +DR+CI +QG+
Sbjct: 266 EFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKRL 325
Query: 129 HTLSSDHLLTCCAACTGGDV-CEGGNPMRAWYYMLENGVPTGGDY 172
LS+ H +CC A C GG P AW + GV TGGD+
Sbjct: 326 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDF 370
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 61/93 (65%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++ C +I +++ QS CGSCWA A A+SDR+CI ++G+ LS+ L
Sbjct: 89 EIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L+CC C G C+GG P AW Y +E+G+ TG
Sbjct: 149 LSCCTECGLG--CQGGFPGAAWDYWVEDGIVTG 179
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 2/99 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R+Q+PNC + ++ Q NCGSCWA A+SDR+CI + G++ +S++
Sbjct: 77 IKLPDSFDPREQWPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAED 136
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LLTCC C G C GG P AW + G+ TGG + S
Sbjct: 137 LLTCCDECGMG--CFGGFPSAAWEFWTNKGLVTGGLFDS 173
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 65/103 (63%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LP+ FD R+Q+PNC + ++ Q +CGSCWA + A+SDR+CI + ++ +
Sbjct: 73 YAGDMKLPKAFDSREQWPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC +C G C GG P AW + + G+ +GG Y S
Sbjct: 133 SAEDLLTCCDSCGMG--CNGGYPSAAWDFWTKEGLVSGGLYDS 173
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 93.6 bits (231), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 65/104 (62%), Gaps = 5/104 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P EFD R ++PNC IG + Q +C SCWA+A T +SDR+CI + R LS+ +LL
Sbjct: 113 IPAEFDARLRWPNCPTIGEIFEQGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLL 172
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRF 178
+CC C G C+GG P AW + ++G+ TGG Y S CQ++
Sbjct: 173 SCCKLC--GKGCKGGFPGGAWMHWSKHGIVTGGSYSSDYGCQKY 214
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 64/101 (63%), Gaps = 1/101 (0%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N++L FD R+++P C++I + S C S WA A ++SDR+CI + G +D LS+
Sbjct: 82 NSDLSPFFDARERWPECSSIPLINDISECKSSWAFAAAESMSDRLCINSGGMIDTILSAQ 141
Query: 135 HLLTCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC G+ C GGNP++AW Y ++G+PTGG Y S
Sbjct: 142 ELLSCCTGVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYES 182
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 32 PDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNC 91
P+++K FL L PQ E + + Q + SN ++PE FD R+++ +C
Sbjct: 59 PEVVKKRRQFL--------LKPQFIERSYN-QENVLPVANITSNDDIPESFDSREKWKDC 109
Query: 92 TNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEG 151
++ + QSNCGSCWA++ +SDR+CI +QGR LS+ +L CC G C+G
Sbjct: 110 PSLRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYG-CDG 168
Query: 152 GNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
G RAW + GV TGG Y ++GNC
Sbjct: 169 GYNARAWKWATIAGVVTGGAYK-----EKGNCK 196
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 62/103 (60%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+ +PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +
Sbjct: 74 FAEDMVLPENFDAREHWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LTCC GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLTCCGD-QCGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+++ +C +I ++ QS CGSCWA A+SDR+CI ++G + +S++ L
Sbjct: 84 DLPESFDAREKWSHCASINLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAEDL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L CC +C G C+GG P AW Y E+G+ + G YG+
Sbjct: 144 LDCCDSCGAG--CDGGYPAAAWEYWKESGLVSDGLYGT 179
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 61/99 (61%), Gaps = 3/99 (3%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+PE FD R+Q+PNC +I V+ QS CGSCWA A+SDR+CIAT + +S++
Sbjct: 72 NLEIPESFDARQQWPNCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIATGKQT--RISTE 129
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
LLTCC T G C GG P AW Y G+ TG +G
Sbjct: 130 DLLTCC-GITCGMGCNGGFPSGAWNYFKNKGLVTGDLFG 167
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 60/96 (62%), Gaps = 3/96 (3%)
Query: 78 LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+PE FD R+ +P C +I G ++ Q++CGSCWA A+SDR+CI + + ++S++ L
Sbjct: 84 IPESFDAREAWPECASIIGDIRDQASCGSCWAFGAAEAMSDRICIHSNATVKVSISTEDL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
TCC C GD C GG P AW Y E G+ TGG Y
Sbjct: 144 NTCCYEC--GDGCNGGWPAEAWAYWAETGIVTGGKY 177
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 93.2 bits (230), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+ NC I ++ Q +CGS WA A+SDR+CI T GR++ +
Sbjct: 57 FSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVNVEV 116
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 117 SAEDLLTCCGIQC-GDGCNGGYPSGAWNFWTRKGLVSGGVYNS 158
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +S++ LL
Sbjct: 103 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSAEDLL 162
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
TCC G+ C GG P AW + + G+ +GG Y S
Sbjct: 163 TCCGF-QCGEGCNGGFPSGAWNFWTKKGLVSGGLYDS 198
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 92.8 bits (229), Expect = 5e-17, Method: Composition-based stats.
Identities = 45/105 (42%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTE-LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
++++ TE +P FD R +P C ++ GHV+ Q +CGSCWA A+T A +DR+CI +QG+
Sbjct: 269 EFENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGL 328
Query: 129 HTLSSDHLLTCCAACTGGDV-CEGGNPMRAWYYMLENGVPTGGDY 172
LS+ H +CC A C GG P AW + GV TGGD+
Sbjct: 329 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDF 373
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/139 (38%), Positives = 78/139 (56%), Gaps = 14/139 (10%)
Query: 39 PSFLSSLKFGLSLTPQSQEPN-PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHV 97
P+++ L L ++P++ P+ +L D S LPE FD R+ +P CT IG +
Sbjct: 61 PAYVRGL---LGVSPENHRYRLPERRL------DLSSLGPLPENFDSRENWPECTTIGEI 111
Query: 98 QLQSNCGSCWAIATTAALSDRMCI--ATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPM 155
+ Q +CGSCWA A+SDR CI + G LS+D LL+CC C G+ C GG P
Sbjct: 112 RDQGSCGSCWAFGAVEAMSDRTCIHSPSGGPKRVHLSADDLLSCCRTC--GNGCNGGFPG 169
Query: 156 RAWYYMLENGVPTGGDYGS 174
AW + ++ G+ TGG+Y S
Sbjct: 170 SAWSFWVKTGIVTGGNYDS 188
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW ++ G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIMC-GDGCNGGYPAGAWNFLTRKGLVSGGLYDS 175
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/102 (49%), Positives = 66/102 (64%), Gaps = 7/102 (6%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT-LSS 133
N LPE FDLR+ YP C ++ V+ QSNCGSCWA T A+SDR+CIA+ G+ D T +SS
Sbjct: 83 NLSLPESFDLREAYPKCESLQQVRDQSNCGSCWAFGTVEAISDRICIAS-GQKDQTRISS 141
Query: 134 DHLLTCCA---ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
++LL+CC AC G C GG AW Y ++ G+ +G Y
Sbjct: 142 ENLLSCCRGTFACGMG--CNGGYTAGAWNYYVKTGLVSGNLY 181
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 65 FAEDLNLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 124
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 125 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYDS 166
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 86 FAEDLNLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 145
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 146 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYDS 187
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 61/103 (59%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C + ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTVKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 80/168 (47%), Gaps = 6/168 (3%)
Query: 9 VNHSHHLLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEH 68
VN S H L R L ++ ++ +F L P+ +
Sbjct: 14 VNASSHFLSDKFIRQ----LQSEDSTWEAGRNFNKHLSIKYFRRLMGVHPDSKFHMPKYE 69
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
N E+P+EFD R +P C IG ++ Q +CGSCWA +SDR CI ++G+ +
Sbjct: 70 AHQIPENFEMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSN 129
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
S+++L++CC C G C GG P A+ Y + +G+ +GG + S Q
Sbjct: 130 FHYSAENLVSCCHLCGFG--CNGGFPGAAFKYWVHSGIVSGGSFNSTQ 175
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 59/93 (63%), Gaps = 2/93 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+Q+P+C IG ++ QS+CGSCWA A+SDR+CI + G +LSS L+
Sbjct: 80 IPKTFDAREQWPHCPTIGQIRDQSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLV 139
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+CC C G C+GG P AW + G+ TGG
Sbjct: 140 SCCGYCGFG--CQGGYPPAAWDFWQAYGIVTGG 170
>gi|402583630|gb|EJW77574.1| hypothetical protein WUBG_11516 [Wuchereria bancrofti]
Length = 168
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 66/103 (64%), Gaps = 4/103 (3%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+ELP+EFD R+++P C +I +V Q CGSC+A+A SDR+CIAT G + LSSD
Sbjct: 14 SELPDEFDARRKWPLCPSIHNVPNQGGCGSCYAVAVAGVASDRICIATNGTVQVILSSDD 73
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRF 178
+++CC +C C GG+ ++A Y + G+ TGG G CQ +
Sbjct: 74 IISCCISCGA---CTGGDSLKAMIYWVNEGIVTGGRDG-CQPY 112
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + +LP FD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YADDLKLPTNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S+ LLTCC C G C GG P AW + +G+ TGG Y S
Sbjct: 133 SAQDLLTCCDGCGMG--CNGGYPSAAWDFWSSDGLVTGGLYNS 173
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 62/99 (62%), Gaps = 3/99 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+ +PNC I ++ Q +CGSCWA A+SDR+CI T G ++ +S++ L
Sbjct: 79 DLPENFDARENWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCIHTNGNVNVEVSAEDL 138
Query: 137 LTCC-AACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LTCC C GD C GG P AW + + G+ +GG Y S
Sbjct: 139 LTCCHMEC--GDGCNGGFPAGAWNFWTKKGLVSGGLYDS 175
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 62/100 (62%), Gaps = 1/100 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+C+ T G + +S++
Sbjct: 79 DMKLPENFDAREQWPKCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHTNGYITIEVSAE 138
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC G+ C GG P AW Y ++ G+ +GG Y S
Sbjct: 139 DLLSCCGL-QCGEGCNGGFPAGAWKYWIKKGLVSGGLYDS 177
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 71/125 (56%), Gaps = 12/125 (9%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAA 114
Q+ N D L D T++P+EFD R+ + +C N IG V+ Q NC S WA+A +
Sbjct: 3 QKTNYDNWLTDRKTVDANYRTDVPKEFDARRHFVSCANVIGDVKDQGNCASSWAVAVAST 62
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV-----CEGGNPMRAWYYMLENGVPTG 169
+DR+CIAT G+ LS+ +L++C GD C GG+ +AW + + NG+ TG
Sbjct: 63 FTDRLCIATGGKFTDNLSAQNLMSC------GDSEKFVGCHGGSAFKAWEFTMGNGIVTG 116
Query: 170 GDYGS 174
G++ S
Sbjct: 117 GNFNS 121
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 61/103 (59%), Gaps = 3/103 (2%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L HF + + LPE FD +P+C I + QS+CGSCWA+A A+SDR C+ T
Sbjct: 77 LPRRHFTEEELRAPLPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCV-T 135
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
G D +S+ LL+CC +C GD C+GG P AW Y E+G+
Sbjct: 136 GGVRDLGISAGDLLSCCTSC--GDGCDGGYPDEAWLYFTESGL 176
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 92.0 bits (227), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 60/93 (64%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++ C +I +++ QS CGSCWA A+SDR+CI ++G+ LS+ L
Sbjct: 89 EIPSSFDSRKKWHQCKSISNIRDQSRCGSCWAFTAVEAMSDRICIESKGKKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L+CC C G C+GG P AW Y +E+G+ TG
Sbjct: 149 LSCCTECGLG--CQGGFPGAAWDYWVEDGIVTG 179
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIMC-GDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 63/102 (61%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N ELP+EFD R +P C IG ++ Q +CGSCWA +SDR CI ++G+ + S++
Sbjct: 76 NFELPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAE 135
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L++CC C G C GG P A+ Y + +G+ +GG + S Q
Sbjct: 136 NLVSCCHLCGFG--CNGGFPGAAFKYWVHSGIVSGGSFNSTQ 175
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+ + +C +I ++ QS CGSC A T A+SDR+CI T+GR+ +S+ L
Sbjct: 24 DLPESFDAREAWSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNISAQDL 83
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LTCC C G C GG P AW Y + G+ TGG YG+
Sbjct: 84 LTCCHQCGMG--CFGGYPSAAWDYYKDEGIVTGGLYGT 119
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIMC-GDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIMC-GDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWAVSAVGAISDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG P AW Y + +G+ TGG
Sbjct: 147 DLISCCENCGSG--CDGGFPGPAWDYWVSHGIVTGG 180
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/104 (42%), Positives = 64/104 (61%), Gaps = 3/104 (2%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ N +LP+ FD R Q+PNC I ++ Q +CGSCWA ++SDR+C+ + G+ + +
Sbjct: 7 FSGNWKLPDNFDSRTQWPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEV 66
Query: 132 SSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LL+CC C G C GG P AW Y E G+ +GG YGS
Sbjct: 67 SAEDLLSCCGFECGMG--CNGGYPSGAWQYWTEKGLVSGGLYGS 108
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 59/93 (63%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++ C +I +++ QS CG CWA A A+SDR+CI ++G+ LS+ L
Sbjct: 89 EIPSSFDSRKKWRQCKSISNIRDQSRCGPCWAFAAVEAMSDRICIQSKGKKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L+CC C G C+GG P AW Y +E G+ TG
Sbjct: 149 LSCCTECGLG--CQGGFPGAAWDYWVEEGIVTG 179
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIMC-GDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 67/116 (57%), Gaps = 3/116 (2%)
Query: 55 SQEPNPDLQLGSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
++ PDL+ D+ N E+P FD RK++P C +I ++ QS CGSCWA
Sbjct: 65 ARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVE 124
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
A+SDR CI + G+ + LS+ LL+CC +C G CEGG AW Y ++ G+ TG
Sbjct: 125 AMSDRSCIQSGGKQNVELSAVDLLSCCESCGLG--CEGGILGPAWDYWVKEGIVTG 178
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGIMC-GDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/111 (40%), Positives = 65/111 (58%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD R+Q+P+C I V+ Q CGSCWA A+SDR CI ++G++ +S++ L
Sbjct: 3 DVPDHFDSREQWPHCPTIKEVRDQGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDL 62
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
L+CC C G C GG P AW + G+ TGG Y S CQ + C+
Sbjct: 63 LSCCETCGMG--CNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACD 111
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 67/106 (63%), Gaps = 4/106 (3%)
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
+ Q +LP+ FD R ++P+C ++ ++ Q+NCGSCWA + A++DR+CIA +G +
Sbjct: 78 YKQVQPRNDLPDNFDPRTKWPDCASLNEIRDQANCGSCWAFGSAEAMTDRICIAGKGNIH 137
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ + CC +C G C GG P AW + ++ GV +GG YG+
Sbjct: 138 --ISAEDINDCCKSCGMG--CNGGYPAAAWEWYVDTGVVSGGQYGT 179
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA A+SDR CI + G+ + LS+
Sbjct: 64 NVEIPSNFDSRKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAV 123
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG---DYGSCQRFDRGNC 183
LL+CC C GD EGG P AW Y ++ G+ TG ++ SCQ + C
Sbjct: 124 DLLSCCEHC--GDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKC 173
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 68/116 (58%), Gaps = 7/116 (6%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P LQ + FG + ELP+ FD R +PNC I V+ Q +CGSCWA A+SDR+
Sbjct: 66 PQLQ---KRFG-FADGMELPDSFDSRAAWPNCPTIREVRDQGSCGSCWAFGAVEAISDRV 121
Query: 120 CIATQGRLDHTLSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C+ T G+++ +S++ LL+CC C G C GG P AW + E G+ +GG Y S
Sbjct: 122 CVHTNGKVNVEVSAEDLLSCCGFECGMG--CNGGYPSGAWKFWTETGLVSGGLYDS 175
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +LP +FD R+Q+P C + ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YAGGLKLPAQFDSREQWPECPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSGSKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LLTCC AC G C GG P AW + + G+ +GG Y S
Sbjct: 133 SSEDLLTCCDACGMG--CNGGYPSAAWDFWTKEGLVSGGLYNS 173
>gi|161343881|tpg|DAA06121.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 182
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 71/122 (58%), Gaps = 6/122 (4%)
Query: 56 QEPNPD--LQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATT 112
Q+ N D L+ + F D T++P+EFD R+ + NC N IG V+ Q NC S WA+A
Sbjct: 43 QKTNYDSWLKKNRKTF-DINYKTDIPKEFDARQYFFNCANVIGDVKDQGNCASSWAVAVA 101
Query: 113 AALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ +DR+CIAT G LS+ +L++C G C GG+ +AW ++ G+ TGG++
Sbjct: 102 STFTDRLCIATNGTFTQNLSAQNLMSCGDDEKSG--CNGGSAFKAWEFITGKGIVTGGNF 159
Query: 173 GS 174
S
Sbjct: 160 DS 161
>gi|161343847|tpg|DAA06104.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 187
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 69/122 (56%), Gaps = 5/122 (4%)
Query: 56 QEPNPDLQLGSEHFGDYQ---SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATT 112
Q+ D+ + +Y + ++PE FD R ++ C +I H+ Q NC + WAI+ T
Sbjct: 62 QKNGKDIDIIGHKVHNYDLDDGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVT 121
Query: 113 AALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+A++DR+CI ++ + S +L+CC C GD C GG AW Y ++ G+ TGGDY
Sbjct: 122 SAINDRICIKSKKNITAFYSPQKMLSCCDDC--GDGCNGGYSGAAWQYWMKRGLVTGGDY 179
Query: 173 GS 174
GS
Sbjct: 180 GS 181
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 69/122 (56%), Gaps = 5/122 (4%)
Query: 56 QEPNPDLQLGSEHFGDYQ---SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATT 112
Q+ D+ + +Y + ++PE FD R ++ C +I H+ Q NC + WAI+ T
Sbjct: 62 QKNGKDIDIIGHKVHNYDLDDGSNDMPETFDARNKWFECVSIAHIWNQGNCAADWAISVT 121
Query: 113 AALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+A++DR+CI ++ + S +L+CC C GD C GG AW Y ++ G+ TGGDY
Sbjct: 122 SAINDRICIKSKKNITAFYSPQKMLSCCDDC--GDGCNGGYSGAAWQYWMKRGLVTGGDY 179
Query: 173 GS 174
GS
Sbjct: 180 GS 181
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 60/101 (59%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+N +P FD R +P C ++ GHV+ Q +CGSCWA A+T A +DR+CI +QG+ LS
Sbjct: 164 ANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWAFASTEAFNDRLCIRSQGKGVMPLS 223
Query: 133 SDHLLTCCAACTGGDV-CEGGNPMRAWYYMLENGVPTGGDY 172
+ H +CC A C GG P AW + GV TGGD+
Sbjct: 224 TQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDF 264
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 64/103 (62%), Gaps = 5/103 (4%)
Query: 70 GDYQSNTELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
DY + E+PE FD +++P C + +++ QSNCGSCWA+++ +SDR+C+AT G++
Sbjct: 64 ADYDLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGSCWAVSSAGVMSDRICVATNGKVK 123
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
++S A+C GGD C GG A+ +ENG PTG +
Sbjct: 124 VSISG----IATASCVGGDGCNGGLEEVAFEKFIENGFPTGSE 162
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 60/93 (64%), Gaps = 2/93 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R +P+C +I ++ QS+CGSCWA A+SDR+CI + G + +LS+ LL
Sbjct: 86 IPKSFDARATWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAVDLL 145
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+CC C GD C+GG P AW + +G+ TGG
Sbjct: 146 SCCKDC--GDGCDGGFPPMAWDFWKTHGIVTGG 176
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 62/100 (62%), Gaps = 2/100 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N LP+ FD R+Q+ NC +I + QS C S WA+A+ AA+SDR+CI T G + LS+
Sbjct: 81 NILLPDYFDSREQWKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAI 140
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L++CC+ C G C G AWYY +ENG+ TG G+
Sbjct: 141 ELVSCCSKCAVG--CNFGYSESAWYYWVENGLVTGESNGN 178
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 5/103 (4%)
Query: 75 NTE---LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
NTE LP+ FD RKQ+P+C I ++ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 71 NTEGIRLPDSFDARKQWPDCRTIQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEI 130
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LL+CC C G C GG P AW + + G+ TGG GS
Sbjct: 131 SAEDLLSCCDECGMG--CSGGYPSSAWEFWTKKGLVTGGLCGS 171
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 1/99 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++ ELP FD R+ +P+C +I V+ Q +CGSCWA + A+SDR CI + LSS
Sbjct: 91 ADLELPANFDSREAWPDCPSISEVRDQGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSS 150
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ LL+CC G+ C GG P AW Y ++NG+ +GG Y
Sbjct: 151 EDLLSCCGYVC-GNGCNGGFPQAAWEYWVQNGLVSGGLY 188
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 58 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 117
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 118 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 159
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/105 (43%), Positives = 66/105 (62%), Gaps = 7/105 (6%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP +FD R+ +PNC +I ++ Q +CGSCWA A+SDR+CI T ++ +S+++LL
Sbjct: 83 LPSDFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNVN--ISAENLL 140
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFD 179
+CC +C G C GG P AW Y G+ +GG YGS CQ +D
Sbjct: 141 SCCYSCGFG--CNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYD 183
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 3/116 (2%)
Query: 55 SQEPNPDLQLGSEHFGDY-QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
++ PDL+ D+ + N E+P FD RK++P C +I ++ QS CGSCWA
Sbjct: 65 ARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFGAVE 124
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
A+SDR CI + G+ + LS+ LL+CC +C G CEGG AW + ++ G+ TG
Sbjct: 125 AMSDRSCIQSGGKQNVELSAVDLLSCCESCGLG--CEGGILGPAWDFWVKEGIVTG 178
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 82/161 (50%), Gaps = 18/161 (11%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
+ + V + N + + L+ SF+ +LK +++P P + +
Sbjct: 23 IAKRVNKQQNSWVANENTPLRDYSSFIGTLK--------NKKPLPIRSIPIKR------- 67
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
ELP+EFD +++P C +I V+ QS+C SCWA +DR+CI ++G+ LS++
Sbjct: 68 -ELPKEFDSSEKWPECPSILEVRDQSSCASCWAFGVVEVATDRICIESKGKNQVRLSAED 126
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L CC C G C+GG AW Y+ GV TGG Y S +
Sbjct: 127 VLECCKDC--GFQCQGGYSAMAWEYLRRTGVVTGGQYNSTE 165
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 68/110 (61%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+ FD RK++ +C +I ++ QS+CGS WA A+SDR+CI ++G+ LS+++L
Sbjct: 94 ELPKSFDARKEWTHCPSISEIRDQSSCGSYWAFGAVEAMSDRICIESKGKYKPFLSAENL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC++C G C GG P AW Y G+ TG Y + CQ ++ C
Sbjct: 154 VSCCSSCGMG--CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPC 201
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +LPE FD R+++ NC +I ++ QS+C SCWA+++ +A++DR+CI + G+ LS+
Sbjct: 82 SENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSA 141
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+++CCA C G C GG P +W Y GV TGG
Sbjct: 142 IDIVSCCAYCGYG--CNGGIPAMSWDYWTREGVVTGG 176
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +LPE FD R+++ NC +I ++ QS+C SCWA+++ +A++DR+CI + G+ LS+
Sbjct: 82 SENDLPESFDARQKWANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSA 141
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+++CCA C G C GG P +W Y GV TGG
Sbjct: 142 IDIVSCCAYCGYG--CNGGIPAMSWDYWTREGVVTGG 176
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 3/116 (2%)
Query: 55 SQEPNPDLQLGSEHFGDY-QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
++ PDL+ D+ + N E+P FD RK++P C +I ++ QS CGSCWA
Sbjct: 70 ARREEPDLRRKRRPTVDHNEWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWAFGAVE 129
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
A+SDR CI + G+ + LS+ LL+CC +C G CEGG AW + ++ G+ TG
Sbjct: 130 AMSDRSCIQSGGKQNVELSAVDLLSCCESCGLG--CEGGILGPAWDFWVKEGIVTG 183
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 71/115 (61%), Gaps = 5/115 (4%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
++ ++LP+ FD R ++ C +I ++ Q +CGSCW+ +++DR+CI + G++ +
Sbjct: 76 HEDTSDLPKSFDSRDKWRMCPSIREIRDQGSCGSCWSFGAVESITDRICIHSNGKVKVHI 135
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
S++ L+TCC +C G C GG +AW+Y + NG+ TGG Y S CQ ++ C
Sbjct: 136 SAEDLMTCCTSCGMG--CNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKC 188
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS C S WA++ AA+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y +++G+ TGG
Sbjct: 147 DLISCCKNCGSG--CDGGVTGYSWDYWVKHGIVTGG 180
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 64/103 (62%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ +LP +FD R Q+P C + V+ Q +CGSCWA A+SDR+CI + G ++ +
Sbjct: 73 FTADVQLPVDFDARVQWPQCPTLKEVRDQGSCGSCWAFGAAEAISDRLCIHSNGLMNVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LL+CC +C G C GG P AW + +G+ +GG Y S
Sbjct: 133 SAEDLLSCCDSCGMG--CNGGYPSAAWEFWTTDGLVSGGLYDS 173
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 7/116 (6%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P LQ + FG + ELP+ FD R +PNC I ++ Q +CGSCWA A+SDR+
Sbjct: 66 PQLQ---KRFG-FADGLELPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAVEAISDRV 121
Query: 120 CIATQGRLDHTLSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C+ T G+++ +S++ LL+CC C G C GG P AW + E G+ +GG Y S
Sbjct: 122 CVHTNGKVNVEVSAEDLLSCCGDECGMG--CNGGYPSGAWQFWTETGLVSGGLYDS 175
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 63/102 (61%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N ++P+EFD R +P C IG ++ Q +CGSCWA +SDR CI ++G+ + SS+
Sbjct: 77 NFDMPKEFDSRAAWPMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSE 136
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L++CC C G C GG P A+ Y + +G+ +GG + S Q
Sbjct: 137 NLVSCCHLCGFG--CNGGFPGAAFKYWVHSGIVSGGSFNSTQ 176
>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 515
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++P +FD RK + C +I ++ QS+CGSCWA A+SDR+CI + + LS+
Sbjct: 77 SNVDIPMQFDARKYWLKCPSIREIRGQSSCGSCWAFGAVEAMSDRLCIHSGAKYQKGLSA 136
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
LL+CC C G C+GG P +AW Y +G+ TGG
Sbjct: 137 VDLLSCCWKCGYG--CDGGFPAQAWNYWSTDGIVTGG 171
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/92 (47%), Positives = 58/92 (63%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD R +PNC+ IG + Q +CGSCWA +LSDR CI L+ +LS++ L
Sbjct: 99 ELPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HYGLNISLSANDL 156
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC GD C+GG P++AW Y + GV T
Sbjct: 157 LACCGFLC-GDGCDGGYPLQAWKYFVRKGVVT 187
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 71/128 (55%), Gaps = 8/128 (6%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P LQL D + +LP+ FD R Q+ NC I ++ Q +CGSCWA ++SDR+
Sbjct: 99 PHLQLPVR---DIEPRKDLPDTFDARTQWSNCPTIKEIRDQGSCGSCWAFGAVESMSDRI 155
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQ 176
CI + G+ + +S++ L +CC +C G+ C GG AW Y +G+ TGG Y S CQ
Sbjct: 156 CIKSNGQQNAHISAEDLTSCCRSC--GNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQ 213
Query: 177 RFDRGNCN 184
+ C+
Sbjct: 214 PYTVKACD 221
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLTCCGS-RCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 74/133 (55%), Gaps = 6/133 (4%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAA 114
Q+ N D S D T++P+EFD R+ + +C N IG V+ Q NC S WA+A +
Sbjct: 42 QKTNYDNWSRSRKTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVAST 101
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+DR+CIA+ G+ LS+ +L++C G C+GG+ +AW + + G+ TGG Y S
Sbjct: 102 FTDRLCIASNGKFTDNLSAQNLMSCGDDEKLG--CDGGSAYKAWEFTMGKGIVTGGPYDS 159
Query: 175 ---CQRFDRGNCN 184
CQ + C+
Sbjct: 160 NEGCQPYKNRPCD 172
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS C S WA++ AA+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y +++G+ TGG
Sbjct: 147 DLISCCENCGSG--CDGGVTGYSWDYWVKHGIVTGG 180
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 65/103 (63%), Gaps = 3/103 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
++++ +LP FD R Q+PNC +I ++ Q+ CGSCWA A +SDR+CI + G
Sbjct: 78 EFETLIQLPTAFDSRVQWPNCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPI 137
Query: 131 LSSDHLLTCC-AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+S + +L+CC ++C G C+GG + A Y + +GV TGGDY
Sbjct: 138 ISPEDILSCCGSSCNNG--CQGGYTIEAMKYWMNSGVVTGGDY 178
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 65/111 (58%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P FD R + +C+ IG ++ Q CGSCWA A+SDR+CIA++G D +++ +
Sbjct: 93 DIPATFDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDV 152
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG---SCQRFDRGNCN 184
L+CC C G+ C GG P+ A Y + G+ TGG YG +CQ + C
Sbjct: 153 LSCCLTC--GNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACE 201
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 70/132 (53%), Gaps = 7/132 (5%)
Query: 47 FGLSLTPQSQEPNPDLQLGS----EHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQS 101
G L+ S P++ LG + F + Q E ++FD R+ +P C IGHV+ Q
Sbjct: 203 MGTYLSFYSDPDKPEVPLGEPLPVKVFAETQQVLET-DKFDAREAFPQCAEVIGHVRDQG 261
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV-CEGGNPMRAWYY 160
+CGSCWA A+T AL+DR CI + GR LS H +CC C GG P AW +
Sbjct: 262 DCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRW 321
Query: 161 MLENGVPTGGDY 172
+GV TGGDY
Sbjct: 322 FSNDGVVTGGDY 333
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 70/132 (53%), Gaps = 7/132 (5%)
Query: 47 FGLSLTPQSQEPNPDLQLGS----EHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQS 101
G L+ S P++ LG + F + Q E ++FD R+ +P C IGHV+ Q
Sbjct: 203 MGTYLSFYSDPDKPEVPLGEPLPVKVFAETQQVLET-DKFDAREAFPQCAEVIGHVRDQG 261
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV-CEGGNPMRAWYY 160
+CGSCWA A+T AL+DR CI + GR LS H +CC C GG P AW +
Sbjct: 262 DCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRW 321
Query: 161 MLENGVPTGGDY 172
+GV TGGDY
Sbjct: 322 FSNDGVVTGGDY 333
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/111 (40%), Positives = 63/111 (56%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP+ FD R ++P+C +I ++ QS CGSCWA A+SDR+CI + G + +LS+ L
Sbjct: 85 RLPKNFDARTKWPHCPSISEIRDQSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDL 144
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG---DYGSCQRFDRGNCN 184
L+CC C G C GG P AW Y +G+ TGG D C+ + C
Sbjct: 145 LSCCENCGYG--CSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCE 193
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 64/102 (62%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LPE FD R+++ C ++ ++ Q CGSCWAI+ +A++DR CI ++G+ + +
Sbjct: 84 DIQLPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGAT 143
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L CC AC GD C+GG AW + +E GV +GG Y S Q
Sbjct: 144 DMLACCHAC--GDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQ 183
>gi|402585445|gb|EJW79385.1| hypothetical protein WUBG_09708, partial [Wuchereria bancrofti]
Length = 190
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 62/97 (63%), Gaps = 4/97 (4%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
PE+FD R Q+P C ++ V Q CGSCWAI+ + +SDR+CIAT +S++ L++
Sbjct: 49 PEQFDARLQWPLCWSVHQVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLIS 108
Query: 139 CCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDYGS 174
CCA C G C+G N + A+ Y +G+ TGGDYGS
Sbjct: 109 CCAECGG---CQGSNWALSAFIYWRNHGIVTGGDYGS 142
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 64/102 (62%), Gaps = 2/102 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LPE FD R+++ C ++ ++ Q CGSCWAI+ +A++DR CI ++G+ + +
Sbjct: 84 DIQLPESFDARQKWSQCPSLNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGAT 143
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+L CC AC GD C+GG AW + +E GV +GG Y S Q
Sbjct: 144 DMLACCHAC--GDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQ 183
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS C S WA++ A+SDR+CI + G+ LS+
Sbjct: 61 KVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAI 120
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG P AW Y + +G+ TGG
Sbjct: 121 DLISCCENCGSG--CDGGFPGPAWDYWVSHGIVTGG 154
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 63/96 (65%), Gaps = 3/96 (3%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R ++ NCT+I ++ Q+ CGSCWA +T +SDR+CIAT+G T+S +L
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 138 TCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
CC +C GD C+GG P++A+ + GV TGGD+
Sbjct: 141 ACCGNSC--GDGCKGGYPIQAFRWWNSRGVVTGGDF 174
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 68/115 (59%), Gaps = 5/115 (4%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P LQ + FG + + +LP+ FD R +PNC I ++ Q +CGSCWA A+SDR+
Sbjct: 66 PQLQ---KRFG-FADDLDLPDSFDSRAAWPNCPTIREIRDQGSCGSCWAFGAVEAISDRV 121
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
C+ T G+++ +S++ LL+CC G C GG P AW + E G+ +GG Y S
Sbjct: 122 CVHTNGKVNVEVSAEDLLSCCGFKCGMG-CNGGYPSGAWRFWTETGLVSGGLYDS 175
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 74/133 (55%), Gaps = 6/133 (4%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAA 114
Q+ N D S D T++P+EFD R+ + +C N IG V+ Q NC S WA+A +
Sbjct: 6 QKTNYDNWSRSRKTADINYKTDIPKEFDARQYFISCANVIGDVKDQGNCASSWAVAVAST 65
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+DR+CIA+ G+ LS+ +L++C G C+GG+ +AW + + G+ TGG Y S
Sbjct: 66 FTDRLCIASNGKFTDNLSAQNLMSCGDDEKLG--CDGGSAYKAWEFTMGKGIVTGGPYDS 123
Query: 175 ---CQRFDRGNCN 184
CQ + C+
Sbjct: 124 NEGCQPYKNRPCD 136
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 67/112 (59%), Gaps = 6/112 (5%)
Query: 77 ELPEEFDLRKQYPN-CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP EFD RK++ + C ++ V+ Q CGSCWA A++DR+CIAT+G+ +S++
Sbjct: 77 DLPIEFDARKEWGSICPSLLEVRDQGECGSCWAFGAAEAMTDRICIATKGKNQVRISTED 136
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
LLTCC +C G C GG P AW + G+ TGG Y S CQ + C+
Sbjct: 137 LLTCCDSCGFG--CNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACD 186
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 66/116 (56%), Gaps = 3/116 (2%)
Query: 55 SQEPNPDLQLGSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
++ PDL+ D+ N E+P FD RK++P C +I ++ QS CGSCW+
Sbjct: 65 ARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVE 124
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
A+SDR CI + G+ + LS+ LLTCC +C G CEGG AW Y ++ G+ T
Sbjct: 125 AMSDRSCIQSGGKQNVELSAVDLLTCCESCGLG--CEGGILGPAWDYWVKEGIVTA 178
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 62/100 (62%), Gaps = 3/100 (3%)
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 81 FTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHC-TVGGVQQ 139
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C GD C+GG P AW Y + +G+ +
Sbjct: 140 LRISAAHLLSCCKDC--GDGCDGGYPDAAWRYYVSHGLAS 177
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 62/100 (62%), Gaps = 3/100 (3%)
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 81 FTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHC-TVGGVQQ 139
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C GD C+GG P AW Y + +G+ +
Sbjct: 140 LRISAAHLLSCCKDC--GDGCDGGYPDSAWEYYVSHGLAS 177
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 58/98 (59%), Gaps = 1/98 (1%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +S++ L
Sbjct: 1 KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 60
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 61 LTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 97
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/107 (41%), Positives = 62/107 (57%), Gaps = 5/107 (4%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT---QGRL 127
D + LPE FD R+ +P+C I ++ Q +CGSCWA A+SDR CI + + R+
Sbjct: 111 DVSALRVLPENFDAREHWPDCPTIREIRDQGSCGSCWAFGAVEAISDRTCIHSPEGKPRV 170
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L++D +L+CC C G C GG P AW Y + G+ TGG+Y S
Sbjct: 171 IAHLAADDVLSCCTECGAG--CNGGFPGSAWSYWVHKGIVTGGNYDS 215
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 58/98 (59%), Gaps = 1/98 (1%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +S++ L
Sbjct: 2 KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 61
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 62 LTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 98
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 61/96 (63%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS C S WA+++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSSVGAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG + +W Y + +G+ TGG
Sbjct: 147 DLISCCKNCGSG--CDGGYFLPSWDYWVSHGIVTGG 180
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 64/111 (57%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPN-CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP FD R Q+ + C ++ V+ Q+NCGSCWA A++DR CIA++G +S++
Sbjct: 88 DLPTSFDSRTQWGSMCPSVKEVRDQANCGSCWAFGAVEAMTDRTCIASKGAQTPHISAED 147
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LLTCC T GD C GG P AW Y G+ TGG Y S CQ + C
Sbjct: 148 LLTCCTF-TCGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKC 197
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P FD R ++P C ++ ++ QSNCGSCWA +T +SDR CIA+ G +S LLT
Sbjct: 84 PLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLT 143
Query: 139 CCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
CC + G+ C+GG P RA+ + GV TGGDY
Sbjct: 144 CCGM-SCGEGCDGGFPYRAFQWWARRGVVTGGDY 176
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 62/101 (61%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRYC-TVGGVQ 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HL++CC C GD C+GG P AW Y + +G+ +
Sbjct: 139 QLRISAAHLMSCCEDC--GDGCKGGAPDSAWEYYVSHGLAS 177
>gi|268566081|ref|XP_002647468.1| Hypothetical protein CBG06540 [Caenorhabditis briggsae]
Length = 188
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 67/119 (56%), Gaps = 8/119 (6%)
Query: 61 DLQLGSEHFGDYQSNT-------ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
D++ H GD S ++P+ FD R+++ NCT+I ++ Q+NCGSCWA
Sbjct: 52 DVKYADPHPGDIPSAKLNIVLAKKIPDTFDARQKWKNCTSIKMIRNQANCGSCWAFGAAE 111
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+SDR+CI T+G +S +L CC G C+GG ++A + + NGV TGGDY
Sbjct: 112 VISDRICIVTKGARQPIISPTDMLDCCGEYCGYG-CDGGYSIQALRWWVSNGVVTGGDY 169
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 62/99 (62%), Gaps = 1/99 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N++L + FD R+++P CT+I + S C S WA A ++SDR+CI + G ++ LS+
Sbjct: 69 NSDLSQFFDARERWPECTSIPQINDISECKSSWAFAAAESMSDRLCINSGGMINTILSAQ 128
Query: 135 HLLTCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDY 172
LL+CC G+ C GGN +AW Y ++G+PTGG Y
Sbjct: 129 ELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSY 167
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 75/143 (52%), Gaps = 12/143 (8%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
L ++ + +T S E P +F + +LPE FD +++P C IG ++ QS
Sbjct: 67 LEEVRKLMGVTSMSTEAVP-----PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQS 121
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NCGSCWAIA A+SDR C + G D +S+ +LL+CC C G C GG P AW +
Sbjct: 122 NCGSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGFG--CYGGIPAMAWLWW 178
Query: 162 LENGVPTGGDYGSCQRFDRGNCN 184
+ GV T CQ + G C+
Sbjct: 179 VWVGVTT----ELCQPYPFGPCS 197
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 61/99 (61%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++P CT++ ++ Q CGSCWAI+ +DR CI ++ + + + LL
Sbjct: 89 LPERFDARDRWPECTSLKQIRNQGCCGSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLL 148
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC +C GD C+GGN AW + ++ GV +GG Y S Q
Sbjct: 149 SCCHSC--GDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQ 185
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 66/111 (59%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++P C +I +++ QS CG+ WA A A+SDR+CI ++G+ LS+ L
Sbjct: 89 EIPTSFDSRKEWPQCKSISNIRDQSRCGAGWAFAAVQAMSDRICIESKGKKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG---DYGSCQRFDRGNCN 184
L+CC C G C+ G P AW Y ++ G+ TGG ++ CQ + C
Sbjct: 149 LSCCIECGLG--CQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCE 197
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 57/100 (57%), Gaps = 2/100 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD R +P C +I ++ Q +CGSCWA +SDR+CIA+ S+ L
Sbjct: 78 QIPQNFDARIVWPQCESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQDL 137
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
L CC C G C GG RAW Y + +G+ +GGD+ + Q
Sbjct: 138 LACCKEC--GHGCGGGYSSRAWQYWVTDGIVSGGDFNTSQ 175
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 75/143 (52%), Gaps = 12/143 (8%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
L ++ + +T S E P +F + +LPE FD +++P C IG ++ QS
Sbjct: 67 LEEVRKLMGVTSMSTEAVP-----PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQS 121
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NCGSCWAIA A+SDR C + G D +S+ +LL+CC C G C GG P AW +
Sbjct: 122 NCGSCWAIAAVEAMSDRYCTMS-GIPDRRISTTNLLSCCFICGFG--CYGGIPAMAWLWW 178
Query: 162 LENGVPTGGDYGSCQRFDRGNCN 184
+ GV T CQ + G C+
Sbjct: 179 VWVGVTT----ELCQPYPFGPCS 197
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +S++ LL
Sbjct: 1 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 60
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
TCC + GD C GG P AW + G+ +GG Y S
Sbjct: 61 TCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 96
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 63/101 (62%), Gaps = 1/101 (0%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++ +LPE FD R Q+P C I ++ Q +CGSCWA A+SDR+CI ++G+++ +S+
Sbjct: 9 TDVKLPENFDSRTQWPKCPTIQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISA 68
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ LL+CC G C GG P AW + E G+ +GG + S
Sbjct: 69 EDLLSCCGMECGFG-CNGGYPSGAWNFWTETGLVSGGLFKS 108
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 58/98 (59%), Gaps = 1/98 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N ++P+ FD R +P C +I H++ QS CGSCWA A+SDR+CIA+ G + LS++
Sbjct: 12 NVKIPDNFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAE 71
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L+CC G C GG P AW + +G+ T Y
Sbjct: 72 DMLSCCLVQCGMG-CNGGFPTGAWRFFKMHGLTTESKY 108
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 69/123 (56%), Gaps = 14/123 (11%)
Query: 64 LGSEHFGDYQSNTEL------------PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y + E+ P++FD R + +C IGH++ Q NCGSCW+ +T
Sbjct: 59 LGSRGYKNYTNEVEIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFST 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + L G C GG P++AW Y GV TGGD
Sbjct: 119 TGAFADRLCVSTGGKFNQLLSPEEL--AFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGD 176
Query: 172 YGS 174
YG+
Sbjct: 177 YGT 179
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/102 (46%), Positives = 61/102 (59%), Gaps = 4/102 (3%)
Query: 76 TELPEEFDLRKQYPNC-TNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT-LSS 133
+ LP+ FD R+ + NC T IGHV+ QS CGSCWA AT+ A SDR+CI + G D LS+
Sbjct: 125 SNLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSA 184
Query: 134 DHLLTCCAACTG--GDVCEGGNPMRAWYYMLENGVPTGGDYG 173
H CC+ G C+GG P AW + E+GV + D G
Sbjct: 185 GHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSG 226
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 66/111 (59%), Gaps = 5/111 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P EFD R+Q+P+C I ++ Q NCGSCWA++ + ++DR CI T+G +D SS+++
Sbjct: 75 EIPVEFDAREQWPHCPCIDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENV 134
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
CC C G+ C GG+ A+ + + G +GG + S CQ + C
Sbjct: 135 AACCTEC--GNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPYSVEECE 183
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 60/98 (61%), Gaps = 1/98 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P+ FD R +PNC +I ++ QS+CGSCWA++ +SDR+CIA+ + ++S+D
Sbjct: 94 DAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISAD 153
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CC G C GG P+ AW + ++ G TGG Y
Sbjct: 154 DINACCGMVCGNG-CNGGYPIEAWRHYVKKGYVTGGSY 190
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/123 (38%), Positives = 69/123 (56%), Gaps = 14/123 (11%)
Query: 64 LGSEHFGDYQSNTEL------------PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT 111
LGS + +Y + E+ P++FD R + +C IGH++ Q NCGSCW+ +T
Sbjct: 59 LGSRGYKNYTNEVEIKKYDPLYVENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFST 118
Query: 112 TAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
T A +DR+C++T G+ + LS + L G C GG P++AW Y GV TGGD
Sbjct: 119 TGAFADRLCVSTGGKFNQLLSPEEL--AFCCKDCGKGCGGGYPIKAWKYFRTQGVTTGGD 176
Query: 172 YGS 174
YG+
Sbjct: 177 YGT 179
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS C S WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y +++G+ TGG
Sbjct: 147 DLISCCKNCGSG--CDGGVTGYSWDYWVKHGIVTGG 180
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 61/95 (64%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP+ FD R Q+P+C ++ ++ Q++CGSCWA +SDR+CI + G +S++ +L
Sbjct: 95 LPDTFDARDQWPDCKSLKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDIL 154
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC + T G C+GG + A Y + +GV TGGDY
Sbjct: 155 SCCGS-TCGKGCQGGYTIEAMKYWMNSGVVTGGDY 188
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 76/136 (55%), Gaps = 10/136 (7%)
Query: 42 LSSLKFGLSLTP-QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQ 100
LSS + G+++ +S+ P + + E + +LPE+FD R ++P C ++ ++ Q
Sbjct: 68 LSSYRVGVNMEELESKRLKPGILILKE-------DIDLPEQFDARDKWPQCPSLREIRNQ 120
Query: 101 SNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYY 160
CGSCWAI+ A +DR CI + + S L++CC +C GD C+GG AW Y
Sbjct: 121 GCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCCHSC--GDGCQGGVLGPAWDY 178
Query: 161 MLENGVPTGGDYGSCQ 176
++ GV +GG Y S Q
Sbjct: 179 WVQKGVSSGGPYNSKQ 194
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 79/153 (51%), Gaps = 9/153 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
++RHV G W + S +S K+ L + Q P DL+ S + +
Sbjct: 45 IVRHVNEHPQAG-WKATMNPRFSNYSVSQFKYLLGV---KQTPEKDLK--STPVLSHPKS 98
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R+ +P C +IG + Q +CGSCWA +LSDR CI ++ TLS +
Sbjct: 99 LKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFD--MNITLSVND 156
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
LL CC GD C+GG P+ AW Y + +GV T
Sbjct: 157 LLACCGFMC-GDGCDGGYPISAWRYFVRHGVVT 188
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 79/153 (51%), Gaps = 9/153 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
++RHV G W + S +S K+ L + Q P DL+ S + +
Sbjct: 46 IVRHVNEHPQAG-WKATMNPRFSNYSVSQFKYLLGV---KQTPEKDLK--STPVLSHPKS 99
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R+ +P C +IG + Q +CGSCWA +LSDR CI ++ TLS +
Sbjct: 100 LKLPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFD--MNITLSVND 157
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
LL CC GD C+GG P+ AW Y + +GV T
Sbjct: 158 LLACCGFMC-GDGCDGGYPISAWRYFVRHGVVT 189
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N LPE FD R +P+C +I ++ QS+CGSCWA A+SDR+CI ++G + +LS+
Sbjct: 636 NQHLPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAV 695
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C GG AW + +G+ TGG
Sbjct: 696 DLVSCCTECGCG--CRGGYSPIAWDFWKTHGIVTGG 729
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 57/100 (57%), Gaps = 2/100 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ ELP+ FD R ++P+C +I ++ QS+C S WA ++SDR+CI + G + +
Sbjct: 44 EVSDEKELPKSFDARTKWPHCPSISEIRDQSSCESFWAFGAVESMSDRLCIHSNGAFNKS 103
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
LS+ LL+CC C G C G AW + +G+ TGG
Sbjct: 104 LSATDLLSCCEDCGLG--CGAGFHPMAWDFWKTHGIVTGG 141
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 61/101 (60%), Gaps = 2/101 (1%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
SN ++P+ FD R ++ +C I ++ Q +CGSCWA A+SDR CI + + L++
Sbjct: 85 SNVQIPDHFDSRHRWHDCPTIREIRDQGSCGSCWAFGAVEAMSDRHCIHSGAKNIVHLAA 144
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
D +L+CC +C G C GG P AW Y + G+ TGG+Y S
Sbjct: 145 DDVLSCCMSCGSG--CNGGFPGAAWSYWVHKGIVTGGNYDS 183
>gi|412985820|emb|CCO17020.1| cathepsin B-like cysteine proteinase [Bathycoccus prasinos]
Length = 541
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 72 YQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
++ ++LPE FD R+++P C+ IG Q CGSCWAIA T +SDR+CIA+ G++
Sbjct: 270 WEPPSDLPESFDAREKWPECSEFIGEAWDQGECGSCWAIAPTKVMSDRLCIASGGKVQER 329
Query: 131 LSSDHLLTCCAACTGGDV--CEGGNPMRAWYYMLENGVPTGGDYG 173
L++ +L+C + CEGG P A+ + E GV +GG YG
Sbjct: 330 LAASEILSCGQLVSEFSFGSCEGGMPDDAYEFAKEFGVASGGKYG 374
>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 180
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 59/97 (60%), Gaps = 5/97 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+CI + G LS+
Sbjct: 31 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGE 90
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ ACT C GG+P AW ++ + G+ TGGDY
Sbjct: 91 M----NACTLFFGCGGGDPYSAWSWVHDKGIATGGDY 123
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 58/106 (54%), Gaps = 2/106 (1%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D + LPE FD R ++PNC + ++ Q +CGSCWA A++DR+C + G
Sbjct: 77 DAELIASLPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCTYSDGTKHFH 136
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
S++ LL+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 137 FSAEDLLSCCPICGLG--CNGGMPTLAWEYWKHAGIVSGGSYNSTQ 180
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 74/121 (61%), Gaps = 4/121 (3%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
E ++ +H+ DY + ++PE FD R ++PNC ++ ++ Q CGSCWA+A + +S
Sbjct: 72 ESRTGFKVPIKHY-DYVYDIDIPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMS 130
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDYGSC 175
DR+CI T G + ++++ L+ CCA C G+ CEGG ++ Y ++ G+ +GG Y S
Sbjct: 131 DRVCIHTNGTRNVAIAAEDLMGCCADC--GNGCEGGFLDGTSFQYWVDAGLVSGGAYNST 188
Query: 176 Q 176
+
Sbjct: 189 E 189
>gi|56758470|gb|AAW27375.1| unknown [Schistosoma japonicum]
Length = 217
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 2/94 (2%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
E+P FD RK++P C +I ++ QS CGSCWA A+SDR CI + G+ + LS+
Sbjct: 1 VEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
LL+CC +C G CEGG AW Y ++ G+ TG
Sbjct: 61 LLSCCESCGLG--CEGGILGPAWDYWVKEGIVTG 92
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS C S WA++ A+SDR+CI + G+ LS+
Sbjct: 54 NVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAI 113
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + +G+ TGG
Sbjct: 114 DLISCCKNCGSG--CDGGVTGYSWDYWVSHGIVTGG 147
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 64/109 (58%), Gaps = 6/109 (5%)
Query: 78 LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+PE FD R+ +P C +I G+++ Q C S WA A +SDR+CIAT G++ LS + L
Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGN 182
+ CC C G+ C+GG AW Y + G+ +GGDY + CQ + N
Sbjct: 132 IDCCHYC--GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYSELN 178
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 63/107 (58%), Gaps = 5/107 (4%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+C+ + G LS+
Sbjct: 59 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGE 118
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGN 182
+ CA G C+GG P AW ++ + G+ TGGDY + +G+
Sbjct: 119 -MNACAPSYG---CDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGD 161
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 62/100 (62%), Gaps = 2/100 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+ FD R Q+P+C ++ V+ Q CGSCWA A +DR+CI ++G ++ LS++ L
Sbjct: 85 DLPDTFDARTQWPDCPSLKEVRDQGACGSCWAFGCVEAATDRLCIQSKGIVNAHLSAEDL 144
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G+ C GG AW Y+ +G+ TGG Y S Q
Sbjct: 145 TSCCRTC--GNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQ 182
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 70/110 (63%), Gaps = 7/110 (6%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+++ +C +I ++ Q +CGSCWA+ A+SDR C++ Q + +S+++L+
Sbjct: 82 IPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVEAMSDRYCVSFQENVH--ISAENLM 139
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
TCC C G+ C GG +AW Y +++G+ TGG YGS CQ + CN
Sbjct: 140 TCCKFC--GNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCN 187
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
Length = 301
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 59/97 (60%), Gaps = 3/97 (3%)
Query: 78 LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+PE FD R+ +P CT+I G ++ Q++CGSCWA A+SDR+CI + + +S++ L
Sbjct: 84 IPESFDAREAWPECTSIIGEIRDQASCGSCWAFGAVEAMSDRICIHSDASVKVRISAEDL 143
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
CC C GD C GG P AW Y G+ TGG YG
Sbjct: 144 NDCCYDC--GDGCNGGWPDLAWSYWSSTGIVTGGLYG 178
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 64/109 (58%), Gaps = 6/109 (5%)
Query: 78 LPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+PE FD R+ +P C +I G+++ Q C S WA A +SDR+CIAT G++ LS + L
Sbjct: 72 IPETFDAREYWPECADIIGNIRNQGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDL 131
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGN 182
+ CC C G+ C+GG AW Y + G+ +GGDY + CQ + N
Sbjct: 132 IDCCHYC--GNQCKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYSELN 178
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 60/98 (61%), Gaps = 3/98 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD RK++P C ++ ++ Q +CGSCWA A +SDR+CI + G S++ L
Sbjct: 80 DIPKTFDARKKWPKCDSLNRIRDQGSCGSCWAFAAVETMSDRICIHSSGAKKFFFSAEDL 139
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L+CC AC C GG M A+ + ++ GV +GGD S
Sbjct: 140 LSCCTACGS---CSGGYMMAAFDFYIKQGVVSGGDLNS 174
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAI 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 54 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 113
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 114 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 147
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + ELP+ FD R Q+PNC + V+ Q +CGSCWA A+SDR+CI + ++ +
Sbjct: 73 YTEDMELPKNFDPRLQWPNCPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEI 132
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SS+ LL+CC +C G C GG P A + + G+ +GG Y S
Sbjct: 133 SSEDLLSCCESCGMG--CNGGYPSAACDFWTKEGLVSGGLYDS 173
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 57/97 (58%), Gaps = 5/97 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+CI + G LS+
Sbjct: 536 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGE 595
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ AC C GG P AW ++ + G+ TGGDY
Sbjct: 596 M----NACAPSHGCNGGFPNSAWSWVHDKGIATGGDY 628
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 59/97 (60%), Gaps = 3/97 (3%)
Query: 77 ELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
E+P+ FD R+ +P+C I G+++ QS CGSCWA A+SDR+CI + + +S++
Sbjct: 83 EIPDSFDAREAWPDCAPIIGNIRDQSTCGSCWAFGAVEAMSDRICIHSNATVKVNISAED 142
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L CC C G C GG P AW + NG+ TGG+Y
Sbjct: 143 PLDCCTICGMG--CNGGMPAMAWLHWTVNGIVTGGNY 177
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 66/112 (58%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P+EFD RKQ+PNC +I ++ Q +CGSCWA+ + G+L LS++
Sbjct: 78 DVTIPDEFDARKQWPNCPSITDIRDQGSCGSCWALELLRLCLIVFVSHSNGKLQVHLSAE 137
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
+L+TCC +C G C GG+P AW Y + G+ +GG+YGS CQ + C
Sbjct: 138 NLVTCCGSCGAG--CFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPC 187
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 59/103 (57%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGS WA A+SDR+CI T + +
Sbjct: 1 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEV 60
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 61 SAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYES 102
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 57/97 (58%), Gaps = 5/97 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+CI + G LS+
Sbjct: 20 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGE 79
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ AC C GG P AW ++ + G+ TGGDY
Sbjct: 80 M----NACAPSHGCNGGFPNSAWSWVHDKGIATGGDY 112
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 69/114 (60%), Gaps = 11/114 (9%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+++P+C IG V+ Q CGSCWA A+SDR CI+ + +++ +S+++LL
Sbjct: 86 IPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISFKEQVN--ISAENLL 143
Query: 138 TCCAACTGGDVCEGGNPMRAWYY----MLENGVPTGGDYGS---CQRFDRGNCN 184
+CC C G C+GG P AW + +L G+ TGG Y S CQ + C+
Sbjct: 144 SCCETCGSG--CDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCD 195
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 62/100 (62%), Gaps = 3/100 (3%)
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 81 FTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHC-TVGGVQQ 139
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HL++CC C GD C+GG P +W Y + +G+ +
Sbjct: 140 LRISAAHLMSCCEDC--GDGCDGGYPGTSWEYYVSHGLAS 177
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 59/96 (61%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAIGAMSDRICIQSGGKQSVKLSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCENCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 56/101 (55%), Gaps = 2/101 (1%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LPE FD R ++PNC + V+ Q +CGSCWA A++DR C + G S++
Sbjct: 85 ADLPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAED 144
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
LL+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 145 LLSCCPVCGLG--CNGGMPTLAWEYWKHFGLVSGGSYNSSQ 183
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 64/121 (52%), Gaps = 8/121 (6%)
Query: 48 GLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCW 107
G +TP + E P + E N +LP EFD RK + +C+ IG + Q +CGSCW
Sbjct: 76 GAKMTP-ANEVEPSI----ERVTHKHKNLDLPTEFDARKHWSHCSTIGDILDQGHCGSCW 130
Query: 108 AIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
A +L+DR CI + +LS + LL CC GD CEGG P+RAW Y GV
Sbjct: 131 AFGAVESLTDRFCIHLNESV--SLSENDLLACCGF-ECGDGCEGGYPIRAWQYFKRTGVV 187
Query: 168 T 168
T
Sbjct: 188 T 188
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 60/100 (60%), Gaps = 1/100 (1%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ LPE FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI + G + +S++
Sbjct: 77 DIALPESFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGLQNVEVSAE 136
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LLTCC G+ C GG P AW + + G+ +GG Y S
Sbjct: 137 DLLTCCGF-QCGEGCNGGFPSGAWNFWKKQGLVSGGLYDS 175
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 61/110 (55%), Gaps = 4/110 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
EH D +N LPE FD R ++PNC + ++ Q +CGSCWA A++DR C + G
Sbjct: 73 EHDDDTINN--LPENFDPRDKWPNCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGT 130
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
S++ LL+CC C G C GG P AW Y G+ +GG+Y S Q
Sbjct: 131 KHFHFSAEDLLSCCPVCGLG--CNGGIPSFAWEYWKHFGIVSGGNYNSSQ 178
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 61/101 (60%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRYC-TVGGVQ 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C G C+GG P AW Y + +G+ +
Sbjct: 139 QLRISAAHLLSCCKDCGYG--CDGGYPGTAWEYYVSHGLAS 177
>gi|239790489|dbj|BAH71802.1| ACYPI000009 [Acyrthosiphon pisum]
Length = 178
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 63/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
D T++P EFD R+ + +C N IG V+ Q NC S WA+A + +DR+CIA+ G+
Sbjct: 57 DNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTD 116
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS+ +L++C G C+GG+ +AW + G+ TGG++ S
Sbjct: 117 NLSAQNLMSCGDGEKMG--CDGGSAFKAWELTMNKGIVTGGNFDS 159
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 56/99 (56%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++PNC + V+ Q +CGSCWA A++DR+C + G S++ LL
Sbjct: 82 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLL 141
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 142 SCCPICGLG--CNGGMPTLAWEYWKHFGLVSGGSYNSSQ 178
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 1/101 (0%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N++L + FD R+++P C +I + S C S WA A ++SDR+CI + G ++ LS+
Sbjct: 25 NSDLSQFFDARERWPECMSIPQINDISECKSSWAFAAAESMSDRLCINSGGTINTILSAQ 84
Query: 135 HLLTCCAACTG-GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LL+CC G+ C GGN +AW Y ++G+PTGG Y S
Sbjct: 85 ELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYES 125
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L + Y + ELP+EFD R + C+ IG++ Q +CGSCWA L
Sbjct: 72 KPAPQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNILEQGHCGSCWAFGAVECLQ 131
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + LS + LL CC GD C+GG P+ AW Y ++NGV T
Sbjct: 132 DRFCIHLN--MSILLSVNDLLACCGFMC-GDGCDGGYPIEAWRYFVQNGVVT 180
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 59/97 (60%), Gaps = 3/97 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y +T+LP+EFD R ++ C+ IG + Q +CGSCWA L DR CI ++ +L
Sbjct: 92 YSRSTDLPKEFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLN--MNISL 149
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
S + L+ CC GD C+GG P+ AW Y++ENGV T
Sbjct: 150 SVNDLVACCGFMC-GDGCDGGYPISAWQYLVENGVVT 185
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 56/99 (56%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++PNC + V+ Q +CGSCWA A++DR+C + G S++ LL
Sbjct: 82 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 141
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 142 SCCPICGLG--CNGGMPTLAWEYWKHFGLVSGGSYNSTQ 178
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 81 FTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHC-TVGGVQQ 139
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C G C+GG P AW Y + +G+ +
Sbjct: 140 LRISAAHLLSCCKDCGYG--CDGGYPDAAWRYYVSHGLAS 177
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 61/112 (54%), Gaps = 3/112 (2%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P P L + Y LP++FD RK +P CT++ + Q +CGSCWA ALSD
Sbjct: 76 PTPRNLLENVPVRTYPKGLNLPKQFDARKAWPQCTSVRTILDQGHCGSCWAFGAVEALSD 135
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
R CI +++ TLS + L+ CC GD C+GG P+ AW Y + GV T
Sbjct: 136 RFCI--HYKVNVTLSENDLVACCGF-RCGDGCDGGYPLSAWQYFISTGVVTA 184
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/96 (43%), Positives = 62/96 (64%), Gaps = 3/96 (3%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R ++ NCT+I ++ Q+ CGSCWA +T +SDR+CIAT+G T+S +L
Sbjct: 81 IPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDML 140
Query: 138 TCCA-ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
CC +C GD C+G P++A+ + GV TGGD+
Sbjct: 141 ACCGNSC--GDGCKGRYPIQAFRWWNSRGVVTGGDF 174
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 63/102 (61%), Gaps = 5/102 (4%)
Query: 78 LPEEFDLR--KQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
LP FD R ++P C + + HV+ Q +CGSCWA A++DR+CIA+ G+ + LS++
Sbjct: 214 LPTSFDPRDGSKWPACKDSLNHVRDQGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAE 273
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
L +CC +C G CEGG P AW Y G+ TGGD+ S Q
Sbjct: 274 DLTSCCDSCGMG--CEGGYPSAAWDYFQSTGLVTGGDWNSNQ 313
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 55/99 (55%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++PNC + V+ Q +CGSCWA A++DR C + G S++ LL
Sbjct: 84 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLL 143
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 144 SCCPICGLG--CNGGMPTLAWEYWKHFGLVSGGSYNSSQ 180
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L + Y + ELP+EFD R + C+ IG++ Q +CGSCWA L
Sbjct: 72 KPAPQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNILDQGHCGSCWAFGAVECLQ 131
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + LS + LL CC GD C+GG P+ AW Y ++NGV T
Sbjct: 132 DRFCIHLN--MSILLSVNDLLACCGFMC-GDGCDGGYPIEAWRYFVQNGVVT 180
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 55/99 (55%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++PNC + V+ Q +CGSCWA A++DR C + G S++ LL
Sbjct: 84 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLL 143
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 144 SCCPICGLG--CNGGMPTLAWEYWKHFGLVSGGSYNSSQ 180
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/118 (36%), Positives = 68/118 (57%), Gaps = 6/118 (5%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
D T++P EFD R+ + +C N IG V+ Q NC S WA+A + +DR+CIA+ G+
Sbjct: 57 DNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTD 116
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
LS+ +L++C G C+GG+ +AW + G+ TGG++ S CQ + C+
Sbjct: 117 NLSAQNLMSCGDGEKMG--CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCD 172
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 63/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
D T++P EFD R+ + +C N IG V+ Q NC S WA+A + +DR+CIA+ G+
Sbjct: 57 DNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTD 116
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS+ +L++C G C+GG+ +AW + G+ TGG++ S
Sbjct: 117 NLSAQNLMSCGDGEKMG--CDGGSAFKAWELTMNKGIVTGGNFDS 159
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 65/105 (61%), Gaps = 5/105 (4%)
Query: 83 DLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAA 142
D R+Q+P+C +I ++ Q +CGSCWA A+SDR CI + G++ +S + LL+CC++
Sbjct: 1 DSREQWPDCPSISEIRDQGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCSS 60
Query: 143 CTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
C G C+GG P AW + ++ G+ TGG + S CQ ++ C
Sbjct: 61 CGMG--CDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACE 103
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 55/99 (55%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++PNC + V+ Q +CGSCWA A++DR C + G S++ LL
Sbjct: 87 LPENFDPRDKWPNCPTLNEVRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLL 146
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 147 SCCPVCGLG--CNGGMPTLAWEYWKHFGLVSGGSYNSGQ 183
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 66/112 (58%), Gaps = 4/112 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPEEFD RKQ+ C +I ++ QS CGSCWA+++ + +SDR+CI + + +S+ +
Sbjct: 80 DLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADM 139
Query: 137 LTCCAACTGG-DVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
+ CC +CT D C GG P + ++G +GG+Y S C + CN
Sbjct: 140 IECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN 191
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 56/99 (56%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++P+C + ++ Q +CGSCWA A++DR C + G SS+ LL
Sbjct: 83 LPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSSEDLL 142
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG+Y S Q
Sbjct: 143 SCCPICGLG--CNGGIPSLAWEYWKHFGIVSGGNYNSTQ 179
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD RK++PNC +IGH+ Q NC S +AI+ +A++DR+CI + + +S+ ++
Sbjct: 63 LPINFDARKRWPNCPSIGHIYNQGNCRSSYAISVASAVTDRICIHSNETKNPIMSAQQII 122
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C+GG+ +W + +G +GGDY S Q
Sbjct: 123 SCCYLCGYG--CDGGSQFESWDFYRRHGFVSGGDYNSNQ 159
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P +FD R+Q+ +C I ++ Q CGSCWA ++SDR CI + + L++D +
Sbjct: 87 DIPAQFDSRQQWQDCPTIREIRDQGACGSCWAFGAVESMSDRHCIHSGAKNIVHLAADDV 146
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L+CC C G C GG P AW Y +E G+ TGG+Y
Sbjct: 147 LSCCWGCGSG--CNGGFPGAAWSYWVEKGIVTGGNY 180
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N LPE FD R +P+C +I ++ QS+CGSCWA A+SDR+CI ++G + +LS+
Sbjct: 83 NQHLPESFDARANWPHCPSISEIRDQSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAV 142
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C GG AW +G+ TGG
Sbjct: 143 DLVSCCTECGCG--CRGGYSPIAWDLWKTHGIVTGG 176
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/96 (42%), Positives = 64/96 (66%), Gaps = 1/96 (1%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE R ++P C+++ ++ Q+NCGSCWA++T +ALSDR+CIA+ GR +S+ +
Sbjct: 1 DIPESPYSRTKWPKCSSLKPIRDQANCGSCWAVSTASALSDRICIASNGRKQVHVSATDI 60
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L+CC G C GG P++A+ Y + G TGGDY
Sbjct: 61 LSCCGNQCGYG-CNGGWPIQAFNYFSKQGAVTGGDY 95
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 55/97 (56%), Gaps = 2/97 (2%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +P C+ IGHV+ QS CG CWA T A +DR+CI + G LS+
Sbjct: 139 DLPTDFDARTAFPKCSKVIGHVRDQSACGDCWAFGVTEAFNDRLCIKSNGTFTKLLSAGE 198
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CA C GG P AW ++ + G+ TGGDY
Sbjct: 199 -MNACAPSLKDPGCRGGFPYSAWSWVHDEGIATGGDY 234
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 68/110 (61%), Gaps = 7/110 (6%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R+++P+C I ++ Q +CGSCWA A+SDR+CI + ++ +S+++L
Sbjct: 83 QIPENFDSRQKWPHCPTISLIRDQGSCGSCWAFGAVEAMSDRLCIHSNKIVN--VSAENL 140
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
L+CC +C G C GG P AW + + G+ +GG YGS CQ + C
Sbjct: 141 LSCCYSCGFG--CNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPC 188
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 60/101 (59%), Gaps = 6/101 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ---GRLDHT-LSS 133
LP FD RK++PNC + ++ Q +CGSCWA A+SDR+CI Q GR LS+
Sbjct: 96 LPASFDPRKKWPNCKTLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSA 155
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
D LL+CC C G C GG P +AW + G+ +GG YG+
Sbjct: 156 DDLLSCCRDCGMG--CNGGFPSQAWNFWKHEGLVSGGLYGT 194
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 60/95 (63%), Gaps = 3/95 (3%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P+ FD R Q+P C +IG ++ QS CGSCWA+++ A+SD +C+ + + +S +L+
Sbjct: 89 PDSFDARTQWPECRSIGTIRDQSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDILS 148
Query: 139 CCAA-CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
CC C G C+GG P+ A+ +M +GV TGG Y
Sbjct: 149 CCGLDCGYG--CQGGWPIEAYRWMQRDGVVTGGKY 181
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 55/92 (59%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD RK +P C+ IG + Q +CGSCWA +LSDR CI L +LS + L
Sbjct: 99 ELPKEFDARKAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCI--HYNLSISLSVNDL 156
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC+ G C+GG P+ AW Y +GV T
Sbjct: 157 LACCSFLCGSG-CDGGYPIAAWRYFKRSGVVT 187
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 4/107 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
E +Y+S ++LP+ FD R+++PNC +I +V Q CGSC+A+A SDR CI + G
Sbjct: 129 EELDNYKS-SDLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGT 187
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
LS + ++ CC+ C C GG+P++A Y + G+ TGG G
Sbjct: 188 FKALLSEEDIIGCCSVCGN---CYGGDPLKALTYWVNQGLVTGGRDG 231
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 77/155 (49%), Gaps = 13/155 (8%)
Query: 30 ADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYP 89
AD L S S L ++ + +T S E P +F + +LPE FD + +P
Sbjct: 61 ADNGYLVSGKS-LEEVRKLMGVTDMSTEAVP-----PRNFSVVEMQQDLPEFFDAAEHWP 114
Query: 90 NCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVC 149
C I ++ QSNCGSCWAIA A+SDR C G D +S+ +LL+CC C G C
Sbjct: 115 MCVTISEIRDQSNCGSCWAIAAVEAISDRYC-TLGGVPDRRISTSNLLSCCFICGFG--C 171
Query: 150 EGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
GG P AW + + G+ T CQ + G C+
Sbjct: 172 YGGIPTMAWLWWVWVGITT----EVCQPYPFGPCS 202
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 5/97 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ I H++ QS+CGSCWA T A +DR+CI + G LS+
Sbjct: 141 DLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGE 200
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CA G C+GG P AW ++ G+ TGGDY
Sbjct: 201 -MNACAPSFG---CDGGIPSLAWSWVHNKGIATGGDY 233
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 63/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
D T++P EFD R+ + +C N IG V+ Q NC S WA+A + +DR+CIA+ G+
Sbjct: 19 DNSYKTDIPREFDARQYFTSCANVIGDVKDQGNCASSWAVAVASTFTDRLCIASNGQFTD 78
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS+ +L++C G C+GG+ +AW + G+ TGG++ S
Sbjct: 79 NLSAQNLMSCGDGEKMG--CDGGSAFKAWELTMNKGIVTGGNFDS 121
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ LS+
Sbjct: 87 KVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAV 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++CC C G C+GG +W Y + G+ TGG
Sbjct: 147 DLISCCKYCGSG--CDGGFLGPSWDYWVLRGIVTGG 180
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 4/107 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
E +Y+S++ LP+ FD R+++PNC +I +V Q CGSC+A+A SDR CI + G
Sbjct: 133 EELENYKSSS-LPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGT 191
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
LS + ++ CC+ C C GG+P++A Y + G+ TGG G
Sbjct: 192 FKSLLSEEDIIGCCSVCGN---CYGGDPLKALTYWVNQGLVTGGRDG 235
>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
Length = 410
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 64/107 (59%), Gaps = 4/107 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
E +Y+S ++LP+ FD R+++PNC +I +V Q CGSC+A+A SDR CI + G
Sbjct: 117 EELENYKS-SDLPKHFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGT 175
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
LS + ++ CC+ C C GG+P++A Y + G+ TGG G
Sbjct: 176 FKALLSEEDIIGCCSVCGN---CYGGDPLKALTYWVNQGLVTGGRDG 219
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 56/99 (56%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++P+C + V+ Q +CGSCWA A++DR+C + G S++ LL
Sbjct: 83 LPENFDPRDKWPDCPTLNEVRDQGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLL 142
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG Y S Q
Sbjct: 143 SCCPICGLG--CSGGMPRLAWEYWKHFGLVSGGSYNSSQ 179
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 86/179 (48%), Gaps = 21/179 (11%)
Query: 5 TPQYVNHSHHLLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQL 64
T +V+H + L N G+W K S K LT + + LQ
Sbjct: 32 TKTFVDHINQL---------NGGMWKAVYNGKMQNITFSEAK---RLTGARIQKSSGLQ- 78
Query: 65 GSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
F + Q T+LPE FD + +P+C I + QS C + WA++T +A+SDR C +
Sbjct: 79 -PARFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGK 137
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
G+ +S+ HLL+CC C GD C+GG P AW Y +E G+ + SCQ + C
Sbjct: 138 GK-QLRISAAHLLSCCKDC--GDGCKGGFPGFAWRYYVEYGITS----SSCQPYPFPRC 189
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 40/98 (40%), Positives = 60/98 (61%), Gaps = 3/98 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R ++PNC ++ ++ Q CGSCWA A+ ++SDR+CI + G S + L
Sbjct: 82 DVPESFDARTKWPNCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSPEDL 141
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L+CC +C GD C GG M A + + G+ +GGD S
Sbjct: 142 LSCCTSC--GD-CGGGYMMSALDFYINEGIVSGGDVNS 176
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 56/92 (60%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD R + NC+ IG + Q +CGSCWA +LSDR CI L+ +LS++ L
Sbjct: 99 ELPQEFDARVAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HYGLNISLSANDL 156
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
CC GD C+GG P++AW Y + GV T
Sbjct: 157 YACCGFLC-GDGCDGGYPLQAWKYFVRKGVVT 187
>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 43/92 (46%), Positives = 55/92 (59%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD RK +P C+ IG + Q +CGSCWA +LSDR CI L +LS + L
Sbjct: 97 ELPKEFDARKAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCI--HYNLSISLSVNDL 154
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC+ G C+GG P+ AW Y +GV T
Sbjct: 155 LACCSFLCGSG-CDGGYPIAAWRYFKRSGVVT 185
>gi|303289014|ref|XP_003063795.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
gi|226454863|gb|EEH52168.1| cathepsin B-like cysteine proteinase [Micromonas pusilla CCMP1545]
Length = 390
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/103 (47%), Positives = 60/103 (58%), Gaps = 11/103 (10%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQL-QSNCGSCWAIATTAALSDRMCIATQGRLDHT------ 130
LPE FD R+++P C + L Q CGSCWA+AT A L+DR CIAT G L
Sbjct: 116 LPELFDARERWPRCARVVGTALDQGKCGSCWAVATAAVLTDRACIATNGALGGGGGGGEF 175
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
LS+ LL+C AA D CEGG+ A+ Y +GV TGG YG
Sbjct: 176 LSASQLLSCGAA----DGCEGGDERDAFEYAKTHGVVTGGAYG 214
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 57/98 (58%), Gaps = 3/98 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y LP++FD R+ +P CT++ + Q +CGSCWA ALSDR CI +++ TL
Sbjct: 90 YPKGINLPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCI--HHKVNVTL 147
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
S + L+ CC GD C+GG P+ AW Y + GV T
Sbjct: 148 SENDLVACCGFMC-GDGCDGGYPISAWQYFISTGVVTA 184
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 79/153 (51%), Gaps = 9/153 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
+++ V N G W + S + ++ K L + P ++ L LG QS
Sbjct: 48 IVKKVNEHPNAG-WKAAINDRFSNATVAEFKRLLGVKPTPKK----LLLGVPVVSHDQS- 101
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R +P CT+IG + Q +CGSCWA +LSDR CI Q ++ TLS +
Sbjct: 102 LKLPKSFDARTHWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCI--QFGMNITLSVND 159
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
LL CC GD C+GG P+ AW Y +GV T
Sbjct: 160 LLACCGF-RCGDGCDGGYPISAWQYFSYSGVVT 191
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 85.5 bits (210), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 7/116 (6%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q T+LPE FD + +P+C I + QS C + WA++T +A+SDR C +G+
Sbjct: 81 RFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK- 139
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
+S+ HLL+CC C GD C+GG P AW Y +E G+ + SCQ + C
Sbjct: 140 QLRISAAHLLSCCKDC--GDGCKGGFPGFAWRYYVEYGITS----SSCQPYPFPRC 189
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 85.5 bits (210), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 7/116 (6%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q T+LPE FD + +P+C I + QS C + WA++T +A+SDR C +G+
Sbjct: 81 RFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK- 139
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
+S+ HLL+CC C GD C+GG P AW Y +E G+ + SCQ + C
Sbjct: 140 QLRISAAHLLSCCKDC--GDGCKGGFPGFAWRYYVEYGITS----SSCQPYPFPRC 189
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 85.5 bits (210), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 7/116 (6%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q T+LPE FD + +P+C I + QS C + WA++T +A+SDR C +G+
Sbjct: 81 RFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGKGK- 139
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
+S+ HLL+CC C GD C+GG P AW Y +E G+ + SCQ + C
Sbjct: 140 QLRISAAHLLSCCKDC--GDGCKGGFPGFAWRYYVEYGITS----SSCQPYPFPRC 189
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 85.5 bits (210), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 73 QSNTELPE---EFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
Q NT LP FD R ++PNC +I ++ Q+ CGSCWA +SDR+CIA+ G
Sbjct: 67 QVNTVLPSIPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQP 126
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+S LL+CC G C+G +P++A+ + + GV TGGDY
Sbjct: 127 IISPTDLLSCCGNFCGYG-CKGASPLQAFRWWNKKGVVTGGDY 168
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 85.5 bits (210), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 61/97 (62%), Gaps = 4/97 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R+ +PNC +I ++ Q +CGSCWA A+SDR+CI T ++ +S+++LL
Sbjct: 82 IPDTFDAREHWPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNVN--ISAENLL 139
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC C G C GG P AW + G+ +GG YGS
Sbjct: 140 SCCYTCGFG--CNGGFPGAAWRFWENKGLVSGGLYGS 174
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 57/98 (58%), Gaps = 3/98 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y LP++FD R+ +P CT++ + Q +CGSCWA ALSDR CI +++ TL
Sbjct: 90 YPKGMNLPKQFDAREAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCI--HHKVNVTL 147
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
S + L+ CC GD C+GG P+ AW Y + GV T
Sbjct: 148 SENDLVACCGFMC-GDGCDGGYPISAWQYFISTGVVTA 184
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 85.1 bits (209), Expect = 1e-14, Method: Composition-based stats.
Identities = 48/141 (34%), Positives = 67/141 (47%), Gaps = 14/141 (9%)
Query: 46 KFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGS 105
KF S EPN L S + P FD R +PNC +I ++ Q+ CGS
Sbjct: 57 KFAFPEEQISSEPNNSLP---------GSLSRAPTSFDARDYWPNCKSIKMIRDQAYCGS 107
Query: 106 CWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENG 165
CWA +SDR+CI + G +S + +LTCC G C+GG + A + G
Sbjct: 108 CWAFGAAEVISDRICIQSNGTDQPIISPEDILTCCTNSHG---CQGGFVLEAMKFWKSKG 164
Query: 166 VPTGGDY--GSCQRFDRGNCN 184
V TGGD+ C + G+C+
Sbjct: 165 VVTGGDFQGDGCIPYSYGSCS 185
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 57/95 (60%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R Q+ C +I ++ Q+ CGSCWA +SDR CI T+G +S D LL
Sbjct: 86 IPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLL 145
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC + + G+ CEGG P++A + GV TGGDY
Sbjct: 146 SCCGS-SCGNGCEGGYPIQALRWWDSKGVVTGGDY 179
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 60/96 (62%), Gaps = 4/96 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD R Q+P+C IG ++ QSNCGSCWA T ++SDR CI + L +S+ +L
Sbjct: 79 DIPDMFDSRTQWPDCKTIGLIEDQSNCGSCWAFGATESMSDRYCIHMKMHL--LISAANL 136
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ CC C G+ CEGG AW Y + G+ TGG Y
Sbjct: 137 MECCRNC--GNGCEGGFLGAAWNYWKQEGLVTGGLY 170
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 63/112 (56%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L Y + ELP++FD R ++ C+ IG + Q +CGSCWA L
Sbjct: 73 KPTPPGLLAGVPTKTYSKSEELPKQFDARSKWSGCSTIGTILDQGHCGSCWAFGAVECLQ 132
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS++ L+ CC GD C+GG P++AW Y +++GV T
Sbjct: 133 DRFCI--HQNINISLSANDLVACCGFMC-GDGCDGGYPIKAWQYFVQSGVVT 181
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 63/112 (56%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P + + +LP+EFD R +P CT+IG++ Q +CGSCWA +LS
Sbjct: 85 KPTPKKHFLGVPIVSHDRSLKLPKEFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLS 144
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + ++ +LS + LL CC GD C+GG P+ AW Y +GV T
Sbjct: 145 DRFCI--EFGMNISLSVNDLLACCGF-RCGDGCDGGYPIAAWQYFSYSGVVT 193
>gi|312083604|ref|XP_003143931.1| hypothetical protein LOAG_08355 [Loa loa]
Length = 188
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/73 (49%), Positives = 51/73 (69%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +PE FD RK +P C ++ +V+ QS+CGSCWA+A A+SDR+CI ++G+ TLS+
Sbjct: 116 STIFIPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSA 175
Query: 134 DHLLTCCAACTGG 146
D LL+CC C G
Sbjct: 176 DDLLSCCKTCGFG 188
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 67/118 (56%), Gaps = 6/118 (5%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
D ++P EFD R+ + +C + IG V+ Q NC S WA+A + SDR+CIA+ G+
Sbjct: 19 DISYKIDIPREFDARQYFGSCADVIGDVKDQGNCASSWAVAVASTFSDRLCIASNGQFTD 78
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
LS+ +LL+C G C+GG+ +AW + G+ TGG++ S CQ + CN
Sbjct: 79 NLSAQNLLSCGDEEKMG--CDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCN 134
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 70/130 (53%), Gaps = 2/130 (1%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTE-LPEEFDLRKQYPNCTNIGHVQLQSNC 103
+KF + ++ D + SE F + E LP+ FD R+++P+C I ++ Q+ C
Sbjct: 58 MKFKVMDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIKLIRNQATC 117
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
GSCWA +SDR+CI + G +S + +L+CC T G C+GG + A +
Sbjct: 118 GSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGT-TCGYGCKGGYSIEALRFWAS 176
Query: 164 NGVPTGGDYG 173
+G TGGDYG
Sbjct: 177 SGAVTGGDYG 186
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P +L + + LPEEFD R +P C+ IG + Q +CGSCWA +LS
Sbjct: 80 KPTPKEELRGIPVISHPKSLRLPEEFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLS 139
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C GG P+ AW Y + +GV T
Sbjct: 140 DRFCIHYG--MNISLSVNDLLACCGFLCGSG-CNGGYPISAWRYFVHHGVVT 188
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 54/93 (58%), Gaps = 2/93 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++ C +I + QS CGS WA A +SDR+CI ++G LS+ L
Sbjct: 89 EIPSTFDSRKKWSQCKSISSIHDQSRCGSGWAFAAVEVMSDRICIQSKGEKSVELSAVDL 148
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L+CC C G C GG P AW Y +E GV TG
Sbjct: 149 LSCCRECGLG--CLGGFPGSAWDYWVEEGVVTG 179
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 84.7 bits (208), Expect = 1e-14, Method: Composition-based stats.
Identities = 46/116 (39%), Positives = 61/116 (52%), Gaps = 16/116 (13%)
Query: 74 SNTELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH--- 129
S+ ++P FD R+ +P C +I G V+ QS+CGSCWA A+T A +DR CIA G+ D
Sbjct: 276 SDEDIPANFDAREAFPECASIIGRVRDQSDCGSCWAFASTEAFNDRRCIAGIGKEDAAGA 335
Query: 130 ----------TLSSDHLLTCCAA--CTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
LS++ CC C C GG P AW + + GV TGGDY
Sbjct: 336 EGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTKTGVVTGGDYA 391
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 92/183 (50%), Gaps = 17/183 (9%)
Query: 7 QYVNHSHHLLLRHVTRDSNPG--LWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQL 64
+Y +H L+ + + N W + K S ++ +K + + QE L+
Sbjct: 27 RYKQEKYHDKLKQIIQKVNSSNSTWKAGENTKWINSDIAGVKAHMGV-KLGQESGIKLET 85
Query: 65 GSEHFGDYQSNTELPEEFDLRKQYPN-CTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
S Q+N LPEEFD R Q+ + C+++ V+ QS CGSCWA +LSDR CI
Sbjct: 86 VSA-----QANG-LPEEFDARVQWGDKCSSLWEVRDQSTCGSCWAFGAAESLSDRHCIHL 139
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDR 180
D LS+ +LLTCCAAC GD C+GG P A Y + G+ TG YG+ CQ +
Sbjct: 140 G--QDIRLSTQNLLTCCAAC--GDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTF 195
Query: 181 GNC 183
C
Sbjct: 196 APC 198
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 57/99 (57%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R ++P+C + ++ Q +CGSCWA A++DR+CI + S++ L+
Sbjct: 44 LPENFDPRDKWPDCPTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLV 103
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C GG P AW Y G+ +GG+Y S Q
Sbjct: 104 SCCPICGLG--CNGGMPTLAWEYWKHVGLVSGGNYNSSQ 140
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 77/155 (49%), Gaps = 13/155 (8%)
Query: 30 ADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYP 89
AD L S S L ++ + +T S E P +F + +LPE FD + +P
Sbjct: 56 ADNGYLVSGKS-LEEVRKLMGVTDMSTEAVP-----PRNFSVDEMQQDLPEFFDAAEHWP 109
Query: 90 NCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVC 149
C I ++ QSNCGSCWAIA A+SDR C G D +S+ +LL+CC C G C
Sbjct: 110 MCVTISEIRDQSNCGSCWAIAAVEAISDRYC-TLGGVPDRRISTSNLLSCCFICGFG--C 166
Query: 150 EGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
GG P AW + + G+ T CQ + G C+
Sbjct: 167 YGGIPTMAWLWWVWVGITT----EVCQPYPFGPCS 197
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 66/121 (54%), Gaps = 7/121 (5%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L F + ++ LP FD + +PNC I + QS CGSCWA+A +A+SDR C
Sbjct: 58 LPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFC-TM 116
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
G D +S+ LL CC+ C GD C GG+P RAW Y G+ + DY CQ + +C
Sbjct: 117 GGVQDVHISAGDLLACCSDC--GDGCNGGDPDRAWAYFSSTGLVS--DY--CQPYPFPHC 170
Query: 184 N 184
+
Sbjct: 171 S 171
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 77/155 (49%), Gaps = 13/155 (8%)
Query: 30 ADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYP 89
AD L S S L ++ + +T S E P +F + +LPE FD + +P
Sbjct: 56 ADNGYLVSGKS-LEEVRKLMGVTDMSTEAVP-----PRNFSVDEMQQDLPEFFDAAEHWP 109
Query: 90 NCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVC 149
C I ++ QSNCGSCWAIA A+SDR C G D +S+ +LL+CC C G C
Sbjct: 110 MCVTISEIRDQSNCGSCWAIAAVEAISDRYC-TLGGVPDRRISTSNLLSCCFICGFG--C 166
Query: 150 EGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
GG P AW + + G+ T CQ + G C+
Sbjct: 167 YGGIPTMAWLWWVWVGITT----EVCQPYPFGPCS 197
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 66/121 (54%), Gaps = 7/121 (5%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L F + ++ LP FD + +PNC I + QS CGSCWA+A +A+SDR C
Sbjct: 57 LPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFC-TM 115
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
G D +S+ LL CC+ C GD C GG+P RAW Y G+ + DY CQ + +C
Sbjct: 116 GGVQDVHISAGDLLACCSDC--GDGCNGGDPDRAWAYFSSTGLVS--DY--CQPYPFPHC 169
Query: 184 N 184
+
Sbjct: 170 S 170
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 57/95 (60%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R Q+ C +I ++ Q+ CGSCWA +SDR CI T+G +S D LL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC + + G+ CEGG P++A + GV TGGDY
Sbjct: 145 SCCGS-SCGNGCEGGYPIQALRWWDSKGVVTGGDY 178
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 69 FGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
F + Q TELPE FD +++PNC I + QS CGSCWA++T +A+SDR C G
Sbjct: 81 FTEEQLRTELPESFDSAEKWPNCPTIREIADQSACGSCWAVSTASAISDRHC-TVGGVQQ 139
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HL++CC C G C+GG P +W Y + +G+ +
Sbjct: 140 LRISAAHLMSCCEDCGYG--CDGGYPGTSWEYYVSHGLAS 177
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 7/118 (5%)
Query: 61 DLQLGSEHFGDYQS---NTEL---PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAA 114
DL+ + H + ++ N EL P FD R + C +I ++ Q+ CGSCWA
Sbjct: 63 DLKYAAPHSDEIRATEVNIELDTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEV 122
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+SDR CI T+G +S D LL+CC + + G+ CEGG P++A + GV TGGDY
Sbjct: 123 ISDRTCIETKGAQQPIISPDDLLSCCGS-SCGNGCEGGYPIQALRWWDSKGVVTGGDY 179
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 66/121 (54%), Gaps = 7/121 (5%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L F + ++ LP FD + +PNC I + QS CGSCWA+A +A+SDR C
Sbjct: 80 LPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFC-TM 138
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
G D +S+ LL CC+ C GD C GG+P RAW Y G+ + DY CQ + +C
Sbjct: 139 GGVQDVHISAGDLLACCSDC--GDGCNGGDPDRAWAYFSSTGLVS--DY--CQPYPFPHC 192
Query: 184 N 184
+
Sbjct: 193 S 193
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 66/121 (54%), Gaps = 7/121 (5%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L F + ++ LP FD + +PNC I + QS CGSCWA+A +A+SDR C
Sbjct: 80 LPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFC-TM 138
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
G D +S+ LL CC+ C GD C GG+P RAW Y G+ + DY CQ + +C
Sbjct: 139 GGVQDVHISAGDLLACCSDC--GDGCNGGDPDRAWAYFSSTGLVS--DY--CQPYPFPHC 192
Query: 184 N 184
+
Sbjct: 193 S 193
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 7/118 (5%)
Query: 61 DLQLGSEHFGDYQS---NTEL---PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAA 114
DL+ + H + ++ N EL P FD R + C +I ++ Q+ CGSCWA
Sbjct: 63 DLKYAAPHSDEIRATEVNIELDTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEV 122
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+SDR CI T+G +S D LL+CC + + G+ CEGG P++A + GV TGGDY
Sbjct: 123 ISDRTCIETKGAQQPIISPDDLLSCCGS-SCGNGCEGGYPIQALRWWDSKGVVTGGDY 179
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 58/95 (61%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R ++P C +I ++ Q+NCGSCWA +SDR+CIAT+G +S ++
Sbjct: 87 IPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMV 146
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
CC G C+GG ++A + + +GV TGGDY
Sbjct: 147 DCCGEYCGYG-CDGGYSIQALRWWVFDGVVTGGDY 180
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 59/111 (53%), Gaps = 3/111 (2%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P P L + Y +LP +FD RK +P+CT+ + Q +CGSCWA A ALSD
Sbjct: 75 PTPQKLLETVPVRVYPKGLKLPSKFDARKAWPHCTSTRSILDQGHCGSCWAFAAVEALSD 134
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
R CI Q ++ TLS + L+ CC G C GG P+ AW Y GV T
Sbjct: 135 RFCIHFQ--VNATLSENDLVACCGFRCGSG-CNGGFPLSAWRYFSRRGVVT 182
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 1/94 (1%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P FD R +P C +IG ++ QS+CGSCWA+++ A+SD +C+ + + +S +L+
Sbjct: 90 PASFDARTHWPECRSIGTIRDQSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILS 149
Query: 139 CCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
CC G C+GG P+ A+ +M +GV TGG Y
Sbjct: 150 CCGISCGYG-CQGGWPIEAYKWMQRDGVVTGGKY 182
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 60/101 (59%), Gaps = 5/101 (4%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
++P+ FD R + C + IGHV+ QS CGSCWA T A + R+CI + G+L+ LS+
Sbjct: 98 VDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAA 157
Query: 135 HLLTCCAA---CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L CC C C GGNP+ +W ++ NG+ +GG +
Sbjct: 158 DMLACCNIGHFCLSFG-CSGGNPITSWTFLHTNGIVSGGGF 197
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
Query: 77 ELPEEFDLRKQY-PNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP FD R+Q+ C + ++ Q+ CGSCWA +++DR+CIA++G L +S+
Sbjct: 87 DLPTAFDSREQWGSTCPSTKEIRDQAACGSCWAFGAVESMTDRICIASKGSLRPHISAQD 146
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
L+TCC T G C GG P AW + G+ TGG+Y S CQ + NC+
Sbjct: 147 LMTCC-LFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCD 197
>gi|324514184|gb|ADY45787.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 476
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 58/96 (60%), Gaps = 3/96 (3%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R+++ C+++ +V Q CG+C+A+A SDR CIA+ G L S + +L
Sbjct: 193 LPSEFDARRKWSYCSSLHNVPNQGGCGACYAVAAVGVASDRACIASNGTLQSMFSEEDVL 252
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
CCA C C GG+P++A Y ++ G+ TGG G
Sbjct: 253 GCCAVCGN---CYGGDPLKALVYWVDEGLVTGGRDG 285
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 86/143 (60%), Gaps = 7/143 (4%)
Query: 39 PSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQ 98
PS L+++ + + P ++ D+ L + D +S LPE +D+ + + C ++ ++
Sbjct: 50 PSLLTNVSHLMGVVPWNKLSEKDILLTYDVSIDLES---LPESYDITQTWSECKSVVSIR 106
Query: 99 LQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAW 158
QSNCGSCWA++T +A SDR+CI + ++ LS ++ + C G+ C GG+P +AW
Sbjct: 107 DQSNCGSCWALSTASAFSDRLCITSNMGVNKVLSGEY-INSCCNGKCGNGCNGGHPEKAW 165
Query: 159 YYMLENGVPTGGDYGS---CQRF 178
Y+ +NG+ TGG+YGS CQ +
Sbjct: 166 KYIKKNGLCTGGEYGSNEGCQPY 188
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 59/111 (53%), Gaps = 3/111 (2%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P +L+ E LP+EFD RK + +C+ IG + Q +CGSCWA +L+D
Sbjct: 75 PANELEPSIERVTHKHKKLVLPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTD 134
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
R CI + +LS + LL CC GD C+GG P+RAW Y GV T
Sbjct: 135 RFCIHMNESV--SLSENDLLACCGF-ECGDGCDGGYPIRAWRYFKRTGVVT 182
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 62/112 (55%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L + + +LP+EFD R ++ C+ IG + Q +CGSCWA L
Sbjct: 75 KPTPPGLLAGVRTKTHPRSEQLPKEFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQ 134
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS++ L+ CC GD C+GG P+ AW Y ++NGV T
Sbjct: 135 DRFCI--HHNMNISLSANDLVACCGFMC-GDGCDGGYPISAWQYFVQNGVVT 183
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 2/106 (1%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D + LPE FD R ++P C + ++ Q +CGSCWA A++DR+CI +
Sbjct: 36 DVELIATLPEIFDPRDKWPECLTLNEIRDQGSCGSCWAFGAVEAMTDRVCIYSNATKHFH 95
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
S++ L++CC C G C GG P AW Y G+ +GG+Y S Q
Sbjct: 96 FSAEDLVSCCPICGLG--CNGGMPTLAWEYWKHVGLVSGGNYNSSQ 139
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 72/143 (50%), Gaps = 12/143 (8%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
L ++ + +T S E P +F + +LPE FD + +P C I ++ QS
Sbjct: 67 LEEVRKLMGVTDMSTEAVP-----PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQS 121
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NCGSCWAIA A+SDR C G D +S+ +LL+CC C G C GG P AW +
Sbjct: 122 NCGSCWAIAAVEAISDRYC-TLGGVPDRRISTSNLLSCCFICGFG--CYGGIPTMAWLWW 178
Query: 162 LENGVPTGGDYGSCQRFDRGNCN 184
+ G+ T CQ + G C+
Sbjct: 179 VWVGITT----EVCQPYPFGPCS 197
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 5/109 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FD R +P C I V+ QS CGSCWA ++SDR+CIA+ LS+ LL
Sbjct: 79 IPDTFDSRTNWPACPTIKEVRDQSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLL 138
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY---GSCQRFDRGNC 183
+CC +C GD C+GG +W Y G+ TG Y G C+ +D C
Sbjct: 139 SCCTSC--GDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPAC 185
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 3/99 (3%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD + +P+C I + QS C + WA+AT +A+SDR C +G+
Sbjct: 81 RFTEEQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGK- 139
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+S+ L+ CC C GG CEGG P AW Y + +G+
Sbjct: 140 QLRISAADLMACCKDCGGG--CEGGYPDAAWEYYVSHGI 176
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 78/153 (50%), Gaps = 9/153 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
+++ V + N G W + S + ++ K L + +P P + +
Sbjct: 47 IVKKVNENPNAG-WKAAINDRFSNATVAEFKRLLGV-----KPTPKKHFLGVPIVSHDPS 100
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R +P CT+IG++ Q +CGSCWA +LSDR CI Q ++ +LS +
Sbjct: 101 LKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCI--QFGMNISLSVND 158
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
LL CC GD C+GG P+ AW Y +GV T
Sbjct: 159 LLACCGF-RCGDGCDGGYPIAAWQYFSYSGVVT 190
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 65/105 (61%), Gaps = 8/105 (7%)
Query: 78 LPEEFDLRKQY-PNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP+ FD RKQ+ C ++ V+ QS CGSCWA A +LSDR+CI T D LS+++L
Sbjct: 97 LPKNFDSRKQWGSKCPSLNEVRDQSTCGSCWAFAAAESLSDRICIHTGE--DVRLSTENL 154
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG---GDYGSCQRF 178
++CC++C GD C GG P A Y ++ G+ TG GD CQ +
Sbjct: 155 VSCCSSC--GDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAY 197
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 56/95 (58%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R + C +I ++ Q+ CGSCWA +SDR CI T+G +S D LL
Sbjct: 122 IPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLL 181
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC + + G+ CEGG P++A + GV TGGDY
Sbjct: 182 SCCGS-SCGNGCEGGYPIQALRWWDSKGVVTGGDY 215
>gi|17510377|ref|NP_490763.1| Protein Y65B4A.2 [Caenorhabditis elegans]
gi|373220066|emb|CCD71920.1| Protein Y65B4A.2 [Caenorhabditis elegans]
Length = 421
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 61/100 (61%), Gaps = 3/100 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+++++P+ FD R+++PNC +I +V Q CGSC+A+A SDR CI + G LS
Sbjct: 134 NSSDVPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSE 193
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
+ ++ CC+ C C GG+P++A Y + G+ TGG G
Sbjct: 194 EDIIGCCSVCGN---CYGGDPLKALTYWVNQGLVTGGRDG 230
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 69/130 (53%), Gaps = 2/130 (1%)
Query: 43 SSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN 102
S +KF + + + +P P+ + G +P+ FD R+ +P+C +I ++ Q+
Sbjct: 59 SEMKFKV-MDERFADPLPEEESGEILVSGEIVPEPIPDTFDARENWPDCKSIKLIRNQAT 117
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA +SDR+CI + G +S + +L+CC T G C+GG + A +
Sbjct: 118 CGSCWAFGAAEVISDRICIQSNGTQQPIISVEDILSCCGT-TCGKGCQGGYSIEAMRFWK 176
Query: 163 ENGVPTGGDY 172
NG TGGDY
Sbjct: 177 SNGAVTGGDY 186
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP+ FD R ++ C +IG V Q NC S +AI+ +A+SDR+CI + G + LS+ +L
Sbjct: 53 LPKSFDARAKWYMCPSIGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQIL 112
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC C GD C GG +W + +G+ +GG+YGS
Sbjct: 113 SCCYLC--GDGCSGGQHFESWDFYRRHGLVSGGEYGS 147
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 64/105 (60%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+Y + ++PE FD R +PNC ++ ++ Q CGSCWA+A + +SDR+CI + G ++
Sbjct: 80 EYVYDVDIPESFDARNHWPNCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVA 139
Query: 131 LSSDHLLTCCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDYGS 174
L+++ L+ CC C G C GG ++ Y ++ G+ +GG Y S
Sbjct: 140 LAAEDLMGCCVDCGNG--CNGGFLDGTSFQYWVDAGLVSGGAYNS 182
>gi|393902164|gb|EFO13452.2| hypothetical protein LOAG_15077, partial [Loa loa]
Length = 186
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 61/98 (62%), Gaps = 4/98 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP+ FD R ++P C ++ V Q CGSCWAI+ + +SDR+CIAT +S++ L+
Sbjct: 11 LPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQISAEDLI 70
Query: 138 TCCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDYGS 174
+CC C G C+G + + A+ Y +GV TGGDYGS
Sbjct: 71 SCCTECGG---CQGSHWALSAFIYWRNHGVVTGGDYGS 105
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 78/153 (50%), Gaps = 9/153 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
+++ V ++ N G W + S + ++ K L + +P P + +
Sbjct: 47 IVKKVNQNPNAG-WKAAINDRFSNATVAEFKRLLGV-----KPTPKKHFLGVPVVSHDPS 100
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R +P CT+IG + Q +CGSCWA +LSDR CI Q ++ +LS +
Sbjct: 101 LKLPKAFDARTAWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCI--QFGMNISLSVND 158
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
LL CC GD C+GG P+ AW Y +GV T
Sbjct: 159 LLACCGF-RCGDGCDGGYPIAAWQYFSYSGVVT 190
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 60/102 (58%), Gaps = 6/102 (5%)
Query: 77 ELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
E+PE FD R +P CT I G ++ QS CGSCWA A A+SDR+CI + +SS
Sbjct: 80 EIPESFDSRTAWPECTQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLVSSQD 139
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQR 177
LLTC A GG C GG P AW NG+ TGG YG+ ++
Sbjct: 140 LLTCGTA--GG--CNGGWPAVAW-SDWTNGIVTGGLYGALEQ 176
>gi|239788200|dbj|BAH70790.1| ACYPI000013 [Acyrthosiphon pisum]
Length = 165
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 56/84 (66%), Gaps = 2/84 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+N +P FD R+++ +C IG V+ Q +CGSCWA+AT++A +DR+C+AT G + LS+
Sbjct: 84 NNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAMATSSAFADRLCVATNGDFNELLSA 143
Query: 134 DHLLTCCAACTGGDVCEGGNPMRA 157
+ + CC C G C GG P++A
Sbjct: 144 EEITFCCHTCGFG--CNGGYPIKA 165
>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 328
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 76/162 (46%), Gaps = 19/162 (11%)
Query: 15 LLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQS 74
+L + +R P LW I++ + K + T +S+E N L G
Sbjct: 34 ILFKQSSRHGAPFLWETEQIMRLA-------KRRVETTTKSKELNKTLDSGV------VK 80
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ + +EFD RK++P C IG + + N WA A L+DRMCIAT G + +S++
Sbjct: 81 DNRIHKEFDARKRWPQCKTIGEFRNEGNFALSWAYAAAGVLADRMCIATNGSYNQLISTE 140
Query: 135 HLLTCCAACTG--GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L++C G G V E W Y+ +G+ +GG Y +
Sbjct: 141 ELISCSGVSGGYHGIVSE----REVWEYLKSHGLVSGGKYNT 178
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 3/90 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP FD ++P C I ++ QSNCGSCWAIA A+SDR C G D +S+ HL
Sbjct: 97 ELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYC-TVAGITDLRVSTGHL 155
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+CC C G C+GG P AW + + G+
Sbjct: 156 LSCCFVCGMG--CQGGIPTMAWLWWVWVGL 183
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 57/100 (57%), Gaps = 3/100 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
++P FD R + C + IGHV QS C SCWAIA A S R+CI + G+ + LS+
Sbjct: 81 ADIPSSFDARDAFKECKDVIGHVWDQSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAG 140
Query: 135 HLLTCC--AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LL CC A C+GG AW ++ ++G+ TGGD+
Sbjct: 141 ELLACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDF 180
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 55/95 (57%), Gaps = 3/95 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
ELP+EFD R ++ C+ IG + Q +CG+CWA L DR CI ++ +LS
Sbjct: 93 EKAELPKEFDARSKWSGCSTIGKILDQGHCGACWAFGAVECLQDRFCI--HHSVNVSLSV 150
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ L+ CC GD C+GG P+ AW Y +ENGV T
Sbjct: 151 NDLVACCGFLC-GDGCDGGYPIFAWQYFVENGVVT 184
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 65/103 (63%), Gaps = 3/103 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D N ++PE FD R+++P C +I ++ Q CG+CWA+AT + +SDR+CI ++G+ D
Sbjct: 78 DVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVATVSVMSDRLCIHSEGKFDVE 137
Query: 131 LSSDHLLTCCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDY 172
L+++ L+ CC C G+ C GG ++ Y ++ G+ +G Y
Sbjct: 138 LAAEDLMGCCKDC--GNGCNGGFLDGTSFQYWVDVGLVSGAAY 178
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LP+ FD R+++P C ++ ++ Q CGSCWA++ +A++DR C+ ++G+ S
Sbjct: 122 DLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSL 181
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LL+CC +C G C GG AW + +E G+ +GG S C + G C
Sbjct: 182 DLLSCCHSC--GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC 231
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LP+ FD R+++P C ++ ++ Q CGSCWA++ +A++DR C+ ++G+ S
Sbjct: 122 DLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSL 181
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LL+CC +C G C GG AW + +E G+ +GG S C + G C
Sbjct: 182 DLLSCCHSC--GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC 231
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 83.2 bits (204), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 1/96 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP+ FD R+++P+C I ++ Q+ CGSCWA +SDR+CI + G +S + +L
Sbjct: 30 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 89
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
+CC T G C+GG + A + +G TGGDYG
Sbjct: 90 SCCGT-TCGYGCKGGYSIEALRFWASSGAVTGGDYG 124
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LP+ FD R+++P C ++ ++ Q CGSCWA++ +A++DR C+ ++G+ S
Sbjct: 122 DLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSL 181
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LL+CC +C G C GG AW + +E G+ +GG S C + G C
Sbjct: 182 DLLSCCHSC--GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC 231
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/122 (40%), Positives = 64/122 (52%), Gaps = 10/122 (8%)
Query: 48 GLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCW 107
G LTP ++ L+ E LP+EFD RKQ+ +C IG + Q +CGSCW
Sbjct: 78 GAILTPANK-----LEPSIETISHKHKKLYLPKEFDARKQWSHCPTIGDILGQGHCGSCW 132
Query: 108 AIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYMLENGV 166
A +L+DR CI + +LS + LL CC C G CEGG P+RAW Y +GV
Sbjct: 133 AFGAVESLTDRFCIHLNESV--SLSENDLLACCGFECGYG--CEGGYPIRAWKYFKHSGV 188
Query: 167 PT 168
T
Sbjct: 189 VT 190
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LP+ FD R+++P C ++ ++ Q CGSCWA++ +A++DR C+ ++G+ S
Sbjct: 122 DLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSL 181
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LL+CC +C G C GG AW + +E G+ +GG S C + G C
Sbjct: 182 DLLSCCHSC--GQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGEC 231
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 3/97 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCI-ATQGRLDHTLSSDH 135
ELP EFD R+ +P C I ++ QS CGSCWA A A+SDR+CI + Q ++ LS+
Sbjct: 85 ELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATD 144
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LL CC C G V GG AW Y +NG+ TGG+Y
Sbjct: 145 LLACCTTCGFGCV--GGWGGMAWDYWRDNGIVTGGEY 179
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 56/94 (59%), Gaps = 5/94 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+CI + G LS+
Sbjct: 141 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGAFTELLSAGE 200
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
+ ACT C GG+P AW ++ + G+ TG
Sbjct: 201 M----NACTLFFGCGGGDPYSAWSWVHDKGIATG 230
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 63/112 (56%), Gaps = 13/112 (11%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +P FD R+Q+P C + V Q CGSCWA +++ ALSDR+CIA++G+++ TLS
Sbjct: 92 DASIPSTFDAREQWPGCVH--AVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSPQ 149
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT---------GGDYGSCQR 177
L+ C G C GG P AW YM G+PT G G+CQR
Sbjct: 150 ALVAC--DDIGNQGCNGGVPQLAWEYMEWKGLPTFECYPYTAGNGTDGTCQR 199
>gi|312105965|ref|XP_003150617.1| hypothetical protein LOAG_15077 [Loa loa]
Length = 150
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 62/103 (60%), Gaps = 4/103 (3%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+ LP+ FD R ++P C ++ V Q CGSCWAI+ + +SDR+CIAT +S
Sbjct: 3 EQKLNLPKHFDARLRWPLCWSVHVVANQGGCGSCWAISAASVMSDRLCIATNYSNQKQIS 62
Query: 133 SDHLLTCCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDYGS 174
++ L++CC C G C+G + + A+ Y +GV TGGDYGS
Sbjct: 63 AEDLISCCTECGG---CQGSHWALSAFIYWRNHGVVTGGDYGS 102
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 59/97 (60%), Gaps = 3/97 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCI-ATQGRLDHTLSSDH 135
ELP EFD R+ +P C I ++ QS CGSCWA A A+SDR+CI + Q ++ LS+
Sbjct: 85 ELPSEFDAREHWPECRTIPQIRDQSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATD 144
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LL CC C G V GG AW Y +NG+ TGG+Y
Sbjct: 145 LLACCTTCGFGCV--GGWGGMAWDYWRDNGIVTGGEY 179
>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
Length = 273
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 54/96 (56%), Gaps = 4/96 (4%)
Query: 78 LPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LPE FD R ++P C + IG + Q NCGSCWA+A +SDR CI + G +D LS L
Sbjct: 18 LPESFDARTKWPTCAHLIGVARDQGNCGSCWAMAPAEVMSDRACIQSGGEIDAELSPFQL 77
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L C G CEGG A+ + NGV TGG +
Sbjct: 78 LACAQGSFG---CEGGESADAYEFAKSNGVVTGGGF 110
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 82.8 bits (203), Expect = 6e-14, Method: Composition-based stats.
Identities = 43/109 (39%), Positives = 57/109 (52%), Gaps = 9/109 (8%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N LP FD +Q+P C IG +Q Q+ CGSCWA ++SDR CI + LS
Sbjct: 67 NVNLPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNESVQ--LSFQ 124
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
L+TC G CEGG+P A+ Y+ +NGV T +CQ + C
Sbjct: 125 DLITCDNQDNG---CEGGDPYTAYKYVQKNGVVT----SNCQPYTIPTC 166
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 68/123 (55%), Gaps = 6/123 (4%)
Query: 66 SEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
S D T +P FD R+ + +C++ IG V+ Q NC S WA+A + +DR+CIA+
Sbjct: 14 SRKIVDNNYETVIPRTFDARQYFVSCSDVIGDVKDQGNCASSWAVAVASTFTDRLCIASN 73
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRG 181
G+ LS+ +L++C G C+GG+ +AW + G+ TGG+Y S CQ +
Sbjct: 74 GQFTDNLSAQNLMSCGNEEKMG--CDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNR 131
Query: 182 NCN 184
C+
Sbjct: 132 PCD 134
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 60/95 (63%), Gaps = 3/95 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T+LP+ FD RKQ+ C IG +Q QSNCGSCWA+++ + + DR+CIA+ G +S+
Sbjct: 106 TKLPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWAVSSASVIQDRICIASNGEQKVHISAQ 165
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
+L+C + G C GG P A+ + ++GV TG
Sbjct: 166 DILSCATDRSQG--CNGGYPDEAFEHYAQSGVVTG 198
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/95 (43%), Positives = 57/95 (60%), Gaps = 3/95 (3%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+ +LP EFD R +P+C+ IG + Q +CGSCWA +LSDR CI L +LS
Sbjct: 79 KSMKLPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGMNL--SLSV 136
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ LL CC G C+GG+P+ AW Y +++GV T
Sbjct: 137 NDLLACCGWMCGAG-CDGGSPIDAWRYFVQSGVVT 170
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 65/105 (61%), Gaps = 3/105 (2%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D N ++PE FD R+++P C +I ++ Q CG+CWA+A + +SDR+CI ++G+ D
Sbjct: 78 DVAYNMDIPEFFDAREKWPYCKSISTIKNQGLCGACWAVAAVSVMSDRLCIHSEGKFDVE 137
Query: 131 LSSDHLLTCCAACTGGDVCEGGN-PMRAWYYMLENGVPTGGDYGS 174
L+++ L+ CC C G+ C GG ++ Y ++ G+ +G Y S
Sbjct: 138 LAAEDLMGCCKDC--GNGCNGGFLDGTSFQYWVDVGLVSGAAYNS 180
>gi|294899385|ref|XP_002776615.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239883670|gb|EER08431.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 233
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 56/94 (59%), Gaps = 5/94 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+CI + G LS+
Sbjct: 116 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGE 175
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
+ CA G C+GG P AW ++ + G+ TG
Sbjct: 176 -MNACAPSYG---CDGGYPDSAWSWVHDEGIATG 205
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 82.4 bits (202), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 64/127 (50%), Gaps = 8/127 (6%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
L ++ + +T S E P E +LPE FD + +P C I ++ QS
Sbjct: 67 LGEVRKLMGVTDMSTEAVPPRNFSVEEL-----QQDLPEFFDAAEHWPMCLTISEIRDQS 121
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NCGSCWAIA A+SDR C G D +S+ +LL+CC C G C GG P AW +
Sbjct: 122 NCGSCWAIAAVEAISDRYC-TFGGVPDRRMSTSNLLSCCFICGLG--CHGGIPTVAWLWW 178
Query: 162 LENGVPT 168
+ G+ T
Sbjct: 179 VWVGIAT 185
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 82.4 bits (202), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q T+LPE FD + +P+C I + QS C + WA+AT +A+SDR C +G+
Sbjct: 81 RFTEEQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGK- 139
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ L+ CC C GG CEGG P AW Y + +G+ +
Sbjct: 140 QLRISAADLMACCKDCGGG--CEGGYPDAAWEYYVSHGIAS 178
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 82.4 bits (202), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 58/95 (61%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R +PNC +I ++ Q+ CG+CWA +SDR+CI + G +S + +L
Sbjct: 76 VPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISVEDIL 135
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC + + G+ C+GG P+ + + +GV TGGDY
Sbjct: 136 SCCGS-SCGEGCKGGYPLEGLKFWMNSGVVTGGDY 169
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 82.0 bits (201), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 64/127 (50%), Gaps = 8/127 (6%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
L ++ + +T S E P E +LPE FD + +P C I ++ QS
Sbjct: 67 LGEVRKLMGVTDMSTEAVPPRNFSVEEL-----QQDLPEFFDAAEHWPMCLTISEIRDQS 121
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NCGSCWAIA A+SDR C G D +S+ +LL+CC C G C GG P AW +
Sbjct: 122 NCGSCWAIAAVEAISDRYC-TFGGVPDRRMSTSNLLSCCFICGLG--CHGGIPTVAWLWW 178
Query: 162 LENGVPT 168
+ G+ T
Sbjct: 179 VWVGIAT 185
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 82.0 bits (201), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 66/138 (47%), Gaps = 12/138 (8%)
Query: 40 SFLSSLKFGLSLTPQSQ---------EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPN 90
+F+S L GL+ P +P P L Y + LP+EFD R +
Sbjct: 100 TFISCLFGGLNNPPVQTAQFKHILGVKPTPHSVLNDVPVKTYPRSLMLPKEFDARSAWSQ 159
Query: 91 CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCE 150
C IG + Q +CGSCWA L DR CI ++ +LS + L+ CC GD C+
Sbjct: 160 CNTIGTILDQGHCGSCWAFGAVECLQDRFCI--HFNMNISLSVNDLVACCGFMC-GDGCD 216
Query: 151 GGNPMRAWYYMLENGVPT 168
GG P+ AW Y + NGV T
Sbjct: 217 GGYPIMAWRYFVRNGVVT 234
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 82.0 bits (201), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 74/120 (61%), Gaps = 4/120 (3%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALS 116
P+P+ ++ ++ +PE FD R+++P C + IG ++ Q NCGSCWA A+T ++
Sbjct: 57 PDPNYKIQTKQH-KISRIISIPESFDAREKWPECKDVIGKIRNQGNCGSCWAFASTEVMT 115
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
DR+CI+++G++ S ++LLTCC C C+GG AW Y + G+ +GGDY S +
Sbjct: 116 DRLCISSKGKIKFVFSPENLLTCCKDCG--CGCKGGYIKNAWDYYINEGIASGGDYNSSE 173
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 82.0 bits (201), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 55/88 (62%), Gaps = 2/88 (2%)
Query: 85 RKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACT 144
R Q+P C I ++ Q++CGSCWA A +A+SDR+CI + G++ L++ L+CC C
Sbjct: 1 RSQWPQCWTISEIRDQASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTYC- 59
Query: 145 GGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G C GG P +AW Y + G+ TGG +
Sbjct: 60 -GQGCRGGYPPKAWDYWMREGIVTGGTW 86
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 60/103 (58%), Gaps = 3/103 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q T+LPE FD + +P+C I + QS C + WA++T +A+SDR C G+
Sbjct: 80 RFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK- 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+S+ LL+CC C GD C+GG P AW Y +E G+ + G
Sbjct: 139 QLRISAADLLSCCKQC--GDGCKGGFPGFAWLYYVEYGIASSG 179
>gi|227018340|gb|ACP18836.1| cysteine proteinase 3 [Chrysomela tremula]
Length = 190
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R+ +P C +I ++ QS+CGSCWA+A AA+SDR+CI + G +S + LL
Sbjct: 83 IPENFDARENWPECESIRMIRDQSDCGSCWAVAAAAAVSDRICIYSYGANQTIVSDEDLL 142
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC C G C+GG AW Y +G+ +GG Y S
Sbjct: 143 SCCDDCGFG--CDGGYSWEAWNYWKNDGIVSGGPYNS 177
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 58/98 (59%), Gaps = 3/98 (3%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP F ++++P C +I + Q NCGSCWA++ + +SDR+CIA+ +S++ LL
Sbjct: 71 LPPSFSAQEKWPGCPSIELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQISAEDLL 130
Query: 138 TCCAA---CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC G C+GG P AW Y+ +G+ TGG Y
Sbjct: 131 SCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTY 168
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 54/92 (58%), Gaps = 4/92 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+ FD R+ +P C +I + Q +CGSCWA AL+DR CI + +LS + L
Sbjct: 87 DLPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILNNENV--SLSENDL 144
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ CC++C G CEGG P AW Y + GV T
Sbjct: 145 VACCSSCGFG--CEGGYPYAAWEYFAQTGVVT 174
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 56/92 (60%), Gaps = 4/92 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+ FD R+ +P C++I ++ Q +CGSCWA AL+DR CI + +LS + L
Sbjct: 98 DLPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENV--SLSENDL 155
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ CC++C G C+GG P AW Y + GV T
Sbjct: 156 VACCSSCGFG--CDGGYPYAAWEYFAQTGVVT 185
>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
Length = 206
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 58/109 (53%), Gaps = 3/109 (2%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P +L S + + LP+EFD R +P C+ IG + Q +CGSCWA +LSDR
Sbjct: 84 PRNELSSIPVVTHPKSLNLPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDRF 143
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
CI +D LS + LL CC G C+GG P+ AW Y +GV T
Sbjct: 144 CI--HFGVDVPLSVNDLLACCGFLCGSG-CDGGYPISAWKYFAHHGVVT 189
>gi|290991959|ref|XP_002678602.1| predicted protein [Naegleria gruberi]
gi|284092215|gb|EFC45858.1| predicted protein [Naegleria gruberi]
Length = 286
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 61/105 (58%), Gaps = 9/105 (8%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE+FD RK++P+C I ++ Q CGSCWA + +A LSDR C+ + G + LS +++L
Sbjct: 96 LPEQFDARKEWPHC--ITPIRNQEQCGSCWAFSASAVLSDRFCVYSNGSVQVMLSPEYML 153
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGN 182
C A + C GG AW +++ G+PT SC + GN
Sbjct: 154 ECSAQ---NNACNGGTLHAAWQFLVSVGIPT----DSCVPYSSGN 191
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 62/112 (55%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P+P +L S + + +LP+ FD R + C+ IG + Q +CGSCWA +LS
Sbjct: 80 KPSPKKELRSTPVVSHPRSLKLPKSFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 140 DRFCIHLD--VNVSLSVNDLLACCGFLCGSG-CDGGYPLYAWRYLAHHGVVT 188
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 57/112 (50%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L Y + LP+EFD R + C IG + Q +CGSCWA L
Sbjct: 81 KPTPHSVLNDVPVKTYPRSLMLPKEFDARSAWSQCNTIGTILDQGHCGSCWAFGAVECLQ 140
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + L+ CC GD C+GG P+ AW Y + NGV T
Sbjct: 141 DRFCI--HFNMNISLSVNDLVACCGFMC-GDGCDGGYPIMAWRYFVRNGVVT 189
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 58/100 (58%), Gaps = 3/100 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
++P FD R + C + IGHV QS CGSCWAIA A + R+CI + G+ + LS+
Sbjct: 57 ADIPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAG 116
Query: 135 HLLTCCAACTGGDV--CEGGNPMRAWYYMLENGVPTGGDY 172
+L CC + + C+GG AW ++ +G+ TGGD+
Sbjct: 117 EMLACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDF 156
>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 238
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 58/98 (59%), Gaps = 5/98 (5%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
++P+ FD R + C + IGHV+ QS CGSCWA T A + R+CI + G+L+ LS+
Sbjct: 57 VDIPDSFDARDAFKECKDVIGHVRDQSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAA 116
Query: 135 HLLTCCAA---CTGGDVCEGGNPMRAWYYMLENGVPTG 169
+L CC C C GGNP+ +W ++ NG+ +G
Sbjct: 117 DMLACCNIEHFCLSFG-CSGGNPITSWTFLHTNGIVSG 153
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 55/92 (59%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+ FD R ++ C+ IG + Q +CGSCWA L DR CI ++ TLS++ L
Sbjct: 56 QLPKVFDARSKWSGCSTIGKILDQGHCGSCWAFGAVECLQDRFCI--HHNMNITLSANDL 113
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ CC GD C+GG P+ AW Y ++NGV T
Sbjct: 114 VACCGFMC-GDGCDGGYPISAWQYFVQNGVVT 144
>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
Length = 166
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 58/110 (52%), Gaps = 3/110 (2%)
Query: 59 NPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
P +L S + + LP+EFD R +P C+ IG + Q +CGSCWA +LSDR
Sbjct: 43 TPRNELSSIPVVTHPKSLNLPKEFDARTAWPQCSTIGRILDQGHCGSCWAFGAVESLSDR 102
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
CI +D LS + LL CC G C+GG P+ AW Y +GV T
Sbjct: 103 FCI--HFGVDVPLSVNDLLACCGFLCGSG-CDGGYPISAWKYFAHHGVVT 149
>gi|412992960|emb|CCO16493.1| cysteine proteinase, putative [Bathycoccus prasinos]
Length = 396
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 6/97 (6%)
Query: 78 LPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP +FD RK++ C IG V+ Q CGSCWA+A T ++DR+CIA G+ + LS +
Sbjct: 146 LPRQFDARKEWAECKGLIGTVRDQGKCGSCWAVAATEVMNDRVCIA-HGKTEE-LSPQYA 203
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
L+C +A G CEGGN + +E GVPTGG +G
Sbjct: 204 LSCYSAGAG---CEGGNVIDTLQEAIEKGVPTGGMFG 237
>gi|157058729|gb|ABV03122.1| cathepsin B-16c [Acyrthosiphon pisum]
Length = 143
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 55/95 (57%), Gaps = 2/95 (2%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
N + H Y +N +P FD R+++ +C IG V+ Q +CGSCWA T++A +D
Sbjct: 51 KNASAHMFKTHDVAYNNNGYIPRTFDARRRWRHCKTIGEVRDQGHCGSCWAFGTSSAFAD 110
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGG 152
R+C+AT G + LS++ L CC C G+ C GG
Sbjct: 111 RLCVATDGDFNELLSAEELTFCCHTC--GNGCNGG 143
>gi|56757237|gb|AAW26790.1| unknown [Schistosoma japonicum]
Length = 170
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 60/106 (56%), Gaps = 3/106 (2%)
Query: 56 QEPNPDLQLGSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAA 114
+ +P+L+ D+ E+P FD RK++P C +I ++ QS C S WA++ A
Sbjct: 67 RREDPNLRQKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGA 126
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYY 160
+SDR+CI + G+ LS+ L++CC C G C+GG P AW Y
Sbjct: 127 MSDRICIQSGGKQSVELSAIDLISCCENCGSG--CDGGFPGPAWDY 170
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P + + + +LP+EFD R + CT+IG + Q +CGSCWA +LS
Sbjct: 85 KPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLS 144
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + ++ +LS + LL CC G C GG P+ AW Y +GV T
Sbjct: 145 DRFCI--KYNMNVSLSVNDLLACCGFLCGQG-CNGGYPIAAWRYFKHHGVVT 193
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 65/120 (54%), Gaps = 3/120 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
PN + + +S LPE FD R+++PNC ++ ++ Q CGSC+ ++T AA++
Sbjct: 60 RPNESVANAVPLLENQRSVRSLPESFDSRQKWPNCPSLNQIRDQGCCGSCYVVSTAAAIT 119
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
DR CI + G+ T + L CC C C+GG + W Y +++G+ + G Y S Q
Sbjct: 120 DRYCIHSGGQKQFTFGATDYLACCTDCFK---CDGGYVGKTWQYWVDSGLTSEGPYKSGQ 176
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 60/99 (60%), Gaps = 5/99 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP F+ +++ C++ IGH++ QS CGSCWA A T A +DR+CI + G LS +
Sbjct: 138 DLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPGN 197
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ AAC+ C GG+ + AW ++ GV TGGDY +
Sbjct: 198 V----AACSKTSGCHGGSSLDAWQWLHTTGVVTGGDYSA 232
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 77/153 (50%), Gaps = 9/153 (5%)
Query: 16 LLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
+++ V + N G W + S + ++ K L + +P P + +
Sbjct: 47 IVKKVNENPNAG-WKAAINDRFSNATVAEFKRLLGV-----KPTPKKHFLGVPIVSHDPS 100
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP+ FD R +P CT+IG++ +CGSCWA +LSDR CI Q ++ +LS +
Sbjct: 101 LKLPKAFDARTAWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCI--QFGMNISLSVND 158
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
LL CC GD C+GG P+ AW Y +GV T
Sbjct: 159 LLACCGF-RCGDGCDGGYPIAAWQYFSYSGVVT 190
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 60/99 (60%), Gaps = 4/99 (4%)
Query: 78 LPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP EFD R+++ C + IGHV+ Q CG+CWA+ T L+DR+CI + G++ LS+ ++
Sbjct: 33 LPPEFDARQKFNYCRDVIGHVRDQGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGYV 92
Query: 137 LTCCA---ACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC C C GG + A ++ ++GV TG D+
Sbjct: 93 TSCCNPAHGCLHAKGCNGGRLVEAMSFLRDHGVVTGNDF 131
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 59/112 (52%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P + + + +LP+EFD R + CT+IG + Q +CGSCWA +LS
Sbjct: 16 KPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLS 75
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C GG P+ AW Y +GV T
Sbjct: 76 DRFCIKYN--MNVSLSVNDLLACCGFLC-GQGCNGGYPIAAWRYFKHHGVVT 124
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L + + +LP+EFD R Q+ +C+ IG++ Q +CG+CWA A +L
Sbjct: 80 KPTPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQ 139
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + +LS + LL CC G C GG P+ AW Y +GV T
Sbjct: 140 DRFCIHLN--MSVSLSVNDLLACCGFLCGSG-CNGGYPISAWRYFRRSGVVT 188
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L + + +LP+EFD R Q+ +C+ IG++ Q +CG+CWA A +L
Sbjct: 80 KPTPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVESLQ 139
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + +LS + LL CC G C GG P+ AW Y +GV T
Sbjct: 140 DRFCIHLN--MSVSLSVNDLLACCGFLCGSG-CNGGYPISAWRYFRRSGVVT 188
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 77/140 (55%), Gaps = 5/140 (3%)
Query: 40 SFLSSLKFGLSLTPQSQEPNP-DLQLGSEHF-GDYQSNTELPEEFDLRKQYPNCTNIGHV 97
+F ++FG + + +P D L S+ +P+ FD R+++P C +I V
Sbjct: 51 AFRGGIRFGEFRSIKGIYESPLDFTLPSKRLHASSLDEVVIPDRFDAREKWPFCQSIHSV 110
Query: 98 QLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGN-PMR 156
+ Q CGSCWA+AT + +SDR+CI + G ++ L+++ L+ CC C G+ C GG
Sbjct: 111 RNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCKDC--GNGCNGGFLDGT 168
Query: 157 AWYYMLENGVPTGGDYGSCQ 176
A+ Y ++ G+ +G Y S +
Sbjct: 169 AFQYWVDAGLVSGAPYNSSE 188
>gi|12958837|gb|AAK09441.1|AF339098_1 cathepsin b-like precursor protein [Ancylostoma ceylanicum]
Length = 180
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 66/112 (58%), Gaps = 14/112 (12%)
Query: 51 LTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIA 110
+TP+ +E D+ +GD + PE FD R Q+P C IG ++ QS+CGSCWA+A
Sbjct: 73 VTPKKEEVLMDV------YGD-----DPPESFDARTQWPECRAIGTIRDQSSCGSCWAVA 121
Query: 111 TTAALSDRMCIATQGRLDHTLSSDHLLTCCA-ACTGGDVCEGGNPMRAWYYM 161
+ +A+SD MC+ + + +S +L+CC C G C+GG P+ A+ +M
Sbjct: 122 SASAMSDEMCVQSNSSIKLMISDTDILSCCGLECGYG--CQGGWPIEAYRWM 171
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 55/112 (49%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P L Y + +LP EFD R Q+ C+ IG + Q +CGSCWA L
Sbjct: 76 KPTPPALLAGVPTKSYSRSMKLPTEFDARSQWSGCSTIGTILDQGHCGSCWAFGAVECLQ 135
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C GG P+ AW Y GV T
Sbjct: 136 DRFCIHLN--MNISLSVNDLLACCGFLCGSG-CNGGYPISAWRYFRRKGVVT 184
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 54/92 (58%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+ FD R +P C+ IG + Q +CGSCWA +LSDR CI ++ +LS + L
Sbjct: 100 KLPKHFDARTAWPQCSTIGKILDQGHCGSCWAFGAVESLSDRFCI--HFGMNISLSVNDL 157
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC G C+GG P+ AW Y + +GV T
Sbjct: 158 LACCGFLCGSG-CDGGYPLYAWRYFIHHGVVT 188
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P + + + +LP+EFD R + CT++G + Q +CGSCWA +LS
Sbjct: 83 KPTPKTEFLGVPIVSHDISLKLPKEFDARTAWSQCTSVGRILDQGHCGSCWAFGAVESLS 142
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + ++ +LS + LL CC G C GG P+ AW Y +GV T
Sbjct: 143 DRFCI--KYNMNISLSVNDLLACCGFLCGQG-CNGGYPIAAWRYFKHHGVVT 191
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 59/103 (57%), Gaps = 5/103 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+PE FD R+ +P C ++ ++ QS+CGSCWA A+SDR+CI + +S++ L
Sbjct: 83 EVPESFDARENWPRCDSLKQIRDQSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVSAEDL 142
Query: 137 LTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC AC G C+GG W Y +G+ TGG Y S Q
Sbjct: 143 NSCCFGLFACGLG--CDGGYVAEPWDYWRTDGIVTGGAYNSSQ 183
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 50/74 (67%), Gaps = 2/74 (2%)
Query: 99 LQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAW 158
L+S+ GSCWA+A A+SDR+CI ++G+ TLS+D LL+CC C G C GG PM AW
Sbjct: 10 LKSSSGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCKTCGFG--CFGGEPMAAW 67
Query: 159 YYMLENGVPTGGDY 172
Y + G+ TG +Y
Sbjct: 68 KYWVLRGIVTGSEY 81
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P +L S + + +LP+ FD R + C+ IG + Q +CGSCWA +LS
Sbjct: 79 KPTPKKELRSTPAISHPKSLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 138
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 139 DRFCI--HFDVNISLSVNDLLACCGFLCGSG-CDGGYPLYAWQYLAHHGVVT 187
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 80.1 bits (196), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+EFD R Q+ +C+ IG++ Q +CG+CWA A AL DR CI + +LS + L
Sbjct: 96 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHLN--MSVSLSVNDL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC G C GG P+ AW Y +GV T
Sbjct: 154 LACCGFLCGSG-CNGGYPISAWRYFRRSGVVT 184
>gi|256052325|ref|XP_002569723.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228438|emb|CCD74609.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 198
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/92 (45%), Positives = 56/92 (60%), Gaps = 4/92 (4%)
Query: 80 EEFDLR--KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
EE DLR K++P C +I ++ QS CGS WA A+SDR CI + G+ + LS+ LL
Sbjct: 63 EESDLRRKKKWPGCKSIATIRDQSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 122
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
+CC C GD EGG P AW Y ++ G+ TG
Sbjct: 123 SCCEHC--GDGFEGGFPALAWDYWVKEGIVTG 152
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 51/76 (67%), Gaps = 2/76 (2%)
Query: 99 LQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAW 158
QS+CGSCWA+ A++DR+CIA++G T+S+D LL+CC C G C+G +P AW
Sbjct: 1 FQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCDECGFG--CDGRDPYAAW 58
Query: 159 YYMLENGVPTGGDYGS 174
Y + NG+ TG +Y S
Sbjct: 59 SYWVSNGIVTGSNYTS 74
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 60/110 (54%), Gaps = 8/110 (7%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+S +P+ FD R Q+PNC I + Q CGSCWA + + LSDR+CIA+ G+ LS
Sbjct: 26 KSVGSIPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLS 83
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGN 182
L++C G C GG P AW YM +G+PT G C + GN
Sbjct: 84 PQALVSC--DIFGNQGCNGGIPQLAWEYMELHGIPTYG----CFPYTSGN 127
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+EFD R Q+ +C+ IG++ Q +CG+CWA A AL DR CI + +LS + L
Sbjct: 96 DLPKEFDARTQWSSCSTIGNILDQGHCGACWAFAAVEALQDRFCIHL--NMSVSLSVNDL 153
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC G C GG P+ AW Y +GV T
Sbjct: 154 LACCGFLCGSG-CNGGYPISAWRYFRRSGVVT 184
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/109 (41%), Positives = 60/109 (55%), Gaps = 8/109 (7%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S +P FD R +P C + V Q CGSCWA A + +LSDR+CIA+QG ++ TLS
Sbjct: 77 SKVAVPNSFDSRTNWPGCVH--AVLNQGQCGSCWAFAASESLSDRLCIASQGAINVTLSP 134
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGN 182
L++C G C GG P AW Y+ +G+PT SC + GN
Sbjct: 135 QALVSCDIEFNQG--CNGGIPQMAWEYLELHGIPT----DSCFPYTSGN 177
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 59/104 (56%), Gaps = 1/104 (0%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D LPE FD R+++P C +IG ++ QS G CWA+++ ++DR+CI + G
Sbjct: 87 DIDLAVSLPESFDAREKWPECPSIGLIRDQSAGGGCWAVSSAEVMTDRICIQSNGTKQVY 146
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S +L+CC G C G P +A+ Y + GV +GG YG+
Sbjct: 147 VSETDILSCCGQRCGSG-CTSGVPRQAFNYAIRKGVCSGGPYGT 189
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 57/99 (57%), Gaps = 3/99 (3%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q T+LPE FD + +P+C I + QS C + WA++T +A+SDR C G+
Sbjct: 80 RFTEEQLRTKLPETFDAAEHWPHCPTIREIADQSECRASWAVSTASAISDRYCTVGGGK- 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+S+ L+ CC C GD C+GG P AW Y +E G+
Sbjct: 139 QLRISAADLMACCKQC--GDGCKGGFPGFAWLYYVEYGI 175
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P +L S + +LP+ FD R + C+ IG + Q +CGSCWA +LS
Sbjct: 80 KPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 140 DRFCI--HFDVNISLSVNDLLACCGFLCGSG-CDGGYPLYAWRYLAHHGVVT 188
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 79.7 bits (195), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 60/112 (53%), Gaps = 3/112 (2%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALS 116
+P P +L S + +LP+ FD R + C+ IG + Q +CGSCWA +LS
Sbjct: 80 KPMPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLS 139
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 140 DRFCIHFD--VNISLSVNDLLACCGFLCGSG-CDGGYPLYAWRYLAHHGVVT 188
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 36/72 (50%), Positives = 49/72 (68%), Gaps = 2/72 (2%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA + + SDR+CIAT G + LS++ L TCC C G+ C+GG+P AWY+ +
Sbjct: 1 CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCCYRC--GNGCDGGSPEAAWYFFM 58
Query: 163 ENGVPTGGDYGS 174
+G+ TGGDY S
Sbjct: 59 RHGIVTGGDYES 70
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 77/162 (47%), Gaps = 9/162 (5%)
Query: 9 VNHSHHLLLRHVTR--DSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGS 66
V + H+L + R + NP + I +P F S+ G + P +L S
Sbjct: 33 VKLNSHILQESIARQINENPEAGWEATI---NPRF-SNFTVGQFKRLLGVKQTPRSELSS 88
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+ + +LP++FD R + C+ IG + Q +CGSCWA +LSDR CI
Sbjct: 89 APVVTHPKSLKLPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFD 146
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++ +LS + +L CC G C GG P AW Y+ +GV T
Sbjct: 147 MNVSLSVNDILACCGLLCGAG-CAGGTPFSAWIYLAHHGVVT 187
>gi|308804940|ref|XP_003079782.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
gi|116058239|emb|CAL53428.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
Length = 498
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 60/110 (54%), Gaps = 6/110 (5%)
Query: 78 LPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP FD R +YP C IG V+ Q CGSCWA+A T ++DR+CI++ G+ LS
Sbjct: 257 LPRHFDARDEYPKCARLIGTVRDQGKCGSCWAVAATEIMNDRLCISSGGKEVAELSPQFA 316
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG--DYGSCQRFDRGNCN 184
L+C + G CEGG+ + L GVP GG D G+C + C+
Sbjct: 317 LSCYNSGAG---CEGGDVVDTLTLALAKGVPHGGMLDKGACLPYQFEPCD 363
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 77/162 (47%), Gaps = 9/162 (5%)
Query: 9 VNHSHHLLLRHVTR--DSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGS 66
V + H+L + R + NP + I +P F S+ G + P +L S
Sbjct: 28 VKLNSHILQESIARQINENPEAGWEATI---NPRF-SNFTVGQFKRLLGVKQTPRSELSS 83
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+ + +LP++FD R + C+ IG + Q +CGSCWA +LSDR CI
Sbjct: 84 APVVTHPKSLKLPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFD 141
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++ +LS + +L CC G C GG P AW Y+ +GV T
Sbjct: 142 MNVSLSVNDILACCGLLCGAG-CAGGTPFSAWIYLAHHGVVT 182
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 59/109 (54%), Gaps = 3/109 (2%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P +L S + + +LP+EFD R + C+ IG + Q +CGSCWA +L DR
Sbjct: 85 PKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRF 144
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
CI ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 145 CI--HFDMNISLSVNDLLACCGFLCGAG-CDGGTPIYAWRYLAHHGVVT 190
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 59/107 (55%), Gaps = 7/107 (6%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP+EFD R +PNC IG + Q +CGSCWA+++ L DR CI ++G+ LS HL
Sbjct: 76 LPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHLT 135
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
+C C+G C GG A+ +M NG+ G D C + G C
Sbjct: 136 SCTPGCSG---CNGGWMSTAFGFMQSNGI-LGED---CIPYQMGKCK 175
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 59/109 (54%), Gaps = 3/109 (2%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P +L S + + +LP+EFD R + C+ IG + Q +CGSCWA +L DR
Sbjct: 83 PKKELLSTPVVTHPKSLKLPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRF 142
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
CI ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 143 CI--HFDMNISLSVNDLLACCGFLCGAG-CDGGTPIYAWRYLAHHGVVT 188
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 79.3 bits (194), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 55/93 (59%), Gaps = 2/93 (2%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++ E+P+ FD R ++ C + Q +CGSCWA A+T LSDR+CI T+G + LSS
Sbjct: 39 TDMEIPKSFDARMEWSTCVRSHKIHDQGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSS 98
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+ LL+C A G +GG AW YM + GV
Sbjct: 99 EDLLSCDKAGRG--CSDGGRLSEAWRYMQKKGV 129
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LPE FD RKQ+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +
Sbjct: 74 FAADVVLPESFDARKQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LT C GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLT-CCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 175
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 79.0 bits (193), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 63/111 (56%), Gaps = 8/111 (7%)
Query: 77 ELPEEFDLRKQYPN-CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP FD R+Q+ + CT++ V+ QSNCGSCWA +L+DR CI D LS+ +
Sbjct: 91 DLPTAFDARQQWGDKCTSLWEVRDQSNCGSCWAFGAVESLTDRHCIHLG--QDIRLSAQN 148
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY---GSCQRFDRGNC 183
+LTCCA C G C GG P A Y ++ G+ TG Y G CQ + C
Sbjct: 149 MLTCCATC--GQGCNGGYPASAMSYYVKTGLVTGDLYNTTGWCQAYSFAPC 197
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 79.0 bits (193), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 54/95 (56%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP+ FD R+Q+P C +I ++ Q+ CGSCWA +SDR+CI + +S + +L
Sbjct: 97 LPDTFDSREQWPECKSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDIL 156
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC G C+GG + A + +G TGGDY
Sbjct: 157 SCCGVSCGKG-CQGGYSIEALRFWKSSGAVTGGDY 190
>gi|294893885|ref|XP_002774682.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880102|gb|EER06498.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 121
Score = 79.0 bits (193), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/93 (40%), Positives = 55/93 (59%), Gaps = 5/93 (5%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+C+ + G LS+
Sbjct: 33 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGE 92
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ CA G C+GG P AW ++ + G+ T
Sbjct: 93 -MNACAPSYG---CDGGYPDSAWSWVHDEGIAT 121
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 85 RKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACT 144
R+Q+P+C I ++ Q +CGSCWA A+SDR+CI ++G+++ +S++ LL+CC
Sbjct: 2 REQWPDCPTIKEIRDQGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKL-E 60
Query: 145 GGDVCEGGNPMRAWYYMLENGVPTGGDY 172
G+ C GG P AW + +G+ +GG Y
Sbjct: 61 CGNGCNGGYPSGAWEFWTNDGLVSGGLY 88
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 69/144 (47%), Gaps = 20/144 (13%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHV--QL 99
LS+ G +P P +L + +LP+EFD R +P+C+ IG + QL
Sbjct: 65 LSNFTVGQFKYLLGAKPTPKKELMGVPMISHPKTLKLPKEFDARTAWPHCSTIGKILGQL 124
Query: 100 ---------------QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACT 144
+ +CGSCWA +LSDR CI ++ +LS + LL CC
Sbjct: 125 LSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCI--HFGMNISLSVNDLLACCGFLC 182
Query: 145 GGDVCEGGNPMRAWYYMLENGVPT 168
GD C+GG PM AW Y + +GV T
Sbjct: 183 -GDGCDGGYPMYAWRYFVHHGVVT 205
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 61/104 (58%), Gaps = 5/104 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD R+++ C ++ V+ Q C S +A+A + ++DR C+ ++G+ + +L
Sbjct: 131 LPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVL 190
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRF 178
+CC C G C+GG P W+Y +ENG+ +GG +GS CQ +
Sbjct: 191 SCCHRCGFG--CDGGVPSAVWHYWVENGITSGGAFGSHEGCQSY 232
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 64/110 (58%), Gaps = 9/110 (8%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q N LP+ FD R Q+ +C + ++ Q+ CGSCWA A +LSDR CIA+QG+++ LS
Sbjct: 73 QINAALPDSFDSRTQWKDC--VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLS 130
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGN 182
+++C + G C GG +AW Y+ + GV + SC+ + GN
Sbjct: 131 PQDMVSCDTSNFG---CFGGYLDQAWQYLEQQGVSS----DSCEPYKSGN 173
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 62/110 (56%), Gaps = 5/110 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+EFD R + C I ++ Q +CGSCWA ++DR CI + G + S+++L
Sbjct: 78 KIPKEFDSRTAWSMCPTISEIRDQGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAENL 137
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
++CC C G C GG P A+ Y + +G+ +GG + S CQ ++ C
Sbjct: 138 VSCCHLCGFG--CNGGFPGAAFQYWVHSGIVSGGAFNSTQGCQPYEIAPC 185
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 3/109 (2%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRM 119
P +L S + + +LP+EFD R + C+ IG + Q +CGSCWA +L DR
Sbjct: 85 PKKELLSTPVVTHPKSLKLPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRF 144
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
C + ++ +LS + LL CC G C+GG P+ AW Y+ +GV T
Sbjct: 145 C--SHFDMNISLSVNDLLACCGFLCGAG-CDGGTPIYAWRYLAHHGVVT 190
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 3/111 (2%)
Query: 59 NPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
P L +F + + LPE FD +++PNC I + QS+CGSCWA+A +++DR
Sbjct: 71 KPVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDR 130
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
C G +S+ LL CC C G C GG+P AW Y G+ +G
Sbjct: 131 YC-TIHGVRGLRISAADLLACCGDCGYG--CLGGDPDMAWAYFSSEGIASG 178
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 80/161 (49%), Gaps = 27/161 (16%)
Query: 29 WADPDI--LKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK 86
W + DI +K+ L + K G+ L +++ N LP EFD R
Sbjct: 59 WINSDIAGVKAHMGTLLNQKSGVKLEKVNRQAN-----------------NLPSEFDSRV 101
Query: 87 QYPN-CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTG 145
Q+ + C+++ V+ QSNCGSCWA +LSDR CI D LS+ +L+TCC C
Sbjct: 102 QWGDKCSSLWEVRDQSNCGSCWAFGAAESLSDRHCIHLG--QDIRLSTQNLVTCCDECGF 159
Query: 146 GDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
G C+GG P A Y + NG+ TG YG+ CQ + C
Sbjct: 160 G--CDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAYSLAPC 198
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 53/78 (67%), Gaps = 5/78 (6%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA+++ +A+SDR+CIATQG +S +++CC C G C+GG +RAWYY E
Sbjct: 1 SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTWCGYG--CQGGWSIRAWYYFAEQ 58
Query: 165 GVPTGGDY---GSCQRFD 179
GV TGG+Y GSC+ ++
Sbjct: 59 GVVTGGNYNTKGSCRPYE 76
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 3/111 (2%)
Query: 59 NPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
P L +F + + LPE FD +++PNC I + QS+CGSCWA+A +++DR
Sbjct: 71 KPVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDR 130
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
C G +S+ LL CC C G C GG+P AW Y G+ +G
Sbjct: 131 YC-TIHGVRGLRISAADLLACCGDCGYG--CLGGDPDMAWAYFSSEGIASG 178
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 64/110 (58%), Gaps = 9/110 (8%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q N LP+ FD R Q+ +C + ++ Q+ CGSCWA A +LSDR CIA+QG+++ LS
Sbjct: 73 QINAALPDSFDSRTQWKDCVH--PIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLS 130
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGN 182
+L+C A+ C GG AW Y+ + GV G D SC+ + GN
Sbjct: 131 PQDMLSCDAS---NFCCFGGYLDTAWQYLEQQGV--GSD--SCEPYKSGN 173
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 66/134 (49%), Gaps = 7/134 (5%)
Query: 36 KSSPSFLS-SLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNI 94
+ +P F S + K SL P+ LQ +E + E+PE FD R +PNC I
Sbjct: 30 RVNPHFKSFNQKKFRSLNSAQHNPSFSLQFKNEFV---KIEDEIPESFDARTNWPNCPTI 86
Query: 95 GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNP 154
GH+ Q +CGSCWA+ + L DR CI + G LS + +C + G C GG
Sbjct: 87 GHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDITSCDSRSHG---CNGGWT 143
Query: 155 MRAWYYMLENGVPT 168
A+ Y + GVPT
Sbjct: 144 ETAFEYAKKAGVPT 157
>gi|312266|emb|CAA51531.1| cathepsin B-like enzyme [Gallus gallus]
Length = 156
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 86 KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTG 145
KQ+PNC I ++ Q +CGSCWA + +SDR+C+ T ++ +S++ LL+CC
Sbjct: 1 KQWPNCPTISEIRDQGSCGSCWAFGSVEVISDRICVHTNAKVSVEVSAEDLLSCCGF-EC 59
Query: 146 GDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
G C GG P AW Y E G+ +GG Y S
Sbjct: 60 GMGCNGGYPSGAWRYWTERGLVSGGLYDS 88
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 64/103 (62%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ++ LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +
Sbjct: 74 FAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ +LT C GD C GG P AW + + G+ +GG Y S
Sbjct: 134 SAEDMLT-CCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 175
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 64/104 (61%), Gaps = 1/104 (0%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
++ + +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++
Sbjct: 52 EFADDIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVE 111
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ +LT C GD C GG P AW + + G+ +GG Y S
Sbjct: 112 VSAEDMLT-CCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDS 154
>gi|390367767|ref|XP_787947.3| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 146
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 46/70 (65%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+ +PNC I V+ Q +CGSCWA A+SDR+CI ++G+ +S++ L
Sbjct: 77 DLPENFDARENWPNCPTIKEVRDQGSCGSCWAFGAVEAISDRICIKSKGQTQVHISAEDL 136
Query: 137 LTCCAACTGG 146
+TCC C G
Sbjct: 137 MTCCKTCGNG 146
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 3/103 (2%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L F + + L ++FD + +PNC I ++ QS+CGSCWA+A +A+SDR C
Sbjct: 78 LAPRQFSEAELRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYC-TL 136
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
G D +S+ L++CC C G C GG P AW + + +G+
Sbjct: 137 GGVRDLRISAGDLMSCCDVCGYG--CNGGFPEVAWVFYVVHGL 177
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/104 (41%), Positives = 64/104 (61%), Gaps = 1/104 (0%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
++ + +LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI T G ++
Sbjct: 73 EFADDIKLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVE 132
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ +LT C GD C GG P AW + + G+ +GG Y S
Sbjct: 133 VSAEDMLT-CCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDS 175
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 52/85 (61%), Gaps = 3/85 (3%)
Query: 82 FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
FD + +PNC I ++ QS CGSCWA+A +A+SDR C G D +S+ LL+CC
Sbjct: 1 FDAGEAWPNCPTITEIRDQSGCGSCWAVAARSAMSDRYC-TRGGVRDLRISAGDLLSCCN 59
Query: 142 ACTGGDVCEGGNPMRAWYYMLENGV 166
AC G C GG+P AW Y +E G+
Sbjct: 60 ACGLG--CNGGDPDWAWLYYVETGI 82
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 58/111 (52%), Gaps = 7/111 (6%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P+ FD R Q+P+C I ++ Q CGSCWA ++SDR CI +S++ L
Sbjct: 77 DVPDMFDARMQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCI--HFNQSAHISAEDL 134
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
+ CC C G C GG AW Y G+ TGG Y S CQ + +C+
Sbjct: 135 MACCETCGMG--CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCD 183
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 52/97 (53%), Gaps = 3/97 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ELP+ FD R +P C +I + Q +CGSCWA +L+DR CI + TL
Sbjct: 90 HSKEIELPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYGTNV--TL 147
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
S + LL CC G+ C+GG P+ AW Y GV T
Sbjct: 148 SVNDLLACCGFLC-GEGCDGGYPIAAWQYFKRTGVVT 183
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/97 (40%), Positives = 52/97 (53%), Gaps = 3/97 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ ELP+ FD R +P C +I + Q +CGSCWA +L+DR CI + TL
Sbjct: 90 HSKEIELPKTFDARTAWPQCLSIADILDQGHCGSCWAFGAVESLTDRFCIHYGTNV--TL 147
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
S + LL CC G+ C+GG P+ AW Y GV T
Sbjct: 148 SVNDLLACCGFLC-GEGCDGGYPIAAWQYFKRTGVVT 183
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 68/138 (49%), Gaps = 10/138 (7%)
Query: 39 PSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQ 98
P +K L L P LQ + +S LP FD R+++P C ++ ++
Sbjct: 15 PEEFGVMKMSLGLNESELNNLPRLQ-------NQRSVRALPASFDARQKWPYCPSLNQIR 67
Query: 99 LQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAW 158
Q +CGSC+A++T A ++DR CI + G S L+CC C C+GG + +
Sbjct: 68 SQGSCGSCYAVSTAAVITDRYCIHSGGERQFYFGSTGYLSCCTDCY---KCDGGYVHKTF 124
Query: 159 YYMLENGVPTGGDYGSCQ 176
Y ++ G+ +GG Y S Q
Sbjct: 125 DYWVKYGLTSGGPYHSGQ 142
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD RK++PNC +IGH+ Q NC S +A+A +A SDR+CI + G + +S+ ++
Sbjct: 61 LPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQII 120
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C+GG+ +W Y +G +GGDY S Q
Sbjct: 121 SCCYLC--GHGCDGGSLFESWDYYRRHGFVSGGDYNSNQ 157
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +S++ +L
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
T C GD C GG P AW + + G+ +GG Y S
Sbjct: 61 T-CCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNS 96
>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 217
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD RK++PNC +IGH+ Q NC S +A+A +A SDR+CI + G + +S+ ++
Sbjct: 61 LPINFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQII 120
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C+GG+ +W Y +G +GGDY S Q
Sbjct: 121 SCCYLC--GHGCDGGSLFESWDYYRRHGFVSGGDYNSNQ 157
>gi|240992693|ref|XP_002404472.1| cysteine proteinase cathepsin L, putative [Ixodes scapularis]
gi|215491569|gb|EEC01210.1| cysteine proteinase cathepsin L, putative [Ixodes scapularis]
Length = 99
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 50/70 (71%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE FD R+++P+C +I ++ QS CGSCWA T A+SDR+CI ++G++ +S++ L
Sbjct: 30 DLPESFDAREKWPHCNSIHLIRDQSTCGSCWAFGATEAMSDRVCIHSEGKVQVDISAEDL 89
Query: 137 LTCCAACTGG 146
L CC +C G
Sbjct: 90 LDCCHSCGYG 99
>gi|145347486|ref|XP_001418195.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578424|gb|ABO96488.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 61/113 (53%), Gaps = 6/113 (5%)
Query: 75 NTELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+ LP FD R YP C+ + G V+ Q CGSCWA+A T ++DR+C+AT G LS
Sbjct: 109 DNRLPTSFDARVAYPKCSRLLGAVRDQGRCGSCWAVAATEVMNDRLCVATDGENADELSP 168
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG--DYGSCQRFDRGNCN 184
+ L+C + +G C+GG+ + G+P GG D +C ++ C+
Sbjct: 169 QYALSCFDSGSG---CDGGDVLDTLRIAFTKGIPYGGMLDSNACLPYEFEACD 218
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 61/131 (46%), Gaps = 36/131 (27%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCW------------------------------ 107
L E FD R+++P C IG ++ QS C CW
Sbjct: 60 LEEHFDAREKWPECKYIGFIKDQSTCSCCWVSGDFLYHYDQWKIILLFDFSSSSSHWLFI 119
Query: 108 ----AIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
A+++ + ++DR CIA +G LS + L +CC +C G C GG P+ A+ Y E
Sbjct: 120 STFKAMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCTSCGYG--CNGGFPLLAFKYWNE 177
Query: 164 NGVPTGGDYGS 174
GVPTGG YGS
Sbjct: 178 IGVPTGGPYGS 188
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 45/68 (66%), Gaps = 2/68 (2%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA+A A+SDR+CI ++G+ LS+D LL+CC C G C GG PM AW Y + +
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCKTCGFG--CFGGEPMAAWKYWVLS 220
Query: 165 GVPTGGDY 172
G+ TG DY
Sbjct: 221 GIVTGSDY 228
>gi|56756124|gb|AAW26240.1| unknown [Schistosoma japonicum]
Length = 159
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 47/71 (66%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTG 145
L++CC C G
Sbjct: 147 DLISCCEDCGG 157
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 3/95 (3%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +LP+ FD R+Q+ +C +I + QS C S WA+A+ A++SDR CI T G + LS+
Sbjct: 81 NIQLPDYFDSREQWKDCPSINIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAI 140
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
L++C G C+ G +W Y L+NG+ TG
Sbjct: 141 ELISCSKNKLG---CQIGFSEFSWDYWLKNGLVTG 172
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 57/96 (59%), Gaps = 5/96 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD RKQ+P +I ++ Q CGSCWA + LSDR IA++ ++ TLS+ L+
Sbjct: 83 IPENFDARKQWPG--SIHPIRNQGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQLV 140
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
C +G C GG P+ AW YM++ G+ T YG
Sbjct: 141 DCDLDNSG---CSGGWPINAWNYMVKTGLLTEQCYG 173
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + GR++ +S++ +L
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 60
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
T C GD C GG P AW + + G+ +GG Y S
Sbjct: 61 T-CCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 96
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 14/108 (12%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R Q+P +I ++ Q CGSCWA T ALSDR+ IA+ ++ LS L+
Sbjct: 81 IPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDLV 138
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPT---------GGDYGSCQ 176
+C + G C+GG P+ AW+YM GV T GD G+CQ
Sbjct: 139 SCDSTDYG---CDGGYPINAWHYMQSLGVVTDTCYPYTSGNGDSGTCQ 183
>gi|260821942|ref|XP_002606362.1| hypothetical protein BRAFLDRAFT_67605 [Branchiostoma floridae]
gi|229291703|gb|EEN62372.1| hypothetical protein BRAFLDRAFT_67605 [Branchiostoma floridae]
Length = 572
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 69/129 (53%), Gaps = 10/129 (7%)
Query: 46 KFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTE------LPEEFDLRKQYPNCTNIGHVQL 99
K G P+S +QL S + + ++ LPE FD R+++P I V+
Sbjct: 183 KLGTDPVPESVHAMRGIQLYSNVVTNDVTTSQVSIRENLPEFFDARQRWPGL--IQDVRD 240
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q NCG+ WA +TTA L+DR+ I ++G + TLS +LL+C G C+GG RAW+
Sbjct: 241 QGNCGASWAFSTTAVLADRLAIQSRGTMTVTLSPQNLLSCNTNRQRG--CQGGRLDRAWW 298
Query: 160 YMLENGVPT 168
++ + G P
Sbjct: 299 FLRKKGFPV 307
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++ G C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLIS--CCKDCGGGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|294926967|ref|XP_002779086.1| Gut-specific cysteine proteinase precursor, putative [Perkinsus
marinus ATCC 50983]
gi|239888027|gb|EER10881.1| Gut-specific cysteine proteinase precursor, putative [Perkinsus
marinus ATCC 50983]
Length = 283
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 56/101 (55%), Gaps = 4/101 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T LP +FD R+++ +C IGHV+ Q C +CWA +T +DR+CI + G + LS
Sbjct: 141 TNLPSDFDARQKFASCAEVIGHVRDQGACHNCWATGSTGMFNDRVCIKSGGSFQNILSLG 200
Query: 135 HLLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ +CC C CEGGN + ++ +G+ TG ++
Sbjct: 201 YFTSCCNPANGCPKAKGCEGGNLLEGLNFLKNHGIVTGNEF 241
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 55/103 (53%), Gaps = 8/103 (7%)
Query: 80 EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
+EFD RK++P C IG V + N WA A L+DR CIAT G + LS++ L++C
Sbjct: 74 KEFDARKRWPKCKTIGEVHNEGNFAFGWAYAAAGVLADRTCIATNGGYNKLLSTEELISC 133
Query: 140 CAAC-TGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRF 178
T G+V N W Y+ +GV +GG Y S CQ F
Sbjct: 134 SGIKETNGNV----NERSIWEYLKSHGVVSGGKYNSNDGCQPF 172
>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
Length = 201
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R + C+ IG + Q +CGSCWA +LSDR CI ++ +LS + L
Sbjct: 17 KLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFDVNISLSVNDL 74
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC G C GG P+ AW Y+ +GV T
Sbjct: 75 LACCGFLCGSG-CNGGYPLSAWRYLSNHGVVT 105
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 70/133 (52%), Gaps = 13/133 (9%)
Query: 40 SFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTE------LPEEFDLRKQYPNCTN 93
S + + FG S+ + + NP L+ + H QS+ + +P EFD R ++P C
Sbjct: 78 SIMKWIPFGKSM--KGKFSNPLLEQANSHSFRLQSSQDHLLKDSIPLEFDFRTKWPQC-- 133
Query: 94 IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGN 153
+ ++ Q+NCG+CWA + L+DR+CI T G ++ LS ++ C G CEGG
Sbjct: 134 LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSPQDMVDCSHDNFG---CEGGY 190
Query: 154 PMRAWYYMLENGV 166
M A Y++ GV
Sbjct: 191 LMNALDYLMNEGV 203
>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
unguiculata]
Length = 195
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP FD R + C+ IG + Q +CGSCWA +LSDR CI ++ +LS + L
Sbjct: 17 KLPVNFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCI--HFDVNISLSVNDL 74
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L CC G C GG P+ AW Y+ +GV T
Sbjct: 75 LACCGFLCGSG-CNGGYPLSAWRYLSNHGVVT 105
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 55/106 (51%), Gaps = 5/106 (4%)
Query: 79 PEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
PE FD ++P C IG ++ QSNCG CWA A A SDR CIAT G + LS+ +
Sbjct: 25 PEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAASDRQCIATGGAVAVPLSAQDV- 83
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
C A D C+GG + W Y+ + G TGG Y F G C
Sbjct: 84 -CFNANV--DGCDGGQIITPWTYVAKAGAVTGGQYNGTGPFGAGLC 126
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 47/70 (67%), Gaps = 1/70 (1%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA A+SDR+CIA+QG+ T+S+D +L+CC G+ CEGG P+ AW Y ++
Sbjct: 1 SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGK-KCGNGCEGGYPIEAWKYWVKT 59
Query: 165 GVPTGGDYGS 174
G+ TGG Y S
Sbjct: 60 GICTGGSYES 69
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 48/69 (69%), Gaps = 2/69 (2%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA++T AA+SDR+CIA++G +S+ +++CC C G CEGG P+ AW Y +
Sbjct: 1 SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTWCGAG--CEGGWPIEAWKYGVTE 58
Query: 165 GVPTGGDYG 173
GV TGG++G
Sbjct: 59 GVVTGGNFG 67
>gi|290995893|ref|XP_002680517.1| predicted protein [Naegleria gruberi]
gi|284094138|gb|EFC47773.1| predicted protein [Naegleria gruberi]
Length = 200
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 61/107 (57%), Gaps = 9/107 (8%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R ++PNC ++ Q CGSC+A +T AL+ R C+A++G++ L+ D+L+
Sbjct: 1 IPESFDARTKWPNCKP--QIRHQLECGSCYAFTSTGALAHRFCVASRGKVYPNLAPDYLV 58
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
C + C+GG AW ++ + GVPT C ++ G+ N
Sbjct: 59 RCNSET---KACKGGKTTSAWDFLEKTGVPT----SECVKYRSGHWN 98
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 43 SSLKFGLSLTPQSQEPNPDLQLGSEHFG--------------DYQSNTELPEEFDLRKQY 88
++LK G ++ P S E + LG+ Y + ++ +EFD RK++
Sbjct: 37 NTLKAGENVGPHSAEEERLMLLGTRGVEAATKSKMLYKTRDPRYIIDNQIHKEFDARKRW 96
Query: 89 PNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV 148
P C IG V + N WA A T +DRMCIAT G + LS++ L+ +C+G
Sbjct: 97 PQCKTIGEVHNEGNELLSWAYAATGVFADRMCIATNGNYNQLLSTEELI----SCSGIKE 152
Query: 149 CEGG--NPMRAWYYMLENGVPTGGDYGS 174
E G N + W Y +G+ +GG Y +
Sbjct: 153 REDGYVNRVLVWEYFKTHGLVSGGKYNT 180
>gi|161343823|tpg|DAA06092.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 152
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 45/73 (61%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D N +P FD RK++ +C IG V+ Q +CGSCWA T++A +DR+C+AT +
Sbjct: 79 DAYENVWIPRTFDARKKWRHCRTIGEVRDQGHCGSCWAFGTSSAFADRLCVATNADFNEL 138
Query: 131 LSSDHLLTCCAAC 143
LS++ + CC C
Sbjct: 139 LSAEEITFCCHTC 151
>gi|60598652|gb|AAX25875.1| unknown [Schistosoma japonicum]
Length = 195
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 54 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 113
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++ G C+GG P +AW Y ++ G+ TGG
Sbjct: 114 DLIS--CCEDCGGGCKGGFPGQAWDYWVKRGIVTGG 147
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 64/110 (58%), Gaps = 5/110 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
PE FD R+++ C ++G ++ Q C S +A+A A ++DR CI ++G+ + + +L
Sbjct: 135 FPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDVL 194
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCN 184
+CC C G C+GG P W+Y +ENG+ +GG Y S CQ + G C
Sbjct: 195 SCCHRCGFG--CDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCK 242
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 57/96 (59%), Gaps = 5/96 (5%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q N +P+ FD R Q+ C + ++ Q+ CGSCWA A + +LSDR CIA+QG+++ LS
Sbjct: 73 QINAAVPDSFDSRTQWQGCVH--PIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLS 130
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+++C G C+GG AW Y+ + GV +
Sbjct: 131 PQDMVSCDTNNYG---CDGGYLNLAWQYLEKKGVAS 163
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 57/96 (59%), Gaps = 5/96 (5%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q N +P+ FD R Q+ C + ++ Q+ CGSCWA A + +LSDR CIA+QG+++ LS
Sbjct: 73 QINAAVPDSFDSRTQWQGCVH--PIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLS 130
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+++C G C+GG AW Y+ + GV +
Sbjct: 131 PQDMVSCDTNNYG---CDGGYLNLAWQYLEKKGVAS 163
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 73/135 (54%), Gaps = 8/135 (5%)
Query: 38 SPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHV 97
SP+ + +K + + EP + L +E FGD + P+ FD R +P C +IG +
Sbjct: 53 SPNAEAFVKARIMDSKFLVEPKKEEVL-TEVFGD-----DPPDSFDARAHWPECRSIGTI 106
Query: 98 QLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRA 157
+ QS CGSCWA+++ A+SD++C+ + +S +L+CC G CE P+ A
Sbjct: 107 RDQSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILSCCGISCGYG-CE-VLPIEA 164
Query: 158 WYYMLENGVPTGGDY 172
+ +M + V TGG Y
Sbjct: 165 YRWMQRSVVVTGGKY 179
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 54/100 (54%), Gaps = 3/100 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
F + + L + FD + +P C I ++ QS+CGSCWA+A +A+SDR C G
Sbjct: 81 RQFSEEELRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYC-TLGGV 139
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
D +S+ L++CC C G C GG P AW Y +G+
Sbjct: 140 RDLRISAGDLMSCCDVCGYG--CNGGYPEVAWEYYAVHGI 177
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 50/78 (64%), Gaps = 5/78 (6%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA+++ AA+SDR+CIA+ G LS +L CC+ C G CEGG PM+AW Y
Sbjct: 1 SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSWCGYG--CEGGWPMKAWQYFXLE 58
Query: 165 GVPTGGDY---GSCQRFD 179
GV TGG+Y G C+ ++
Sbjct: 59 GVVTGGNYRKQGCCRPYE 76
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++ G C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLIS--CCEDCGGGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
Length = 314
Score = 73.6 bits (179), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 65/136 (47%), Gaps = 13/136 (9%)
Query: 47 FGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC 106
G+ T ++ P + G E G +P FD R Q+P+C I + Q CGSC
Sbjct: 63 IGMMGTKKTAAPFKLTENGEELKG------SIPTSFDSRVQWPDC--IHPILNQEQCGSC 114
Query: 107 WAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
WA +++ LSDR+CIA+ + + S L C G D C GG P AW YM G+
Sbjct: 115 WAFSSSEVLSDRLCIASNNKTNPGALSPQTLVACDV-YGNDGCSGGIPQLAWEYMELKGL 173
Query: 167 PTGGDYGSCQRFDRGN 182
PT SC + GN
Sbjct: 174 PT----DSCVPYTAGN 185
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 72/155 (46%), Gaps = 16/155 (10%)
Query: 34 ILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGD-------YQS-------NTELP 79
I+ S +SLK G ++ P S E + L + Y++ + ++
Sbjct: 28 IIDPSDMETNSLKAGENVLPNSAEEEHQMLLETREVEAATKSKIMYKTRHPRSAIDNQIH 87
Query: 80 EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
EEFD RK +P C IG V N WA AT L+DRMCIAT G + LS++ L+ C
Sbjct: 88 EEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEELIFC 147
Query: 140 CAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
T G+ + W Y+ +G+ +GG Y +
Sbjct: 148 GGIKTKQSGAVRGDDV--WEYLKSHGLVSGGKYNT 180
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 56/95 (58%), Gaps = 1/95 (1%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R +P C +I V+ QSNCGSCWA +SDR+CI + G+ +S++ +L
Sbjct: 70 IPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEVISDRICIHSNGKEQPVISAEDIL 129
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
TCC + G+ C+GG + A + G TGGDY
Sbjct: 130 TCCGK-SCGNGCQGGQGLEAMKFWTTYGAVTGGDY 163
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++ G C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLIS--CCEDCGGGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 72/155 (46%), Gaps = 16/155 (10%)
Query: 34 ILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGD-------YQS-------NTELP 79
I+ S +SLK G ++ P S E + L + Y++ + ++
Sbjct: 28 IIDPSDMETNSLKAGENVLPNSAEEEHQMLLETREVEAATKSKIMYKTRHPRSAIDNQIH 87
Query: 80 EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
EEFD RK +P C IG V N WA AT L+DRMCIAT G + LS++ L+ C
Sbjct: 88 EEFDARKGWPQCKTIGEVHDDGNTRWGWAYATAGVLADRMCIATNGSYNQLLSTEELIFC 147
Query: 140 CAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
T G+ + W Y+ +G+ +GG Y +
Sbjct: 148 GGIKTKQSGAVRGDDV--WEYLKSHGLVSGGKYNT 180
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++ G C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLIS--CCEDCGGGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 60/96 (62%), Gaps = 2/96 (2%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G+ LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSAL 146
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
L++ G C+GG P +AW Y ++ G+ TGG
Sbjct: 147 DLIS--CCEDCGGGCKGGFPGQAWDYWVKRGIVTGG 180
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 54/98 (55%), Gaps = 3/98 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
++P FD R + C + IGHV QS C SCWAIA A + R+CI + G+ + LS+
Sbjct: 57 ADIPNSFDARDAFKECKDVIGHVWDQSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAG 116
Query: 135 HLLTCCAACTGGDV--CEGGNPMRAWYYMLENGVPTGG 170
++ CC + C+GG + AW ++ +G+ T G
Sbjct: 117 EMIACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEG 154
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 56/101 (55%), Gaps = 4/101 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T LP FD R+++ +C + IGHV+ Q C +CWA A +DR+CI + GR+ LS
Sbjct: 143 TTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLG 202
Query: 135 HLLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L +CC C + C G+ +M +G+ TGG+Y
Sbjct: 203 YLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEY 243
>gi|294865522|ref|XP_002764429.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239863788|gb|EEQ97146.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 154
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 54/97 (55%), Gaps = 4/97 (4%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP FD R+++ +C IGHV+ QS C +CWA++ T L+DR+CI + G LS +
Sbjct: 36 DLPSNFDARQKFASCAGVIGHVRDQSACNNCWAVSPTGMLNDRVCIKSGGSFRDILSVGY 95
Query: 136 LLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTG 169
+CC C C+GGN ++ +G+ TG
Sbjct: 96 FTSCCNPANGCPKARGCQGGNLFEGLNFLKNHGIVTG 132
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 53/94 (56%), Gaps = 5/94 (5%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N + P FD R + NCT IG+++ Q+ CGSCWA + DR+CI LD LS
Sbjct: 75 NIKAPASFDSRTAWSNCTTIGYIENQARCGSCWAFGAVESAQDRICI--HKGLDVQLSFL 132
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L+TC + D CEGG+ + AW ++ + GV T
Sbjct: 133 DLVTCDQS---DDGCEGGDDVSAWNFLKKQGVVT 163
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 57/100 (57%), Gaps = 4/100 (4%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP FD R+++ +C IGHV+ QS C +CW +++T L+DR+CI + G LS +
Sbjct: 36 DLPSNFDARQKFASCAGVIGHVRDQSACHNCWTVSSTGMLNDRVCIKSGGTFRDILSVGY 95
Query: 136 LLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC C C+GGN + ++ +G+ TG ++
Sbjct: 96 FTSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEF 135
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 53/89 (59%), Gaps = 5/89 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD R+Q+ C I ++ Q CGSCWA + + +LSDR CIA+ G++D LS ++
Sbjct: 86 LPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDMV 143
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C G C+GGN AW++M G+
Sbjct: 144 SCDYNDMG---CDGGNLDNAWWWMKNKGI 169
>gi|308488534|ref|XP_003106461.1| CRE-CPR-5 protein [Caenorhabditis remanei]
gi|308253811|gb|EFO97763.1| CRE-CPR-5 protein [Caenorhabditis remanei]
Length = 153
Score = 73.2 bits (178), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 51/84 (60%), Gaps = 5/84 (5%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
P+ D + + D +P+ FD R ++ +C +I +++ QS+CGSCWA A A+SD
Sbjct: 67 PHKDEDIVATEVAD-----AIPDSFDARDKWSSCVSINNIRDQSDCGSCWAFAAAEAISD 121
Query: 118 RMCIATQGRLDHTLSSDHLLTCCA 141
R CIA+ G ++ LSS LL+CC
Sbjct: 122 RTCIASNGAVNTLLSSQDLLSCCV 145
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 54/100 (54%), Gaps = 3/100 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
F + + L + FD + +P C I ++ QS+CGSCWA+A +A+SDR C G
Sbjct: 81 RQFSEEELREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYC-TLGGV 139
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
D +S+ L++CC C G C GG P AW Y +G+
Sbjct: 140 RDLRISAGDLMSCCDVCGYG--CNGGYPEVAWEYYAVHGI 177
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 54/100 (54%), Gaps = 3/100 (3%)
Query: 67 EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
F + + L + FD + +P C + ++ QS+CGSCWA+A +A+SDR C G
Sbjct: 81 RQFSEEELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYC-TLGGV 139
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
D +S+ L++CC C G C GG P AW Y +G+
Sbjct: 140 RDLRISAGDLMSCCDVCGFG--CNGGYPEVAWEYYAVHGI 177
>gi|145356617|ref|XP_001422524.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582767|gb|ABP00841.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 245
Score = 72.4 bits (176), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 9/105 (8%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQL-QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP++FD+R+++P C + L Q CGSCWA+A ++DR+CIAT G + LS+ L
Sbjct: 2 LPKDFDVREKWPKCAALVSEALDQGECGSCWAVAPAKVMADRLCIATNGAVASHLSAMQL 61
Query: 137 LTCC--------AACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
L+C A T C+GG P A+ +G+ +GG +G
Sbjct: 62 LSCGKLENGTFDAGSTYSGSCDGGFPNEAYEKARTSGIVSGGLFG 106
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 72/143 (50%), Gaps = 20/143 (13%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCG 104
++ G L S E + L +E +LP+ FD R+++P C I ++ QSNCG
Sbjct: 252 IRMGTKLMNSSTEFDSKLSNNNEALIK-----KLPKHFDSREKWPECEWIRFIRDQSNCG 306
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA++ + ++DR CIA++G+ +S + +L C G + +P W M
Sbjct: 307 SCWAVSAASVMTDRHCIASKGQETPYISDEQILAC------GMI---PSPFNYWKKM--- 354
Query: 165 GVPTGGDYGS---CQRFDRGNCN 184
G+ TGG YG CQ + C+
Sbjct: 355 GIATGGPYGDKSCCQPYSIAPCS 377
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 54/88 (61%), Gaps = 4/88 (4%)
Query: 66 SEHFGDYQSNTEL--PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
S HF Y+ ++ PE F R+ + +C++I ++ QS CGSCWA A ++SDR+CI T
Sbjct: 73 SSHFTSYEEDSRWTCPESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHT 132
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEG 151
G++ +S++ LL CC C G C+G
Sbjct: 133 NGKVQVNISAEDLLACCHTCGHG--CDG 158
>gi|290977636|ref|XP_002671543.1| predicted protein [Naegleria gruberi]
gi|284085113|gb|EFC38799.1| predicted protein [Naegleria gruberi]
Length = 268
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 55/94 (58%), Gaps = 5/94 (5%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N + FD R+++ +C I ++ Q CGSCWA + + A SDR+CIAT G ++ LS
Sbjct: 86 NLKAASHFDAREKWEDC--IHEIRNQEECGSCWAFSASEAFSDRLCIATNGSVNIVLSPQ 143
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++++C A G C+GG AW ++ G+P+
Sbjct: 144 YMVSCDATDYG---CDGGYLNNAWNFLANTGIPS 174
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 56/90 (62%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELPE FD R ++ + N V Q +CGS WA++TT SDR+ I ++GR++ +LSS L
Sbjct: 197 ELPEHFDSRDKWGHLIN--PVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQL 254
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 255 LSCNQHRQKG--CEGGYLDRAWWYIRKLGV 282
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 31/69 (44%), Positives = 49/69 (71%), Gaps = 2/69 (2%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA+++ AA+SDR+CIA++G +S+ +++CC+ C G C+GG P++AW +
Sbjct: 1 SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSYCGYG--CDGGWPIKAWQFFARE 58
Query: 165 GVPTGGDYG 173
GV TGG+YG
Sbjct: 59 GVVTGGNYG 67
>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 254
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 61/99 (61%), Gaps = 2/99 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD RK++PNC +IGH+ Q NC S +A+A +A SDR+CI + + +S+ ++
Sbjct: 63 LPTNFDSRKKWPNCPSIGHIYNQGNCRSSYAVAAASAASDRICIHSNSTKNPIMSAQQII 122
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+CC C G C+GG+ +W + +G +GG+Y S Q
Sbjct: 123 SCCYLCGYG--CDGGSLFESWDFYRRHGFVSGGEYNSNQ 159
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 4/101 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T LP F+ + ++ +C + IGH++ Q+ C +CWA A+ +DR+CI + GR+ LS
Sbjct: 37 TTLPSNFNAQIKFASCADVIGHIRDQAECHNCWASASVGMFNDRVCIQSGGRITDILSLA 96
Query: 135 HLLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L +CC C D C G+ +M +G+ TGG+Y
Sbjct: 97 YLTSCCNHANGCPKSDGCRRGSVAEGLIFMKNHGIVTGGEY 137
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 55/92 (59%), Gaps = 5/92 (5%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+EFD R + +CT+I ++ +CGSCWA +LSDR CI + L+ +LS++ +
Sbjct: 102 KLPKEFDARTAWSHCTSIR--RILGHCGSCWAFGAVESLSDRFCI--KYNLNVSLSANDV 157
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ CC G C GG PM AW Y +GV T
Sbjct: 158 IACCGLLCGFG-CNGGFPMGAWLYFKYHGVVT 188
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 56/90 (62%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELPE FD R ++ I V Q +CGS W+++TTA SDR+ I ++GR++ TLSS L
Sbjct: 183 ELPEHFDARDKWGPL--IHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQL 240
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 241 LSCNQHRQKG--CEGGYLDRAWWYIRKLGV 268
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/93 (40%), Positives = 52/93 (55%), Gaps = 6/93 (6%)
Query: 80 EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
+EFD RK++P C IG V + N WA ATT +DRMCIAT G + LS++ L++C
Sbjct: 89 KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 148
Query: 140 --CAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
A G V +G AW Y +G+ +GG
Sbjct: 149 SGIKASANGWVRDG----LAWEYFKTHGLVSGG 177
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 59/110 (53%), Gaps = 5/110 (4%)
Query: 78 LPEEFDLRKQY-PNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP+ +D R+++ C + ++ Q +CGSCWA A +DR+CI + G + +S++ L
Sbjct: 77 LPDSYDTREKWGSTCPSTTEIRDQGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDL 136
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNC 183
LTCC G C GG AW + G TGG Y S CQ ++ +C
Sbjct: 137 LTCCGFWCGFG-CNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSC 185
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 56/90 (62%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELPE FD R ++ + I + Q +CGS WA++TT SDR+ I ++GR++ +LSS L
Sbjct: 257 ELPEHFDARDKWGHL--IHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQL 314
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 315 LSCNQHRQKG--CEGGYLDRAWWYIRKLGV 342
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 65/146 (44%), Gaps = 43/146 (29%)
Query: 72 YQSNTELPEE------FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
Y N PE FD R+++P C++I + S+C S WA + ++SDR+CI + G
Sbjct: 69 YSKNIFSPENLDDSNFFDARERWPECSSIPIINDISDCKSSWAFSAAESMSDRLCINSGG 128
Query: 126 RLDHTLSSDHLLTCCA---ACTGGD----------------------------------V 148
++ LS+ LL+CC +C GD
Sbjct: 129 MINTVLSAQELLSCCTGVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREK 188
Query: 149 CEGGNPMRAWYYMLENGVPTGGDYGS 174
C GGN +AW Y ++G+PTGG Y S
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYES 214
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + ++ +EFD RK + C IG V N WA ATT A +DRMC+AT G + L
Sbjct: 79 YVAYGKISKEFDARKHWSQCKTIGEVYNDGNSDLSWAYATTGAFADRMCVATNGSYNQLL 138
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ L++C + + +AW + + G+ +GG Y +
Sbjct: 139 STEQLISCSGIKSNAMADD-----QAWKFFKKQGLVSGGKYNT 176
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 56/90 (62%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELPE FD R ++ + I V Q +CGS WA++TT SDR+ I ++GR++ +LSS L
Sbjct: 201 ELPEHFDARDKWGHL--IHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQL 258
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 259 LSCNQHRQKG--CEGGYLDRAWWYIRKLGV 286
>gi|390362268|ref|XP_782154.3| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 409
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 60/113 (53%), Gaps = 11/113 (9%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT-LSSDHL 136
+PEEFD R Q+P + VQ Q NC S WA++T A SDR+ I + G + LS HL
Sbjct: 222 IPEEFDARAQWPGL--VEGVQNQGNCASSWAMSTAATASDRLAIQSNGTFKYMHLSPQHL 279
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQRFDRGNC 183
L+C G C GG+ RAW+YM + G+ T Y S + +GNC
Sbjct: 280 LSCNVKRQQG--CAGGHLDRAWWYMRKRGIVTEDCYPYLSGTTSDMQMRKGNC 330
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/93 (40%), Positives = 52/93 (55%), Gaps = 6/93 (6%)
Query: 80 EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
+EFD RK++P C IG V + N WA ATT +DRMCIAT G + LS++ L++C
Sbjct: 34 KEFDARKRWPQCKTIGEVYNEGNALLSWAYATTGVFADRMCIATNGSYNKHLSTEELISC 93
Query: 140 --CAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
A G V +G AW Y +G+ +GG
Sbjct: 94 SGIKASANGWVRDG----LAWEYFKTHGLVSGG 122
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 72/164 (43%), Gaps = 19/164 (11%)
Query: 28 LWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFG----------------- 70
L+ D +I+ S +LK G ++ P S E + G+
Sbjct: 23 LFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHLMLSGTRGVEATSKSKMLHKTRNRRCF 82
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ + ++ +EFD RK++P+C IG V N WA T +DRMCIAT G +
Sbjct: 83 SVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQL 142
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS++ L++C D N W Y+ +G+ +GG Y +
Sbjct: 143 LSTEELISCSGIKE--DEFGSVNDYYVWEYLKNHGLVSGGKYNT 184
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 54/104 (51%), Gaps = 9/104 (8%)
Query: 80 EEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
+EFD RK++P C IG V + N WA A L+DR CIAT G + LS++ L++
Sbjct: 67 KEFDARKRWPKCKTIGEVHNEGNFALGWAYAVAGVLADRTCIATNGGYNKLLSTEELIS- 125
Query: 140 CAACTGGDVCEGGNPMR--AWYYMLENGVPTGGDYGS---CQRF 178
C+G G P W Y+ +GV +GG Y S CQ F
Sbjct: 126 ---CSGIKENNGSVPSERSIWEYLKSHGVVSGGKYNSNDGCQPF 166
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 53/87 (60%), Gaps = 5/87 (5%)
Query: 101 SNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA-ACTGGDVCEGGNPMRAWY 159
+ CGSCWA +T +SDR+CIAT+G T+S +L CC +C GD CEGG P++A+
Sbjct: 59 AQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSC--GDGCEGGYPIQAFR 116
Query: 160 YMLENGVPTGGDY--GSCQRFDRGNCN 184
+ GV TGGD+ C+ + CN
Sbjct: 117 WWNSRGVVTGGDFRGSGCRPYPFAPCN 143
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 15/96 (15%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD RK++P C +I ++ QS CGS WA++ A+SDR+CI + G+ +
Sbjct: 87 NVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAVGAISDRICIQSGGKQSY----- 141
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
C G C+GG +W Y + G+ TGG
Sbjct: 142 --------CGSG--CDGGFLGPSWDYWVLRGIVTGG 167
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 58/111 (52%), Gaps = 14/111 (12%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+N + FD R ++ C + ++ Q CGSCWA + + LSDR CIA+ G +D LS
Sbjct: 79 ANLKAASSFDARTKWGKCVH--PIRDQQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSP 136
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT---------GGDYGSC 175
+++L C + G C+GG AW ++ G+P+ GD GSC
Sbjct: 137 EYMLQCDSTDYG---CDGGYLNNAWAFLAGTGIPSDKCDPYTSGNGDVGSC 184
>gi|330798471|ref|XP_003287276.1| hypothetical protein DICPUDRAFT_151351 [Dictyostelium purpureum]
gi|325082736|gb|EGC36209.1| hypothetical protein DICPUDRAFT_151351 [Dictyostelium purpureum]
Length = 317
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 71/136 (52%), Gaps = 16/136 (11%)
Query: 35 LKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNI 94
L S L L +G PQS + ++ G ++ +P+ +D+R + C I
Sbjct: 7 LIFSIVVLYKLSYGF---PQSYDACTEVTYGDKY-------DTIPDSYDVRTTWSEC--I 54
Query: 95 GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTG----GDVCE 150
++ Q +CGSCWA +T L+D+ CI T G++ TLS +++ C +CT + C+
Sbjct: 55 SPIREQKSCGSCWAQVSTGLLADKACIQTGGKIKVTLSPQYMMDCDGSCTSNSGCNNGCK 114
Query: 151 GGNPMRAWYYMLENGV 166
GG +A+ +++ NGV
Sbjct: 115 GGFVGKAFEFLINNGV 130
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 59/111 (53%), Gaps = 4/111 (3%)
Query: 66 SEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
SE G+ ++P FD R+++P+C+ IG V+ QS+CGS + SDR CIA+ G
Sbjct: 80 SEKTGNDNVLVDIPSSFDSRQKWPSCSQIGAVRDQSDCGSAAHLVAVEIASDRTCIASNG 139
Query: 126 RLDHTLSSDHLLTCCAA----CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ LS+ L+CC C G C+G P + +G+ TGG+Y
Sbjct: 140 TFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNY 190
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 62/107 (57%), Gaps = 1/107 (0%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
H + + LPE FD R+Q+ +C I ++ Q +CGSCWA ++SDR+CI T G +
Sbjct: 70 HRIKFAEDMNLPESFDAREQWSSCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHV 129
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ +S++ +LT C G+ C GG P AW + + G+ +GG Y S
Sbjct: 130 NVEVSAEDMLT-CCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDS 175
>gi|294940600|ref|XP_002782826.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239894881|gb|EER14622.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 172
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 4/101 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T LP FD R+++ +C + IGHV+ Q C +CWA A +DR+CI + G+ LS
Sbjct: 37 TTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGKTTDILSLG 96
Query: 135 HLLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L +CC C + C G+ +M +G+ TGG+Y
Sbjct: 97 YLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEY 137
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 5/97 (5%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D +S +P+ FD R ++P C I ++ Q CGSCWA ATT SDR+CI T +
Sbjct: 81 DVKSTVAIPDTFDSRTKWPQC--IHGIRNQGQCGSCWAFATTGVFSDRLCITTNNVSNVV 138
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
+S + L+ C C+GG +W + + G+P
Sbjct: 139 ISPEFLIECDKT---SFACQGGYGYYSWKFFMNTGIP 172
>gi|239799410|dbj|BAH70626.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 265
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 72/164 (43%), Gaps = 19/164 (11%)
Query: 28 LWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFG----------------- 70
L+ D +I+ S +LK G ++ P S E + G+
Sbjct: 23 LFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHLMLSGTRGVEATSKSKMLHKTRNRRCF 82
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ + ++ +EFD RK++P+C IG V N WA T +DRMCIAT G +
Sbjct: 83 RVEIDHQIDQEFDARKRWPHCKTIGEVPNDGNSLLSWAYVPTGVFADRMCIATNGTYNQL 142
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS++ L++C D N W Y+ +G+ +GG Y +
Sbjct: 143 LSTEELISCSG--IKEDEFGSVNDYYVWEYLKNHGLVSGGKYNT 184
>gi|239799408|dbj|BAH70625.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 214
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 72/164 (43%), Gaps = 19/164 (11%)
Query: 28 LWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFG----------------- 70
L+ D +I+ S +LK G ++ P S E + G+
Sbjct: 23 LFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHLMLSGTRGVEATSKSKMLHKTRNRRCF 82
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ + ++ +EFD RK++P+C IG V N WA T +DRMCIAT G +
Sbjct: 83 RVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQL 142
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS++ L++C D N W Y+ +G+ +GG Y +
Sbjct: 143 LSTEELISCSG--IKEDEFGSVNDDYVWEYLKNHGLVSGGKYNT 184
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 72/164 (43%), Gaps = 19/164 (11%)
Query: 28 LWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFG----------------- 70
L+ D +I+ S +LK G ++ P S E + G+
Sbjct: 23 LFHDDNIIDKSVMGTDTLKVGENVGPNSVEEEHLMLSGTRGVEATSKSKMLHKTRNRRCF 82
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ + ++ +EFD RK++P+C IG V N WA T +DRMCIAT G +
Sbjct: 83 RVEIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQL 142
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
LS++ L++C D N W Y+ +G+ +GG Y +
Sbjct: 143 LSTEELISCSGIKE--DEFGSVNDDYVWEYLKNHGLVSGGKYNT 184
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 49/85 (57%), Gaps = 3/85 (3%)
Query: 82 FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
FD + +P C I ++ QS+CGSCWA+A +A+SDR C G D +S+ L++CC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYC-TLGGVRDLRISAGDLMSCCD 59
Query: 142 ACTGGDVCEGGNPMRAWYYMLENGV 166
C G C GG P AW Y +G+
Sbjct: 60 VCGYG--CNGGYPEVAWEYYAVHGI 82
>gi|290980380|ref|XP_002672910.1| predicted protein [Naegleria gruberi]
gi|284086490|gb|EFC40166.1| predicted protein [Naegleria gruberi]
Length = 302
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 51/91 (56%), Gaps = 5/91 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P E+DLRK + C +G +Q + CG+ WA+A +A +SDRMCI + + LSS ++L
Sbjct: 79 IPPEYDLRKNWYQC--VGDIQNEGQCGAVWAMAPSATVSDRMCIQSNAKFQERLSSQYIL 136
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
C G C GG + + L GVPT
Sbjct: 137 ECDTRDFG---CNGGYMNTEFEFELNRGVPT 164
>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
Length = 311
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 17 LRHVTRDSNPGLWADPDILKSSPS-FLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSN 75
L+ + P + D + + SPS S+ F +P S G+++ + Q
Sbjct: 30 LKKILGVKTPAGYFDANYGQQSPSKTTSAYTFSAPKSPVSARGTS----GTDYL-NRQVA 84
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
++P +D+R YP C N ++ Q+ CGSCWA ATT L R C+AT+G+ LS +
Sbjct: 85 KQMPSSYDVRTVYPMCEN--RIKDQAQCGSCWAFATTNVLEYRYCMATKGKKYPELSPQN 142
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L++C + + G C+GG + + Y+ GV T
Sbjct: 143 LISCFNSASWG--CDGGYIDQTFLYLEMMGVNT 173
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 69.7 bits (169), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 49/85 (57%), Gaps = 3/85 (3%)
Query: 82 FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
FD + +P C I ++ QS+CGSCWA+A +A+SDR C G D +S+ L++CC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYC-TLGGVRDLRISAGDLMSCCD 59
Query: 142 ACTGGDVCEGGNPMRAWYYMLENGV 166
C G C GG P AW Y +G+
Sbjct: 60 VCGYG--CNGGYPEVAWEYYAVHGI 82
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 49/85 (57%), Gaps = 3/85 (3%)
Query: 82 FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
FD + +P C I ++ QS+CGSCWA+A +A+SDR C G D +S+ L++CC
Sbjct: 1 FDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYC-TLGGVRDLRISAGDLMSCCD 59
Query: 142 ACTGGDVCEGGNPMRAWYYMLENGV 166
C G C GG P AW Y +G+
Sbjct: 60 VCGYG--CNGGYPEVAWEYYAVHGI 82
>gi|255076333|ref|XP_002501841.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226517105|gb|ACO63099.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 359
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 55/98 (56%), Gaps = 4/98 (4%)
Query: 74 SNTELPEEFDLRKQYPNCTNI-GHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
++ LP FD R+++P C I G V+ Q CGSCWA+AT ++DR+CIA+ G LS
Sbjct: 101 ADWNLPLNFDARQKWPQCRAIIGTVRDQGKCGSCWAVATAEVMNDRLCIASGGAEQRELS 160
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+ L+C GG C+GG+ A + G+ GG
Sbjct: 161 PQYPLSC---YDGGSGCQGGDVAVAMHEATTKGMVFGG 195
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 69.3 bits (168), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 49/85 (57%), Gaps = 3/85 (3%)
Query: 82 FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
FD + +P C + ++ QS+CGSCWA+A +A+SDR C G D +S+ L++CC
Sbjct: 1 FDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYC-TLGGVRDLRISAGDLMSCCD 59
Query: 142 ACTGGDVCEGGNPMRAWYYMLENGV 166
C G C GG P AW Y +G+
Sbjct: 60 VCGFG--CNGGYPEVAWEYYAVHGI 82
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 55/94 (58%), Gaps = 5/94 (5%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N +PE FD R ++PNC I ++ Q CGSCWA A++A LSDR CI ++G+++ LS
Sbjct: 122 NETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDLSPQ 179
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L++C G C GG + +++ G+ +
Sbjct: 180 DLVSCSYENFG---CSGGQLTESVDFLIYEGIVS 210
>gi|19880041|gb|AAM00234.1|AF359422_1 cathepsin B-like cysteine proteinase [Nicotiana tabacum]
Length = 110
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP+EFD R +PNC+ IG + Q +CGSCWA +LSDR CI L+ +LS++ L
Sbjct: 42 ELPQEFDARVAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYG--LNISLSANDL 99
Query: 137 LTCCAACTG 145
L CC G
Sbjct: 100 LACCGFLCG 108
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 1/103 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LP+ FD R+Q+ +C I ++ Q +CGSCWA ++SDR+CI T G + +
Sbjct: 74 FAKDMNLPKSFDAREQWSHCPTIKEIRDQGSCGSCWAFGAVESISDRICIHTNGHVSVEV 133
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
S++ LLT C GD C GG P AW + G+ +GG Y S
Sbjct: 134 SAEDLLT-CCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 68.9 bits (167), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 53/99 (53%), Gaps = 4/99 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R+Q+P CT IG V+ QS+CGS + SDR CI++ G + LS+ L
Sbjct: 91 IPASFDSRQQWPECTQIGAVRDQSDCGSAAHLVAVEMASDRTCISSNGTFNWPLSAQDPL 150
Query: 138 TCCAA----CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+CC C G C+G P + +G+ TGG+Y
Sbjct: 151 SCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNY 189
>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 422
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 55/101 (54%), Gaps = 4/101 (3%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T LP FD R+++ +C + IGHV+ Q C +CWA A +DR+CI + GR+ LS
Sbjct: 143 TTLPSSFDARQKFASCADVIGHVREQGECNNCWASAAVGMFNDRVCIKSGGRITDILSLG 202
Query: 135 HLLTCC---AACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L +CC C + C G+ +M +G+ TG ++
Sbjct: 203 YLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGRNF 243
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 46/68 (67%), Gaps = 2/68 (2%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA A+SDR+CIA++G+ TLS+ LL+CC +C G C GG+P+ AW + ++
Sbjct: 1 SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCCRSCGFG--CNGGDPLSAWKFWVKE 58
Query: 165 GVPTGGDY 172
G+ TG ++
Sbjct: 59 GIVTGSNH 66
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA A+SDR CI T GR++ +S++ LLTCC GD C GG P AW +
Sbjct: 1 CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGI-QCGDGCNGGYPSGAWNFWT 59
Query: 163 ENGVPTGGDYGS 174
+ G+ +GG Y S
Sbjct: 60 KKGLVSGGVYDS 71
>gi|170595047|ref|XP_001902227.1| Papain family cysteine protease containing protein [Brugia malayi]
gi|158590214|gb|EDP28925.1| Papain family cysteine protease containing protein [Brugia malayi]
Length = 246
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/90 (41%), Positives = 53/90 (58%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP FD R+++PN I +Q Q C S WA +T A +DR+ + T GR + +LS+ +
Sbjct: 80 ELPTSFDARQKWPNF--IHPIQDQGECASSWAQSTAATSADRLALITDGRQNVSLSAQQI 137
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 138 LSCNQHRQKG--CEGGYLDRAWWYIRKFGV 165
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 5/128 (3%)
Query: 47 FGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC 106
G++ + PDL+ F + + LP+EFD R ++PNC I ++ Q +CG+C
Sbjct: 23 MGINYSELKPNVTPDLE---PPFVVSKISENLPDEFDSRVRWPNCPTIREIRDQGSCGAC 79
Query: 107 WAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
WA A A+SDR+CI + S+ +LL+CC +C G C G + AW + +++G+
Sbjct: 80 WAFAAAEAMSDRVCIHSSQTKHFHFSALNLLSCCDSCEKG--CLGCDHHLAWDHWVKHGI 137
Query: 167 PTGGDYGS 174
+GG YGS
Sbjct: 138 VSGGSYGS 145
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 53/87 (60%), Gaps = 5/87 (5%)
Query: 82 FDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
FD R ++P+C + ++ Q CGSCWA + + LSDR CIA+ G++D LS ++++C +
Sbjct: 17 FDSRTKWPHCVH--PIRNQEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQYMVSCDS 74
Query: 142 ACTGGDVCEGGNPMRAWYYMLENGVPT 168
G C+GG AW ++ G+P+
Sbjct: 75 TDYG---CDGGYLNNAWAFLAGTGIPS 98
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 55/104 (52%), Gaps = 4/104 (3%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q E+P FD R+++P+C+ IG V+ QS+CGS + SDR CI + G + LS
Sbjct: 89 QVFEEIPNSFDARQKWPSCSQIGAVRDQSDCGSAAHLVAAEIASDRTCIFSNGTFNWPLS 148
Query: 133 SDHLLTCCAA----CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ L+CC C G C+G P + +G+ TGG+Y
Sbjct: 149 AQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNY 192
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 54/96 (56%), Gaps = 10/96 (10%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWA-----IATTAALSDRMCIATQGRLDHTLS 132
LPE FD R+++P C I ++ Q CGSCWA I ++ LSDR CIA+ G+++ LS
Sbjct: 2 LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L++C G C+GG AW Y+ G+ T
Sbjct: 60 PQDLVSCNWYNAG---CDGGILWAAWIYLKHTGIVT 92
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+ P +FD R+Q+P C I ++ Q NCGSCWA + ++ L+DR CI + G+++ LS +
Sbjct: 124 DFPTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFM 181
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENG 165
++C G C GG W +++ G
Sbjct: 182 VSCSGQNNG---CNGGFFDATWRFLVSVG 207
>gi|56757323|gb|AAW26833.1| SJCHGC00037 protein [Schistosoma japonicum]
Length = 162
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 56 QEPNPDLQLGSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAA 114
++ +P+L+ D+ E+P FD RK++P C +I ++ QS C S WA++ A
Sbjct: 67 RKEDPNLREKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAVSAVGA 126
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACT 144
+SDR+CI + G+ LS+ L++CC T
Sbjct: 127 MSDRICIQSGGKQSVELSAVDLISCCNYTT 156
>gi|343422787|emb|CCD18361.1| cysteine peptidase C (CPC), putative, (fragment) [Trypanosoma vivax
Y486]
Length = 153
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 44/77 (57%), Gaps = 1/77 (1%)
Query: 64 LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
L HF + + LPE FD +P+C I + QS+CGSCWA+A A+SDR C+ T
Sbjct: 77 LPRRHFTEEELRAPLPESFDAATAWPDCPTIKRIADQSSCGSCWAVAAATAMSDRFCV-T 135
Query: 124 QGRLDHTLSSDHLLTCC 140
G +S+ LL+CC
Sbjct: 136 GGVRALGISAGDLLSCC 152
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 62/125 (49%), Gaps = 11/125 (8%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGS-EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQ 100
++K G L + P LQ+ S + G ++P F+ + +PNCT I +Q Q
Sbjct: 47 FDNIKVGQLLGFKRSPNRPKLQIKSYDPLG-----VQIPTSFNAQTNWPNCTTISQIQNQ 101
Query: 101 SNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYY 160
+ CGSCWA T + +DR+CI + LS ++TC G CEGG+ AW +
Sbjct: 102 ARCGSCWAFGATESATDRLCIHNNENVQ--LSFMDMVTCDETDNG---CEGGDAFSAWNW 156
Query: 161 MLENG 165
+ + G
Sbjct: 157 LRKQG 161
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 54/92 (58%), Gaps = 5/92 (5%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPE F+ + +PN + ++ Q+ CGSCWA A + LSDR IA+ G ++ LS + L
Sbjct: 93 DLPESFNCYENWPNYMH--PIRDQARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDL 150
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++C G C+GG +AW Y+ NG+ T
Sbjct: 151 VSCDKGDMG---CQGGYLDKAWDYLKTNGIVT 179
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 54/110 (49%), Gaps = 8/110 (7%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC--W-----AIATTAALSDRMCIATQ 124
+ + LPE F R+Q+P C I + Q G W A A+SDR+CI T
Sbjct: 79 FTEDLNLPESFYAREQWPQCPTIXXXRAQPGRGGLTRWGSFLQAFGAVEAISDRICIHTN 138
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+ +S++ LLTCC + GD C GG P AW + G+ +GG Y S
Sbjct: 139 AHISVEVSAEDLLTCCGSMC-GDGCNGGYPAEAWNFWTRKGLVSGGLYDS 187
>gi|452268|emb|CAA80451.1| cathepsin B-like protease [Fasciola hepatica]
Length = 104
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/72 (44%), Positives = 44/72 (61%), Gaps = 2/72 (2%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q CG+CWA A+SDR+CI ++G++ LS+ LL+CC C G C GG+P AW
Sbjct: 1 QGQCGTCWAFGAVGAMSDRVCIHSKGQMKPHLSARDLLSCCEFC--GRGCRGGSPALAWD 58
Query: 160 YMLENGVPTGGD 171
Y +G+ TGG
Sbjct: 59 YWKSSGIVTGGS 70
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD ++P + +Q Q CGS WAI T A SDR I ++GR TLS+ HLL
Sbjct: 79 LPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 136
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C G C GG RAW Y+ + G+
Sbjct: 137 SCDR--RGQQSCNGGYLDRAWSYIRKIGL 163
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 53/90 (58%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP FD R+++P+ I +Q Q +C S WA +T A +DR+ + T+GR + LS+
Sbjct: 79 ELPTSFDARQKWPDF--IHPIQDQGDCASSWAQSTAATSADRLALITEGRQNVALSAQQF 136
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 137 LSCNQHRQKG--CEGGYLDRAWWYIRKFGV 164
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 54/98 (55%), Gaps = 4/98 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD R + NC+ I + +S C + WAIAT ++SDR+CI + GR+ LS+
Sbjct: 86 NMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSAR 145
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
++C + G C G+ + Y + G+ TGG Y
Sbjct: 146 DAISC--GFSPG--CFHGSEVEVLVYWITYGIVTGGSY 179
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P FD R + NC+ I + +S C + WAIAT ++SDR+CI + GR+ LS+
Sbjct: 25 NMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRISVQLSAR 84
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
++C + C G+ + Y + G+ TGG Y
Sbjct: 85 DAISCGFSPG----CFHGSEVEVLVYWITYGIVTGGSY 118
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 58/110 (52%), Gaps = 7/110 (6%)
Query: 70 GDYQSNT---ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
GD ++N ++P FD R+++P CT IG V+ QS+CGS + SDR CI + G
Sbjct: 83 GDSENNQVLLDIPTYFDSRQKWPECTQIGAVRDQSDCGSAAHLVAVELASDRTCIFSNGT 142
Query: 127 LDHTLSSDHLLTCCAA----CTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+ LS+ L+CC C G C+G P + +G+ TGG+Y
Sbjct: 143 FNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNY 192
>gi|294946296|ref|XP_002785014.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239898389|gb|EER16810.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 232
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 40/64 (62%), Gaps = 1/64 (1%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP +FD R +PNC+ IGH++ QS CGSCWA T A +DR+CI + G LS+
Sbjct: 139 DLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGE 198
Query: 136 LLTC 139
+ C
Sbjct: 199 MNAC 202
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 66/128 (51%), Gaps = 10/128 (7%)
Query: 46 KFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNCG 104
KF L P P+P L +E S T+LPE F ++P T H L Q NC
Sbjct: 181 KFRLGTLP----PSPTLLSMNEMTVTLPSQTDLPEFFISSYKWPGWT---HDPLDQKNCA 233
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
+ WA +T + +DR+ I ++GR LS +L++CC G C+GG+ RAW+Y+ +
Sbjct: 234 ASWAFSTASVAADRIAIQSKGRYTDNLSPQNLISCCVKNRHG--CKGGSIDRAWWYLRKR 291
Query: 165 GVPTGGDY 172
G+ + Y
Sbjct: 292 GLVSHACY 299
>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
Length = 247
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 5/94 (5%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q N +P+ FD R+Q+ NC + ++ Q+ CGSCWA + LSDR+CIA+ + D LS
Sbjct: 24 QHNDIVPKTFDSREQWGNCVH--PIRDQAQCGSCWAFGASETLSDRICIASDKKTDVILS 81
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+ L+ C G C GG AW Y+ G
Sbjct: 82 PEDLVACDGWNMG---CNGGILPWAWSYLTNTGA 112
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 50/97 (51%), Gaps = 4/97 (4%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
LPE FD R+ +P I V Q CGS WAI+T + SDR+ I + G ++ LS H
Sbjct: 195 ARLPETFDARENWPGL--IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQH 252
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
LL+C G C GG RAWY++ G + Y
Sbjct: 253 LLSC--NIRGQRGCSGGYLDRAWYHLRRAGAVSRACY 287
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 69/139 (49%), Gaps = 12/139 (8%)
Query: 41 FLSSLKFGLSLTPQSQEPNPDLQLGSEHFG---------DYQSNTELPEEFDLRKQYPNC 91
+L++++ GL SQ P+ + +L S + D +P FD+R + C
Sbjct: 39 YLNTIQ-GLFHLKDSQSPDTEKKLMSAKYKHTVDICGREDRSLALSIPPSFDVRSLWHVC 97
Query: 92 TNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEG 151
+ + ++ Q+ CGSCWA++ +SDR+C+ + + +S +L+CC G C G
Sbjct: 98 S-LNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDTDILSCCGLYCGYG-CNG 155
Query: 152 GNPMRAWYYMLENGVPTGG 170
G P+ AW + G TGG
Sbjct: 156 GFPIEAWRHFTVAGNCTGG 174
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 63/123 (51%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGV 166
G+
Sbjct: 299 RGL 301
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 42/64 (65%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
N E+P +FD RK++P+C +I ++ QS CGSCWA A++DR+CI + G LS+
Sbjct: 87 NVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSAL 146
Query: 135 HLLT 138
L++
Sbjct: 147 DLIS 150
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 78/174 (44%), Gaps = 24/174 (13%)
Query: 4 PTPQYVNHSH-------HLLLRHVTRDSNPGLWADPDILKSSPSFLSSLK---FGLSLTP 53
PT NH+H L+ + + G W + + +S L+ FGLSL
Sbjct: 65 PTHPIANHTHANTPVNDKSLIDRINSNHTHG-WKATEYSRFDNMTISQLRDNLFGLSLM- 122
Query: 54 QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
S E P + + ++ ++P FD R Q+ C + ++ Q CG+CWA +
Sbjct: 123 SSDEDTPRM-------ANIETRIDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANY 173
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
L+ R+CIAT G+ + LS ++ + C T C+GG +W ++ G P
Sbjct: 174 VLAHRLCIATNGQTNVVLSPEYQVQC---DTMNKACQGGYLKYSWTFLENTGTP 224
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 61/120 (50%), Gaps = 16/120 (13%)
Query: 47 FGLSLTPQSQEPN-PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGS 105
FGLSL S +P+ P L D + +LP FD R Q+ C I V+ Q CG+
Sbjct: 110 FGLSLL--STDPDTPRL--------DIEPRVDLPMNFDARTQWRGC--IPAVRDQQTCGA 157
Query: 106 CWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENG 165
CWA + T L+ R+CIAT G+ + LS ++ + C T C+GG AW ++ G
Sbjct: 158 CWAFSATYVLAHRLCIATNGKTNVVLSPEYQVQC---DTMNKACQGGYLKYAWSFLERTG 214
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 55/112 (49%), Gaps = 23/112 (20%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQ--------------------LQSNCGSCWAIATTAALS 116
+LP+EFD R + +CT+I + L +CGSCWA +LS
Sbjct: 102 KLPKEFDARTAWSHCTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLS 161
Query: 117 DRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
DR CI + L+ +LS++ ++ CC G C GG PM AW Y +GV T
Sbjct: 162 DRFCI--KYNLNVSLSANDVIACCGLLCGFG-CNGGFPMGAWLYFKYHGVVT 210
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 77/174 (44%), Gaps = 24/174 (13%)
Query: 4 PTPQYVNHSH-------HLLLRHVTRDSNPGLWADPDILKSSPSFLSSLK---FGLSLTP 53
PT NH+H L+ + + G W + + +S L+ FGLSL
Sbjct: 13 PTHPIANHTHANTPVNDKSLIDRINSNHTHG-WKATEYSRFDNMTISQLRDNLFGLSLM- 70
Query: 54 QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
S E P + ++ ++P FD R Q+ C + ++ Q CG+CWA +
Sbjct: 71 SSDEDTPRM-------ASIETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANY 121
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
L+ R+CIAT G+ + LS ++ + C T C+GG +W ++ G P
Sbjct: 122 VLAHRLCIATNGKTNVVLSPEYQVQC---DTMNKACQGGYLKYSWTFLENTGTP 172
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD ++P + +Q Q CGS WAI T A SDR I ++GR TLS+ HLL
Sbjct: 205 LPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 262
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C G C GG RAW Y+ + G+
Sbjct: 263 SCDR--RGQQSCNGGYLDRAWSYIRKIGL 289
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 60/103 (58%), Gaps = 3/103 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS C + WA++T +A+SDR C G+
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK- 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG 170
+S+ HLL+CC C C+GG P AW Y +E G+ + G
Sbjct: 139 QLRISAAHLLSCCKQCG--GGCKGGFPGFAWLYYVEYGIASSG 179
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E T+LPE F ++P T H L Q NC
Sbjct: 176 FKFRLGTLP----PSPLLLSMNEMTASLPKTTDLPEFFVASYKWPGWT---HGPLDQKNC 228
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 229 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 286
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 287 RGLVSHACY 295
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 63/123 (51%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 176 FKFRLGTLP----PSPMLLSMNEMTAPLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 228
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 229 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 286
Query: 164 NGV 166
G+
Sbjct: 287 RGL 289
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFHLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|403331769|gb|EJY64852.1| hypothetical protein OXYTRI_15000 [Oxytricha trifallax]
Length = 259
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 63/122 (51%), Gaps = 10/122 (8%)
Query: 47 FGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC 106
FG S+ PQ + +L + + + LP FD ++P+C I + Q +CGSC
Sbjct: 7 FGKSIKPQPSSYSLNLNITQKLLA-----SNLPLSFDSTVEWPDC--IHATRNQGSCGSC 59
Query: 107 WAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+A A + +SDR+CI + G+++ LS L++C G C GG YY++ G+
Sbjct: 60 YAFAASGMMSDRLCIKSNGQINLVLSPQELVSCDYQNYG---CSGGWMTNTLYYLMSYGI 116
Query: 167 PT 168
P+
Sbjct: 117 PS 118
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 5/95 (5%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
++T++P+ FD R++YP+C I V Q +CGSCWA ++ A+L DR C A + T S
Sbjct: 70 ADTKVPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSP 127
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++++C G C+GG W ++ + G T
Sbjct: 128 QYVVSCDH---GDMACDGGWLQSVWRFLTKTGTTT 159
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 62/123 (50%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E T+LPE F ++P T H L Q NC
Sbjct: 188 FKYRLGTLP----PSPLLLSMNEVTASLTKTTDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I +QGR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHG--CNSGSVDRAWWYLRK 298
Query: 164 NGV 166
G+
Sbjct: 299 RGL 301
>gi|227018338|gb|ACP18835.1| cysteine proteinase 2 [Chrysomela tremula]
Length = 179
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+ +P +I ++ QS+C S WA+A AA+SDR+CI +QG LS + LL
Sbjct: 83 LPENFDARQNWPESESIRMIRDQSSCASSWAVAAAAAMSDRICIYSQGTYRTILSDEELL 142
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+CC G C+GG AWYY E+G+ +GG Y S
Sbjct: 143 SCCTDLEHG--CDGGYHPDAWYYWKEHGIVSGGPYNS 177
>gi|148694398|gb|EDL26345.1| tubulointerstitial nephritis antigen, isoform CRA_b [Mus musculus]
Length = 258
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 62/123 (50%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + +LPE F ++P T H L Q NC
Sbjct: 29 FKFRLGTLP----PSPMLLSMNEMTASFPPRADLPEIFIASYKWPGWT---HGPLDQKNC 81
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 82 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWFLRK 139
Query: 164 NGV 166
G+
Sbjct: 140 RGL 142
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 78/174 (44%), Gaps = 24/174 (13%)
Query: 4 PTPQYVNHSH-------HLLLRHVTRDSNPGLWADPDILKSSPSFLSSLK---FGLSLTP 53
PT NH+H L+ + + G W + + +S L+ FGLSL
Sbjct: 13 PTHPIANHTHANTPVNDKSLIDRINSNHTHG-WKATEYSRFDNMTISQLRDNLFGLSLM- 70
Query: 54 QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
S E P + + ++ ++P FD R Q+ C + ++ Q CG+CWA +
Sbjct: 71 SSDEDTPRM-------ANIETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACWAFSANY 121
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
L+ R+CIAT G+ + LS ++ + C T C+GG +W ++ G P
Sbjct: 122 VLAHRLCIATNGQTNVVLSPEYQVQC---DTMNKACQGGYLKYSWTFLENTGTP 172
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 62/123 (50%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E T+LPE F ++P T H L Q NC
Sbjct: 188 FKYRLGTLP----PSPLLLSMNEVTASLAETTDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I +QGR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHG--CNSGSVDRAWWYLRK 298
Query: 164 NGV 166
G+
Sbjct: 299 RGL 301
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 54/90 (60%), Gaps = 4/90 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELP FD R+++P I V+ Q +C S W+ +TTA +DR+ I T GR++ LS+ L
Sbjct: 183 ELPSSFDAREKWP--LYIHPVRDQGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQL 240
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG RAW+Y+ + GV
Sbjct: 241 LSCNQHRQRG--CEGGYLDRAWWYIRKLGV 268
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTAPLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTAPLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 4/76 (5%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAA---CTGGDVCEGGNPMR 156
QS CGSCWA T A + R+CI + G+L+ LS+ ++L CC C C GGNP+
Sbjct: 2 QSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFG-CSGGNPIT 60
Query: 157 AWYYMLENGVPTGGDY 172
+W ++ NG+ +GG +
Sbjct: 61 SWTFLHTNGIVSGGGF 76
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + + +LPE F ++P T H L Q NC
Sbjct: 187 FKFRLGTLP----PSPTLLSMNEMTATFPARADLPEVFISSYKWPGWT---HGPLDQKNC 239
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 240 AASWAFSTASVAADRIAIQSRGRYTANLSPQNLISCCAKKRHG--CNSGSIDRAWWFLRK 297
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 298 RGLVSHACY 306
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 39/128 (30%)
Query: 77 ELPEEFDLRKQYPNCTNIG------------------------------------HVQLQ 100
+LP+ FD R +P C+ IG +++ Q
Sbjct: 98 KLPKHFDARTAWPQCSTIGKILGRLLDSFSSYFDDFFCFGCTDALYFSYHLLVPFYIKDQ 157
Query: 101 SNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYY 160
+CGSCWA +LSDR CI ++ +LS + LL CC G C+GG P+ AW Y
Sbjct: 158 GHCGSCWAFGAVESLSDRFCI--HFGMNISLSVNDLLACCGFLCGSG-CDGGYPLYAWRY 214
Query: 161 MLENGVPT 168
+ +GV T
Sbjct: 215 FIHHGVVT 222
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 52/104 (50%), Gaps = 5/104 (4%)
Query: 65 GSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
G+ G + ++PE FD R++YP+C I V Q CGSCWA ++ A DR CIA
Sbjct: 62 GAAPRGTFADKDDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGL 119
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ S ++++C G C GG AW ++ + G T
Sbjct: 120 DKKPVKYSPQYVVSCDH---GNMACNGGWLPNAWKFLTKTGTTT 160
>gi|206725499|ref|NP_001128673.1| cathepsin L like protein precursor [Bombyx mori]
gi|198041259|dbj|BAG70408.1| cathepsin L like protein [Bombyx mori]
Length = 547
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 59/109 (54%), Gaps = 7/109 (6%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP E+DLR + V+ Q +CGSCW TTAA + GRL +LS+ +
Sbjct: 331 DLPSEYDLRI----LGYVSKVKNQEDCGSCWTFGTTAAAEGALARINGGRL-LSLSNQAI 385
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
L C A GG CEGG+ A+ +M++ G+PT +YGS D G CN
Sbjct: 386 LDC-AWPYGGSGCEGGSDNAAYDWMMKFGLPTEEEYGSYTNAD-GICNI 432
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 10/113 (8%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R ++P +I + Q CG+ WA++T SDR I ++G D LS+ HLL
Sbjct: 203 LPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQHLL 260
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV------PTGGDYGSCQRFDRGNCN 184
+C G C GG RAW +M + G+ P G C+ R N N
Sbjct: 261 SC--NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWTGRNDQCRLRKRSNLN 311
>gi|326435817|gb|EGD81387.1| cathepsin Z [Salpingoeca sp. ATCC 50818]
Length = 884
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 68/152 (44%), Gaps = 11/152 (7%)
Query: 22 RDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEE 81
R P + AD + ++P F + P ++ P + S Y +++P+
Sbjct: 316 RKQMPTMPADGQLFTTNPDIKKGAYFRYN-KPAVRKGTPTSHVVSPMPHTYLRPSDVPQT 374
Query: 82 FDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+D R N + + + N CGSCWA TT+AL+DR+ +A S
Sbjct: 375 YDPR----NINGVDYTTVNRNQHIPQYCGSCWAHGTTSALADRIKLARNRTFPDIQPSVQ 430
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
+L C CEGG+P A+ Y+L+NG+P
Sbjct: 431 VLVDCVTLNNTHGCEGGDPTAAYSYILQNGIP 462
Score = 39.7 bits (91), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 59/149 (39%), Gaps = 30/149 (20%)
Query: 46 KFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN--- 102
+ G + P + P P +Y LPE F + N + + + N
Sbjct: 30 EIGWTFGPVVKSPQPH---------EYLDVGALPESF----AWNNVSGVNFLSRSRNQHI 76
Query: 103 ---CGSCWAIATTAALSDRMCIATQGRL-DHTLSSDHLLTCCAACTGGDVCEGGNPMRAW 158
CGSCWA TT+AL+DR+ I + + L+ LL C + G W
Sbjct: 77 PQYCGSCWAHGTTSALNDRLSIMRRNAWPEINLAPQVLLNCNGGGSCGGGAP----GGVW 132
Query: 159 YYMLENGVPTGGDYGSCQRFDR--GNCNC 185
Y+ NG+P +CQ ++ G CN
Sbjct: 133 EYIYHNGIPD----ETCQNYEARDGQCNA 157
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTXPLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 61/105 (58%), Gaps = 1/105 (0%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + LP+ FD R+Q+P C+++ ++ Q CGSC ++ +A++DR CI ++G+ T
Sbjct: 56 FAEDLVLPKSFDARQQWPQCSSLNEIRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTF 115
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQ 176
+ LL+CC C GG G W Y ++ GV +GG YGS Q
Sbjct: 116 GAFDLLSCCYECGGGCTGGGIP-GPIWSYWVKQGVSSGGPYGSNQ 159
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 51/84 (60%), Gaps = 4/84 (4%)
Query: 91 CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV-C 149
C ++ ++ Q+NCGSCWA +T A++DRMCIA+ G + LS+ + +C GD+ C
Sbjct: 1 CPSLKEIRDQANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL---GDMGC 57
Query: 150 EGGNPMRAWYYMLENGVPTGGDYG 173
GG P + Y +G+ GG+YG
Sbjct: 58 NGGIPSSVYSYWALSGIVDGGNYG 81
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 65/129 (50%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CC+ G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCSKNRPG--CNSGSIDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R ++ +I +V Q CG+ WAI+T +DR I ++G D LS+ HLL
Sbjct: 261 LPREFDSRTRWSR--DISNVHDQGWCGASWAISTADVATDRFSIMSKGAEDAELSAQHLL 318
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV------PTGGDYGSCQRFDRGN 182
+C G C GG RAW +M + G+ P G G C+ R N
Sbjct: 319 SC--NNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNGQCKLRKRNN 367
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 79/174 (45%), Gaps = 24/174 (13%)
Query: 4 PTPQYVNHSH-------HLLLRHVTRDSNPGLWADPDILKSSPSFLSSLK---FGLSLTP 53
PT NH+H L+ + + G W + + +S L+ FGLSL
Sbjct: 28 PTHPIANHTHANTPVNDKSLIDRINSNHTHG-WKATEYSRFDNMTISQLRDNLFGLSLM- 85
Query: 54 QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
+ E P ++ + ++ ++P FD R Q+ C + ++ Q CG+CWA +
Sbjct: 86 STDEDTPRME-------NIETRMDIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANY 136
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
L+ R+CIAT G+ + LS ++ + C T C+GG +W ++ G P
Sbjct: 137 VLAHRLCIATNGQTNVVLSPEYQVQC---DTMNKACQGGYLKYSWTFLENTGTP 187
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 42/73 (57%), Gaps = 4/73 (5%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
QS CGSCWA T A +DR+CI + G LS+ + ACT C GG+P AW
Sbjct: 2 QSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEM----NACTLFFGCGGGDPYSAWS 57
Query: 160 YMLENGVPTGGDY 172
++ + G+ TGGDY
Sbjct: 58 WVHDKGIATGGDY 70
>gi|325180819|emb|CCA15230.1| cathepsinlike cysteine protease putative [Albugo laibachii Nc14]
Length = 660
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 65/120 (54%), Gaps = 25/120 (20%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIA-TQGR- 126
++ELP+ FDLR + + HV + N CGSCWA A+T++LSDR+ I ++ R
Sbjct: 45 SSELPKNFDLR----DIDGVNHVTITRNQHIPFYCGSCWAFASTSSLSDRIHIQRSRNRK 100
Query: 127 -------LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFD 179
L + + +L C GG C GG+P+ A+ Y+ ENG+P SCQR++
Sbjct: 101 EKSPVDVLREVVLAPQVLLNCDTADGG--CHGGDPLSAFRYIHENGIPD----ESCQRYE 154
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 51/115 (44%), Gaps = 17/115 (14%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQGRLDHT 130
ELP+ +D R N + V N CGSCWA TT+ALSDR+ I
Sbjct: 389 ELPKAWDWR----NVNGVSFVTWDKNQHIPHYCGSCWAQGTTSALSDRIMILRNATWPEI 444
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
S +L C GG C GGNP + Y +G+P +CQ + N NC
Sbjct: 445 ALSPQVLI---NCHGGGSCAGGNPGLVYEYAHRHGIPD----QTCQAYQAQNLNC 492
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + +LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPALLGMNEVTAALPAKIDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I + GR LS +L++CCA G C GG+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSSGRYTANLSPQNLISCCARKRHG--CGGGSVDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|294879679|ref|XP_002768754.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239871591|gb|EER01472.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 194
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Query: 78 LPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P FD R + C IGHV QS CGSCWAIA A + R+CI + G+ + LS+ +
Sbjct: 129 IPSSFDARDAFKECKGVIGHVWDQSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEM 188
Query: 137 LTCC 140
L CC
Sbjct: 189 LACC 192
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 4/96 (4%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
D + ++P+ FD R ++ + I + Q NC S WA +T SDR+ I + G T
Sbjct: 172 DIKMKKKIPKSFDARDKWG--SMITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMT 229
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
LS HLL+C G C GG+ RAW++M + GV
Sbjct: 230 LSPQHLLSC--NTRGQRGCSGGHIDRAWWFMRKRGV 263
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 64.3 bits (155), Expect = 2e-08, Method: Composition-based stats.
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 14/113 (12%)
Query: 78 LPEEFDLR----KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD-HTLS 132
LP+ +D R + Y T H+ CGSCW+ A+ +++SDR+ + T+G+ H LS
Sbjct: 43 LPKSYDPRDIDGRNYVTVTKNQHIP--QYCGSCWSFASVSSVSDRLKLMTKGKWPVHDLS 100
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
+L C G C+GG+P+ A+ YM ++GVP G C R+ N C
Sbjct: 101 PQVILNCDHNSNG---CQGGHPLTAFKYMHDHGVPEEG----CMRYMAKNMEC 146
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 34/121 (28%), Positives = 58/121 (47%), Gaps = 19/121 (15%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQG 125
Y + ++P+ +D+R N + + N CGSCWA A T+ALSDR+ + +G
Sbjct: 321 YIKSEDIPKNYDIR----NIDGVNYATWDKNQHIPQYCGSCWAQAPTSALSDRINLMRKG 376
Query: 126 RLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
+ LS+ ++ C A T C+GG+ + Y G+P +CQ ++ +
Sbjct: 377 KWPTVELSAQEVINCSNAGT----CDGGSDADVFEYAFNEGIPD----QTCQVYEAIDKE 428
Query: 185 C 185
C
Sbjct: 429 C 429
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 53/95 (55%), Gaps = 5/95 (5%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S T++P+ FD R++YP+C I V Q CGSCWA ++ A++ DR C+A + S
Sbjct: 70 SATQVPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCVAGLDKKAVRYSP 127
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++++C G C+GG W ++++ G T
Sbjct: 128 QYVVSC---DRGDMACDGGWLPSVWRFLVKTGTTT 159
>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
Length = 322
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 43/71 (60%), Gaps = 2/71 (2%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
Q +P FD R Q+PNC I V+ Q +C SCWA+ +++ L+DR+CIA+ G + LS
Sbjct: 34 QDRANIPASFDARTQWPNC--ISPVRDQGSCSSCWAMTSSSILADRLCIASGGAIKKLLS 91
Query: 133 SDHLLTCCAAC 143
+++ C C
Sbjct: 92 PQYMVDCAKNC 102
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 63/123 (51%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKYRLGTLP----PSPRLLSMNEMTASLPATTDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 241 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWFLRK 298
Query: 164 NGV 166
G+
Sbjct: 299 RGL 301
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 66/130 (50%), Gaps = 6/130 (4%)
Query: 44 SLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSN 102
+L+ G + + P+P L +E + T+LPE F ++P T H L Q N
Sbjct: 183 TLEEGFTFRLGTLAPSPMLLSMNEVTAALPAKTDLPEFFIASYKWPGWT---HDPLDQKN 239
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
C + WA +T + +DR+ I + GR LS +L++CC G C GG+ RAW+Y+
Sbjct: 240 CAASWAFSTASVAADRIAIQSNGRYTVNLSPQNLISCCLKHRYG--CSGGSIDRAWWYLR 297
Query: 163 ENGVPTGGDY 172
+ G+ + Y
Sbjct: 298 KRGLVSHACY 307
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + +LPE F ++P T H L Q NC
Sbjct: 187 FKFRLGTLP----PSPMLLSMNEMTASFPPRADLPEIFIASYKWPGWT---HGPLDQKNC 239
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 240 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWFLRK 297
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 298 RGLVSHACY 306
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E + +LPE F ++P T H L Q NC
Sbjct: 187 FKFRLGTLP----PSPMLLSMNEMTASFPPRADLPEIFIASYKWPGWT---HGPLDQKNC 239
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 240 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWFLRK 297
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 298 RGLVSHACY 306
>gi|308811264|ref|XP_003082940.1| cysteine proteinase (ISS) [Ostreococcus tauri]
gi|116054818|emb|CAL56895.1| cysteine proteinase (ISS) [Ostreococcus tauri]
Length = 362
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 2/107 (1%)
Query: 35 LKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFG-DYQSNTELPEEFDLRKQYPNCTN 93
L SS S L +L FG + + L + + LP+ FD+R+++P C
Sbjct: 44 LSSSLSVLGTLSFGRRKSARMGSLEDRLAKTWDPTKIKLHAGGRLPDTFDVREKWPKCAA 103
Query: 94 -IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTC 139
+ Q CGSCWA+A A++DR+CIAT G ++ +S+ LL+C
Sbjct: 104 LVSEAVDQGACGSCWAVAPAKAMTDRLCIATNGAVNTHVSAIQLLSC 150
>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
Length = 562
Score = 63.9 bits (154), Expect = 3e-08, Method: Composition-based stats.
Identities = 39/113 (34%), Positives = 60/113 (53%), Gaps = 14/113 (12%)
Query: 78 LPEEFDLR----KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLD-HTLS 132
LP+ +D R + Y T H+ CGSCW+ A+ +++SDR+ + T+G+ H LS
Sbjct: 43 LPKSYDPRDIDGRNYVTVTKNQHIP--QYCGSCWSFASVSSVSDRLKLMTKGKWPVHDLS 100
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
+L C G C+GG+P+ A+ YM ++GVP G C R+ N C
Sbjct: 101 PQVILNCDHNSNG---CQGGHPLTAFKYMHDHGVPEEG----CMRYMAKNMEC 146
Score = 49.3 bits (116), Expect = 7e-04, Method: Composition-based stats.
Identities = 38/138 (27%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 60 PDLQLGSEHF-----GDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWA 108
PD + EH Y + ++P+ +D+R N + + N CGSCWA
Sbjct: 304 PDTKEMKEHVVSPRPHTYIKSEDIPKNYDIR----NIDGVNYATWDKNQHIPQYCGSCWA 359
Query: 109 IATTAALSDRMCIATQGRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
A T+ALSDR+ + +G+ LS ++ C+G CEGG + Y G+P
Sbjct: 360 QAPTSALSDRINLMRKGKWPTVELSVQEIIN----CSGKGSCEGGWQSGVYQYAYHQGIP 415
Query: 168 TGGDYGSCQRFDRGNCNC 185
+CQ ++ + C
Sbjct: 416 D----QTCQVYEAIDKEC 429
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E T+LPE F ++P T H L Q NC
Sbjct: 80 FKYRLGTLP----PSPLLLSMNEVTASLPETTDLPEFFVASYKWPGWT---HGPLDQKNC 132
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 133 AASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 190
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 191 RGLVSHACY 199
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 48/89 (53%), Gaps = 4/89 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R ++P I + Q CG+ WAI+TT SDR + ++G LS+ HLL
Sbjct: 202 LPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSAQHLL 259
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C G C GG RAW YM + G+
Sbjct: 260 SC--NNRGQQACSGGYLDRAWLYMRKFGL 286
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS C + WA++T +A+SDR C G+
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK- 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C C+GG P AW Y +E G+ +
Sbjct: 139 QLRISAAHLLSCCKQCG--GGCKGGFPGFAWRYYVEYGIAS 177
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 63/127 (49%), Gaps = 9/127 (7%)
Query: 40 SFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL 99
+F LK L SQ L + + H+ + +LP EFD R Q+ N +I VQ
Sbjct: 201 TFDDGLKLRLGTINPSQSTRQMLPV-TRHY----NPNDLPREFDSRIQWGN--DITPVQD 253
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q CG+ WAI+T SDR I ++G LS HL++C G C+GG RAW
Sbjct: 254 QGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLISCNNRGQRG--CKGGYLDRAWL 311
Query: 160 YMLENGV 166
+M + GV
Sbjct: 312 FMRKFGV 318
>gi|395829284|ref|XP_003787790.1| PREDICTED: cathepsin Z [Otolemur garnettii]
Length = 300
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 61/116 (52%), Gaps = 18/116 (15%)
Query: 62 LQLGSEHF---GDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATT 112
LQLG + +Y S + LP+ +D R N + +V + N CGSCWA A+T
Sbjct: 40 LQLGHRTYPRPHEYLSTSHLPKSWDWR----NVNGVNYVSITRNQHIPQYCGSCWAHAST 95
Query: 113 AALSDRMCIATQGRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
+A++DR+ I +G T LS H+L C A + CEGGN + W Y +G+P
Sbjct: 96 SAMADRINIKRKGAWPSTLLSVQHVLDCGDAGS----CEGGNDLPVWAYAHRHGIP 147
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS C + WA++T +A+SDR C G+
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASAISDRYCTVGGGK- 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C C+GG P AW Y +E G+ +
Sbjct: 139 QLRISAAHLLSCCKQCG--GGCKGGFPGFAWRYYVEYGIAS 177
>gi|254746346|emb|CAX16638.1| putative C1A cysteine protease precursor [Spodoptera frugiperda]
Length = 552
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 9/138 (6%)
Query: 48 GLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCW 107
GL P+ + N E + + +P+E+D R + HV+ Q +CGSCW
Sbjct: 309 GLLKRPEGKSGNVPFPYTEEKLNELSED--MPKEYDTRL----LGLVSHVKNQEDCGSCW 362
Query: 108 AIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
TTAA+ + G+L L++ L+ C A G C+GG A+ +M+E G+P
Sbjct: 363 TFGTTAAVEGALARINGGKL-MALANQALVDCVWAF-GAAGCDGGTDNAAYEWMMEYGLP 420
Query: 168 TGGDYGSCQRFDRGNCNC 185
T +YG D G CN
Sbjct: 421 TEAEYGPYTNKD-GECNI 437
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 192 FKYRLGTLP----PSPMLLSMNEVTPSLPATTDLPEFFIASYKWPGWT---HGPLDQKNC 244
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I + GR LS +L++CCA G C G+ RAW+Y+ +
Sbjct: 245 AASWAFSTASVAADRIAIQSNGRFTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLRK 302
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 303 RGLVSHACY 311
>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
Length = 163
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 42/65 (64%), Gaps = 2/65 (3%)
Query: 110 ATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
T++A +DR+CIAT G + LS++ L CC C G C GG P++AW + ++G+ TG
Sbjct: 1 GTSSAFADRLCIATDGEFNELLSAEELAFCCHKCGFG--CHGGYPIKAWEWFKKHGLVTG 58
Query: 170 GDYGS 174
GDY S
Sbjct: 59 GDYDS 63
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 53/88 (60%), Gaps = 5/88 (5%)
Query: 81 EFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCC 140
EFD R+++PNC + ++ Q NCGSC++ A++ +SDR CI + G ++ LS L+TC
Sbjct: 5 EFDSRQKWPNCVH--PIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCS 62
Query: 141 AACTGGDVCEGGNPMRAWYYMLENGVPT 168
G C GG P + Y+ ++G+ +
Sbjct: 63 WYSFG---CNGGIPGLVFDYIHKDGLVS 87
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
LKF L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 LKFRLGTLP----PSPMLLSMNEVTPSLPATTDLPEFFVASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I + GR LS +L++CC G C G+ RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCTKNRHG--CNSGSVDRAWWYLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 43/71 (60%), Gaps = 2/71 (2%)
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
GSCWA A+SDR+CI + G++ +SS+ LL CC +C G C GG P AW + +
Sbjct: 1 GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCDSCGMG--CNGGYPSAAWDFWTD 58
Query: 164 NGVPTGGDYGS 174
G+ +GG Y S
Sbjct: 59 VGLVSGGLYDS 69
>gi|294897635|ref|XP_002776031.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239882504|gb|EER07847.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 108
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LP FD R+++ +C IGHV+ QS C +CWA++ T L+DR+CI + G LS +
Sbjct: 36 DLPSNFDARQKFASCAGVIGHVRDQSACNNCWAVSPTGMLNDRVCIKSGGSFRDILSVGY 95
Query: 136 LLTCCAACTG 145
+CC G
Sbjct: 96 FTSCCNPANG 105
>gi|187944195|gb|ACD40324.1| cathepsin 1-like protease [Helicoverpa armigera]
Length = 550
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 7/108 (6%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+E+D R + + +++ Q CGSCW TTAA+ + G+L S+ L
Sbjct: 333 IPDEYDARLE----GLVSNIKNQEECGSCWTFGTTAAVEGALARINGGKL--MALSNQAL 386
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
CA G CEGG A+ +M+E G+PT +YG D G CN
Sbjct: 387 VDCAWAYGAYGCEGGTDNAAYEWMMEYGLPTVAEYGQYTNKD-GECNI 433
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 61/123 (49%), Gaps = 10/123 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E T+LPE F ++P T H L Q NC
Sbjct: 188 FKYRLGTLP----PSPLLLSMNEVTASLTKTTDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I +QGR LS +L++CCA G C + RAW+Y+ +
Sbjct: 241 AASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRRG--CNSESVDRAWWYLRK 298
Query: 164 NGV 166
G+
Sbjct: 299 RGL 301
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 65 GSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
G+ G + ++PE FD R++YP+C I V Q CGSCWA ++ A DR C+A
Sbjct: 62 GAAPRGTFTDKDDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGL 119
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ S ++++C G C GG W ++ + G T
Sbjct: 120 DKKPVKYSPQYVVSCDH---GDMACNGGWLPNVWKFLTKTGTTT 160
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 65 GSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
G+ G + ++PE FD R++YP+C I V Q CGSCWA ++ A DR C+A
Sbjct: 62 GAAPRGTFTDKDDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGL 119
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ S ++++C G C GG W ++ + G T
Sbjct: 120 DKKPVKYSPQYVVSCDH---GDMACNGGWLPNVWKFLTKTGTTT 160
>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
Length = 561
Score = 62.8 bits (151), Expect = 6e-08, Method: Composition-based stats.
Identities = 38/121 (31%), Positives = 59/121 (48%), Gaps = 16/121 (13%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y +N ELP+ +D R N + +V + N CGSCWA + +A++DR+ + T+
Sbjct: 36 EYMTNEELPKSYDPR----NIDGVSYVSVSRNQHIPQYCGSCWAFSAASAVADRLRLMTK 91
Query: 125 GRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
S ++ CA G C GG+ A+ M E GVPT G C R++ +
Sbjct: 92 NAWPTAELSPQMIVNCATTAMG--CHGGSMTSAYKLMKERGVPTEG----CMRYEAKDME 145
Query: 185 C 185
C
Sbjct: 146 C 146
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 39/134 (29%), Positives = 60/134 (44%), Gaps = 24/134 (17%)
Query: 57 EPNPDLQLGSEHFGDYQSNTELPEEFDLR----KQYPNCTNIGHVQLQSNCGSCWAIATT 112
EP P L SE ++P+ +D+R + Y H+ CGSCWA +T
Sbjct: 313 EPLPHFYLKSE---------DIPKSYDIRNIDGRNYATWDKNQHIP--QYCGSCWAQGST 361
Query: 113 AALSDRMCIATQGRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGD 171
+A++DR+ I +G+ LS ++ C TG C GG + Y E G+P
Sbjct: 362 SAIADRINIMRKGKWPTVELSVQEVINC--GNTGS--CNGGWDSGVYRYAHEEGIPD--- 414
Query: 172 YGSCQRFDRGNCNC 185
+CQ ++ N C
Sbjct: 415 -QTCQVYEARNKEC 427
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 188 FKYRLGTLP----PSPMLLSMNEMTASLPATTDLPEFFIASYKWPGWT---HGPLDQKNC 240
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I + GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 241 AASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWFLRK 298
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 299 RGLVSHACY 307
>gi|290969944|ref|XP_002667994.1| predicted protein [Naegleria gruberi]
gi|284080970|gb|EFC35250.1| predicted protein [Naegleria gruberi]
Length = 191
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+ P +FD R+Q+P C I ++ Q NCGSCWA + ++ L+DR CI + G+++ LS +
Sbjct: 123 DFPTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFM 180
Query: 137 LTCCAACTG 145
++C G
Sbjct: 181 VSCSGQNNG 189
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA++ + ++DR+C+ ++GR+ +S +L+CC G C GG +RAW +++ N
Sbjct: 1 SCWAVSAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGYG-CRGGANIRAWKHVMRN 59
Query: 165 GVPTGG 170
GV TGG
Sbjct: 60 GVCTGG 65
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 64/131 (48%), Gaps = 14/131 (10%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGS--EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QS 101
KF L P P L L S E + T+LPE F ++P T H L Q
Sbjct: 188 FKFRLGTLP------PSLMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQK 238
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NC + WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+
Sbjct: 239 NCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYL 296
Query: 162 LENGVPTGGDY 172
+ G+ + Y
Sbjct: 297 RKRGLVSHACY 307
>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
Length = 559
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 18/122 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y + +LP+ +D R N + V + N CGSCWA + T+A+SDR+ + T+
Sbjct: 37 NYVRSGQLPKNYDPR----NINGLNMVSVNKNQHIPVWCGSCWAFSATSAVSDRLKLMTK 92
Query: 125 GRL-DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
G +H LS ++ C G C GG+P + M E GVP G C R++ N
Sbjct: 93 GAWPEHDLSVQVVINCADNAEG---CGGGHPTDVYRLMNEMGVPAEG----CMRYEAKNM 145
Query: 184 NC 185
C
Sbjct: 146 EC 147
Score = 41.6 bits (96), Expect = 0.16, Method: Composition-based stats.
Identities = 35/121 (28%), Positives = 52/121 (42%), Gaps = 19/121 (15%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQG 125
Y +LP +D+R N + + N CGSCWA +TAALSDR+ I +G
Sbjct: 320 YLKANDLPASYDIR----NVDGVNYATWNRNQHIPVWCGSCWAQGSTAALSDRINIMRKG 375
Query: 126 RLDH-TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCN 184
L+ +L C A + C GG + Y E +P +CQ ++ +
Sbjct: 376 AWPAVNLAVQVVLNCGDAGS----CHGGWDDGVYAYAHEVDIPD----QTCQPYEAVDHE 427
Query: 185 C 185
C
Sbjct: 428 C 428
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 64/131 (48%), Gaps = 14/131 (10%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGS--EHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QS 101
KF L P P L L S E + T+LPE F ++P T H L Q
Sbjct: 188 FKFRLGTLP------PSLMLLSMNEMTASLPATTDLPEFFVASYKWPGWT---HGPLDQK 238
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
NC + WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+Y+
Sbjct: 239 NCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYL 296
Query: 162 LENGVPTGGDY 172
+ G+ + Y
Sbjct: 297 RKRGLVSHACY 307
>gi|303279765|ref|XP_003059175.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459011|gb|EEH56307.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 475
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 78 LPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP +FD R +P C IG V+ Q CGSCWA+A T ++DR+C+ T G +S+
Sbjct: 173 LPRDFDARVAFPACAALIGGVRDQGECGSCWAMAATEVMNDRLCVKTNGAERRRMSAQFT 232
Query: 137 LTCCA--ACTGGDVCEGGN 153
L C G + C GG+
Sbjct: 233 LACTVDHGNAGQNGCRGGS 251
>gi|89114252|gb|ABD61714.1| cathepsin B [Scophthalmus maximus]
Length = 122
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 34/50 (68%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCI 121
Y +LPEEFD R+Q+PNC + ++ Q +CGSCWA A+SDR+CI
Sbjct: 73 YAGGMKLPEEFDSREQWPNCPTLKEIRDQCSCGSCWAFGAAEAISDRVCI 122
>gi|338719460|ref|XP_001490326.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin Z-like [Equus caballus]
Length = 302
Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+E+D R N I +V + N CGSCWA +T+A++DR+ I +
Sbjct: 54 EYLSPLDLPKEWDWR----NVDGINYVSVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 109
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS ++ C A + CEGGN ++ W Y E+G+P
Sbjct: 110 GAWPSTLLSVQQVIDCGQAGS----CEGGNDLQVWEYAHEHGIP 149
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 47/89 (52%), Gaps = 4/89 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R ++P I + Q CG+ WAI+ T SDR + ++G LS+ HLL
Sbjct: 202 LPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSAQHLL 259
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C G C GG RAW YM + G+
Sbjct: 260 SC--NNRGQQACSGGYLDRAWLYMRKFGL 286
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S T+ P+ FD R++YP+C I V Q CGSCWA ++ A++ DR C A + S
Sbjct: 70 SATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSP 127
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++++C G C+GG W ++ + G T
Sbjct: 128 QYVVSC---DRGDMACDGGWLPSVWRFLTKTGTTT 159
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 11/129 (8%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
KF L P P+P L +E Y +LPE F ++P T H L Q NC
Sbjct: 188 FKFRLGTLP----PSPMLLSMNEMTASY-PRADLPEVFIASYKWPGWT---HGPLDQKNC 239
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CCA G C G+ RAW+++ +
Sbjct: 240 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWFLRK 297
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 298 RGLVSHACY 306
>gi|73992645|ref|XP_854795.1| PREDICTED: cathepsin Z [Canis lupus familiaris]
Length = 375
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 127 EYLSPSDLPKSWDWRNVNGVNYASATRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 184
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS H+L C A + CEGGN + W Y E+G+P
Sbjct: 185 WPSTLLSVQHVLDCANAGS----CEGGNDLPVWSYAHEHGIP 222
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/72 (44%), Positives = 43/72 (59%), Gaps = 3/72 (4%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA A+SDR+CI T ++ +S++ LLTCC + GD C GG P AW +
Sbjct: 1 CGSCWAFGAVEAISDRICIHTNVSVE--VSAEDLLTCCGSMC-GDGCNGGYPAEAWNFWT 57
Query: 163 ENGVPTGGDYGS 174
G+ +GG Y S
Sbjct: 58 RKGLVSGGLYES 69
>gi|301759441|ref|XP_002915580.1| PREDICTED: cathepsin Z-like [Ailuropoda melanoleuca]
Length = 359
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 111 EYLSPSDLPKSWDWRNVNGVNYASATRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 168
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS H+L C A + CEGGN + W Y E+G+P
Sbjct: 169 WPSTLLSVQHVLDCANAGS----CEGGNDLPVWGYAHEHGIP 206
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 53/99 (53%), Gaps = 3/99 (3%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++P+C I + QS C + WA+AT +A+SDR C G+
Sbjct: 81 RFTEEQLRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDRYCTVGNGKQ 140
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
++D + G CEGG P AW Y + NG+
Sbjct: 141 LRISAADLM---ACCTGCGGGCEGGYPDAAWEYYVSNGI 176
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 74 SNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
S T+ P+ FD R++YP+C I V Q CGSCWA ++ A++ DR C A + S
Sbjct: 70 SATQAPDSFDFREEYPHC--IPEVVDQGGCGSCWAFSSVASVGDRRCFAGLDKKAVKYSP 127
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++++C G C+GG W ++ + G T
Sbjct: 128 QYVVSC---DRGDMACDGGWLPSVWRFLTKTGTTT 159
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 5/88 (5%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P+ FD R+Q+ + I ++ Q CG+CWA T ALSDR IA+ G +D S + L++
Sbjct: 77 PDNFDARQQWG--SKIHAIRDQQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVS 134
Query: 139 CCAACTGGDVCEGGNPMRAWYYMLENGV 166
C G C GG AW ++ ++GV
Sbjct: 135 CDTNDYG---CNGGYMDMAWEFLDQHGV 159
>gi|355681668|gb|AER96820.1| cathepsin Z [Mustela putorius furo]
Length = 230
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 9 EYLSPSDLPKSWDWRNVNGVNYASATRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 66
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS H+L C A + CEGGN + W Y E+G+P
Sbjct: 67 WPSTLLSVQHVLDCANAGS----CEGGNDLPVWRYAHEHGIP 104
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 10/111 (9%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R ++ I V Q CG+ WAI+T SDR + ++G LS+ HLL
Sbjct: 197 LPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGTDSVLLSAQHLL 254
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV------PTGGDYGSCQRFDRGN 182
+C G C+GG RAW +M + G+ P G Y C+ R N
Sbjct: 255 SCNKKGQRG--CDGGYLDRAWLFMRKFGLVDEQCYPWKGVYEQCKLQKRTN 303
>gi|48762487|dbj|BAD23813.1| cathepsin B-N [Tuberaphis taiwana]
Length = 163
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 2/65 (3%)
Query: 110 ATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
T++A +DR+CIAT G + LS++ L CC C G C GG P+RAW ++G+ TG
Sbjct: 1 GTSSAFADRLCIATDGEFNELLSAEELAFCCHKCGFG--CSGGYPIRAWERFKKHGLVTG 58
Query: 170 GDYGS 174
G+Y S
Sbjct: 59 GNYDS 63
>gi|417398668|gb|JAA46367.1| Putative cathepsin z [Desmodus rotundus]
Length = 305
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S ++LP+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 57 EYLSPSDLPKSWDWR----NVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 112
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G LS H+L C A + CEGGN M W Y ++G+P
Sbjct: 113 GAWPSALLSVQHVLDCAEAGS----CEGGNDMEVWAYAHKHGIP 152
>gi|395506744|ref|XP_003757690.1| PREDICTED: cathepsin Z [Sarcophilus harrisii]
Length = 308
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 66/128 (51%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +ELP+ +D R N + + + N CGSCWA +T+AL+DR+ I +
Sbjct: 59 EYLSPSELPKAWDWR----NVNGVNYASITRNQHIPQYCGSCWAHGSTSALADRINIKRK 114
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS H++ C A + CEGGN W Y +G+P T +Y + C +
Sbjct: 115 GAWPSTLLSVQHVIDCGNAGS----CEGGNDFSVWEYANRHGIPDETCNNYQAKDQECDK 170
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 171 FNQCGTCN 178
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSNC 103
K+ L P P+P L +E + T+LPE F ++P T H L Q NC
Sbjct: 185 FKYRLGTLP----PSPMLLSMNEVTASLPATTDLPEFFIASYKWPGWT---HGPLDQKNC 237
Query: 104 GSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLE 163
+ WA +T + +DR+ I ++GR LS +L++CC G C G+ RAW+++ +
Sbjct: 238 AASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCPKNRHG--CNSGSIDRAWWFLRK 295
Query: 164 NGVPTGGDY 172
G+ + Y
Sbjct: 296 RGLVSHACY 304
>gi|402882120|ref|XP_003904600.1| PREDICTED: cathepsin Z [Papio anubis]
gi|355562994|gb|EHH19556.1| Cathepsin Z [Macaca mulatta]
gi|380808428|gb|AFE76089.1| cathepsin Z preproprotein [Macaca mulatta]
gi|383412189|gb|AFH29308.1| cathepsin Z preproprotein [Macaca mulatta]
gi|384943102|gb|AFI35156.1| cathepsin Z preproprotein [Macaca mulatta]
Length = 303
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 67/128 (52%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVNGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS H+L C A + CEGGN + W Y +G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQHVLDCANAGS----CEGGNDLPVWDYAHRHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|254746344|emb|CAX16637.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 541
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 6/97 (6%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LPEE DLR + I V+ Q NCGSCWA ++ AA+ + + GR + LS L
Sbjct: 325 DLPEELDLRLE----GVITPVKNQGNCGSCWAFSSVAAVEATLALKNGGR-NLELSEQSL 379
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
+ C C G +P + Y+LE+GVPT +YG
Sbjct: 380 VDCAWGFEAMG-CNGASPDSGFKYILEHGVPTDMEYG 415
>gi|38683695|gb|AAR26872.1| FirrV-1-A48 precursor [Feldmannia irregularis virus a]
Length = 373
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 43/73 (58%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q +C SCW+I+ L+DR+ ++T G++ LS +++C G +GG P +A+
Sbjct: 81 QGSCASCWSISVVQMLADRVSVSTNGKIKLKLSVQEMISCWDGHDGLACSKGGVPEKAYQ 140
Query: 160 YMLENGVPTGGDY 172
Y++ENG+ DY
Sbjct: 141 YIIENGIGLAEDY 153
>gi|294891863|ref|XP_002773776.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878980|gb|EER05592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 131
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 42/71 (59%), Gaps = 1/71 (1%)
Query: 76 TELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
T LP +FD R+++ +C IGHV+ QS C +CW +T +DR+CI + G + LS
Sbjct: 58 TNLPSDFDARQKFASCAGVIGHVRDQSACHNCWVSGSTGTFNDRLCIKSGGSFRNILSLG 117
Query: 135 HLLTCCAACTG 145
++ +CC G
Sbjct: 118 YITSCCNRANG 128
>gi|426241881|ref|XP_004014809.1| PREDICTED: cathepsin Z-like [Ovis aries]
Length = 345
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 70/150 (46%), Gaps = 23/150 (15%)
Query: 35 LKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGD------------YQSNTELPEEF 82
+ S L +L +G + Q+Q N L++ D Y S ++LP+ +
Sbjct: 49 VDSGRHALRTLAYGGCIIRQAQLQNVQLKMKMGALADQHLTKTYPRPHEYLSPSDLPKSW 108
Query: 83 DLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT-LSSDHLL 137
D R Y + T H+ CGSCWA +T+A++DR+ I +G T LS H++
Sbjct: 109 DWRNVNGVNYASVTRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVI 166
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
C A + CEGGN + W Y +G+P
Sbjct: 167 DCGDAGS----CEGGNDLPVWEYAHRHGIP 192
>gi|281348427|gb|EFB24011.1| hypothetical protein PANDA_003571 [Ailuropoda melanoleuca]
Length = 255
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 56/102 (54%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 7 EYLSPSDLPKSWDWRNVNGVNYASATRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 64
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS H+L C A + CEGGN + W Y E+G+P
Sbjct: 65 WPSTLLSVQHVLDCANAGS----CEGGNDLPVWGYAHEHGIP 102
>gi|48425699|pdb|1SP4|A Chain A, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 48
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/48 (52%), Positives = 33/48 (68%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
LPE FD R+Q+PNC I ++ Q +CGSCWA A+SDR+CI + G
Sbjct: 1 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNG 48
>gi|1763661|gb|AAB58259.1| cysteine protease [Giardia intestinalis]
Length = 198
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 5/92 (5%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++PE FD R++YP+C I V Q CGSCWA ++ A DR C+A + S ++
Sbjct: 4 DVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCVAGLDKKPVKYSPQYV 61
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
++C G C GG W ++ + G T
Sbjct: 62 VSCDH---GDMACNGGWLPNVWKFLTKTGTTT 90
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 48/89 (53%), Gaps = 4/89 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EFD R ++P +I + Q CG+ WAI++ SDR I ++G LS+ HLL
Sbjct: 200 LPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSAQHLL 257
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C G C GG+ RAW +M G+
Sbjct: 258 SC--NNRGQQGCSGGHLDRAWMFMRRFGL 284
>gi|254746342|emb|CAX16636.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 545
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 54/99 (54%), Gaps = 8/99 (8%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS-SD 134
T+LP+E+D R + V+ Q NCGSCW TTAA+ + G+L LS S+
Sbjct: 328 TDLPDEYDARL----LGLVSAVKNQDNCGSCWTFGTTAAVEGALAQHNGGKL---LSLSN 380
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
L CA G C+GG+ A+ +M+E G+PT +YG
Sbjct: 381 QALIDCAWPFGVRGCDGGSDNAAYEWMMEYGLPTEAEYG 419
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 57/101 (56%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS C + WA++T + +SDR C G
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASVISDRYC-TVGGVQ 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C C+GG P AW Y +E G+ +
Sbjct: 139 QLRISAAHLLSCCKQCG--GGCKGGFPGFAWRYYVEYGIAS 177
>gi|340385491|ref|XP_003391243.1| PREDICTED: cathepsin Z-like [Amphimedon queenslandica]
Length = 426
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 2/98 (2%)
Query: 72 YQSNTELPEEFDLRKQYPN--CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
Y ++P +D+R N T + + CGSCWA+ TT+ALSDR+ + +G
Sbjct: 187 YIKLEDIPAAYDIRNINGNDYSTVNRNQHIPQYCGSCWAMGTTSALSDRIKLMRKGAYPV 246
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
S +L C C+GG+P A+ Y+ ENGVP
Sbjct: 247 INLSPQVLVDCVTANNSHGCDGGDPTAAYSYIYENGVP 284
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 57/101 (56%), Gaps = 3/101 (2%)
Query: 68 HFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
F + Q TELPE FD +++PNC I + QS C + WA++T + +SDR C G
Sbjct: 80 RFTEEQLRTELPESFDSAEKWPNCPTIREIADQSACRASWAVSTASVISDRYC-TVGGVQ 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S+ HLL+CC C C+GG P AW Y +E G+ +
Sbjct: 139 QLRISAAHLLSCCKQCG--GGCKGGFPGFAWRYYVEYGIAS 177
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAAL 115
Q P ++ H + +LPEEFD R ++ + V+ Q +C + WA +T A
Sbjct: 181 QPERPTAEMNELHL---KKREQLPEEFDARIRWSGLVH--GVRDQGDCANSWAFSTAAVA 235
Query: 116 SDRMCIATQGRLDHTLSSDHLLTCCAACTGGD--VCEGGNPMRAWYYMLENG 165
SDR+ I ++G LS L++C GG VC+GG+P R W ++L G
Sbjct: 236 SDRLSIQSRGVDKVELSPQDLMSC---LNGGRRVVCQGGHPDRGWRFLLNYG 284
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 53/98 (54%), Gaps = 7/98 (7%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+Y +T LP+ FD R+Q+P I V+ Q CGSCWA A +R+ I GR D
Sbjct: 56 NYVPDTSLPDNFDAREQWPG--KILPVRNQEQCGSCWAFAVAETTGNRLNILGCGRGD-- 111
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+S L++C G C GG+P+ +W ++ +G+ T
Sbjct: 112 MSPQDLVSCDKVDHG---CNGGSPLFSWEWVKHSGITT 146
>gi|390462766|ref|XP_003732901.1| PREDICTED: cathepsin Z, partial [Callithrix jacchus]
Length = 307
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 67/128 (52%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 59 EYLSLADLPKSWDWR----NVGGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 114
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS H++ C A + CEGGN + W Y +G+P T +Y + C +
Sbjct: 115 GAWPSTLLSVQHIIDCGNAGS----CEGGNDLSVWEYAHRHGIPDETCNNYQAKDQECDK 170
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 171 FNQCGTCN 178
>gi|403365594|gb|EJY82586.1| Cathepsin B [Oxytricha trifallax]
Length = 333
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 52/108 (48%), Gaps = 5/108 (4%)
Query: 60 PDLQLGSEHFGDYQSN-TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDR 118
P L L F ++N +P+ +D RK Y NC I V Q C +CWA A +SDR
Sbjct: 86 PSLFLADSSFYKPKANGVTIPKTYDSRKIYKNC--IHGVLDQVKCSACWAFAIAQVVSDR 143
Query: 119 MCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
CI + D LS +L++C G C+ G A+ YM + G+
Sbjct: 144 FCIVSNSTTDVVLSYQNLISCVNPKIFG--CKIGVIDVAFQYMEKTGI 189
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 6/97 (6%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQL-QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+LPE F ++P T H L Q NC + WA +T + +DR+ I ++GR LS +
Sbjct: 165 DLPEFFVAYYKWPGWT---HGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQN 221
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
L++CCA G C G+ RAW+Y+ + G+ + Y
Sbjct: 222 LISCCAKNRHG--CSSGSIDRAWWYLRKRGLVSHACY 256
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 3/89 (3%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+EFD R+++P + I VQ Q NCG+ +A +T+ +DR+ I + G L LS+ +L+
Sbjct: 213 MPDEFDAREEWP--SFIHPVQDQGNCGASYAFSTSTVAADRLSIHSGGELKDMLSAQYLI 270
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+C CEGG+ RAW+ + G
Sbjct: 271 SCTTD-HHQKGCEGGHVDRAWWQLRRVGT 298
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA++ +SDR+CIA+ + ++S+D + CC G+ C GG P+ AW + ++
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVC-GNGCNGGYPIEAWRHYVKK 59
Query: 165 GVPTGGDY 172
G TGG Y
Sbjct: 60 GYVTGGSY 67
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 10/111 (9%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP EF+ R ++P +I + Q CG+ WA++T SDR I ++G LS+ HLL
Sbjct: 203 LPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSAQHLL 260
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV------PTGGDYGSCQRFDRGN 182
+C G C+GG RAW +M + G+ P G C+ R N
Sbjct: 261 SC--NNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPWTGRNDQCRLRKRSN 309
>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 355
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 44/91 (48%), Gaps = 2/91 (2%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R ++PNC +I + Q C S A ++DR CI + G T S+ L
Sbjct: 20 IPTSFDARTRWPNCPSIALIPNQGCCNSSAFQIPAAVITDRACIRSNGTSTRTYSAYDAL 79
Query: 138 TCCAACTGGDV--CEGGNPMRAWYYMLENGV 166
CC C + C GG+P++ W Y G+
Sbjct: 80 ACCTDCPFSQLFKCAGGDPLKVWNYWATTGL 110
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 59.7 bits (143), Expect = 6e-07, Method: Composition-based stats.
Identities = 39/123 (31%), Positives = 56/123 (45%), Gaps = 16/123 (13%)
Query: 64 LGSEHFGDYQSN-TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIA 122
LG D+ N T++P EF+ Q+ + ++ Q CGSCWA + LSDR I
Sbjct: 325 LGETRSQDFYDNITDVPSEFNAVTQWKGL--VQPIRDQQQCGSCWAFSAAEVLSDRNAI- 381
Query: 123 TQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT---------GGDYG 173
+ + LS + L++C G C GGN AW Y+ G+ T GGD
Sbjct: 382 QHNKAEPVLSPEDLVSCDRVDQG---CNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDAP 438
Query: 174 SCQ 176
C+
Sbjct: 439 KCE 441
>gi|255078272|ref|XP_002502716.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226517981|gb|ACO63974.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 670
Score = 59.7 bits (143), Expect = 6e-07, Method: Composition-based stats.
Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 19/118 (16%)
Query: 80 EEFDLRKQYPNCTNIGHV------------QLQSNCGSCWAIATTAALSDRMCIATQGRL 127
E+ D+R P G V + CGSCWA+ TTA+LSDR+ IA
Sbjct: 79 EQLDVRNDLPTHVFWGDVDGVNYLTETRNQHIPQYCGSCWAMGTTASLSDRIKIARNATF 138
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
+ + +L C A G CEGG+P + + Y+ +G+P +CQ ++ + C
Sbjct: 139 PEVILAPQVLINCRA---GGSCEGGDPAQVYEYIAAHGIPD----ETCQAYEARDGKC 189
Score = 46.2 bits (108), Expect = 0.005, Method: Composition-based stats.
Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 6/137 (4%)
Query: 51 LTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK-QYPNCTNIGHVQ-LQSNCGSCWA 108
L+P S+E ++ H T++P +D+R N I Q + CGSCWA
Sbjct: 393 LSPPSKEVRELVRTVRPHEAPDYDKTKIPSSWDIRDVDGVNLATINRNQHIPQYCGSCWA 452
Query: 109 IATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
TT++++DR+ + G+ + +L C + G D C GG+P A ++ NGVP
Sbjct: 453 HGTTSSMADRINLMRGGKFPEIDLAPQVLVDCVSGGGTDGCNGGDPTSAHVWIAANGVPE 512
Query: 169 GGDYGSCQRFDRGNCNC 185
+CQ + C
Sbjct: 513 ----ETCQNYQAKKNEC 525
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 33/54 (61%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQG 125
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI G
Sbjct: 74 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHVNG 127
>gi|351694787|gb|EHA97705.1| Cathepsin Z [Heterocephalus glaber]
Length = 297
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 62/126 (49%), Gaps = 21/126 (16%)
Query: 47 FGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSN 102
F LS T + P P +Y S ++LP+ +D R Y + T H+
Sbjct: 34 FALSFTAWTY-PRPH---------EYLSLSDLPKTWDWRSVDGVNYASVTRNQHIP--QY 81
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
CGSCWA +T+A++DR+ I +G T LS H++ C G CEGG+ + W Y
Sbjct: 82 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVID----CGGAGSCEGGSDLLVWKYA 137
Query: 162 LENGVP 167
E+G+P
Sbjct: 138 QEHGIP 143
>gi|66810658|ref|XP_639036.1| cathepsin Z precursor [Dictyostelium discoideum AX4]
gi|60467666|gb|EAL65685.1| cathepsin Z precursor [Dictyostelium discoideum AX4]
Length = 296
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 55/99 (55%), Gaps = 15/99 (15%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQGRL- 127
N E+P+ +D R N + + ++ + N CG CWA A+T+++SDR+ I +
Sbjct: 55 NLEVPQSWDWR----NVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFP 110
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
D ++ HL+ C GG C+GG+P A+ ++ ENG+
Sbjct: 111 DVNVAPQHLID----CNGGGTCDGGDPGDAFAFINENGI 145
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 40/67 (59%), Gaps = 3/67 (4%)
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
+CGSCWA +LSDR CI L +LS + LL CC G D C+GG P+ AW Y
Sbjct: 93 HCGSCWAFGAVESLSDRFCIHYGMNL--SLSVNDLLACCGWMCG-DGCDGGYPIDAWRYF 149
Query: 162 LENGVPT 168
+++GV T
Sbjct: 150 VQSGVVT 156
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 55 SQEPNPDLQLGSEHFGDYQS-NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTA 113
++ PDL+ D+ N E+P FD RK++P C +I ++ QS CGSC A
Sbjct: 65 ARREEPDLRRTRRPTVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVE 124
Query: 114 ALSDRMCIATQGRLDHTLSSDHL 136
A+S+R CI + G+ + LS+ L
Sbjct: 125 AMSERSCIQSGGKQNVELSAVDL 147
>gi|431894550|gb|ELK04350.1| Cathepsin Z [Pteropus alecto]
Length = 265
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S ++LP+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 17 EYLSPSDLPKSWDWR----NVNGVNYASITRNQHIPQYCGSCWAHGSTSAMADRINIKRK 72
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS H++ C A + CEGGN + W Y +G+P
Sbjct: 73 GAWPSTLLSVQHVIDCGEAGS----CEGGNDLEVWEYANRHGIP 112
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 47/81 (58%), Gaps = 4/81 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R+++P + I V+ Q +C S WA +TTA +DR+ I + G+ + LS LL
Sbjct: 311 LPESFDARERWP--SFIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNPLSVQQLL 368
Query: 138 TCCAACTGGDVCEGGNPMRAW 158
+C A G C GG RAW
Sbjct: 369 SCNQARQRG--CNGGYLDRAW 387
>gi|444730805|gb|ELW71178.1| Cathepsin Z [Tupaia chinensis]
Length = 410
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 55/102 (53%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 162 EYLSPSDLPKSWDWRDVNGVNYASITRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 219
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS H+L C A + CEGGN + W Y +G+P
Sbjct: 220 WPSTLLSVQHVLDCGDAGS----CEGGNDLPVWEYAHRHGIP 257
>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 487
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 11/116 (9%)
Query: 56 QEPNPDLQ---LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATT 112
Q P LQ L + DYQ LP FDLRK + + I Q CG+ WAI+T
Sbjct: 201 QPPEKILQVVPLKAVFHQDYQ----LPSSFDLRKVFGD--KITDPIDQGWCGASWAISTA 254
Query: 113 AALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+DR I T+G + LS HLL+C G C+GG+ AW +++ G+ T
Sbjct: 255 QVTTDRFVIMTKGLMRDALSPKHLLSCNNDLQRG--CQGGHLTSAWNWVMTFGLVT 308
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 3/92 (3%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP+EFD R + CT+I + Q +CGSCWA +LSDR CI + L+ +LS++ +
Sbjct: 102 KLPKEFDARTAWSQCTSIPRILDQGHCGSCWAFGAVESLSDRFCI--KYNLNVSLSANDV 159
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+ C G C GG PM AW Y +GV T
Sbjct: 160 VA-CCGLLCGLGCNGGFPMGAWLYFKYHGVVT 190
>gi|281210420|gb|EFA84586.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 358
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 53/107 (49%), Gaps = 16/107 (14%)
Query: 71 DYQSNTELPEEFDLR----------KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMC 120
DY + +LP FD R + + T H L CGSCWA TT+AL DR+
Sbjct: 106 DYINYEDLPSYFDWRNITSEGYDAPRSFVTVTRNQH--LPQYCGSCWAFGTTSALGDRIK 163
Query: 121 IATQGRL-DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
IA + + L+ LL C + GD C+GG+P A+ Y+L G+
Sbjct: 164 IARNAQFPEIDLAPQVLLNCMGS---GDSCDGGDPTEAYEYILNKGI 207
>gi|437323|gb|AAB00354.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 133
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA++ +SDR+CIA+ + ++S+D + CC G+ C GG P+ AW + ++
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVC-GNGCNGGYPIEAWRHYVKK 59
Query: 165 GVPTGGDY 172
G TGG Y
Sbjct: 60 GYVTGGSY 67
>gi|403282663|ref|XP_003932761.1| PREDICTED: cathepsin Z [Saimiri boliviensis boliviensis]
Length = 291
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 67/128 (52%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T++++DR+ I +
Sbjct: 43 EYLSPADLPKSWDWR----NVGGVNYASITRNQHIPQYCGSCWAHASTSSMADRINIKRK 98
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS H++ C A + CEGGN + W Y +G+P T +Y + C +
Sbjct: 99 GAWPSTLLSVQHVIDCGDAGS----CEGGNDLSVWEYAHRHGIPDETCNNYQAKDQECDK 154
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 155 FNQCGTCN 162
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 62/130 (47%), Gaps = 12/130 (9%)
Query: 45 LKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFD--LRKQYPNCTNIGHVQLQSN 102
+F L P P+P L +E T+LPE F L+ + + IG N
Sbjct: 186 FRFRLGTLP----PSPVLLSMNEMRATLPETTDLPEFFIAFLQMAWMDSWAIG----SKN 237
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
C + WA +T + +DR+ I + GR LS +L++CCA G C G+ RAW+Y+
Sbjct: 238 CAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHG--CNSGSIDRAWWYLR 295
Query: 163 ENGVPTGGDY 172
+ G+ + Y
Sbjct: 296 KRGLVSHACY 305
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 57/116 (49%), Gaps = 11/116 (9%)
Query: 56 QEPNPDLQ---LGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATT 112
Q P LQ L + DYQ LP FDLRK + + I Q CG+ WAI+T
Sbjct: 201 QPPEKILQVVPLKAVFHQDYQ----LPSSFDLRKVFGD--KITDPIDQGWCGASWAISTA 254
Query: 113 AALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+DR I T+G + LS HLL+C G C+GG+ AW +++ G+ T
Sbjct: 255 QVTTDRFVIMTKGLMRDALSPKHLLSCNNDLQRG--CQGGHLTSAWNWVMTFGLVT 308
>gi|403355865|gb|EJY77523.1| Cathepsin B [Oxytricha trifallax]
Length = 299
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 50/93 (53%), Gaps = 5/93 (5%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
T LP +D R +P CT+ V Q +CGSCW+ A T+ L DR+C+ + G ++ LS
Sbjct: 74 TTLPSSYDYRTAHPGCTHA--VLNQQSCGSCWSFAATSMLQDRLCLHSNGAVNVQLSQQD 131
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+++C G C GG Y++ +GV T
Sbjct: 132 MVSCDFDNAG---CSGGWLSHTINYLVVHGVVT 161
>gi|291411144|ref|XP_002721851.1| PREDICTED: cathepsin Z [Oryctolagus cuniculus]
Length = 305
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 55/102 (53%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 57 EYLSPSDLPKNWDWRNVDGVNYASVTRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 114
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS H+L C A + CEGGN + W Y +G+P
Sbjct: 115 WPSTLLSVQHVLDCGNAGS----CEGGNDLPVWEYAHRHGIP 152
>gi|348690656|gb|EGZ30470.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 647
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 64/139 (46%), Gaps = 27/139 (19%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAI 109
+ P+ ++L S D +LP+ FD R N +V + N CGSCW+
Sbjct: 35 RSPDRSVELTSPRPHDVLDVAKLPKNFDWR----NVNGTNYVTISRNQHIPHYCGSCWSF 90
Query: 110 ATTAALSDRMCIA----------TQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
A T+AL+DR+ IA + + LS +L C G C GG+ + A+
Sbjct: 91 AATSALADRIMIAKERSPSNKPSVEVHREVVLSPQVILNCDKKDNG---CHGGDQLEAYR 147
Query: 160 YMLENGVPTGGDYGSCQRF 178
Y+ +NGVP G CQR+
Sbjct: 148 YIKKNGVPEEG----CQRY 162
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 39/83 (46%), Gaps = 7/83 (8%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA TT+ALSDR+ I S +L C A G C GGNP + Y
Sbjct: 398 CGSCWAQGTTSALSDRISILRNASWPEIALSPQVLINCHA---GGTCNGGNPGLVYEYAH 454
Query: 163 ENGVPTGGDYGSCQRFDRGNCNC 185
+G+P +CQ + N C
Sbjct: 455 RHGIPD----QTCQAYQAKNLQC 473
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 60/153 (39%), Gaps = 26/153 (16%)
Query: 17 LRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNT 76
++T D G+ PDIL + L S + QEP
Sbjct: 39 FENITEDEFRGMLIRPDILGAGSGSLPP-----SSVTEIQEPA----------------D 77
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+P +FD R +YP C + V Q +CG CWA + DR C+A + S +L
Sbjct: 78 PIPSQFDFRDEYPQC--VTPVMDQGSCGGCWAFSAIGVFGDRRCVAGIDKEGVPYSQQYL 135
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
++C G C+GG+ W ++ G T
Sbjct: 136 ISCSTENHG---CDGGDFWPTWSFLTLTGATTA 165
>gi|410953470|ref|XP_003983393.1| PREDICTED: cathepsin Z [Felis catus]
Length = 344
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 96 EYLSPRDLPKSWDWR----NVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 151
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS H++ C A + CEGGN + W Y E+G+P
Sbjct: 152 GAWPSTLLSVQHVIDCGDAGS----CEGGNDLPVWGYAHEHGIP 191
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 48/91 (52%), Gaps = 5/91 (5%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
++P FD R Q+ C + ++ Q CG+CWA + L+ R+CIAT G+ + LS ++
Sbjct: 2 DIPMNFDARTQWRGC--VPAIRDQQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQ 59
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
+ C T C+GG +W ++ G P
Sbjct: 60 VQC---DTMNKACQGGYLKYSWTFLENTGTP 87
>gi|294911203|ref|XP_002777976.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239886071|gb|EER09771.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 85
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 4/71 (5%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDV--CEGGNPMRA 157
QS CGSCWA T A R+CI + G+L +LS +L+CC G + C GGNP+ +
Sbjct: 17 QSACGSCWAFGTVEAFKARLCIKSGGKLKQSLSDSEMLSCCNLWHGCLLFDCNGGNPVMS 76
Query: 158 WYYMLENGVPT 168
W ++ NG+ T
Sbjct: 77 WPFL--NGIVT 85
>gi|296481033|tpg|DAA23148.1| TPA: cathepsin Z precursor [Bos taurus]
Length = 304
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S ++LP+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 56 EYLSPSDLPKSWDWR----NVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 111
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS H+L C A + CEGGN + W Y +G+P
Sbjct: 112 GAWPSTLLSVQHVLDCGDAGS----CEGGNDLPVWEYAHRHGIP 151
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 60/132 (45%), Gaps = 5/132 (3%)
Query: 38 SPSFLSSLKFGLSLTPQSQEPNPD-LQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGH 96
S F SL G+ +Q P+ + + N LP F+ +++PN I
Sbjct: 161 SQFFGMSLDEGIRYRLGTQRPSRTVMNMNEIQMKMDPQNDHLPRYFNSSEKWPN--KIHE 218
Query: 97 VQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMR 156
Q NC + WA +T A SDR+ I + G + LS +L++C GG C GG
Sbjct: 219 PLDQGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQGG--CAGGRIDG 276
Query: 157 AWYYMLENGVPT 168
AW+Y+ GV T
Sbjct: 277 AWWYLRRRGVVT 288
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 105 SCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN 164
SCWA++ +SDR+C+ T GR LS +L CC G C GG RAW Y +
Sbjct: 1 SCWAVSAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGYG-CNGGYSARAWLYARNS 59
Query: 165 GVPTGGDY 172
GV +GG Y
Sbjct: 60 GVCSGGRY 67
>gi|332256898|ref|XP_003277555.1| PREDICTED: cathepsin Z [Nomascus leucogenys]
Length = 303
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 75/153 (49%), Gaps = 33/153 (21%)
Query: 48 GLSLTP--QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN--- 102
G+ L P +S P P +Y S +LP+ +D R N + + + N
Sbjct: 39 GVGLAPPGRSTYPRPH---------EYLSPADLPKSWDWR----NVDGVNYASITRNQHI 85
Query: 103 ---CGSCWAIATTAALSDRMCIATQGRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAW 158
CGSCWA A+T+A++DR+ I +G T LS +++ C A + CEGGN M W
Sbjct: 86 PQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGS----CEGGNDMSVW 141
Query: 159 YYMLENGVP--TGGDYGS----CQRFDR-GNCN 184
Y +G+P T +Y + C +F++ G CN
Sbjct: 142 DYAHRHGIPDETCNNYQAKDQECDKFNQCGTCN 174
>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
Length = 635
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 69/156 (44%), Gaps = 32/156 (20%)
Query: 39 PSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQ 98
P SS +G + P+ + L S DY ++LP+ FD R N +V
Sbjct: 22 PELTSSGGYGYV-----RSPDRSVSLTSPRPHDYIDVSKLPKNFDWR----NVNGTRYVS 72
Query: 99 LQSN------CGSCWAIATTAALSDRMCI----------ATQGRLDHTLSSDHLLTCCAA 142
+ N CGSCW+ A T+AL+DR+ I + + LS +L C
Sbjct: 73 ISRNQHIPHYCGSCWSFAATSALADRILIFKERNPGNKPSVEVHRGVVLSPQVILNCDKK 132
Query: 143 CTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRF 178
G C GG+ + A+ Y+ E+GVP G CQR+
Sbjct: 133 DNG---CHGGDQLEAYRYIKEHGVPEEG----CQRY 161
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 50/114 (43%), Gaps = 13/114 (11%)
Query: 76 TELPEEFDLR----KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
T+LP+ +D R K Y H+ CGSCWA TT+ALSDR+ I
Sbjct: 360 TDLPKSWDWRDVNGKNYVTWDKNQHIP--KYCGSCWAQGTTSALSDRISILRNASWPEIA 417
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
S +L C A G C GGNP + Y + +P +CQ + N C
Sbjct: 418 LSPQVLINCHA---GGTCNGGNPGLVYEYAHRHVIPD----QTCQAYQAKNLQC 464
>gi|301119245|ref|XP_002907350.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262105862|gb|EEY63914.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 710
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 69/156 (44%), Gaps = 32/156 (20%)
Query: 39 PSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQ 98
P SS +G + P+ + L S DY ++LP+ FD R N +V
Sbjct: 22 PELTSSGGYGYV-----RSPDRSVSLTSPRPHDYIDVSKLPKNFDWR----NVNGTRYVS 72
Query: 99 LQSN------CGSCWAIATTAALSDRMCI----------ATQGRLDHTLSSDHLLTCCAA 142
+ N CGSCW+ A T+AL+DR+ I + + LS +L C
Sbjct: 73 ISRNQHIPHYCGSCWSFAATSALADRILIFKERNPGNKPSVEVHRGVVLSPQVILNCDKK 132
Query: 143 CTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRF 178
G C GG+ + A+ Y+ E+GVP G CQR+
Sbjct: 133 DNG---CHGGDQLEAYRYIKEHGVPEEG----CQRY 161
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 51/114 (44%), Gaps = 13/114 (11%)
Query: 76 TELPEEFDLR----KQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
T+LP+ +D R K Y H+ CGSCWA TT+ALSDR+ I
Sbjct: 360 TDLPKSWDWRDVNGKNYVTWDKNQHIP--KYCGSCWAQGTTSALSDRISILRNASWPEIA 417
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
S +L C A G C GGNP + Y +G+P +CQ + N C
Sbjct: 418 LSPQVLINCHA---GGTCNGGNPGLVYEYAHRHGIPD----QTCQAYQAKNLQC 464
>gi|340382603|ref|XP_003389808.1| PREDICTED: hypothetical protein LOC100632176 [Amphimedon
queenslandica]
Length = 570
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 72 YQSNTELPEEFDLRKQYPN--CTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDH 129
Y ++P +D+R N T + + CGSCWA+ TT+ALSDR+ + +G
Sbjct: 327 YIKLEDIPAAYDIRNINGNDYSTVNRNQHIPQYCGSCWAMGTTSALSDRIKLMRKGAYPV 386
Query: 130 TLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
S +L CA + G C+GG+P A+ Y+ ENGVP
Sbjct: 387 INLSPQVLVDCANNSHG--CDGGDPTAAYSYIYENGVP 422
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 43/83 (51%), Gaps = 7/83 (8%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA+ TT+ALSDR+ I + ++ C GG C+GGNP + Y+
Sbjct: 78 CGSCWAMGTTSALSDRISIMRNNTYPMVQLATQVII---NCRGGGSCQGGNPGGVYEYIH 134
Query: 163 ENGVPTGGDYGSCQRFDRGNCNC 185
+G+P +CQ ++ N C
Sbjct: 135 RHGLPD----ETCQNYEARNGEC 153
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 63/163 (38%), Gaps = 31/163 (19%)
Query: 17 LRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNT 76
+VT D G+ +PD LK+ + S P + +P
Sbjct: 39 FENVTEDEFRGMLINPDRLKARSGSMPS-------APLKEINDP--------------TD 77
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP +FD R +YP+C + V Q +CG CWA + R C + S HL
Sbjct: 78 PLPAQFDFRDEYPHC--VSPVFDQGSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHL 135
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG-----DYGS 174
++C G C GG+ W ++ + G T DYGS
Sbjct: 136 ISCSTENFG---CSGGDFFPTWSFLTQTGATTAECVKYVDYGS 175
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 45/91 (49%), Gaps = 5/91 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P+ FDLR++YP C I V Q CG+CWA + T A DR C+ + S + +
Sbjct: 104 IPDAFDLREEYPQC--ITPVYDQGYCGACWAFSATGAFGDRRCMQWLDPVGVPYSQQYTV 161
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+C G C GG W ++ E+G T
Sbjct: 162 SCDDLDLG---CAGGTSFNVWTFLTEHGTTT 189
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 59/126 (46%), Gaps = 6/126 (4%)
Query: 44 SLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQL-QSN 102
+L+ G+ + P P + +E SN LP FD ++P + H L Q N
Sbjct: 168 TLEDGMRYRLGTFRPPPTVMNMNEMHMAMDSNEVLPRHFDAATKWPG---MIHEPLDQGN 224
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
C WA +T A SDR+ I + G + +LS +LL+C G C GG AW+Y+
Sbjct: 225 CAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLLSCDTRNQRG--CSGGRLDGAWWYLR 282
Query: 163 ENGVPT 168
GV T
Sbjct: 283 RRGVVT 288
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 63/163 (38%), Gaps = 31/163 (19%)
Query: 17 LRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNT 76
+VT D G+ +PD LK+ + S P + +P
Sbjct: 39 FENVTEDEFRGMLINPDRLKARSGSMPS-------APLKEINDP--------------TD 77
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP +FD R +YP+C + V Q +CG CWA + R C + S HL
Sbjct: 78 PLPAQFDFRDEYPHC--VSPVFDQGSCGGCWAFSAIGMFGSRRCAVGIDKAAVLYSQQHL 135
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGG-----DYGS 174
++C G C GG+ W ++ + G T DYGS
Sbjct: 136 ISCSTENFG---CSGGDFFPTWSFLTQTGATTAECVKYVDYGS 175
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+S +LP FD R ++P I Q CG+ WA++T + SDR I ++G LS
Sbjct: 185 KSKGKLPNSFDARNKWPGW--ISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLS 242
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
HLL+C G C+GG+ RAW ++ + G+
Sbjct: 243 PQHLLSCNKGQRG---CQGGHLSRAWTFIRKFGL 273
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 49/95 (51%), Gaps = 8/95 (8%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD+R + +C + V+ Q +CGSCWA T+ L+DRMCI + + LS +L+
Sbjct: 46 IPASFDVRTNWGDC--MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLM 103
Query: 138 TCCAACTGGDV------CEGGNPMRAWYYMLENGV 166
C +C V C+GG A ++ G+
Sbjct: 104 DCDGSCVSDGVSGCNNGCKGGFVGLALTRLINEGI 138
>gi|327285294|ref|XP_003227369.1| PREDICTED: cathepsin Z-like [Anolis carolinensis]
Length = 306
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y + +E+P+++D R N + + N CGSCWA A T+AL+DR+ I +
Sbjct: 57 EYLNMSEIPKKWDWR----NVNGVNYASPTRNQGVPQFCGSCWAHAATSALADRINIKRK 112
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G LS H++ C+ G C+GGN M W Y +G+P
Sbjct: 113 GAWPSAFLSVQHVVD----CSRGGSCKGGNEMLVWRYAHRHGIP 152
>gi|119595855|gb|EAW75449.1| cathepsin Z, isoform CRA_b [Homo sapiens]
Length = 371
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 56/104 (53%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 123 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 178
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS +++ C A + CEGGN + W Y ++G+P
Sbjct: 179 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIP 218
>gi|76156106|gb|AAX27341.2| SJCHGC02853 protein [Schistosoma japonicum]
Length = 181
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 36/50 (72%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQ 124
N +LP+ FD RK + NC++I ++ QS+CGSCWA ++SDR+CI ++
Sbjct: 79 NIKLPKYFDSRKYWKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSK 128
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 116 SDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
SDR+CI T+G++ +S++ LLTCC +C G C GG P AW + + G+ TGG YG+
Sbjct: 1 SDRICIHTKGKVQVNISAEDLLTCCDSCGSG--CNGGYPSAAWQFYKDEGIVTGGLYGT 57
>gi|327285292|ref|XP_003227368.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin Z-like [Anolis
carolinensis]
Length = 305
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 65/127 (51%), Gaps = 22/127 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y +ELP+ +D R N + +V N CGSCWA +T+AL+DR+ I +
Sbjct: 56 EYLDISELPKSWDWR----NVDGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKKK 111
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G LS H++ C A + CEGG+ W Y E+G+P T +Y + CQ+
Sbjct: 112 GAWPSAYLSVQHVIDCGNAGS----CEGGDDGAVWQYAHEHGIPDETCNNYQAKDQPCQK 167
Query: 178 FDR-GNC 183
F++ G C
Sbjct: 168 FNQCGTC 174
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP F+ +++ T I V Q CG+ W ++TT+ SDR I +QG+ LS+ ++
Sbjct: 186 DLPRSFNAVEKWS--TFISEVPDQGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQNI 243
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFD 179
L+C G C+GG+ AW YM +NGV Y Q+ D
Sbjct: 244 LSCTRRQQG---CDGGHLDAAWRYMHKNGVLDANCYPYIQQRD 283
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P F+ +Q+ NC+ I +Q Q+ CGSCWA ++SDR CI +G D LS L+
Sbjct: 70 VPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIH-KGE-DVLLSFQDLV 127
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
TC + G C+GG+ A ++ + G+
Sbjct: 128 TCDQSDNG---CQGGDAYTAMKFIQKKGI 153
>gi|118150788|ref|NP_001071303.1| cathepsin Z precursor [Bos taurus]
gi|145559450|sp|P05689.2|CATZ_BOVIN RecName: Full=Cathepsin Z; Flags: Precursor
gi|113912012|gb|AAI22604.1| Cathepsin Z [Bos taurus]
Length = 304
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S ++LP+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 56 EYLSPSDLPKSWDWR----NVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 111
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS H++ C A + CEGGN + W Y +G+P
Sbjct: 112 GAWPSTLLSVQHVIDCGDAGS----CEGGNDLPVWEYAHRHGIP 151
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y+S+ LPE FD R+Q+P I V+ Q++CGSCWA + + DR+ I GR +
Sbjct: 57 YESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGR--GHM 112
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
S L++C G C GG +AW + +GV
Sbjct: 113 SPQDLVSCDTTDMG---CNGGYMDKAWAWTKSHGV 144
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 99 LQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAW 158
L C WA A+SDR+CI T + +S++ LLTCC + GD C GG P AW
Sbjct: 7 LSIPCRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC-GDGCNGGYPAEAW 65
Query: 159 YYMLENGVPTGGDYGS 174
+ G+ +GG Y S
Sbjct: 66 NFWTRKGLVSGGLYES 81
>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 620
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 44/130 (33%), Positives = 62/130 (47%), Gaps = 18/130 (13%)
Query: 54 QSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAI 109
+S + P L G + YQ ++P+ FD R Y + T H+ CGSCWA
Sbjct: 318 ESPKNQPQLIKGKQ---PYQIIQKVPKSFDWRNVNGVNYLSHTRNQHIP--QYCGSCWAH 372
Query: 110 ATTAALSDRMCIATQGRL-DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
TT++LSDR+ IA D +LS ++ C A G CEGGNP + + G+P
Sbjct: 373 GTTSSLSDRINIARNKTWPDTSLSVQAIINCNA----GGSCEGGNPQTVYEFANNKGIPE 428
Query: 169 GGDYGSCQRF 178
SCQ +
Sbjct: 429 ----ESCQNY 434
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 12/95 (12%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQGRLDHTL 131
LPE F + N N + + N CGSCWA A +++LSDR+ I + L
Sbjct: 39 LPENF----SWQNVNNTNFLTVTKNQHIPQYCGSCWAQAASSSLSDRIKIVRNAQWPDIL 94
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+ +L C + G C GG+ ++ ++ EN +
Sbjct: 95 IAPQVLVSCNKYSNG--CHGGSAADSFQWIKENNI 127
>gi|428184003|gb|EKX52859.1| hypothetical protein GUITHDRAFT_101312 [Guillardia theta CCMP2712]
Length = 608
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 45/83 (54%), Gaps = 7/83 (8%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYML 162
CGSCWA+ATT+ +SDR+ I S +L C GG C GG+P +A +YM
Sbjct: 65 CGSCWAMATTSMMSDRIKIMRHNAQPEINLSPQVLI---NCHGGGSCRGGDPAQAMHYMF 121
Query: 163 ENGVPTGGDYGSCQRFDRGNCNC 185
ENG+P +CQ ++ N C
Sbjct: 122 ENGLPD----ETCQNYEAVNGAC 140
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 55/122 (45%), Gaps = 14/122 (11%)
Query: 70 GDYQSNTELPEEFDLRKQYPNCTNIGHV------QLQSNCGSCWAIATTAALSDRMCIAT 123
G S +P+ +D+R N + + + + S CGSCWA +TT+ALSDR+ +
Sbjct: 355 GKEVSTLGIPQSWDIR----NLSGVSYATPNRNQHIPSYCGSCWAFSTTSALSDRINLMR 410
Query: 124 QGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
+ S +L C C GG+P A+ +M E VP +CQ + +
Sbjct: 411 NNSFPRYVLSAQVLVNCVTANETRGCRGGDPTAAYEWMEEQDVPD----ETCQAYQAKDL 466
Query: 184 NC 185
C
Sbjct: 467 EC 468
>gi|440891622|gb|ELR45201.1| Cathepsin Z, partial [Bos grunniens mutus]
Length = 256
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S ++LP+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 8 EYLSPSDLPKSWDWR----NVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 63
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS H++ C A + CEGGN + W Y +G+P
Sbjct: 64 GAWPSTLLSVQHVIDCGDAGS----CEGGNDLPVWEYAHRHGIP 103
>gi|119595856|gb|EAW75450.1| cathepsin Z, isoform CRA_c [Homo sapiens]
Length = 284
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|410351487|gb|JAA42347.1| cathepsin Z [Pan troglodytes]
Length = 303
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|60827804|gb|AAX36814.1| cathepsin Z [synthetic construct]
gi|61368398|gb|AAX43171.1| cathepsin Z [synthetic construct]
Length = 304
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|54696690|gb|AAV38717.1| cathepsin Z [synthetic construct]
gi|61366436|gb|AAX42859.1| cathepsin Z [synthetic construct]
Length = 304
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|357624871|gb|EHJ75484.1| putative 26,29kDa proteinase [Danaus plexippus]
Length = 553
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 6/99 (6%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+ +LP EFD R + V+ QS CGSCW+ T A+ + ++ G L S
Sbjct: 332 SVKLPPEFDWRL----FGAVTPVKDQSVCGSCWSFGTVGAVEGALFLSNGGHL--VRLSQ 385
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
L C+ G + C+GG RA+ +++ +G+PT DYG
Sbjct: 386 QALVDCSWGFGNNGCDGGEDYRAYQWIMRHGLPTEDDYG 424
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 38/68 (55%), Gaps = 3/68 (4%)
Query: 101 SNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYY 160
+CGSCWA L DR CI ++ +LS + L+ CC G D C+GG P+ AW Y
Sbjct: 1 GHCGSCWAFGAVECLQDRFCI--HFNMNISLSVNDLVACCGFMCG-DGCDGGYPIMAWRY 57
Query: 161 MLENGVPT 168
+ NGV T
Sbjct: 58 FVRNGVVT 65
>gi|54696692|gb|AAV38718.1| cathepsin Z [Homo sapiens]
Length = 303
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|389611850|dbj|BAM19484.1| cathepsin L [Papilio xuthus]
Length = 342
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+ + +LP EFD R + V+ QS CGSCW+ T A+ + + G L
Sbjct: 119 EMSVKLPPEFDWRL----FGAVTPVKDQSVCGSCWSFGTVGAVEGALFLHNGGHL--VRL 172
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
S L C+ G + C+GG RA+ +++++G+PT DYG
Sbjct: 173 SQQTLIDCSWGFGNNGCDGGEDFRAYQWIMKHGLPTEEDYG 213
>gi|61358271|gb|AAX41539.1| cathepsin Z [synthetic construct]
Length = 303
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 6/93 (6%)
Query: 75 NTELPEEFDLRKQYPNCTNIGHVQL-QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
N LP FD +++P H L Q NC WA +T A SDR+ I ++G + +LS
Sbjct: 54 NVVLPRNFDAAQKWPGLI---HEPLDQGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSP 110
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
+LL+C G C GG RAW ++ G+
Sbjct: 111 QNLLSCNTRHQQG--CNGGRLDRAWSFLRRRGL 141
>gi|3294548|gb|AAC39839.1| cathepsin Z precursor [Homo sapiens]
Length = 303
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|22538442|ref|NP_001327.2| cathepsin Z preproprotein [Homo sapiens]
gi|114682841|ref|XP_001140086.1| PREDICTED: cathepsin Z [Pan troglodytes]
gi|397479059|ref|XP_003810850.1| PREDICTED: cathepsin Z [Pan paniscus]
gi|12643324|sp|Q9UBR2.1|CATZ_HUMAN RecName: Full=Cathepsin Z; AltName: Full=Cathepsin P; AltName:
Full=Cathepsin X; Flags: Precursor
gi|6467380|gb|AAF13145.1|AF136273_1 cathepsin Z precursor [Homo sapiens]
gi|6467389|gb|AAF13148.1| cathepsin Z precursor [Homo sapiens]
gi|27503311|gb|AAH42168.1| Cathepsin Z [Homo sapiens]
gi|60816128|gb|AAX36371.1| cathepsin Z [synthetic construct]
gi|117646862|emb|CAL37546.1| hypothetical protein [synthetic construct]
gi|119595854|gb|EAW75448.1| cathepsin Z, isoform CRA_a [Homo sapiens]
gi|189067455|dbj|BAG37437.1| unnamed protein product [Homo sapiens]
gi|261859516|dbj|BAI46280.1| cathepsin Z [synthetic construct]
gi|410255516|gb|JAA15725.1| cathepsin Z [Pan troglodytes]
Length = 303
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|197322475|ref|YP_002154748.1| putative cysteine protease [Feldmannia species virus]
gi|197130542|gb|ACH46878.1| putative cysteine protease [Feldmannia species virus]
Length = 362
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 40/73 (54%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q C SCW+I+ L+DR+ + T G++ LS +++C G +GG P A+
Sbjct: 77 QGKCASCWSISVVQMLADRVSVYTGGKVRKRLSVQEMISCWDGHDGLACSKGGVPEEAYQ 136
Query: 160 YMLENGVPTGGDY 172
Y++ENG+ DY
Sbjct: 137 YIVENGIGMDEDY 149
>gi|3719219|gb|AAC63141.1| preprocathepsin P [Homo sapiens]
Length = 293
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 45 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 100
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 101 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 156
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 157 FNQCGTCN 164
>gi|7245728|pdb|1DEU|A Chain A, Crystal Structure Of Human Procathepsin X: A Cysteine
Protease With The Proregion Covalently Linked To The
Active Site Cysteine
gi|7245729|pdb|1DEU|B Chain B, Crystal Structure Of Human Procathepsin X: A Cysteine
Protease With The Proregion Covalently Linked To The
Active Site Cysteine
Length = 277
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 29 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 84
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 85 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 140
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 141 FNQCGTCN 148
>gi|426392305|ref|XP_004062496.1| PREDICTED: cathepsin Z [Gorilla gorilla gorilla]
Length = 303
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 68/128 (53%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +LP+ +D R N + + + N CGSCWA A+T+A++DR+ I +
Sbjct: 55 EYLSPADLPKSWDWR----NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRK 110
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS +++ C A + CEGGN + W Y ++G+P T +Y + C +
Sbjct: 111 GVWPSTLLSVQNVIDCGNAGS----CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDK 166
Query: 178 FDR-GNCN 184
F++ G CN
Sbjct: 167 FNQCGTCN 174
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 44/91 (48%), Gaps = 5/91 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P FD R++YP C I V Q +CGSCWA + T+A DR C+ S + +
Sbjct: 78 IPASFDFREEYPQC--ITPVYDQGHCGSCWAFSATSAFGDRRCMQGLDSAGVPYSQQYTI 135
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+C G C GG W ++ E+G T
Sbjct: 136 SCDYLDLG---CAGGLSFSVWTFLTEHGTTT 163
>gi|167537940|ref|XP_001750637.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770933|gb|EDQ84610.1| predicted protein [Monosiga brevicollis MX1]
Length = 624
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 72 YQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL 127
Y + +LPE +D R Y H+ CGSCWA TT+AL+DR+ + +G
Sbjct: 367 YLTPEDLPETYDPRNINGMDYTTANRNQHIP--QYCGSCWAHGTTSALADRIKLLRKGAF 424
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
S +L C CEGG+P A ++ ENG+P
Sbjct: 425 PDIQPSVQVLVNCVTANETHGCEGGDPTAAHNWIYENGIP 464
Score = 45.4 bits (106), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 19/122 (15%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y + +LP F + N + + ++ N CGSC A ATT++L+DRM I +
Sbjct: 50 EYINVEDLPTTF----SWANVSGVNYLTRSRNQHIPEYCGSCVAFATTSSLNDRMAILRR 105
Query: 125 GRL-DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNC 183
+ L+ LL C A G CEGGN + ++ NGVP +CQ ++ +
Sbjct: 106 KAWPEINLAPQVLLNCNA----GVSCEGGNAGPVFEHIHRNGVPD----ETCQNYEARDG 157
Query: 184 NC 185
C
Sbjct: 158 EC 159
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 48/93 (51%), Gaps = 3/93 (3%)
Query: 76 TELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
+ LP FD R+++ +C + Q Q C SCWA+ T L+DR+C+A+ G++ LS
Sbjct: 31 SNLPASFDSRQKWSDCFSPVRDQGQ-KCSSCWAMTATGVLADRLCVASGGKVKKVLSPQE 89
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
L+ C G C GG Y +NGV T
Sbjct: 90 LIDCDR--NGNLGCGGGRLDTPLAYFRDNGVVT 120
>gi|254746348|emb|CAX16639.1| putative C1A cysteine protease precursor [Spodoptera frugiperda]
Length = 539
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/97 (39%), Positives = 49/97 (50%), Gaps = 6/97 (6%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
ELPE FDLR + + V+ Q +CGSCWA TTAA+ + A RL S+
Sbjct: 323 ELPENFDLRME----GAVTPVKNQGHCGSCWAFCTTAAVEGAVARANGDRL--VDLSEQS 376
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
L CA C GG A Y+L +G+PT +YG
Sbjct: 377 LVDCAWGYENQGCNGGTLDGAMKYVLTHGIPTEEEYG 413
>gi|334312335|ref|XP_001377465.2| PREDICTED: cathepsin Z-like [Monodelphis domestica]
Length = 401
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 64/128 (50%), Gaps = 22/128 (17%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y + + LP+ +D R N + + + N CGSCWA TT+AL+DR+ I +
Sbjct: 152 EYMARSSLPKAWDWR----NVNGVNYASITRNQHIPQYCGSCWAHGTTSALADRINIKRK 207
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP--TGGDYGS----CQR 177
G T LS H++ C A + CEGG + W Y +G+P T +Y + C +
Sbjct: 208 GAWPSTLLSVQHVIDCGNAGS----CEGGMDIPVWEYAHMHGIPDETCNNYQAKDQECDK 263
Query: 178 FDR-GNCN 184
F+ G CN
Sbjct: 264 FNECGTCN 271
>gi|118388356|ref|XP_001027276.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89309046|gb|EAS07034.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 323
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 39/79 (49%)
Query: 94 IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGN 153
+ V+ Q CGSCW + T AL + IA + LS L+ CC A G C GGN
Sbjct: 127 VTAVKQQGGCGSCWTFSATGALESALIIAGKAEQTIDLSEQQLIDCCGASYGNLGCRGGN 186
Query: 154 PMRAWYYMLENGVPTGGDY 172
+A+ Y+ N + T +Y
Sbjct: 187 KDQAFRYVESNPITTEKNY 205
>gi|11499528|ref|NP_070770.1| cysteine proteinase [Archaeoglobus fulgidus DSM 4304]
gi|2648597|gb|AAB89309.1| cysteine proteinase, putative [Archaeoglobus fulgidus DSM 4304]
Length = 1088
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 19/118 (16%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD R + T + V+ Q +CGSCWA + AAL + + + LS HLL
Sbjct: 594 LPSRFDWR----DYTGLSAVRDQGSCGSCWAHSAVAALESALIVESGASSSIDLSEQHLL 649
Query: 138 TCCAAC----------TGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFDRGNCNC 185
+C C + GD C+GG P +A +++ NGVP SC + N NC
Sbjct: 650 SCEQDCEVGIGDWCWASSGD-CDGGWPHKALNFIINNGVPD----ESCFPYTATNGNC 702
>gi|301122279|ref|XP_002908866.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
gi|262099628|gb|EEY57680.1| cathepsin, cysteine protease family C01A, putative [Phytophthora
infestans T30-4]
Length = 396
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 52/98 (53%), Gaps = 9/98 (9%)
Query: 77 ELPEEFDLRKQYPN----CTNIGHVQLQSNCGSCWAIATTAALSDRMCIA---TQGRLD- 128
+ PE +D R T++ + + CGSCWA AT +ALSDR+ IA T GRLD
Sbjct: 139 DFPERWDWRDYNKTGISLTTSVMNQMVPRACGSCWAFATVSALSDRIRIARFKTTGRLDT 198
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L S +L C + G C GG+P A ++ ENG+
Sbjct: 199 EVLLSPQVLLDCGMRSFGS-CHGGDPRYAHKWIHENGI 235
>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
Length = 375
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 54/112 (48%), Gaps = 19/112 (16%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R++ + V++Q CGSCWA +TT A+ I+T+ L TLS L+
Sbjct: 146 LPESFDWREK----GAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLL--TLSEQQLV 199
Query: 138 TCCAACTGGDV------CEGGNPMRAWYYMLENG-------VPTGGDYGSCQ 176
C C D CEGG A+ Y++E G P G +G C+
Sbjct: 200 DCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKHGECK 251
>gi|62751833|ref|NP_001015747.1| cathepsin L1 precursor [Xenopus (Silurana) tropicalis]
gi|58477061|gb|AAH89683.1| MGC107932 protein [Xenopus (Silurana) tropicalis]
Length = 333
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 65/128 (50%), Gaps = 15/128 (11%)
Query: 51 LTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN-CGSCWAI 109
L P+ + NP + +E + Y S T +P+E D RK NC + V+ Q CGSCWA
Sbjct: 93 LLPREKSLNP---VKAESYS-YTSIT-IPKEVDWRK--SNC--VTPVKNQGTFCGSCWAF 143
Query: 110 ATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
AT + R CI T+ L+ LS L+ C G C GG P++A Y+ ++GV
Sbjct: 144 ATVGVMESRYCIRTKELLN--LSEQQLVDCDEINEG---CCGGFPIKALEYVAQHGVMRN 198
Query: 170 GDYGSCQR 177
+Y Q+
Sbjct: 199 KEYEYSQK 206
>gi|290974021|ref|XP_002669745.1| predicted protein [Naegleria gruberi]
gi|284083296|gb|EFC37001.1| predicted protein [Naegleria gruberi]
Length = 335
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 11/109 (10%)
Query: 63 QLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIAT---TAALSDRM 119
++G H ++N ++P FD+R+++P C V +N SC A+ T ++SDR
Sbjct: 99 RIGKRHVYHQENNDDIPLTFDVREKWPGC-----VFPANNIMSCSAVGTFTIVDSISDRF 153
Query: 120 CIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
CIAT G+ LSS ++L C G C+G + Y+ NG T
Sbjct: 154 CIATGGKFKKLLSSQYMLECDRDRQG---CQGAVESNIFSYLEGNGTTT 199
>gi|290973645|ref|XP_002669558.1| predicted protein [Naegleria gruberi]
gi|284083107|gb|EFC36814.1| predicted protein [Naegleria gruberi]
Length = 343
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 49/95 (51%), Gaps = 13/95 (13%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWA--------IATTAALSDRMCIATQGRLDHT 130
P FD R+++P C + V+ Q +CGSCWA ++ T LSDR CIA+ G ++
Sbjct: 85 PTNFDSRQKWPQCVHT--VRNQLDCGSCWAFWIEFNDLVSATKVLSDRFCIASNGSVNVI 142
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENG 165
+S + + C G C GG+ + W ++ G
Sbjct: 143 MSPQYQIDCNMDNLG---CSGGSLPKTWNFLTNVG 174
>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
Length = 321
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 54/112 (48%), Gaps = 19/112 (16%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R++ + V++Q CGSCWA +TT A+ I+T+ L TLS L+
Sbjct: 92 LPESFDWREK----GAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLL--TLSEQQLV 145
Query: 138 TCCAACTGGDV------CEGGNPMRAWYYMLENG-------VPTGGDYGSCQ 176
C C D CEGG A+ Y++E G P G +G C+
Sbjct: 146 DCDHMCDIRDKXACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKHGECK 197
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 59/114 (51%), Gaps = 10/114 (8%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
Y + ELP+ D RK+ + HV+ Q CGSCWA + AA+ I T+ + +L
Sbjct: 114 YHKHGELPKSIDWRKK----GAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLV--SL 167
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDYGSCQRFDRGNCN 184
S L+ C +G + CEGG+ A+ Y+ ++ G+ T +Y R GNCN
Sbjct: 168 SEQQLIDCDIK-SGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGR--DGNCN 218
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 3/97 (3%)
Query: 77 ELPEEFDLRKQYPNCTN-IGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
++P +FD R+Q+ + + G + ++SDR CI + + L++D
Sbjct: 87 DIPAQFDSRQQWQDWPHHPGDPGTKERADPVGHFGAVESMSDRHCIHSGAKNIVHLAADD 146
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
+L+CC C G C GG P AW Y ++ G+ TGG+Y
Sbjct: 147 VLSCCWGCGSG--CNGGFPAAAWSYWVDKGIVTGGNY 181
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 110 ATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
A++DR+CI + + +SS LL+CC +C G C GG P RAW + +ENG+ TG
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISSTDLLSCCESCGFG--CHGGFPPRAWDFWMENGLVTG 58
Query: 170 GDY---GSCQRFDRGNCN 184
G C+ + CN
Sbjct: 59 GSKENPSGCRSYPFPKCN 76
>gi|178057125|ref|NP_001116576.1| cathepsin Z precursor [Sus scrofa]
gi|147223312|emb|CAN13205.1| cathepsin Z [Sus scrofa]
Length = 304
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S ++LP +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 56 EYLSPSDLPRSWDWR----NVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRK 111
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS H++ C A + CEGG+ + W Y +G+P
Sbjct: 112 GAWPSTLLSVQHVIDCGNAGS----CEGGDDLPVWAYAHRHGIP 151
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 10/108 (9%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCI-----ATQGR 126
+ + LPE FD +Q+P+ ++ Q + G CWA+ A+SD +CI QG
Sbjct: 87 FAXDINLPESFDPXEQWPDXPX-REIRDQGSYGFCWALGALEAISDWICIHPNVGGAQGG 145
Query: 127 LDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
+S++ LTC GD C GG P W + G+ +GG Y S
Sbjct: 146 NHVEVSAEDKLTCLC----GDGCNGGXPNEGWNFWTGKGLVSGGLYDS 189
>gi|34328540|ref|NP_899159.1| cathepsin Z precursor [Rattus norvegicus]
gi|34978341|sp|Q9R1T3.2|CATZ_RAT RecName: Full=Cathepsin Z; AltName: Full=Cathepsin Y; Flags:
Precursor
gi|28971937|dbj|BAA82844.2| cathepsin Y [Rattus norvegicus]
gi|60688149|gb|AAH91110.1| Cathepsin Z [Rattus norvegicus]
gi|149029992|gb|EDL85084.1| cathepsin Z, isoform CRA_b [Rattus norvegicus]
Length = 306
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 55/102 (53%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y S +LP+ +D R Y + T H+ CGSCWA +T+AL+DR+ I +G
Sbjct: 57 EYLSPADLPKNWDWRNVNGVNYASVTRNQHIP--QYCGSCWAHGSTSALADRINIKRKGA 114
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
T LS +++ C A + CEGGN + W Y ++G+P
Sbjct: 115 WPSTLLSVQNVIDCGNAGS----CEGGNDLPVWEYAHKHGIP 152
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 53/102 (51%), Gaps = 5/102 (4%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP +F+ +++ + I V Q CGS W ++TT+ SDR I +QG+ LS ++L
Sbjct: 187 LPRKFNAVERWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNIL 244
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGSCQRFD 179
+C G CEGG+ AW Y+ + GV Y QR D
Sbjct: 245 SCTRRQQG---CEGGHLDAAWRYLHKKGVVDETCYPYTQRRD 283
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 51/90 (56%), Gaps = 5/90 (5%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
+LP +F+ +++ + I V Q CGS W ++TT+ SDR I +QG+ LS+ ++
Sbjct: 186 DLPRKFNAVEKWS--SYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQNI 243
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
L+C G CEGG+ AW Y+ + GV
Sbjct: 244 LSCTRRQQG---CEGGHLDAAWRYLHKKGV 270
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 59/125 (47%), Gaps = 19/125 (15%)
Query: 42 LSSLKFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQS 101
L KF L++ P Q+ Y NT LP FD R ++ + + V+ Q
Sbjct: 164 LEPEKFVLAMHPIKQK--------------YDRNT-LPMSFDGRIEWRD--TLQDVRDQG 206
Query: 102 NCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYM 161
CG+ WA +T A +DR+ I ++G + LS +LL C G C GG+ RAW YM
Sbjct: 207 WCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLLAC--NNRGQQGCNGGHLDRAWNYM 264
Query: 162 LENGV 166
GV
Sbjct: 265 RRFGV 269
>gi|66814230|ref|XP_641294.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|60469326|gb|EAL67320.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 291
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 12/113 (10%)
Query: 60 PDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTA 113
P + S+ +Y LP ++D R N + ++ + N CGSCWA TT+
Sbjct: 31 PTSIIKSQLPSEYIDEDTLPTQYDWR----NISGSSYITITRNQHLPQYCGSCWAHGTTS 86
Query: 114 ALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGV 166
AL DR+ I +G + + +L CA + C+GG+P A+ YM G+
Sbjct: 87 ALGDRIKIGRKGTFPEVVLAPQVLLNCAG--PDNTCDGGDPTEAYAYMAAKGI 137
>gi|999908|pdb|1HUC|A Chain A, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999910|pdb|1HUC|C Chain C, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421163|pdb|1CSB|A Chain A, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421166|pdb|1CSB|D Chain D, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920710|pdb|2IPP|A Chain A, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 47
Score = 55.5 bits (132), Expect = 9e-06, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 30/46 (65%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIAT 123
LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T
Sbjct: 1 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHT 46
>gi|348552534|ref|XP_003462082.1| PREDICTED: cathepsin Z-like [Cavia porcellus]
Length = 303
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 54/102 (52%), Gaps = 11/102 (10%)
Query: 71 DYQSNTELPEEFDLRK----QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGR 126
+Y + ++LP+ +D R Y + T H+ CGSCWA +T+A++DR+ I +G
Sbjct: 54 EYLTPSDLPKSWDWRNMNGVNYASVTRNQHIP--QYCGSCWAHGSTSAMADRINIKRKGA 111
Query: 127 LDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
LS H++ C A + CEGG + W Y E+G+P
Sbjct: 112 WPSALLSVQHVIDCGNAGS----CEGGEDLLVWKYAHEHGIP 149
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LP FD +P +G + Q CGS WA++TT SDR I ++GR L+ LL
Sbjct: 296 LPSHFDAADHWPRL--VGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGV 166
C C GG+ AW Y+ GV
Sbjct: 354 ACVRR---QQACSGGHLDTAWQYLRRVGV 379
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 47/91 (51%), Gaps = 8/91 (8%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+PE FD R+Q+ + I ++ Q CGSCWA T A SDR I + D LS + L+
Sbjct: 76 VPENFDARQQWG--SKIHAIRDQQQCGSCWAFGATEAFSDRFAINGK---DVILSPEDLV 130
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPT 168
+C G C GG AW Y+ ++G T
Sbjct: 131 SCDTNDYG---CNGGYMDVAWEYLADHGAAT 158
>gi|354468711|ref|XP_003496795.1| PREDICTED: cathepsin Z [Cricetulus griseus]
gi|11863537|emb|CAC18798.1| cathepsin Z [Cricetulus griseus]
gi|344237122|gb|EGV93225.1| Cathepsin Z [Cricetulus griseus]
Length = 306
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y S +++P+ +D R N + + + N CGSCWA +T+A++DR+ I +
Sbjct: 57 EYLSPSDIPKNWDWR----NVKGVNYASITRNQHIPQYCGSCWAHGSTSAMADRINIKRK 112
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G T LS +++ C A + CEGGN + W Y ++G+P
Sbjct: 113 GAWPSTLLSVQNVIDCGNAGS----CEGGNDLPVWAYAHKHGIP 152
>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
Length = 257
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 54/112 (48%), Gaps = 19/112 (16%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
LPE FD R++ + V++Q CGSCWA +TT A+ I+T+ L TLS L+
Sbjct: 17 LPESFDWREK----GAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLL--TLSEQQLV 70
Query: 138 TCCAACTGGDV------CEGGNPMRAWYYMLENG-------VPTGGDYGSCQ 176
C C D CEGG A+ Y++E G P G +G C+
Sbjct: 71 DCDHMCDIRDKTACDSGCEGGLMTNAYKYLIEAGGLEEESSYPYTGKHGECK 122
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 45/79 (56%), Gaps = 5/79 (6%)
Query: 108 AIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
A++ A+SDR+CI + G+ LS+ L++CC C G C+GG P AW Y + +G+
Sbjct: 42 AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSG--CDGGFPGPAWDYWVSHGIV 99
Query: 168 TGG---DYGSCQRFDRGNC 183
TGG ++ CQ + C
Sbjct: 100 TGGSKENHTGCQPYPFPKC 118
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLL 137
+P +FD R +YP C + Q +CGSCWA + DR C + + S HL+
Sbjct: 79 IPPQFDFRDEYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLI 136
Query: 138 TCCAACTGGDVCEGGNPMRAWYYMLENGVPTG 169
+C G C+GG+ W ++ G T
Sbjct: 137 SCSLENFG---CDGGDFQPTWSFLTFTGATTA 165
>gi|18676596|dbj|BAB84950.1| FLJ00196 protein [Homo sapiens]
gi|119628012|gb|EAX07607.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_c [Homo
sapiens]
Length = 284
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 47/97 (48%), Gaps = 6/97 (6%)
Query: 78 LPEEFDLRKQYPNCTNIGHVQL-QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
LP F+ +++PN H L Q NC WA +T A SDR+ I + G + LS +L
Sbjct: 69 LPTAFEASEKWPNLI---HEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNL 125
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYG 173
L+C G C GG AW+++ G GD G
Sbjct: 126 LSCDTHQQQG--CRGGRLDGAWWFLRRRGYAATGDVG 160
>gi|290980579|ref|XP_002673009.1| predicted protein [Naegleria gruberi]
gi|284086590|gb|EFC40265.1| predicted protein [Naegleria gruberi]
Length = 218
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 46/80 (57%), Gaps = 5/80 (6%)
Query: 74 SNTELPEEFDLRK--QYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
S+ ++P + L +Y NCT + ++ QS C +C+A ++DR CI++QG+++ L
Sbjct: 18 SSLDIPTNYTLTTDPKYMNCTQLHKIRDQSQCAACYAFGVAEMVADRYCISSQGKVNTIL 77
Query: 132 SSDHLLTC---CAACTGGDV 148
S +L+C C GGD+
Sbjct: 78 SPQFILSCDEYEGNCYGGDI 97
>gi|327288646|ref|XP_003229037.1| PREDICTED: cathepsin Z-like [Anolis carolinensis]
Length = 309
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 15/104 (14%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSN------CGSCWAIATTAALSDRMCIATQ 124
+Y +E+P+ +D R N + + N CGSCWA TT+AL+DR+ I +
Sbjct: 60 EYLKMSEIPKRWDWR----NVNGVNYASPTRNQGVPQFCGSCWAHGTTSALADRINIKRK 115
Query: 125 GRLDHT-LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVP 167
G LS H++ C+G C+GG W+Y +G+P
Sbjct: 116 GAWPSAFLSVQHVVD----CSGAGSCKGGYDYYVWFYAHNHGIP 155
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.445
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,408,134,951
Number of Sequences: 23463169
Number of extensions: 144234522
Number of successful extensions: 259795
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1181
Number of HSP's successfully gapped in prelim test: 3669
Number of HSP's that attempted gapping in prelim test: 255818
Number of HSP's gapped (non-prelim): 5009
length of query: 185
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 51
effective length of database: 9,215,130,721
effective search space: 469971666771
effective search space used: 469971666771
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 72 (32.3 bits)