BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 025695
(249 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 199/237 (83%), Positives = 221/237 (93%), Gaps = 2/237 (0%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
+GHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCGDGCDGGYP+ AWRYFV
Sbjct: 140 EGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFV 199
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
HHGVVTEECDPYFD+ GCSHPGCEP +PTPKCVRKC+ KNQLWR SKHYS++AYRI+SDP
Sbjct: 200 HHGVVTEECDPYFDNIGCSHPGCEPGFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDP 259
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
D+MAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG+VMGGHAVKLIGWGTSD+GEDYW+
Sbjct: 260 HDVMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWL 319
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN--LVKEITSADMFEDASA 249
LANQWNR WG DGYFKI+RG+NECGIE+D VAGLPS++N LV+E+ S D EDA A
Sbjct: 320 LANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLPSARNLDLVREVASMDALEDAFA 376
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 198/238 (83%), Positives = 217/238 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIH+GMN+SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGYPISAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFVHHGVVTEECDPYFD GCSHPGCEP YPTPKC RKCV KNQLW+ SKHY + YRI+
Sbjct: 180 YFVHHGVVTEECDPYFDDIGCSHPGCEPGYPTPKCARKCVNKNQLWKKSKHYGVKPYRID 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDPE IMAEIYKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTS+DGE
Sbjct: 240 SDPESIMAEIYKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDGEA 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
YW+LANQWNR WG DGYFKI+RG+NECGIE DVVAGLPS++NLV+E+ S D EDASA
Sbjct: 300 YWLLANQWNRGWGDDGYFKIRRGTNECGIEGDVVAGLPSTRNLVREVVSVDAREDASA 357
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 432 bits (1112), Expect = e-119, Method: Compositional matrix adjust.
Identities = 192/229 (83%), Positives = 214/229 (93%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE+LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPIS
Sbjct: 118 IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPIS 177
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AWRYFV HGVVTE+CDPYFD+TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAY
Sbjct: 178 AWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAY 237
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+ DP DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DD
Sbjct: 238 RVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDD 297
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
GEDYW+LANQWNR WG DGYFKI+RG+NECGIEEDVVAGLPS+KN+ +E
Sbjct: 298 GEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTKNIARE 346
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 432 bits (1112), Expect = e-119, Method: Compositional matrix adjust.
Identities = 192/229 (83%), Positives = 214/229 (93%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE+LSDRFCIHF MN++LSVNDLLACCGF+CGDGCDGGYPIS
Sbjct: 117 IGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLACCGFMCGDGCDGGYPIS 176
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AWRYFV HGVVTE+CDPYFD+TGCSHPGCEPAYPTP+CVR CV KNQ+WR +KHY +SAY
Sbjct: 177 AWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVDKNQIWRKTKHYGVSAY 236
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+ DP DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT+DD
Sbjct: 237 RVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDD 296
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
GEDYW+LANQWNR WG DGYFKI+RG+NECGIEEDVVAGLPS+KN+ +E
Sbjct: 297 GEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTKNIARE 345
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 192/237 (81%), Positives = 215/237 (90%), Gaps = 2/237 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCV+KCV NQ+W+ SKHYS+SAYR+N
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVN 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEV+FTVYEDFA+YKSGVYKHITG +GGHAVKLIGWGT+DDGED
Sbjct: 240 SDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW+LANQWNR WG DGYFKI+RG+NECGIEEDV AGLPS+KNLV+E+T DM DA+
Sbjct: 300 YWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVT--DMDADAA 354
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 422 bits (1085), Expect = e-116, Method: Compositional matrix adjust.
Identities = 193/237 (81%), Positives = 212/237 (89%)
Query: 13 VIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
V GHCGSCWAFGAVE+LSDRFCIH+GMNLSLSVNDLLACCG++CGDGCDGGYPI AWRY
Sbjct: 89 VPLGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRY 148
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
FV GVVTEECDPYFD GCSHPGCEP +PTPKC RKC KN+LW SKH+S++AYRI+S
Sbjct: 149 FVQSGVVTEECDPYFDDIGCSHPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRIDS 208
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
DP IMAE+ NGPVEV+FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY
Sbjct: 209 DPHSIMAEVSMNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 268
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
W+LANQWNR WG DGYFKI+RG+NECGIEEDVVAGLPS++NLV+E+ D E ASA
Sbjct: 269 WLLANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLPSTRNLVREVAKIDAHEHASA 325
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 422 bits (1084), Expect = e-116, Method: Compositional matrix adjust.
Identities = 187/232 (80%), Positives = 210/232 (90%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L DRFCIHF MN+SLSVNDLLACCGFLCG GCDGG PI AWR
Sbjct: 122 ILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWR 181
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+
Sbjct: 182 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVK 241
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP+DIMAE+YKNGPVEV+FTV+EDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GED
Sbjct: 242 SDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 301
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADM 243
YW+LANQWN +WG DGYFKIKRG+NECGIE+DV AGLPS+KN+V+E+T D+
Sbjct: 302 YWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDV 353
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 187/232 (80%), Positives = 210/232 (90%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L DRFCIHF MN+SLSVNDLLACCGFLCG GCDGG PI AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVK 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP+DIMAE+YKNGPVEV+FTV+EDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GED
Sbjct: 240 SDPQDIMAEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADM 243
YW+LANQWN +WG DGYFKIKRG+NECGIE+DV AGLPS+KN+V+E+T D+
Sbjct: 300 YWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDV 351
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 190/231 (82%), Positives = 212/231 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWR
Sbjct: 125 ILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWR 184
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF HHGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV NQLWR SKHY +SAY++
Sbjct: 185 YFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVR 244
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGED
Sbjct: 245 SHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGED 304
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
YW+LANQWNRSWG DGYFKI+RG+NECGIE VVAGLPS +N+VK IT++D
Sbjct: 305 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 355
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 189/231 (81%), Positives = 211/231 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWR
Sbjct: 123 ILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGYPIAAWR 182
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF HHGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV NQLWR SKHY +SAY++
Sbjct: 183 YFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVR 242
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGED
Sbjct: 243 SHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGED 302
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
YW+LANQWNRSWG DGYFKI+RG+NECGIE VVAGLPS +N+ K IT++D
Sbjct: 303 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKGITTSD 353
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 419 bits (1076), Expect = e-115, Method: Compositional matrix adjust.
Identities = 188/237 (79%), Positives = 214/237 (90%), Gaps = 2/237 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AW+
Sbjct: 119 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQ 178
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCV+KCV NQ+W+ SKHYS++AYR++
Sbjct: 179 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVNAYRVS 238
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIM E+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT++DGED
Sbjct: 239 SDPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHAVKLIGWGTTEDGED 298
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW+LANQWNR WG DGYFKI+RG+NECGIEEDV AGLPS+KNLV+E+T DM DA+
Sbjct: 299 YWLLANQWNREWGDDGYFKIRRGTNECGIEEDVTAGLPSTKNLVREVT--DMDADAA 353
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 190/231 (82%), Positives = 212/231 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI + MN+SLSVNDLLACCGFLCG GC+GGYPI+AWR
Sbjct: 56 ILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWR 115
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF HHGVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV NQLWR SKHY +SAY++
Sbjct: 116 YFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVR 175
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGED
Sbjct: 176 SHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGED 235
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
YW+LANQWNRSWG DGYFKI+RG+NECGIE VVAGLPS +N+VK IT++D
Sbjct: 236 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVVKGITTSD 286
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 185/232 (79%), Positives = 208/232 (89%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L DRFC HF MN+SLSVNDLLACCGFLCG GCDGG PI AWR
Sbjct: 122 ILDQGHCGSCWAFGAVESLQDRFCSHFDMNISLSVNDLLACCGFLCGAGCDGGTPIYAWR 181
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SKHYS+ AYR+
Sbjct: 182 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKRSKHYSVKAYRVK 241
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP+DIM E+YKNGPVEV+FTV+EDFAHYKSGVYKHITG +GGHAVKLIGWGTSD+GED
Sbjct: 242 SDPQDIMTEVYKNGPVEVAFTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTSDEGED 301
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADM 243
YW+LANQWN +WG DGYFKIKRG+NECGIE+DV AGLPS+KN+V+E+T D+
Sbjct: 302 YWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLPSTKNIVREVTDMDV 353
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 188/224 (83%), Positives = 207/224 (92%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF+HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC +NQLWR +K Y SAYRI+
Sbjct: 180 YFIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQLWRKAKRYGQSAYRIS 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGDVMGGHAVKLIGWGT+DDGED
Sbjct: 240 SDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
YWILANQWNR+WG DGYF I+RG NECGIEE VVAGLPSSKNL+
Sbjct: 300 YWILANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLM 343
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 188/221 (85%), Positives = 205/221 (92%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QGHCGSCWAFGAVE+LSDRFCIHFGMN+SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+
Sbjct: 157 QGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFI 216
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
HHGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKC +NQLWR +K Y SAYRI+SDP
Sbjct: 217 HHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDP 276
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
IMAE+YKNGPVEV+FTVYEDFAHY+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDYWI
Sbjct: 277 YQIMAEVYKNGPVEVAFTVYEDFAHYESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWI 336
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
LANQWNR+WG DGYF I+RG NECGIEE VVAGLPSSKNL+
Sbjct: 337 LANQWNRNWGDDGYFMIRRGVNECGIEEGVVAGLPSSKNLM 377
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 416 bits (1068), Expect = e-114, Method: Compositional matrix adjust.
Identities = 188/238 (78%), Positives = 212/238 (89%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+
Sbjct: 125 ILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 184
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF + GVVTEECDPYFD TGCSHPGCEPAYPTPKC+RKCV NQLW SKHYS+S Y +
Sbjct: 185 YFSYSGVVTEECDPYFDDTGCSHPGCEPAYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVK 244
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT+D+GED
Sbjct: 245 SNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEGED 304
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
YW+LANQWNRSWG DGYF I+RG+NECGIE++ VAGLPSS+N+ K IT +D AS
Sbjct: 305 YWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKVITGSDDLSVASV 362
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 185/244 (75%), Positives = 215/244 (88%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGG 64
N + ++ QGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDLLACCGFLCGDGCDGG
Sbjct: 112 NCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDGCDGG 171
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
YP+ AW+YFV GVVT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW SKH+
Sbjct: 172 YPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKSKHFG 231
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
++AY I+SDP IM E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGDVMGGHAVKLIGWG
Sbjct: 232 VNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWG 291
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMF 244
TS+DGEDYW+LANQWNR WG DGYFKI+RG++EC IE++VVAGLPS++NL E+ +D F
Sbjct: 292 TSEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEIEDEVVAGLPSARNLNMELDVSDAF 351
Query: 245 EDAS 248
DA+
Sbjct: 352 LDAA 355
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 184/248 (74%), Positives = 217/248 (87%)
Query: 1 MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG 60
+ ++N + ++ QGHCGSCWAFGAVE+LSDRFCIH+G+N+SLS NDL ACCGFLCGDG
Sbjct: 108 VAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDG 167
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
CDGGYP+ AW+YFV GVVT+ECDPYFD+ GCSHPGCEPAYPTPKC RKCVK+N LW S
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRS 227
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
KH+ ++AY I+SDP IM E+YKNGPVEVSFTVYEDFAHYKSGVYKH+TGD+MGGHAVKL
Sbjct: 228 KHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDIMGGHAVKL 287
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITS 240
IGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAGLPS++NL E+
Sbjct: 288 IGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEIEDEVVAGLPSARNLNVELDV 347
Query: 241 ADMFEDAS 248
+D F DA+
Sbjct: 348 SDAFLDAA 355
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 187/238 (78%), Positives = 209/238 (87%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIH+GMNLSLSVNDLLACCG++CG GCDGG PI AWR
Sbjct: 102 ILDQGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLACCGWMCGAGCDGGSPIDAWR 161
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFV GVVTEECDPYFD GCSHPGCEP +PTPKC RKC KN+LW SKH+S++AYRI+
Sbjct: 162 YFVQSGVVTEECDPYFDDIGCSHPGCEPGFPTPKCERKCADKNKLWAESKHFSVNAYRID 221
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP IMAE+ NGPVEV+FTVYEDFAHYKSGVYKHITGD MGGHAVKLIGWGTS+DGED
Sbjct: 222 SDPHSIMAEVSSNGPVEVAFTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTSEDGED 281
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDASA 249
YW+LANQWNR WG DGYFKIKRG+NECGIE VVAGLPS++NLV+E+ D E A+A
Sbjct: 282 YWLLANQWNRGWGDDGYFKIKRGTNECGIEGAVVAGLPSTRNLVREVAGIDGHEHATA 339
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 182/237 (76%), Positives = 208/237 (87%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIH +N+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHLDVNVSLSVNDLLACCGFLCGSGCDGGYPLYAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCVRKCVK NQ+W+ SK++S++AY +
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVRKCVKGNQIWKKSKYFSVNAYSVK 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT+D+GED
Sbjct: 240 SDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHAVKLIGWGTTDEGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGLPS+KN+ + + D D S
Sbjct: 300 YWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGLPSTKNMGRWVMDMDADADVS 356
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 183/237 (77%), Positives = 211/237 (89%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+
Sbjct: 122 ILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 181
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF + GVVTEECDPYFD+TGCSHPGCEPAYPTP+C+RKCV N+LW SKHYS+S Y +N
Sbjct: 182 YFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPRCLRKCVSDNKLWSESKHYSVSTYTVN 241
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTS++GED
Sbjct: 242 SSPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEGED 301
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW++ANQWNR WG DGYF I+RG+NECGIE++ VAGLPSS+N+ K T ++ AS
Sbjct: 302 YWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSRNVFKVDTGSNDLPVAS 358
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 180/233 (77%), Positives = 206/233 (88%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI
Sbjct: 163 IGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIM 222
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AWRYFV +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+W KH+S++AY
Sbjct: 223 AWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAY 282
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D
Sbjct: 283 RVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDA 342
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+ SA
Sbjct: 343 GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 395
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 182/237 (76%), Positives = 204/237 (86%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW
Sbjct: 114 ILDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWI 173
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEP Y TPKCV+KCV NQLW SKHYS+ AY +N
Sbjct: 174 YLAHHGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVN 233
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKL+GWGTS +GED
Sbjct: 234 SDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGED 293
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW+LANQWN +WG DGYFKIKRG+NECGIE V AGLPS+KN+V+E+T D+ D S
Sbjct: 294 YWLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVS 350
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 182/237 (76%), Positives = 204/237 (86%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF MN+SLSVND+LACCG LCG GC GG P SAW
Sbjct: 119 ILDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWI 178
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEP Y TPKCV+KCV NQLW SKHYS+ AY +N
Sbjct: 179 YLAHHGVVTEECDPYFDQIGCSHPGCEPTYRTPKCVKKCVNGNQLWETSKHYSVKAYTVN 238
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKL+GWGTS +GED
Sbjct: 239 SDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEGED 298
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW+LANQWN +WG DGYFKIKRG+NECGIE V AGLPS+KN+V+E+T D+ D S
Sbjct: 299 YWLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLPSTKNIVREVTDMDVDADVS 355
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 183/237 (77%), Positives = 209/237 (88%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+
Sbjct: 122 ILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 181
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF + GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV N+LW SKHYS+S Y +
Sbjct: 182 YFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVK 241
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S+P+DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTS +GED
Sbjct: 242 SNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGED 301
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW++ANQWNR WG DGYF I+RG+NECGIE++ VAGLPSSKN+ + T ++ AS
Sbjct: 302 YWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRVDTGSNDLPVAS 358
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 180/233 (77%), Positives = 206/233 (88%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI
Sbjct: 118 IGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIM 177
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AWRYFV +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+W KH+S++AY
Sbjct: 178 AWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAY 237
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D
Sbjct: 238 RVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDA 297
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+ SA
Sbjct: 298 GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 350
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 402 bits (1033), Expect = e-110, Method: Compositional matrix adjust.
Identities = 182/221 (82%), Positives = 203/221 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI FGMN++LSVNDLLACCGF CGDGCDGGYPISAW+
Sbjct: 123 ILDQGHCGSCWAFGAVESLSDRFCIQFGMNITLSVNDLLACCGFRCGDGCDGGYPISAWQ 182
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF + GVVTEECDPYFD TGCSHPGCEPAY TP+C+RKCV +NQLW SKHYSI+ Y +
Sbjct: 183 YFSYSGVVTEECDPYFDQTGCSHPGCEPAYNTPQCLRKCVGRNQLWSESKHYSINTYVVE 242
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S+P+DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGT+DDGED
Sbjct: 243 SNPQDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDGED 302
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
YW+LANQWNRSWG DGYF I+RG+NECGIE++ VAGLPSSK
Sbjct: 303 YWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLPSSK 343
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 402 bits (1032), Expect = e-110, Method: Compositional matrix adjust.
Identities = 180/234 (76%), Positives = 206/234 (88%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
I I GHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+
Sbjct: 117 TSIRRILGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMG 176
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW SKHY + AY
Sbjct: 177 AWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAY 236
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
RIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVKLIGWGTSDD
Sbjct: 237 RINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDD 296
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
GEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 297 GEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 350
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 182/233 (78%), Positives = 206/233 (88%)
Query: 16 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
GHCGSCWAFGAVE+LSDRFCI FGMN+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF +
Sbjct: 126 GHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSY 185
Query: 76 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 135
GVVTEECDPYFD+TGCSHPGCEPAYPTPKC RKCV N+LW SKHYS+S Y + S+P+
Sbjct: 186 SGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQ 245
Query: 136 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 195
DIMAE+YKNGPVEVSFTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTS +GEDYW++
Sbjct: 246 DIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLM 305
Query: 196 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
ANQWNR WG DGYF I+RG+NECGIE++ VAGLPSSKN+ + T ++ AS
Sbjct: 306 ANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLPSSKNVFRVDTGSNDLPVAS 358
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 178/229 (77%), Positives = 205/229 (89%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
+ GHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW YF
Sbjct: 144 LLGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYF 203
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
+HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW SKHY + AYRIN D
Sbjct: 204 KYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINPD 263
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
P+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVKLIGWGTSDDGEDYW
Sbjct: 264 PQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYW 323
Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 324 LLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 372
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 181/230 (78%), Positives = 201/230 (87%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE L DRFCIH M++ LSVNDLLACCGF+CGDGCDGGYPI AWR
Sbjct: 112 ILEQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWR 171
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFV +GVVT+ECDPYFD GC HPGCEPAYPTPKC +KC ++NQ+W+ KH+SI AYRIN
Sbjct: 172 YFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTSD GED
Sbjct: 232 SDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGED 291
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
YW+LANQWNR WG DGYFKI RG NECGIEE VVAG+PS+KN+V A
Sbjct: 292 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMPSTKNMVPNFGGA 341
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 181/230 (78%), Positives = 201/230 (87%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE L DRFCIH M++ LSVNDLLACCGF+CGDGCDGGYPI AWR
Sbjct: 112 ILDQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWR 171
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFV +GVVT+ECDPYFD GC HPGCEPAYPTPKC +KC ++NQ+W+ KH+SI AYRIN
Sbjct: 172 YFVQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQVWQEKKHFSIDAYRIN 231
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGTSD GED
Sbjct: 232 SDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSDAGED 291
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
YW+LANQWNR WG DGYFKI RG NECGIEE VVAG+PS+KN+V A
Sbjct: 292 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVVAGMPSTKNMVPNFGGA 341
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 399 bits (1024), Expect = e-109, Method: Compositional matrix adjust.
Identities = 177/228 (77%), Positives = 203/228 (89%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE L DRFCIH MN+SLSVNDL+ACCGF+CGDGCDGGYPIS
Sbjct: 114 IGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLVACCGFMCGDGCDGGYPIS 173
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW+Y V +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+W+ KH+SI+AY
Sbjct: 174 AWQYLVENGVVTDECDPYFDQVGCKHPGCEPAYPTPACEKKCKVQNQVWQEKKHFSINAY 233
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVY+HITG++MGGHAVKLIGWGTS D
Sbjct: 234 RVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYEHITGEMMGGHAVKLIGWGTSAD 293
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
G+DYW+LANQWNR WG DGYFKI RG NECGIEEDVVAG+PS+KN V+
Sbjct: 294 GKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNTVR 341
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 398 bits (1023), Expect = e-109, Method: Compositional matrix adjust.
Identities = 181/237 (76%), Positives = 205/237 (86%), Gaps = 2/237 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWR
Sbjct: 119 ILDQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWR 178
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVTEECDPYFD+TGCSHPGCEP YPTPKC RKCVK N LWR SKHY ++AYR++
Sbjct: 179 YFKRSGVVTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVKGNVLWRKSKHYGVNAYRVS 238
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
DP+ IMAE+YKNGPVEVSFTVYEDFAHYKSGVYKH+TG MGGHAVKLIGWGTS+ GED
Sbjct: 239 HDPQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHAVKLIGWGTSEQGED 298
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
YW++ N WNR WG DGYFKI+RG+NECGIE VVAGLPS++NL E+ D DAS
Sbjct: 299 YWLIVNSWNRGWGEDGYFKIRRGTNECGIEHSVVAGLPSARNLNVEL--GDAVLDAS 353
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 179/226 (79%), Positives = 202/226 (89%)
Query: 16 GHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
GHCGSCWAFGAVE L DRFCIHF MN+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV
Sbjct: 1 GHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVR 60
Query: 76 HGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 135
+GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+W KH+S++AYR+NSDP
Sbjct: 61 NGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPH 120
Query: 136 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 195
DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+L
Sbjct: 121 DIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLL 180
Query: 196 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
ANQWNR WG DGYFKI RG+NECGIEEDVVAG+PS+KN+V+ SA
Sbjct: 181 ANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMPSTKNMVRNYDSA 226
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 395 bits (1015), Expect = e-108, Method: Compositional matrix adjust.
Identities = 177/223 (79%), Positives = 200/223 (89%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCG+CWAF AVE+L DRFCIH M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 120 ILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC +NQ+W+ +KH+S++AYR++
Sbjct: 180 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQVWKKNKHFSVNAYRVH 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 240 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+
Sbjct: 300 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSTKNM 342
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 177/223 (79%), Positives = 199/223 (89%), Gaps = 1/223 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+
Sbjct: 115 ILDQGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQ 174
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVT ECDPYFD TGCSHPGCEPAYPTP C +KCVKKN LW SKH+S++AYR+N
Sbjct: 175 YFKRTGVVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVN 234
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SD IM E+Y NGP EVSFTVYEDFAHYKSGVYKH+TG MGGHAVKLIGWGTS+DGED
Sbjct: 235 SDQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGED 294
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW+LANQWNRSWG DGYFKI RG+NECGI EDV AG+PS+KNL
Sbjct: 295 YWLLANQWNRSWGDDGYFKIIRGTNECGI-EDVTAGMPSTKNL 336
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 177/223 (79%), Positives = 199/223 (89%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCG+CWAF AVE+L DRFCIH M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 120 ILDQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC +NQ+W+ +KH S++AYR++
Sbjct: 180 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQVWKKNKHSSVNAYRVH 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 240 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+
Sbjct: 300 YWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVTAGMPSTKNM 342
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 178/224 (79%), Positives = 197/224 (87%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE L DRFCIH MN+SLS NDL+ACCGF+CGDGCDGGYPISAW+
Sbjct: 115 ILDQGHCGSCWAFGAVECLQDRFCIHHNMNISLSANDLVACCGFMCGDGCDGGYPISAWQ 174
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFV +GVVTEECDPYFD GC HPGCEPAYPTP C +KC +NQ+W+ KH+SI AY++N
Sbjct: 175 YFVQNGVVTEECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWQEKKHFSIDAYQVN 234
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 235 SDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 294
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS KN+
Sbjct: 295 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSMKNIA 338
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 177/223 (79%), Positives = 198/223 (88%), Gaps = 1/223 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L+DRFCIH+G N++LSVNDLLACCGFLCG+GCDGGYPI+AW+
Sbjct: 115 ILDQGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQ 174
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVT ECDPYFD TGCSHPGCEPAYPTP C +KCVKKN LW SKH+S++AYR+N
Sbjct: 175 YFKRTGVVTSECDPYFDQTGCSHPGCEPAYPTPACEKKCVKKNLLWSESKHFSVNAYRVN 234
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SD IM E+Y NGP EVSFTVYEDFAHYKSGVYKH+TG MGGHAVKLIGWGTS+DGED
Sbjct: 235 SDQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHAVKLIGWGTSEDGED 294
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW+LANQWNRSWG DGYFKI RG+NECGI EDV AG PS+KNL
Sbjct: 295 YWLLANQWNRSWGGDGYFKIIRGTNECGI-EDVTAGTPSTKNL 336
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 178/233 (76%), Positives = 200/233 (85%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE L DRFCIH +N+SLS NDL+ACCGF+CGDGCDGGYPI
Sbjct: 110 IGTILDQGHCGSCWAFGAVECLQDRFCIHQNINISLSANDLVACCGFMCGDGCDGGYPIK 169
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW+YFV GVVTEECDPYFD GC HPGCEPAY TPKC +KC +NQ+W KH+SI+AY
Sbjct: 170 AWQYFVQSGVVTEECDPYFDQVGCKHPGCEPAYDTPKCEKKCKVQNQVWEEKKHFSINAY 229
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+NSDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG VMGGHAVKLIGWGTSD
Sbjct: 230 RVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGGVMGGHAVKLIGWGTSDA 289
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
GEDYW+LANQWNR WG DGYFKI RG NECGIEE+VVAG+PS+KN+ SA
Sbjct: 290 GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVVAGMPSTKNMAGNHGSA 342
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 176/233 (75%), Positives = 199/233 (85%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QGHCGSCWAFGAVE L DRFCIH MN+SLSVNDLLACCGFLCG GC+GGYPIS
Sbjct: 113 IGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLLACCGFLCGSGCNGGYPIS 172
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AWRYF GVVT+ECDPYFD GC HPGCEPAY TPKC +KC +N++W+ KH+S+ AY
Sbjct: 173 AWRYFRRKGVVTDECDPYFDQVGCKHPGCEPAYRTPKCEKKCKVQNEVWKEQKHFSVDAY 232
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R++S+P DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD
Sbjct: 233 RVHSNPHDIMAEVYTNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDA 292
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSA 241
GEDYW+LANQWNR WG DGYFKI RG NECGIEEDVVAG+PS+KN+ + A
Sbjct: 293 GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVVAGMPSTKNMARNYDDA 345
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 175/224 (78%), Positives = 197/224 (87%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE L DRFCIH MN++LS NDL+ACCGF+CGDGCDGGYPISAW+
Sbjct: 76 ILDQGHCGSCWAFGAVECLQDRFCIHHNMNITLSANDLVACCGFMCGDGCDGGYPISAWQ 135
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFV +GVVT+ECDPYFD GC HPGCEPAYPTP C +KC +NQ+W KH+SI+AY++N
Sbjct: 136 YFVQNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWEEKKHFSINAYQVN 195
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+Y NGPVEV+FTVYEDFAHYKSGVYKHITG VMGGHAVKLIGWGTSD GED
Sbjct: 196 SDPHDIMAEVYNNGPVEVAFTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAGED 255
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
YW+LANQWNR WG DGYFKI RG NECGIEEDV AG+PS+KN+
Sbjct: 256 YWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVTAGMPSTKNIA 299
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 172/208 (82%), Positives = 191/208 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GCDGGYP+ AWR
Sbjct: 120 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWR 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y HHGVVTEECDPYFD GCSHPGCEPAY TPKCV+KCV NQ+W+ SKHYS+SAYR+N
Sbjct: 180 YLAHHGVVTEECDPYFDQIGCSHPGCEPAYRTPKCVKKCVSGNQVWKKSKHYSVSAYRVN 239
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEV+FTVYEDFA+YKSGVYKHITG +GGHAVKLIGWGT+DDGED
Sbjct: 240 SDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDGED 299
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECG 219
YW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 300 YWLLANQWNREWGDDGYFKIRRGTNECG 327
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 174/227 (76%), Positives = 196/227 (86%), Gaps = 2/227 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCG+CWAF AVEAL DRFCIH M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 116 ILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 175
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC +NQ W+ +KH+S++AYR++
Sbjct: 176 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVENQAWKENKHFSVNAYRVH 235
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
S+P DIMAE+YKNGPVEV+FT + DFAHYKSGVYKHITG VMGGHAVKLIGWGTSD G
Sbjct: 236 SNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAG 295
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
EDYW+LANQWNR WG DGYFKI RG NECGIE DV AG+PS+KN +
Sbjct: 296 EDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVTAGMPSTKNTAR 342
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/231 (78%), Positives = 209/231 (90%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCI + +N+SLS ND++ACCG LCG GC+GG+P+ AW
Sbjct: 122 ILDQGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVVACCGLLCGLGCNGGFPMGAWL 181
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF +HGVVTEECDPYFD+TGCSHPGCEP YPTPKCVRKCV +NQLW SKHY +SAYRIN
Sbjct: 182 YFKYHGVVTEECDPYFDNTGCSHPGCEPGYPTPKCVRKCVSENQLWGESKHYGVSAYRIN 241
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHITG +GGHAVKLIGWGTSDDGED
Sbjct: 242 HDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDGED 301
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
YW+LANQWNRSWG DGYFKI+RG+NECGIE VVAGLPS +N+ K++T++D
Sbjct: 302 YWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVFKDVTTSD 352
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 169/224 (75%), Positives = 194/224 (86%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCG+CWAFGAVE L DRFCIH +N+SLSVNDL+ACCGFLCGDGCDGGYPI AW+
Sbjct: 116 ILDQGHCGACWAFGAVECLQDRFCIHHSVNVSLSVNDLVACCGFLCGDGCDGGYPIFAWQ 175
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YFV +GVVT+ECDP+FD GC HPGCEPAYPTP C +KC +NQ+W KH+SI AY++N
Sbjct: 176 YFVENGVVTDECDPFFDQVGCQHPGCEPAYPTPVCEKKCKVQNQVWEEKKHFSIDAYQVN 235
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP DIMAE+YKNGPVEVSF +YEDFAHYKSGVYK ITG ++GGHA KLIGWGTSD GED
Sbjct: 236 SDPHDIMAEVYKNGPVEVSFIIYEDFAHYKSGVYKQITGRMVGGHAAKLIGWGTSDAGED 295
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
YW+LANQWNR WG DGYFKI RG+NECGIE DV AG+PS+KN+
Sbjct: 296 YWLLANQWNRGWGDDGYFKIIRGTNECGIEGDVNAGMPSTKNIA 339
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/229 (73%), Positives = 197/229 (86%), Gaps = 1/229 (0%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
V+ ++ QGHCGSCWAFGAVEALSDRFCIH +N++LS NDL+ACCGF+CGDGCDGGYPIS
Sbjct: 112 VQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPIS 171
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW+YF+ GVVT ECDPYFD GC HPGCEP YPTP+CV++C +NQ W NSK +S +AY
Sbjct: 172 AWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAY 231
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
RI+S P DIMAE+Y NGPVEVSF+VYEDFAHYKSGVYK+ GD MGGHAVKL+GWGT +D
Sbjct: 232 RISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-ED 290
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
G DYW++AN WN +WG DGYFKI RGSNECGIE DVVAG+PS+KNLV +
Sbjct: 291 GTDYWLVANSWNTAWGEDGYFKIARGSNECGIEGDVVAGMPSTKNLVMD 339
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/229 (73%), Positives = 197/229 (86%), Gaps = 1/229 (0%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
V+ ++ QGHCGSCWAFGAVEALSDRFCIH +N++LS NDL+ACCGF+CGDGCDGGYPIS
Sbjct: 112 VQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDGGYPIS 171
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW+YF+ GVVT ECDPYFD GC HPGCEP YPTP+CV++C +NQ W NSK +S +AY
Sbjct: 172 AWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDENQKWGNSKRFSATAY 231
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
RI+S P DIMAE+Y NGPVEVSF+VYEDFAHYKSGVYK+ GD MGGHAVKL+GWGT +D
Sbjct: 232 RISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVYKYTKGDYMGGHAVKLVGWGT-ED 290
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
G DYW++AN WN +WG DGYFKI RGSNECGIE DVVAG+PS+KNLV +
Sbjct: 291 GTDYWLVANSWNTAWGEDGYFKIARGSNECGIEGDVVAGMPSTKNLVMD 339
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 167/229 (72%), Positives = 194/229 (84%), Gaps = 1/229 (0%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
V ++ QGHCGSCWAFGAVEALSDRFCIH+ +N++LS NDL+ACCGF CGDGCDGGYP+S
Sbjct: 112 VRTILDQGHCGSCWAFGAVEALSDRFCIHYKVNVTLSENDLVACCGFRCGDGCDGGYPLS 171
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW+YF+ GVVT ECDPYFD GC HPGCEP YPTP+CV++C +NQ W NSK +S +AY
Sbjct: 172 AWQYFISTGVVTAECDPYFDEAGCQHPGCEPLYPTPQCVKQCKDENQNWGNSKRFSATAY 231
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
RI S P DIMAE+Y GPVEV F VYEDFAHYKSGVYK+ITGD +GGHAVKLIGWGT ++
Sbjct: 232 RITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYKSGVYKYITGDFLGGHAVKLIGWGT-EN 290
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
G DYW++AN WN +WG DGYFKI RGSNEC IEEDVVAG+PS+KNLV +
Sbjct: 291 GTDYWLVANSWNTAWGEDGYFKIARGSNECSIEEDVVAGMPSTKNLVMD 339
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 365 bits (936), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 162/228 (71%), Positives = 195/228 (85%), Gaps = 1/228 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAF AVEALSDRFCIHF +N +LS NDL+ACCGF CG GC+GG+P+SAWR
Sbjct: 114 ILDQGHCGSCWAFAAVEALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGFPLSAWR 173
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVT+ECDPYFD+ GC+HPGCEP+YPTP+CV+ C K NQ W +SKHYS +AYRI
Sbjct: 174 YFSRRGVVTDECDPYFDNDGCNHPGCEPSYPTPRCVKNC-KDNQRWSHSKHYSANAYRIK 232
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
SDP +IMAE++ NGPVEVSF+VYEDFAHY++GVYKH+ G +GGHAVKLIGWGT+DDG D
Sbjct: 233 SDPYNIMAEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGID 292
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 239
YW++AN WN +WG GYFKI RG NECGIE D VAG+PS+KNL+++ T
Sbjct: 293 YWLIANSWNTAWGEGGYFKIARGVNECGIERDPVAGMPSAKNLIQDPT 340
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 352 bits (902), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 160/202 (79%), Positives = 178/202 (88%)
Query: 40 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 99
M++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEP
Sbjct: 1 MSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEP 60
Query: 100 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 159
AYPTPKC +KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 61 AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAH 120
Query: 160 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 219
YKSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 121 YKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 180
Query: 220 IEEDVVAGLPSSKNLVKEITSA 241
IEE VVAG+PS+KN+V A
Sbjct: 181 IEEGVVAGMPSTKNMVPNFGGA 202
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 337 bits (863), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 156/230 (67%), Positives = 185/230 (80%), Gaps = 2/230 (0%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
++ ++ QGHCGSCWAFGAVEAL+DRFCI N+SLS NDL+ACC CG GCDGGYP +
Sbjct: 115 IKNILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVACCS-SCGFGCDGGYPYA 173
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW YF GVVT +CDPYFD GC HPGCEP Y TP CV++CV N+ WR+SKH+++ Y
Sbjct: 174 AWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEYDTPVCVKQCVD-NEQWRDSKHFTVQTY 232
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+NSD DI AEIYKNGPVEVS+TVYEDFAHYKSGVYKH+ G+V+GGHAVK IGWGT+DD
Sbjct: 233 AVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTTDD 292
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
G+DYWI+AN WNRSWG DG+F+I RGSNECGIE + VAG+P K +I
Sbjct: 293 GKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 342
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 336 bits (861), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 155/230 (67%), Positives = 184/230 (80%), Gaps = 2/230 (0%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
++ ++ QGHCGSCWAFGAVEAL+DRFCI N+SLS NDL+ACC CG GC+GGYP +
Sbjct: 104 IKTILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVACCS-SCGFGCEGGYPYA 162
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
AW YF GVVT +CDPYFD GC HPGCEP Y TP CV++CV N+ WR+SKH+++ Y
Sbjct: 163 AWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEYDTPVCVKQCVD-NEQWRDSKHFTVQTY 221
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+NSD DI AEIYKNGPVEVS+TVYEDFAHYKSGVYKH+ G V+GGHAVK IGWGT+DD
Sbjct: 222 AVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTTDD 281
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
G+DYWI+AN WNRSWG DG+F+I RGSNECGIE + VAG+P K +I
Sbjct: 282 GKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIPLKKTGFSDI 331
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 335 bits (860), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 152/195 (77%), Positives = 171/195 (87%), Gaps = 2/195 (1%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCG+CWAF AVEAL DRFCIH M++SLSVNDLLACCGFLCG GC+GGYPISAWR
Sbjct: 116 ILDQGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWR 175
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVTEECDPYFD TGC HPGCEPAYPTPKC RKC +NQ W+ +KH+S++AYR++
Sbjct: 176 YFRRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVENQAWKENKHFSVNAYRVH 235
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYE--DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
S+P DIMAE+YKNGPVEV+FT + DFAHYKSGVYKHITG VMGGHAVKLIGWGTSD G
Sbjct: 236 SNPHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSDAG 295
Query: 190 EDYWILANQWNRSWG 204
EDYW+LANQWNR WG
Sbjct: 296 EDYWLLANQWNRGWG 310
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 333 bits (855), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 149/223 (66%), Positives = 180/223 (80%), Gaps = 1/223 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L+DRFCIH ++SLS NDLLACCGF CG GC+GGYPI AW+
Sbjct: 122 ILGQGHCGSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGYGCEGGYPIRAWK 181
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF H GVVT +CDPYFD GC+HPGC P Y TPKC ++CV ++ W SKH ++AY ++
Sbjct: 182 YFKHSGVVTNKCDPYFDQKGCAHPGCYPTYETPKCEKQCVD-DEFWVQSKHLGVNAYEMS 240
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
+PED+MAE+Y NGPVEV+F VYEDFAHYK+GVYKH+ G MGGHAVKLIGWGT+DDG D
Sbjct: 241 MEPEDLMAELYTNGPVEVAFEVYEDFAHYKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVD 300
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW + N WN +WG DG F+I RG++ECGIE + VAGLPS K L
Sbjct: 301 YWTIVNSWNTNWGEDGLFRIVRGNDECGIESNAVAGLPSRKGL 343
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 332 bits (852), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 150/223 (67%), Positives = 176/223 (78%), Gaps = 1/223 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGA E+L+DRFCIH ++SLS NDLLACCGF CGDGCDGGYPI AWR
Sbjct: 114 ILDQGHCGSCWAFGAAESLTDRFCIHMNESVSLSENDLLACCGFECGDGCDGGYPIRAWR 173
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVT +CDPYFD GC HPGC P Y TPKCV+ CV ++LW SKH S++AY ++
Sbjct: 174 YFKRTGVVTSKCDPYFDQIGCGHPGCYPTYRTPKCVKHCVD-DELWVKSKHLSVNAYEVS 232
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
+PED+MAE+Y NGP+EVSF V+EDFAHYK+GVYKH+ G +GGHAVKLIGWGT+DDG D
Sbjct: 233 KEPEDLMAELYTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVD 292
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW + N WN +WG G F+I RG NECGIE VAGLP K L
Sbjct: 293 YWTIVNSWNTNWGEHGLFRIARGGNECGIESYAVAGLPFDKGL 335
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 329 bits (843), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 145/223 (65%), Positives = 181/223 (81%), Gaps = 1/223 (0%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+L+DRFCIH ++SLS NDLLACCGF CGDGC+GGYPI AW+
Sbjct: 120 ILDQGHCGSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGDGCEGGYPIRAWQ 179
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
YF GVVT +CDPYFD GC HPGC P Y TPKC ++CV ++LW +SKH +SAY ++
Sbjct: 180 YFKRTGVVTSKCDPYFDQKGCGHPGCYPTYDTPKCFKRCVD-DELWVSSKHLGVSAYEVS 238
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
+PE++MAE++ NGP+EV+F V+EDFAHYK+GVYKH+ G +GGHAVKL+GWGT+DDG D
Sbjct: 239 MEPEELMAELFTNGPIEVAFDVFEDFAHYKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVD 298
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
YW + N WN +WG DG F+I RG +ECGIE + VAGLPS+K L
Sbjct: 299 YWSMVNSWNTNWGEDGTFRILRGKDECGIESNAVAGLPSNKGL 341
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 319 bits (817), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 164/200 (82%)
Query: 49 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 108
L F G GGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPAY TPKCVR
Sbjct: 9 FLHAVAFSVGLAVMGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVR 68
Query: 109 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
KCVK NQ+W+ SKH+S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHI
Sbjct: 69 KCVKGNQIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHI 128
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
TG +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGL
Sbjct: 129 TGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGL 188
Query: 229 PSSKNLVKEITSADMFEDAS 248
PS+KN+ + + D D S
Sbjct: 189 PSTKNMGRWVMDMDADADVS 208
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 310 bits (793), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 140/176 (79%), Positives = 158/176 (89%)
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW SKHY +
Sbjct: 1 MGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVG 60
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVKLIGWGTS
Sbjct: 61 AYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTS 120
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
DDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 121 DDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 176
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 300 bits (769), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 134/166 (80%), Positives = 150/166 (90%)
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
CDGGYPISAW+YF HHGVVTEECDPYFD GCSHPGCEP Y TPKCVRKCVK NQ+W+ S
Sbjct: 1 CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
KHYS+ Y++NSDP++IM E+YKNGPVEV+F+VYEDFAHYKSGVYKHITG +GGHAVKL
Sbjct: 61 KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
GWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIEEDV A
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166
>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
Length = 201
Score = 293 bits (750), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 131/161 (81%), Positives = 148/161 (91%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GC+GGYP+SAWR
Sbjct: 37 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCNGGYPLSAWR 96
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y +HGVVTEECDPYFD TGCSHPGCEPAY TPKCV+KCV NQLW+ SKHYS+SAY++
Sbjct: 97 YLSNHGVVTEECDPYFDQTGCSHPGCEPAYRTPKCVKKCVSGNQLWKKSKHYSVSAYKVK 156
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG V
Sbjct: 157 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGYV 197
>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
unguiculata]
Length = 195
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 130/159 (81%), Positives = 147/159 (92%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHF +N+SLSVNDLLACCGFLCG GC+GGYP+SAWR
Sbjct: 37 ILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCNGGYPLSAWR 96
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y +HGVVTEECDPYFD TGCSHPGCEPAY TPKCV+KCV NQLW+ SKHYS+SAY++
Sbjct: 97 YLSNHGVVTEECDPYFDQTGCSHPGCEPAYRTPKCVKKCVSGNQLWKKSKHYSVSAYKVK 156
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
S+P DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKH+TG
Sbjct: 157 SNPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTG 195
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 239 bits (610), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 122/232 (52%), Positives = 152/232 (65%), Gaps = 18/232 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR+CI + + +S DLL+CCGF CGDGC+GG+P SAW+Y
Sbjct: 134 QGSCGSCWAFGAVEAMSDRYCIRSNGKIQVEISAEDLLSCCGFECGDGCNGGFPGSAWKY 193
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+VT C PY C H P C TP CV KC + +
Sbjct: 194 WNSDGLVTGGLYGSKTGCLPY-QIKPCEHHVPGDRPKCSEGGGTPSCVSKCKGNTTIHYN 252
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +S+Y + SDP I EI +GPVE +FTVY DF YKSGVYKH+TG V+GGHA+
Sbjct: 253 QDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVYKHVTGGVLGGHAI 312
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+++GWG S++G YW++AN WN WG GYFKI RGS+ECGIE VVAG+P
Sbjct: 313 RILGWG-SENGVAYWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAGIPQ 363
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 235 bits (599), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 118/241 (48%), Positives = 161/241 (66%), Gaps = 19/241 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR C+H + + +S DLL+CCG CG+GC+GG+P AW+Y
Sbjct: 104 QGSCGSCWAFGAVEAISDRICVHTNGYITIEVSAEDLLSCCGLQCGEGCNGGFPAGAWKY 163
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC-VKKNQLW 117
++ G+V+ C PY C H G PA TPKC +KC + +
Sbjct: 164 WIKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDY 222
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
++ KHY +AY + S ++IMAEIYKNGPVE +F VY DF YKSGVY+H+TGD++GGHA
Sbjct: 223 KDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 237
++++GWG +DG YW+ AN WN WG +G+FKI RG + CGIE ++VAG+P ++ K+
Sbjct: 283 IRVLGWGV-EDGVPYWLAANSWNTDWGDNGFFKILRGKDHCGIESEMVAGIPRTEQYWKK 341
Query: 238 I 238
I
Sbjct: 342 I 342
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 123/241 (51%), Positives = 155/241 (64%), Gaps = 30/241 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAF A E+LSDR CIH G ++ LS +L++CC CGDGC+GGYP +A +YFV
Sbjct: 120 QSTCGSCWAFAAAESLSDRICIHTGEDVRLSTENLVSCCSS-CGDGCNGGYPEAAMQYFV 178
Query: 75 HHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-----VKK-- 113
G+VT + C Y C+H P C+ PTP+C +KC VK+
Sbjct: 179 KTGLVTGDLFGDNNFCQAY-SFPPCAHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPY 237
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
L++ K YS+S SDP+ IM EI NGPVEV+FTVYEDF YKSGVY+H+TG+
Sbjct: 238 NEDLYKGQKSYSVS-----SDPKAIMTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQ 292
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+GGHAVK+IGWG +D YW++ N WN +WG G FKI RGSNECGIE++VV LP K
Sbjct: 293 LGGHAVKMIGWGVEND-TPYWLIVNSWNETWGDQGTFKILRGSNECGIEDEVVTALPQKK 351
Query: 233 N 233
Sbjct: 352 Q 352
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 LTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 116/234 (49%), Positives = 153/234 (65%), Gaps = 16/234 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRN 119
+ G+V+ C PY S P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKE 221
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH GDVMGGHA++
Sbjct: 222 DKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIR 281
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VAG+P +++
Sbjct: 282 ILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPRTQD 334
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 118/246 (47%), Positives = 163/246 (66%), Gaps = 18/246 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYP 66
V+ + QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP
Sbjct: 96 VKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYP 155
Query: 67 ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 113
AW ++ G+V+ C PY C H P C TPKC + C
Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPG 214
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++
Sbjct: 215 YSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEM 274
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
MGGHA++++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 275 MGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 333
Query: 233 NLVKEI 238
++I
Sbjct: 334 QYWEKI 339
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 234 bits (596), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 233 bits (595), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 119/252 (47%), Positives = 160/252 (63%), Gaps = 17/252 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR C+H +S+ V+ DLL+CCGF CG G
Sbjct: 90 WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GGYP AWRY+ G+V+ C PY G P TP+C
Sbjct: 150 CNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGETPRCS 209
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C + ++ KHY I++Y + ++IMAEIYKNGPVE +F VYEDF YKSGVY+
Sbjct: 210 RHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQ 269
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG+ +GGHA++L+GWG D+G YW+ AN WN WG +G+FKI RG + CGIE ++VA
Sbjct: 270 HVTGEQVGGHAIRLLGWGV-DNGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVA 328
Query: 227 GLPSSKNLVKEI 238
G+PS++ K +
Sbjct: 329 GIPSTERYWKRV 340
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 233 bits (595), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 160/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 24 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 83
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 84 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 142
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 143 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 202
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 203 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 261
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 233 bits (594), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 116/233 (49%), Positives = 157/233 (67%), Gaps = 18/233 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 86 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 145
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 146 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 204
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 205 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 264
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 265 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 316
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 233 bits (593), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 116/240 (48%), Positives = 157/240 (65%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 93 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 152
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 153 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSPTYK 211
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y ++++ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 212 QDKHYGYDSYSVSNNERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAI 271
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++ N WN WG +G+FKI RG + CGIE +VVAG+P + + I
Sbjct: 272 RILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWRNI 330
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 233 bits (593), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 116/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGP E +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 232 bits (592), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 116/233 (49%), Positives = 157/233 (67%), Gaps = 18/233 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 25 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 84
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 85 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 143
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 144 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 203
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 204 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 255
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 232 bits (592), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 159/240 (66%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSRCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 232 bits (592), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 116/233 (49%), Positives = 157/233 (67%), Gaps = 18/233 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 23 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 82
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 83 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 141
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 142 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 201
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 202 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 253
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 118/246 (47%), Positives = 158/246 (64%), Gaps = 18/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 17 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 76
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GGYP AW ++ G+V+ C PY C H P C TPKC
Sbjct: 77 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPCTGEGDTPKCN 135
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYK
Sbjct: 136 KMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYK 195
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VA
Sbjct: 196 HEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVA 254
Query: 227 GLPSSK 232
G+P ++
Sbjct: 255 GIPRTQ 260
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 118/246 (47%), Positives = 158/246 (64%), Gaps = 18/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 11 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 70
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GGYP AW ++ G+V+ C PY C H P C TPKC
Sbjct: 71 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPCTGEGDTPKCN 129
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYK
Sbjct: 130 KMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYK 189
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VA
Sbjct: 190 HEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVA 248
Query: 227 GLPSSK 232
G+P ++
Sbjct: 249 GIPRTQ 254
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TPKC +
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSSKN 233
+P ++
Sbjct: 329 IPRTQQ 334
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 232 bits (591), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 119/235 (50%), Positives = 153/235 (65%), Gaps = 16/235 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH LS DLL+CCG++CG+GC+GG+P +AW Y
Sbjct: 117 QGSCGSCWAFGASEAISDRTCIHSNAAFTFDLSSEDLLSCCGYVCGNGCNGGFPQAAWEY 176
Query: 73 FVHHGVVT------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRN 119
+V +G+V+ C PY + G P TPKC KCV +
Sbjct: 177 WVQNGLVSGGLYHGTGCQPYAIEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQ 236
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY AYRI ++ + IM EIYKNGPVE +F VYEDF YKSGVY H TG +GGHA++
Sbjct: 237 DKHYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIR 296
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
++GWG ++GE YW+ N WN WG +G+FKIKRG NECGIE ++V G+P+S++L
Sbjct: 297 VLGWG-EENGEKYWLCGNSWNTDWGNNGFFKIKRGVNECGIESEMVGGIPASESL 350
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 232 bits (591), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TPKC +
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSSKN 233
+P ++
Sbjct: 329 IPRTQQ 334
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 231 bits (590), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/246 (47%), Positives = 157/246 (63%), Gaps = 16/246 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 22 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 81
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TPKC +
Sbjct: 82 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNK 141
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 142 MCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 201
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 202 EAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 260
Query: 228 LPSSKN 233
+P ++
Sbjct: 261 IPRTQQ 266
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 231 bits (589), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 117/252 (46%), Positives = 161/252 (63%), Gaps = 18/252 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY S+Y ++ + ++IMAEIYKNGPVE +FTVY DF YKSGVY+
Sbjct: 209 KICEPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG++MGGHAV+++GWG +DG YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVA 327
Query: 227 GLPSSKNLVKEI 238
G+P + K+I
Sbjct: 328 GIPCTDQYWKKI 339
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 231 bits (589), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 114/240 (47%), Positives = 155/240 (64%), Gaps = 17/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCGF CG GC+GGYP AW+Y
Sbjct: 92 QGSCGSCWAFGAVEAISDRVCVHTNGKVNVEISAEDLLSCCGFECGMGCNGGYPSGAWKY 151
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY + G P TP+CV+KC ++
Sbjct: 152 WTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYK 211
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +++Y I ++IMAEIYKNGPVE +F VY DF YKSGVY+H++G+ +GGHA+
Sbjct: 212 QDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAI 271
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG D+G YW+ AN WN WG DG+F+I RG + CGIE ++VAG+P + K +
Sbjct: 272 RILGWGV-DNGTPYWLAANSWNTDWGEDGFFRILRGQDHCGIESEIVAGIPKTSEYWKML 330
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 231 bits (589), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 115/233 (49%), Positives = 154/233 (66%), Gaps = 18/233 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 114 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 173
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C ++
Sbjct: 174 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYTPTYK 232
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 233 QDKHYGYNSYSVSNSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 292
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+++GWG ++G YW++ N WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 293 RILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 344
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 231 bits (588), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 119/232 (51%), Positives = 155/232 (66%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF-GMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA++DR CI G N + +S DLL CC CG GC+GGYP SAW +
Sbjct: 101 QGECGSCWAFGAAEAMTDRICIATKGKNQVRISTEDLLTCCD-SCGFGCNGGYPQSAWEF 159
Query: 73 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKK-NQLW 117
F G+VT PY GC S C + PTPKC + C K N +
Sbjct: 160 FKTKGIVTG--GPYNSHKGCQPYAIPACDHHVPHSKNPCNGSLPTPKCEKVCEKGYNITY 217
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+N KHY +++Y IN+D +IM EI NGPVE +FTV+ DF +YKSGVY+H++G+ +GGHA
Sbjct: 218 KNDKHYGVTSYSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGVYQHVSGEELGGHA 277
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+K++GWG ++ YW++AN WN SWG +G+FKI RGS+ECGIE++VVAGLP
Sbjct: 278 IKILGWGVENN-TPYWLVANSWNPSWGDNGFFKILRGSDECGIEDEVVAGLP 328
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 230 bits (587), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 115/239 (48%), Positives = 159/239 (66%), Gaps = 16/239 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS WAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 29 QGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNF 88
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH-----PGCEPAYPTPKCVRKCVKK-NQLWRN 119
+ G+V+ C PY +H P C TPKC + C + ++
Sbjct: 89 WTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYKQ 148
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++
Sbjct: 149 DKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIR 208
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 209 ILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 266
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 229 bits (583), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 117/239 (48%), Positives = 155/239 (64%), Gaps = 17/239 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q CGSCWAFGA EA+SDR CIH + + +S DLL CC CG GC+GGYP
Sbjct: 101 IHLIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVDISAEDLLDCCDS-CGAGCNGGYP 159
Query: 67 ISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK- 113
+AW Y+ G+VT + C PY + T S P C PTPKCV C K
Sbjct: 160 AAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGY 219
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ +++ KH+ Y I+SD + I EI+KNGPVE FTVY DF YKSGVY+H +GDV+
Sbjct: 220 GKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHQSGDVL 279
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GGHA++++GWGT ++G YW++AN WN WG GYFKI RG +ECGIE+D+ AG+P ++
Sbjct: 280 GGHAIRILGWGT-ENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKNE 337
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 115/243 (47%), Positives = 153/243 (62%), Gaps = 16/243 (6%)
Query: 1 MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG 60
M + + ++ + QG CGSCWAFGAVE++SDRFCIHF + +S DL+ACC CG G
Sbjct: 86 MQWPDCPTIKEIRDQGACGSCWAFGAVESMSDRFCIHFNQSAHISAEDLMACCE-TCGMG 144
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDST------GCSHPGCEPAYPTPKCV 107
C+GGY +AWRYF H G+VT E C PY ++ G P TP+C
Sbjct: 145 CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQPCASKEEHTPRCS 204
Query: 108 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + + KH+ SAY + S E I EI NGPVE +FTVY DF YKSGVY+
Sbjct: 205 KTCEAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVYADFPTYKSGVYQ 264
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H +G ++GGHA++++GWGT ++G YW++AN WN WGA GYFKI RG ++CGIE + A
Sbjct: 265 HTSGAMLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGAMGYFKIIRGKDDCGIESQITA 323
Query: 227 GLP 229
G+P
Sbjct: 324 GMP 326
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 117/239 (48%), Positives = 156/239 (65%), Gaps = 17/239 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q CGSCWAFGA EA+SDR CIH G+ +++S DLL CC CG GCDGGYP
Sbjct: 101 INLIRDQSTCGSCWAFGAAEAMSDRVCIHSEGGIQVNISAEDLLDCCDS-CGAGCDGGYP 159
Query: 67 ISAWRYFVHHGVVTEE-------CDPYFDS-----TGCSHPGCEPAYPTPKCVRKCVKK- 113
+AW Y+ G+V++ C PY + T S P C PTPKCV C K
Sbjct: 160 AAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCVHLCRKGY 219
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ +++ KH+ Y I+S+ + I EI+KNGPVE FTVY DF YKSGVY+H +GDV+
Sbjct: 220 GKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGVYQHHSGDVL 279
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GGHA++++GWGT ++G YW++AN WN WG GYFKI RG +ECGIE+D+ AG+P +
Sbjct: 280 GGHAIRILGWGT-ENGTPYWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAGIPKDE 337
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 117/247 (47%), Positives = 157/247 (63%), Gaps = 18/247 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGS WAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 73 WSNCPTIAQIRDQGSCGSSWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 132
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GGYP AW ++ G+V+ C PY C H P C TPKC
Sbjct: 133 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CEHHVNGARPPCTGEGDTPKCN 191
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYK
Sbjct: 192 KMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYK 251
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N CGIE ++VA
Sbjct: 252 HEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDNGFFKILRGENHCGIESEIVA 310
Query: 227 GLPSSKN 233
G+P ++
Sbjct: 311 GIPRTQQ 317
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 119/231 (51%), Positives = 147/231 (63%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWAFGAVEA++DR CI +S DLL CC F CGDGC+GGYP +AW Y
Sbjct: 112 QANCGSCWAFGAVEAMTDRTCIASKGAQTPHISAEDLLTCCTFTCGDGCNGGYPAAAWEY 171
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ + G+VT + C PY +TG P C PTP C R C + N +
Sbjct: 172 WKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CGDIVPTPACKRSCRQGYNVTYP 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
N KH+ S+Y + + I EI NGPVE +FTVY DF YKSGVY+H +G +GGHA+
Sbjct: 231 NDKHFGASSYGVRG-VDQIATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAI 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K+IGWG DG DYWI+AN WN SWG DG+F IK+G++ECGIE VVAGLP
Sbjct: 290 KIIGWGVQ-DGTDYWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAGLP 339
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/237 (49%), Positives = 152/237 (64%), Gaps = 19/237 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSL--SVNDLLACCGFLCGDGCDGGYP 66
++ + QG CGSCWAFGA EA+SDR CIH G +SL S DLL+CC CG GC GGYP
Sbjct: 93 IQQIRDQGSCGSCWAFGAAEAISDRLCIHSGSKISLEISAEDLLSCCD-ECGMGCSGGYP 151
Query: 67 ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 113
SAW ++ G+VT C PY + C H P C+ TPKC +KC+
Sbjct: 152 SSAWEFWTKKGLVTGGLCGSEVGCRPYSIAP-CEHHVNGTRPPCQGTQETPKCEKKCIDG 210
Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ KH+ +Y + S E IM E+YKNGPVE +FTVY DF YK+GVY+H+TG+V
Sbjct: 211 YLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEV 270
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GGHA+K++GWG + G YW+ AN WN WG G+FKIKRG++ECGIE ++VAG P
Sbjct: 271 LGGHAIKILGWG-EESGTPYWLAANSWNGDWGDKGFFKIKRGNDECGIESEMVAGTP 326
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 102/134 (76%), Positives = 120/134 (89%)
Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
+KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKH
Sbjct: 1 KKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKH 60
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
ITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG
Sbjct: 61 ITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAG 120
Query: 228 LPSSKNLVKEITSA 241
+PS+KN+V+ SA
Sbjct: 121 MPSTKNMVRNYDSA 134
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 228 bits (580), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 115/247 (46%), Positives = 155/247 (62%), Gaps = 17/247 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI + +N+ +S DLL CCGF CG+G
Sbjct: 113 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGNVNVEVSAEDLLTCCGFQCGEG 172
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY G P TPKC
Sbjct: 173 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGSTPKCS 232
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C ++ KH+ S+Y + S +IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 233 RICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGVYQ 292
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG++MGGHAV+++GWG +DG YW++ N WN WG G+FKI RG + CGIE ++VA
Sbjct: 293 HVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDSGFFKILRGQDHCGIESEIVA 351
Query: 227 GLPSSKN 233
GLP ++
Sbjct: 352 GLPCTEQ 358
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 118/230 (51%), Positives = 145/230 (63%), Gaps = 19/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA++DR CI + N LS DL +CC CG GC+GGYP +AW Y
Sbjct: 239 QGSCGSCWAFGAAEAMTDRICIASNGQNNFYLSAEDLTSCCDS-CGMGCEGGYPSAAWDY 297
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F G+VT + C PY TG P C PTP C C + N W +
Sbjct: 298 FQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP-CGDIQPTPACANSC-QNNATWSS 355
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KH+ S+Y + +D + IM EIY NGPVE S+ VY DF YKSGVY+H+TGD +GGHAVK
Sbjct: 356 DKHFGASSYSVGTDQQSIMTEIYTNGPVEASYDVYADFVSYKSGVYQHVTGDYLGGHAVK 415
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+IGWG D YWI+AN WN WG +G+F I RGS+ECGIE+ +VAG+P
Sbjct: 416 IIGWGV-DGSTPYWIVANSWNNDWGNNGFFNILRGSDECGIEDGIVAGIP 464
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 118/239 (49%), Positives = 150/239 (62%), Gaps = 22/239 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC--GFLCGDGCDGGYPISAW 70
Q CGSCWA A E +SDR CI +N+ +S DLL+CC G+ CGDGC+GGYPI AW
Sbjct: 97 QSDCGSCWAVAAAETISDRTCIASNGEVNVLISAEDLLSCCTGGYNCGDGCEGGYPIQAW 156
Query: 71 RYFVHHGVVT-------EECDPYFDS------TGCSHPGCEP-AYPTPKCVRKCVKKNQL 116
RY+VH+G+VT C PY + G + P C TP+CV++C K+
Sbjct: 157 RYWVHNGLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDY 216
Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KHY SAY I + I EI +NGPVEV F VY DF YKSG+YKH+ G +
Sbjct: 217 AVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGREL 276
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GGHAVK++GWG ++G YW+ AN WN +WG GYF+I+RG+NECGIE VVAG+P K
Sbjct: 277 GGHAVKILGWGV-ENGTPYWLAANSWNVNWGEKGYFRIRRGTNECGIESSVVAGIPDLK 334
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 114/232 (49%), Positives = 146/232 (62%), Gaps = 24/232 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q +CGSCWAFGAVE+L+DR CIH G ++ LS ++L CC CG GC+GGYP SA Y+V
Sbjct: 115 QSNCGSCWAFGAVESLTDRHCIHLGQDIRLSAQNMLTCCA-TCGQGCNGGYPASAMSYYV 173
Query: 75 HHGVVTEECDPYFDSTG---------CSH-------PGCEPAYPTPKCVRKC-VKKNQLW 117
G+VT + +++TG C+H P C PTPKC + C Q +
Sbjct: 174 KTGLVTGD---LYNTTGWCQAYSFAPCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTY 230
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ H AY + E IM EI NGPVE +FTVYEDF +YKSGVYKH+TG +GGHA
Sbjct: 231 --TVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHA 288
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+K++GWG ++ YWI+ N WN++WG +G FKI RG NECGIE VV LP
Sbjct: 289 IKIVGWGVENN-TPYWIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTALP 339
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 227 bits (578), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 158/248 (63%), Gaps = 18/248 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C ++ KH+ S+Y I+ + ++IMAEIYKNGPVE +FTVY DF YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TGD+MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLPSSKNL 234
G+P + +
Sbjct: 328 GIPCTPHF 335
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 227 bits (578), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 113/231 (48%), Positives = 143/231 (61%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVE+++DR CI +L +S DL+ CC F CG GC GGYP +AW +
Sbjct: 111 QAACGSCWAFGAVESMTDRICIASKGSLRPHISAQDLMTCCLFTCGSGCSGGYPSAAWSW 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
F G+VT + C PY C H P C PTP C + C N +
Sbjct: 171 FKTTGIVTGGNYNSSQGCQPY-SLPNCDHHVSGQYPACSGEGPTPACKKSCEAGYNNTYS 229
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
N KH+ +AY + + + I EI NGPVE +FTVYED YKSGVY+H TG V+GGHA+
Sbjct: 230 NDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYEDLLTYKSGVYQHTTGQVLGGHAI 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K+IGWG + G DYW +AN WN WG +G+FKIK+G +ECGIE +VAG+P
Sbjct: 290 KIIGWGV-ESGVDYWWVANSWNNDWGDNGFFKIKKGVDECGIESQIVAGMP 339
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 226 bits (577), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 158/248 (63%), Gaps = 18/248 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C ++ KH+ S+Y I+ + ++IMAEIYKNGPVE +FTVY DF YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TGD+MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLPSSKNL 234
G+P + +
Sbjct: 328 GIPCTPHF 335
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 116/246 (47%), Positives = 160/246 (65%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL+CCG LCG+G
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAMSDRLCIHTNGHVNVEVSAEDLLSCCGPLCGEG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
C+GGYP AW+Y+ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPY-SIPPCEHHVNGTRPKCTGEGGDTPKC 208
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+ C + ++ K+Y S+Y + S ++IMAEIYKNGPVE +F+V+ DF YKSGVY
Sbjct: 209 SKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVY 268
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
KH+ G+V+GGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE +VV
Sbjct: 269 KHVAGEVLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGDNGFFKILRGEDHCGIESEVV 327
Query: 226 AGLPSS 231
AG+P +
Sbjct: 328 AGIPRT 333
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 226 bits (576), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 157/247 (63%), Gaps = 17/247 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR C+H +S+ V+ DLL+CCGF CG G
Sbjct: 90 WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GGYP AWRY+ G+V+ C PY G P TP+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGETPRCS 209
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C + ++ KHY I++Y + ++IMAEIYKNGPVE +F VYEDF YKSGVY+
Sbjct: 210 RHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQ 269
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G+ +GGHA++++GWG ++G YW+ AN WN WG +G+FKI RG + CGIE ++VA
Sbjct: 270 HVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVA 328
Query: 227 GLPSSKN 233
G+P ++
Sbjct: 329 GVPRTEQ 335
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 226 bits (576), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 157/244 (64%), Gaps = 16/244 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N ++ + QG CGSCWAFGAV A+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIKQIRDQGSCGSCWAFGAVGAMSDRLCIHTNGHVNVEVSAEDLLTCCGSQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW +++ G+V+ C PY S P C TPKC +
Sbjct: 150 CNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEHHVNGSRPQCTGEGDTPKCTK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KHY ++Y ++++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GD+MGGHA++++GWG ++ YW++AN WN WG +G FKI RG + CGIE ++VAG
Sbjct: 270 EAGDIMGGHAIRILGWGV-ENSVPYWLVANSWNVDWGDNGLFKILRGEDHCGIESEIVAG 328
Query: 228 LPSS 231
+P +
Sbjct: 329 IPRT 332
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 226 bits (576), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 122/245 (49%), Positives = 153/245 (62%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
+ N V+ + QG CGSCWAFGAVEA+SDR CI + +N +S DLLACC CG+G
Sbjct: 105 WPNCPTVKEVRDQGDCGSCWAFGAVEAMSDRICIASNGKVNAEISAEDLLACCSS-CGEG 163
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKC 106
C GG+P AWRY+ G+VT + C PY C H P + TPKC
Sbjct: 164 CQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYM-IPACDHHVVGHLQPCPKEEAKTPKC 222
Query: 107 VRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+KC N +++ KHY ++Y ++S E IM EI NGPVE +FTVYEDF YKSGVY
Sbjct: 223 SKKCEANYNVTYKDDKHYGKNSYSVDSV-EKIMTEIMTNGPVEAAFTVYEDFLSYKSGVY 281
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H TG +GGHAVK++GWG D+G YWI+AN WN WG G+F I RG +ECGIE +V
Sbjct: 282 QHRTGQELGGHAVKILGWG-EDNGTPYWIVANSWNPDWGNQGFFNILRGKDECGIESQIV 340
Query: 226 AGLPS 230
AGLP
Sbjct: 341 AGLPK 345
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 156/244 (63%), Gaps = 16/244 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TP+C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GD+MGGHA++++GWG ++G YW+ AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSS 231
+P +
Sbjct: 329 IPRT 332
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/246 (45%), Positives = 161/246 (65%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR C+H N+ +S DLL+CCG CGDG
Sbjct: 91 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICVHSNGNANVEVSAEDLLSCCGSECGDG 150
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKC 106
C+GG+P AW ++ G+V+ C PY C H G PA TP C
Sbjct: 151 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACTGEEGDTPTC 209
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+KC + + +++ K+Y ++Y + S ++IMAEIYKNGPVE +F+VYEDF HYKSGVY
Sbjct: 210 RKKCEEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGVY 269
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H+ G+++GGHA++++GWG ++G YW+ AN WN WG +G+FK RG N CGIE +++
Sbjct: 270 QHVAGEMLGGHAIRILGWGV-ENGIRYWLAANSWNIDWGDNGFFKFLRGKNHCGIESEII 328
Query: 226 AGLPSS 231
AG+P +
Sbjct: 329 AGIPRT 334
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 116/233 (49%), Positives = 144/233 (61%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGA E+LSDR CIH G ++ LS +LL CC CGDGCDGG+P +A Y+V
Sbjct: 116 QSTCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLLTCCA-ACGDGCDGGWPEAAMDYYV 174
Query: 75 HHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LW 117
+ G+VT + C Y + C+H P C PTP C+ C + +
Sbjct: 175 NTGLVTGDLYGNNSWCQAYTFAP-CAHHVTSDIYPPCTGELPTPPCINSCDSNSTHTIPY 233
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
H AY I D + IMAEIYKNGP+EV+ TVYEDF YK+GVY+H+TGD +GGHA
Sbjct: 234 SKDIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHA 293
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
VK++GWG ++G YW + N WN SWG G FKI RG NECGIE V LP+
Sbjct: 294 VKMVGWGV-ENGTPYWTIVNSWNESWGDKGTFKILRGKNECGIESSCVTALPA 345
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 225 bits (573), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 118/236 (50%), Positives = 145/236 (61%), Gaps = 22/236 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
Q CGSCWAF A EA+SDR CI + +N LS DLL+CC F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAW 163
Query: 71 RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
+++V HG+VT C PY + G P C E PTPKCV C KN
Sbjct: 164 KWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNY 223
Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KH+ +AY + E I EI NGP+EV+FTVYEDF Y +GVY H G +
Sbjct: 224 ATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 118/245 (48%), Positives = 158/245 (64%), Gaps = 19/245 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG+CGSCWAFGA EA+SDR CI G ++L +S DLL CC CG G
Sbjct: 89 WPNCPTLKQIRDQGNCGSCWAFGAAEAISDRICIQSGGKISLEISAEDLLTCCD-ECGMG 147
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C GG+P +AW ++ + G+VT C PY + C H P C+ TPKCV
Sbjct: 148 CFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAP-CEHHVNGSRPPCQGEVETPKCV 206
Query: 108 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+C L + KH+ +Y I S E IM E+YKNGPVE +F+VY DF YK+GVY+
Sbjct: 207 TQCNNGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQ 266
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TGD++GGHAVK++GWG ++G YW++AN WN WG G+FKIKRG++ECGIE ++VA
Sbjct: 267 HVTGDMLGGHAVKILGWG-EENGTPYWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVA 325
Query: 227 GLPSS 231
G P S
Sbjct: 326 GAPLS 330
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 158/252 (62%), Gaps = 18/252 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICILTNGHVNVEVSAEDMLTCCGDQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C ++ KHY ++Y +++ ++IMAEIYKNGPVE +F+V+ DF YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG++MGGHAV+++GWG +D YW++ N WN WG G+FKI RG + CGIE +VVA
Sbjct: 269 HVTGEMMGGHAVRILGWGVEND-TPYWLVGNSWNTDWGDHGFFKILRGRDHCGIESEVVA 327
Query: 227 GLPSSKNLVKEI 238
G+P ++ K I
Sbjct: 328 GIPCTEQYWKRI 339
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 224 bits (572), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/231 (51%), Positives = 154/231 (66%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE++SDR CIH G + L+ +D+L+CC + CG GC+GG+P +AW Y
Sbjct: 110 QGACGSCWAFGAVESMSDRHCIHSGAKNIVHLAADDVLSCC-WGCGSGCNGGFPGAAWSY 168
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+V G+VT E C PY C H C PTPKCVR C K N ++
Sbjct: 169 WVEKGIVTGGNYDTDEGCMPY-PVPSCDHHVNGTLGPCGQDPPTPKCVRLCRKGYNIDFK 227
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ KHY S+Y ++S+ I EI KNGPVE +FTVY DF YKSGVYK + D +GGHA+
Sbjct: 228 DDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAI 287
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++G +W++AN WN WG GYFKI RGSNECGIEED+VAG+P
Sbjct: 288 RILGWGV-ENGVPFWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIP 337
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 112/243 (46%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDMLTCCGSECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + +++ KH+ S+Y ++S+ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG +D YW++ N WN WG G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVA 327
Query: 227 GLP 229
G+P
Sbjct: 328 GMP 330
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 112/243 (46%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSKGRVNVEVSAEDMLTCCGSECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + +++ KH+ S+Y ++S+ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG +D YW++ N WN WG G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDKGFFKILRGQDHCGIESEIVA 327
Query: 227 GLP 229
G+P
Sbjct: 328 GMP 330
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 224 bits (570), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 114/233 (48%), Positives = 152/233 (65%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR C+H +S+ V+ DLL+CCGF CG GC+GGYP AWRY
Sbjct: 102 QGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRY 161
Query: 73 FVHHGVVTEECDPYFDSTGC---SHPGCE------------PAYPTPKCVRKCVKK-NQL 116
+ G+V+ Y GC + P CE TP+C R C +
Sbjct: 162 WTERGLVSGGL--YDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPS 219
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KHY I++Y + ++IMAEIYKNGPVE +F VYEDF YKSGVY+H++G+ +GGH
Sbjct: 220 YKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGH 279
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWG ++G YW+ AN WN WG G+FKI RG + CGIE ++VAG+P
Sbjct: 280 AIRILGWGV-ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAGVP 331
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 223 bits (569), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 118/236 (50%), Positives = 147/236 (62%), Gaps = 22/236 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFL--CGDGCDGGYPISAW 70
Q CGSCWAF A EA+SDR CI + +N LS DLL+CC L CG+GC+GGYPI AW
Sbjct: 105 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSQDLLSCCTGLLSCGNGCEGGYPIQAW 164
Query: 71 RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
+++V HG+VT C PY + G + P C + PTPKCV C N
Sbjct: 165 KWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTY 224
Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KH+ +AY + E I EI KNGPVEV+FTVYEDF Y +GVY H +G +
Sbjct: 225 PTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAFTVYEDFYQYTTGVYVHTSGASL 284
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE VAG+P
Sbjct: 285 GGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHSAVAGIP 339
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 223 bits (568), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 119/230 (51%), Positives = 145/230 (63%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA +DR CI N +S DLL CCGF CG GC+GG AW +
Sbjct: 100 QGSCGSCWAFGAVEAFTDRICIQSNGAKNPHISAEDLLTCCGFWCGFGCNGGRLGPAWNF 159
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
F + G VT E C PY ++G P CE + PTPKC R C + N +
Sbjct: 160 FKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP-CEGSEPTPKCKRSCREGYNVSYS 218
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ KH S Y I +D E I EIY NGPVE +FTVY DF +YKSGVYK+ TG+ +GGHA+
Sbjct: 219 DDKHKVSSHYSIANDEEQIKNEIYLNGPVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAI 278
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
K++GWG ++ YW++AN WN WG G+FKI RGSNECGIE VVAG+
Sbjct: 279 KILGWGVENN-VPYWLVANSWNPDWGDKGFFKILRGSNECGIEASVVAGM 327
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 223 bits (568), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 119/231 (51%), Positives = 148/231 (64%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +N+ +S DLL CC CG GC+GG+P SAW Y
Sbjct: 105 QGSCGSCWAFGAAEAMSDRHCIHSNGKVNVEISAEDLLTCCD-SCGMGCNGGFPGSAWEY 163
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+V G+VT C PY ++ C H P C TP+CV C K N +R
Sbjct: 164 WVDKGLVTGGLYNSHVGCQPYTIAS-CEHHTKGKLPPCGDIVDTPQCVHMCEKGYNVSYR 222
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K++ +Y I+ + I EI NGPVE +FTVY DF YKSGVY+H+TG+ MGGHAV
Sbjct: 223 ADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVYADFVTYKSGVYRHVTGEEMGGHAV 282
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWGT + G YW++AN WN WG GYFKI RGS+ECGIE +VAGLP
Sbjct: 283 RILGWGT-ESGTPYWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAGLP 332
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 223 bits (567), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 115/245 (46%), Positives = 145/245 (59%), Gaps = 19/245 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
++ + + + Q HCGSCWA A E +SDR CIH +N+ LS D+L+CCG CG G
Sbjct: 105 WSQCDSIRTIRDQSHCGSCWAVSAAETMSDRTCIHSDGKINVGLSATDILSCCGTTCGRG 164
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPK 105
C GGYPI AWRYF+ HGV T + C PY C H E Y PTP+
Sbjct: 165 CRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHP-CGHHRNEIYYGECPKEIFPTPQ 223
Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C + C + + K Y SAY + ++ + I EI NGPV+ +F VYEDF+ Y+SG+
Sbjct: 224 CTQSCQAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSGI 283
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
Y H G GGHAVKLIGWG DDG YW+ AN WN WG +GYF+I RG + CGIE V
Sbjct: 284 YVHTAGRREGGHAVKLIGWGVDDDGNKYWLAANSWNSDWGENGYFRIVRGVDHCGIESAV 343
Query: 225 VAGLP 229
VAG+P
Sbjct: 344 VAGMP 348
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 223 bits (567), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 112/238 (47%), Positives = 151/238 (63%), Gaps = 19/238 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q CGSCWAFGA EA+SDR CIH + +++S DLL CC CG GC+GGYP
Sbjct: 100 IHVIRDQSTCGSCWAFGATEAMSDRVCIHSKGKVQVNISAEDLLTCCD-SCGAGCNGGYP 158
Query: 67 ISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK 113
+AW ++ G+VT + C PY+ C H P C PTP+CVR C K
Sbjct: 159 AAAWEFYKTDGIVTGGLYGTDDGCQPYYFPP-CEHHTVGPLPNCTGIKPTPQCVRDCRKG 217
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ + KHY+ Y +++D I EI+KNGPVE FTVY DF YKSGVY+ + D
Sbjct: 218 YEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEADFTVYADFVSYKSGVYQRHSDDA 277
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+GGHA++++GWGT ++G YW++AN WN WG GYFKI RG++ECGIE+D+ AG+P
Sbjct: 278 LGGHAIRILGWGT-ENGVPYWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAGIPK 334
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 223 bits (567), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 116/253 (45%), Positives = 160/253 (63%), Gaps = 19/253 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CC CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRVCIHTNGNVNVEVSAEDLLTCCHMECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
C+GG+P AW ++ G+V+ C PY C H P C+ TPKC
Sbjct: 150 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCKGEGGETPKC 208
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+ C + ++ KHY S+Y + S ++IMAEIYKNGPVE +F+VY DF YKSGVY
Sbjct: 209 SKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNGPVEGAFSVYTDFLVYKSGVY 268
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H+TG+ +GGHA++++GWG ++G YW+ AN WN WG +G+FKI RG + CGIE ++V
Sbjct: 269 QHVTGEEVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIV 327
Query: 226 AGLPSSKNLVKEI 238
AG+P + K+I
Sbjct: 328 AGIPRTDQYWKKI 340
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 222 bits (566), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 151/243 (62%), Gaps = 19/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+ N + + + QG CGSCWAFGAVE++SDR CIH S +S DLL+CC CG G
Sbjct: 85 WPNCKTLSQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLLSCCD-QCGFG 143
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
C GG+P AW Y+ G+VT C PY C H P C TPKC
Sbjct: 144 CSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPY-SIAPCEHHVNGTRPPCSGEQDTPKCT 202
Query: 108 RKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
C+ K + ++ KH+ Y + SD + IM E+Y NGPVE +FTVYEDF YKSGVY+
Sbjct: 203 GVCIPKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQ 262
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG +GGHAVK++GWG ++G +W++AN WN WG +GYFKI RG +ECGIE ++VA
Sbjct: 263 HLTGSALGGHAVKILGWG-EENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVA 321
Query: 227 GLP 229
GLP
Sbjct: 322 GLP 324
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 222 bits (566), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 155/244 (63%), Gaps = 16/244 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TP+C +
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GD+MGGHA++++ WG ++G YW+ AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILVWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSS 231
+P +
Sbjct: 329 IPRT 332
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 154/244 (63%), Gaps = 16/244 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C TP+C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KH+ ++Y +++ ++IMAEIYKN PVE +FTV+ DF YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GD+MGGHA++++GWG +G YW+ AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGVG-NGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSS 231
+P +
Sbjct: 329 IPRT 332
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 115/244 (47%), Positives = 151/244 (61%), Gaps = 19/244 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGA EA+SDR CIH ++ +S DLL+CC CG G
Sbjct: 88 WPNCPTIQDIRDQGSCGSCWAFGAAEAISDRLCIHSNAKITVEISAEDLLSCCE-ECGMG 146
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
C GGYP +AW Y+ G+VT + C PY C H P C+ TPKC
Sbjct: 147 CFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPY-SIPPCEHHVNGTRPPCQGEGDTPKCQ 205
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
KC+ + K++ Y + S E IM E+YKNGPVE +F+VYEDF YKSGVY+
Sbjct: 206 TKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGVYQ 265
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TGD++GGHA+K++GWG ++ YW+ AN WN WG G+FKI RG +ECGIE +VVA
Sbjct: 266 HLTGDMLGGHAIKILGWG-KENNTPYWLAANSWNTDWGNQGFFKILRGGDECGIESEVVA 324
Query: 227 GLPS 230
G+P
Sbjct: 325 GIPQ 328
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 222 bits (565), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 116/246 (47%), Positives = 159/246 (64%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCGF CG G
Sbjct: 90 WPNCPTIREVRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFECGMG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKC 106
C+GGYP AW+++ G+V+ C PY C H G PA TPKC
Sbjct: 150 CNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPY-SIPPCEHHVNGSRPACKGEEGDTPKC 208
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V++C ++ + KH+ ++Y + S ++IMAEIYKNGPVE +F VY DF YKSGVY
Sbjct: 209 VKQCEDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADFPMYKSGVY 268
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H TG+ +GGHA+K++GWG ++G YW+ AN WN WG +G+FKI RG + CGIE ++V
Sbjct: 269 QHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIV 327
Query: 226 AGLPSS 231
AG+P +
Sbjct: 328 AGIPKN 333
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 222 bits (565), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 112/237 (47%), Positives = 155/237 (65%), Gaps = 18/237 (7%)
Query: 18 CGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVH 75
C WAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++
Sbjct: 11 CRMSWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 70
Query: 76 HGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSK 121
G+V+ C PY C H P C TPKC + C + ++ K
Sbjct: 71 KGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDK 129
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
HY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++
Sbjct: 130 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 189
Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 190 GWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 245
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 222 bits (565), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 155/244 (63%), Gaps = 16/244 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVR 108
C+GGYP AW ++ G+V+ C PY S P C T +C +
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTHRCNK 209
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ KH+ ++Y +++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH
Sbjct: 210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GD+MGGHA++++GWG ++G YW+ AN WN WG +G+FKI RG N CGIE ++VAG
Sbjct: 270 EAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAG 328
Query: 228 LPSS 231
+P +
Sbjct: 329 IPRT 332
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 157/243 (64%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CC CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCDGECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLP 229
G+P
Sbjct: 328 GMP 330
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 150/248 (60%), Gaps = 22/248 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC--GFLCG 58
F+ V + Q HCGSCWA A EA+SDR CI +N LS D+L CC + CG
Sbjct: 91 FSQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGVVNTLLSAEDILTCCIGEYYCG 150
Query: 59 DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGCEPA-YPTP 104
DGC+GGYPI AW+Y+V +G+VT C PY + G + P C + TP
Sbjct: 151 DGCEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTP 210
Query: 105 KCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 161
KCV C + + KHY +AY ++ + I +EI KNGPVEV FTVY DF YK
Sbjct: 211 KCVDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYK 270
Query: 162 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
SGVY H+ G +GGHAVKL+GWG D+G YW+ AN WN +WG +GYF+I RG NECGIE
Sbjct: 271 SGVYVHVAGPELGGHAVKLLGWGV-DNGTPYWLAANSWNTNWGENGYFRILRGVNECGIE 329
Query: 222 EDVVAGLP 229
VVAG+P
Sbjct: 330 SQVVAGMP 337
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 221 bits (563), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 119/234 (50%), Positives = 151/234 (64%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG----MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
QG CGSCWAFGAVEA+SDR CIH + LS +DLL+CC CG+GC+GG+P SAW
Sbjct: 114 QGSCGSCWAFGAVEAMSDRTCIHSPSGGPKRVHLSADDLLSCC-RTCGNGCNGGFPGSAW 172
Query: 71 RYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL 116
++V G+VT + C PY C H P + PTP+CV C K +
Sbjct: 173 SFWVKTGIVTGGNYDSDDGCMPY-PIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDV 231
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ + KHY S+Y + S+ + I AEI NGPVE FTVY DF HYKSGVY+ T + +GG
Sbjct: 232 DYHDDKHYGKSSYSVPSEEKQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGG 291
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HA++L+GWG ++G YW+ AN WN WG G+FKI RGS+ECGIE+DVVAGLP
Sbjct: 292 HAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGLP 344
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 221 bits (563), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/235 (51%), Positives = 149/235 (63%), Gaps = 23/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG-----MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
QG CGSCWAFGAVEA+SDR CIH + L+ +D+L+CC CG GC+GG+P SA
Sbjct: 140 QGSCGSCWAFGAVEAISDRTCIHSPEGKPRVIAHLAADDVLSCC-TECGAGCNGGFPGSA 198
Query: 70 WRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ 115
W Y+VH G+VT E C PY C H P + PTP+CVR C K
Sbjct: 199 WSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKTIPPTPRCVRMCRKGYD 257
Query: 116 L-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ + + KHY AY + + + I AEI NGPVE FTVYEDF HYKSGVY+ T +G
Sbjct: 258 VDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADFTVYEDFLHYKSGVYQRHTDSALG 317
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GHA++L+GWG ++G YW+ AN WN WG G+FKI RGS+ECGIE D+VAGLP
Sbjct: 318 GHAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGFFKILRGSDECGIESDIVAGLP 371
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 220 bits (561), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/234 (48%), Positives = 153/234 (65%), Gaps = 19/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CI N+ +S DLL+CC CG GC+GG+P +AW Y
Sbjct: 144 QGSCGSCWAFGAVEAMSDRICIASKGNVHAHISSEDLLSCCSS-CGMGCNGGFPPAAWEY 202
Query: 73 FVHHGVVT-------EECDPYFDS------TGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F G+V+ + C PY + G P C PTPKC R C K ++ +
Sbjct: 203 FRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP-CSGEGPTPKCERTCEKGYKVKYE 261
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ K++ +AY +++D + IM EI NGPVE +FTVY DF YKSGVY+H++G +GGHA+
Sbjct: 262 DDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAFTVYADFPTYKSGVYQHVSGGELGGHAI 321
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+++GWG +DG YW++AN WN WG +G+FKI RG NECGIE ++VAGLP +
Sbjct: 322 RVLGWGV-EDGTPYWLVANSWNSDWGDNGFFKILRGQNECGIEGEIVAGLPKKQ 374
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 220 bits (561), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 112/246 (45%), Positives = 159/246 (64%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAV A+SDR CIH +N+ +S DLL+CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVGAMSDRVCIHTNGHVNVEVSAEDLLSCCGLECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
C+GGYP +AW+Y+ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGTRPQCTGEGGDTPKC 208
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+ C + ++ KH+ +Y ++S+ ++IMAEIYKNGPVE +FTV+ DF YK+GVY
Sbjct: 209 SKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGVY 268
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
KH+ G+++GGHA++++GWG ++G YW++ N WN WG G+FKI RG + CGIE ++V
Sbjct: 269 KHLAGEMLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGDSGFFKIVRGEDHCGIESEIV 327
Query: 226 AGLPSS 231
AG+P +
Sbjct: 328 AGIPRT 333
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 220 bits (561), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 117/252 (46%), Positives = 160/252 (63%), Gaps = 18/252 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C ++ KHY S+Y ++S ++IMAEIYKNGPVE +FTVY DF YKSGVY+
Sbjct: 209 KFCEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG++MGGHAV+++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVA 327
Query: 227 GLPSSKNLVKEI 238
G+P + K+I
Sbjct: 328 GIPCTDQYWKKI 339
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 117/231 (50%), Positives = 149/231 (64%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR+CI + +S DLL+CC CG GC+GGYP SAW +
Sbjct: 26 QGACGSCWAFGAVEAMSDRYCIKSEGKVMPHISAEDLLSCC-ETCGMGCNGGYPESAWDH 84
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
+ G+VT + C PY C H C+ PTPKC RKC N +
Sbjct: 85 WKSKGLVTGGQYDSHKGCQPY-KIAACDHHVVGKLKPCKGDSPTPKCERKCEAGYNVSYS 143
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ KH+ SAY + SDP +I EI NGPVE +FTVY DF YKSGVY+H +G +GGHA+
Sbjct: 144 DDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVYADFPTYKSGVYQHTSGSALGGHAI 203
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKIKRG++ECGIE +V GLP
Sbjct: 204 KILGWG-EENGTPYWLVANSWNSDWGDEGFFKIKRGNDECGIESGIVGGLP 253
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/246 (47%), Positives = 154/246 (62%), Gaps = 19/246 (7%)
Query: 2 PFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD 59
P NS H ++ Q CGSCWAFGA EA+SDR CIH + +++S DLL CC CG
Sbjct: 96 PHCNSIH--LIRDQSTCGSCWAFGAAEAMSDRVCIHSKGKIQVNISAEDLLDCCDS-CGA 152
Query: 60 GCDGGYPISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPGCEPAYPTPKCV 107
GC+GG P +AW Y+ G+VT + C PY + T S P C PTPKCV
Sbjct: 153 GCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEHHTKGSLPNCTGTVPTPKCV 212
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
C K + +++ KH+ Y I+SD + I EI+KNGPVE F V DF YKSGVY+
Sbjct: 213 HLCRKGYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGVYQ 272
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H + DV+GGHA++++GWGT ++G YW+ AN WN WG GYFKI RG +ECGIEED+ A
Sbjct: 273 HHSDDVIGGHAIRILGWGT-ENGTPYWLAANSWNEDWGDHGYFKILRGKDECGIEEDINA 331
Query: 227 GLPSSK 232
G+P ++
Sbjct: 332 GIPKNR 337
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/254 (46%), Positives = 158/254 (62%), Gaps = 22/254 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
++N + ++ + Q CGSCWAFGAVEA+SDR CI + + ++LS +DLL+CC CG G
Sbjct: 131 WSNCQSIKNIRDQSSCGSCWAFGAVEAMSDRICIASNEKIQVTLSADDLLSCCR-TCGFG 189
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPK 105
C+GG P+ AW+Y+V HG+VT + C PY C H P YPTPK
Sbjct: 190 CEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPY-PFPPCEHHSNKTRFDPCRHDLYPTPK 248
Query: 106 CVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
C +KCV K + + + + Y +AY + +D I EI +GPVEV+F VYEDF HY G
Sbjct: 249 CSKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGG 308
Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
+Y H G + GGHAVKLIGWG D G YW++AN WN WG +G+F+I RG +ECGIE
Sbjct: 309 IYVHTGGKLGGGHAVKLIGWGI-DQGTPYWLIANSWNTDWGEEGFFRILRGVDECGIESG 367
Query: 224 VVAGLPSSKNLVKE 237
VV G+P S N+ +
Sbjct: 368 VVGGIPKSTNIQRR 381
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 115/246 (46%), Positives = 158/246 (64%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCGF CG G
Sbjct: 90 WPNCPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFKCGMG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
C+GGYP AWR++ G+V+ C PY C H P C+ TPKC
Sbjct: 150 CNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPSCKGEEGDTPKC 208
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
++ C + + + KH+ ++Y + S ++IMA+IYKNGPVE +F VY DF YKSGVY
Sbjct: 209 MKTCEEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVY 268
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H TG+ +GGHA+K++GWG ++G YW+ AN WN WG +G+FKI RG + CGIE +VV
Sbjct: 269 QHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVV 327
Query: 226 AGLPSS 231
AG+P +
Sbjct: 328 AGIPKN 333
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 220 bits (560), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 110/240 (45%), Positives = 151/240 (62%), Gaps = 17/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CI N+ +S DLL CCGF CG+GC+GG+P AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIRSNGLQNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNF 161
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY G P TPKC + C + ++
Sbjct: 162 WKKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYK 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ Y + SD ++IM EIYKNGPVE +F+VY DF YKSGVY+H+TG+++GGHAV
Sbjct: 222 EDKHFGCDTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAV 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VAG+P + + + I
Sbjct: 282 RILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTGHYSERI 340
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 220 bits (560), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 116/236 (49%), Positives = 145/236 (61%), Gaps = 22/236 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
Q CGSCWAF A EA+SDR CI + +N LS DLL+CC F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAW 163
Query: 71 RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQ- 115
+++ HG+VT C PY + G + P C E PTPKCV C +
Sbjct: 164 KWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTY 223
Query: 116 --LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KH+ +AY + E I EI KNGP+EV+FTVYEDF Y +GVY H G +
Sbjct: 224 PTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNINWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 220 bits (560), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 116/236 (49%), Positives = 145/236 (61%), Gaps = 22/236 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
Q CGSCWAF A EA+SDR CI + +N LS DLL+CC F CG+GC+GGYPI AW
Sbjct: 104 QSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAW 163
Query: 71 RYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKCVRKCVKKNQ- 115
+++ HG+VT C PY + G + P C E PTPKCV C +
Sbjct: 164 KWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTY 223
Query: 116 --LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KH+ +AY + E I EI KNGP+EV+FTVYEDF Y +GVY H G +
Sbjct: 224 PTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE VAG+P
Sbjct: 284 GGHAVKILGWGV-DNGTPYWLVANSWNINWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 219 bits (559), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 113/233 (48%), Positives = 145/233 (62%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q +CGSCWAFGA E+LSDR CIH G ++ LS +L+ CC CG GCDGG+P +A Y+V
Sbjct: 116 QSNCGSCWAFGAAESLSDRHCIHLGQDIRLSTQNLVTCCD-ECGFGCDGGWPEAAMDYYV 174
Query: 75 HHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL---W 117
++G+VT + C Y C+H P C PTP CV+ C + +
Sbjct: 175 NNGLVTGDLYGNNSWCQAY-SLAPCAHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTIPY 233
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
H AY I+ + + IM EI NGP+EV+FTVYEDF YKSGVY+H+TG +GGHA
Sbjct: 234 PKDLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHA 293
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
VK++GWG ++G YWI+ N WN SWG G FKI RG NECGIE + V LP+
Sbjct: 294 VKMVGWGV-ENGTPYWIIVNSWNESWGDKGTFKILRGQNECGIESECVTALPA 345
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 219 bits (558), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 149/231 (64%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + + LS +L++CC CG GCDGGYP SAW Y
Sbjct: 103 QGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCDS-CGFGCDGGYPASAWDY 161
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ + G+V+ + C PY + C H P C TP C +C K++ + +
Sbjct: 162 WQNVGIVSGGNYGSKQGCQPYSIAP-CEHHVPGPRPACSGEGSTPDCRNQCDKRSGISYD 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+Y SAY + + + I AEI KNGPVE +FTVYED +YK GVY+H+ G V+GGHA+
Sbjct: 221 KDLYYGESAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG +D YW++AN WN WG +G+FKI RG +ECGIE DV AGLP
Sbjct: 281 KILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAGLP 330
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 219 bits (558), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 112/240 (46%), Positives = 151/240 (62%), Gaps = 18/240 (7%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGG 64
N + ++ QG CGSCWAFGA EA+SDR CIH N+++S +LL+CC + CG GC+GG
Sbjct: 94 NCPSIRLIRDQGSCGSCWAFGAAEAMSDRVCIHTHKNVNISAENLLSCC-YTCGFGCNGG 152
Query: 65 YPISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCV 111
+P +AWR++ + G+V+ + C PY G P C TPKC + C
Sbjct: 153 FPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKP-CAEGGRTPKCHKTCD 211
Query: 112 KKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
KN K S S+Y I SDP+ I +I NGPVE +F+VY DF YKSGVY+H+
Sbjct: 212 NKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAFSVYSDFMSYKSGVYRHVK 271
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G ++GGHA++++GWG + G YW++AN WN WG +G FKI RGS+ CGIE+ VVAGLP
Sbjct: 272 GSLLGGHAIRILGWGM-EKGTPYWLVANSWNTDWGDNGTFKILRGSDHCGIEDSVVAGLP 330
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 116/252 (46%), Positives = 156/252 (61%), Gaps = 32/252 (12%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N + + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDG
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC-------------- 106
C+GGYP AW ++ G+V+ Y+DS H GC P Y P C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVS---GGYYDS----HIGCLP-YTIPPCEHHVNGSRPPCTGE 201
Query: 107 --VRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 159
R+C K + ++ KH+ ++Y +++ + IMAEIYKNGPVE +FTV+ DF
Sbjct: 202 GDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIYKNGPVEGAFTVFSDFLT 261
Query: 160 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 219
YKSGVYKH GD+MGGHA++++ WG ++G YW AN WN WG +G+FKI RG N CG
Sbjct: 262 YKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWAAANSWNLDWGDNGFFKILRGENHCG 320
Query: 220 IEEDVVAGLPSS 231
IE ++VAG+P +
Sbjct: 321 IESEIVAGIPRT 332
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 218 bits (556), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/232 (47%), Positives = 153/232 (65%), Gaps = 18/232 (7%)
Query: 23 AFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
AFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 122 AFGAVEAISDRICIHTNAHISVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 181
Query: 81 EE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSIS 126
C PY C H P C TPKC + C + ++ KHY +
Sbjct: 182 GGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYN 240
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
+Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+HITG++MGGHA++++GWG
Sbjct: 241 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGV- 299
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
++G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 300 ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 351
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 218 bits (556), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 115/231 (49%), Positives = 148/231 (64%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH G +S+ ++ DLL CC CG GC+GGYP SAW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSGGKISVEISSEDLLTCCDS-CGMGCNGGYPSSAWDF 159
Query: 73 FVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKC-VKKNQLWR 118
+ G+V+ C PY S G P TP+C+ +C + ++
Sbjct: 160 WTKEGLVSGGLYNSHIGCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAGYSPSYK 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY S+Y + E I AEI KNGPVE +FTVYEDF YKSGVY+H++G V+GGHA+
Sbjct: 220 QDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG +DG YW+ AN WN WG +G+FKI RGSN CGIE ++VAG+P
Sbjct: 280 KVLGWG-EEDGIPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEIVAGIP 329
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/232 (50%), Positives = 147/232 (63%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G + L+ +D+L+CC CG GC+GG+P +AW Y
Sbjct: 111 QGSCGSCWAFGAVEAMSDRHCIHSGAKNIVHLAADDVLSCC-MSCGSGCNGGFPGAAWSY 169
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKK-NQLW 117
+VH G+VT E C PY C H P + PTP+CVR C K N +
Sbjct: 170 WVHKGIVTGGNYDSDEGCMPY-PIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDF 228
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KHY +Y + S+ I EI NGPVE FTVY DF YKSGVY+ T +GGHA
Sbjct: 229 ADDKHYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVYADFPLYKSGVYQRHTDQALGGHA 288
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++L+GWG + G YW+ AN WN WG G+FKI RGS+ECGIE+DVVAG+P
Sbjct: 289 IRLLGWGV-EKGVPYWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAGIP 339
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 218 bits (554), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG GC+GG
Sbjct: 118 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 176
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
P++AWRY+V G+VT Y + GC P CE YPTPKC +
Sbjct: 177 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 234
Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y GVY
Sbjct: 235 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 294
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G + GGHAVKLIGWG DDG YW +AN WN WG DG+F+I RG +ECGIE VV
Sbjct: 295 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 353
Query: 227 GLPSSKNLV 235
G+P +L
Sbjct: 354 GIPKLNSLT 362
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 218 bits (554), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG GC+GG
Sbjct: 119 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 177
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
P++AWRY+V G+VT Y + GC P CE YPTPKC +
Sbjct: 178 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 235
Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y GVY
Sbjct: 236 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 295
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G + GGHAVKLIGWG DDG YW +AN WN WG DG+F+I RG +ECGIE VV
Sbjct: 296 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 354
Query: 227 GLPSSKNLV 235
G+P +L
Sbjct: 355 GIPKLNSLT 363
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 117/249 (46%), Positives = 153/249 (61%), Gaps = 24/249 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGAVEA+SDR CI H + ++LS +DLL+CC CG GC+GG
Sbjct: 109 DSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGG 167
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
P++AWRY+V G+VT Y + GC P CE YPTPKC +
Sbjct: 168 DPLAAWRYWVKDGIVTGS--NYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 225
Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
KCV ++ + K + SAY + D E I E+ +GP+E++F VYEDF +Y GVY
Sbjct: 226 KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 285
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G + GGHAVKLIGWG DDG YW +AN WN WG DG+F+I RG +ECGIE VV
Sbjct: 286 HTGGKLGGGHAVKLIGWGI-DDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVG 344
Query: 227 GLPSSKNLV 235
G+P +L
Sbjct: 345 GIPKLNSLT 353
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 217 bits (553), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 116/232 (50%), Positives = 145/232 (62%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CI N+ +S DL +CC CG+GC+GG+P +AW Y
Sbjct: 111 QGACGSCWAFGAVEAMSDRICIKSQGKENVHISAEDLTSCC-RTCGNGCEGGFPSAAWSY 169
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLW 117
+ G+VT + C PY C H P + PTPKC C N +
Sbjct: 170 YKRDGLVTGGQYNSHQGCQPY-TIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTY 228
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +SAY ++ E IM EI NGPVE +FTVY DF YKSGVYKH TG +GGHA
Sbjct: 229 EKDKHYGMSAYSVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHA 287
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+K++GWGT ++G+DYW++AN WN WG G+FKI RG +ECGIE + AG P
Sbjct: 288 IKILGWGT-ENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEP 338
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 217 bits (552), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 116/231 (50%), Positives = 148/231 (64%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + + +S DLL+CC CG GCDGG+P SAW +
Sbjct: 18 QGSCGSCWAFGAVEAMSDRHCIHSNGKVKIEVSPEDLLSCCS-SCGMGCDGGFPPSAWEF 76
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+V G+ T C PY + C H P C TPKCV C K N +R
Sbjct: 77 WVDKGIATGGLWNSHIGCQPY-EIPACEHHTTGDRPPCSDIVDTPKCVHLCEKGYNTSYR 135
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ KH+ +Y I S + I EI+KNGPVE +F+VY DF +YKSGVY+H +G+ +GGHA+
Sbjct: 136 DDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAI 195
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG +D YW+ AN WN WG GYFKI RGS+ECGIE +VAG+P
Sbjct: 196 RVLGWGYEND-VPYWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAGIP 245
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 217 bits (552), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 146/231 (63%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +++ +S DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRICIHTKGKVSVEISSQDLLTCCDS-CGMGCNGGYPANAWEF 159
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWR 118
+ G+VT C PY G P TP+CV +C ++
Sbjct: 160 WTEQGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQ 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y + S+ E I +EIYKNGPVE +F VYEDF YKSGVY+H+TG +GGHA+
Sbjct: 220 KDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGVYQHVTGSALGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K+IGWG ++G YW+ AN WN WG +G+FKI RGSN CGIE +VVAG+P
Sbjct: 280 KMIGWG-EENGVPYWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAGIP 329
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 217 bits (552), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 114/246 (46%), Positives = 158/246 (64%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR C+H +N+ +S DLL+CCG CG G
Sbjct: 90 WPNCPTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGDECGMG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--PGCEPAYP-----TPKC 106
C+GGYP AW+++ G+V+ C PY C H G PA TPKC
Sbjct: 150 CNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPACKGEEGDTPKC 208
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V++C + + + KH+ ++Y + + ++IMAEIYKNGPVE +F VY DF YKSGVY
Sbjct: 209 VKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVY 268
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H TG+ +GGHA+K++GWG ++G YW+ AN WN WG +G+FKI RG + CGIE ++V
Sbjct: 269 QHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIV 327
Query: 226 AGLPSS 231
AG+P +
Sbjct: 328 AGVPKN 333
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 216 bits (550), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 115/251 (45%), Positives = 160/251 (63%), Gaps = 18/251 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI ++S+ V+ D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C ++ KHY S+Y ++S ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KICEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG++MGGHAV+++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLPSSKNLVKE 237
G+P + K+
Sbjct: 328 GIPCTDQYWKK 338
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 216 bits (550), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 114/234 (48%), Positives = 143/234 (61%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A E +SDR CI + LS+S +D+ ACCG +CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASNGKTQLSISADDINACCGMVCGNGCNGGYPIEAWRH 178
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
+V G VT Y + TGC +P CE P+ YPT KC R C L
Sbjct: 179 YVKKGYVTG--GSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYAL 236
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ H+ SAY ++ +I EI +GPVEV+F+VYEDF HY GVY H G +GG
Sbjct: 237 TYTQDLHFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVYVHTAGASLGG 296
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWG D+G YW+ AN WN WG +GYF+I RG NECGIE VV G+P
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGGIP 349
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 216 bits (550), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 111/223 (49%), Positives = 140/223 (62%), Gaps = 19/223 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI + LS D+L+CC CG GC+GG+P AWR+
Sbjct: 37 QSTCGSCWAFGAVEAMSDRLCIASNGTVKDELSAEDMLSCCLVQCGMGCNGGFPTGAWRF 96
Query: 73 FVHHGVVTEECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI 125
F HG+ TE PY C H C P+ PTPKCVR KK +++
Sbjct: 97 FKMHGLTTESKYPYVFPP-CEHHINKTHYKPCGPSQPTPKCVRASEKK------PRYHGK 149
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
S Y ++ P I AEI NGPVE +FTVY+DF Y+SGVY+H++G +GGHA+K++GWG
Sbjct: 150 SVYSVS--PAKIQAEIMTNGPVEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGV 207
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+ G YW++AN WN WG G FKI RG +ECGIE VVAG+
Sbjct: 208 -EAGNKYWLVANSWNEDWGDKGTFKIARGDDECGIESSVVAGM 249
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 216 bits (549), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 113/234 (48%), Positives = 142/234 (60%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A E +SDR CI + +S+S +D+ ACCG +CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASNGKTQISISADDINACCGMVCGNGCNGGYPIEAWRH 178
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
+V G VT Y + +GC +P CE P+ YPT KC C L
Sbjct: 179 YVKKGYVTG--GSYQEKSGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPL 236
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ H+ SAY ++ P +I EI +GPVEV+FTVYEDF HY GVY H G +GG
Sbjct: 237 TYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGG 296
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWG D+G YW+ AN WN WG +GYF+I RG NECGIE VV G P
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGGTP 349
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 138/230 (60%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGA EA+SDR CIH + +S+S DL CC + CGDGC+GG+P AW Y
Sbjct: 107 QASCGSCWAFGAAEAMSDRICIHSNATVKVSISTEDLNTCC-YECGDGCNGGWPAEAWAY 165
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQLWRN 119
+ G+VT + C Y C H P C PTP+C ++C +
Sbjct: 166 WAETGIVTGGKYETKDGCKAY-TVPPCEHHTEGDLPACGDIVPTPQCKKECDAGVDIEYK 224
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
S SAY+ +SD I EI NGPVE F VYEDF +YKSGVY+ TG+ GGHA+K
Sbjct: 225 SDLRKGSAYQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSGVYQQTTGNYAGGHAIK 284
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWG +DG YW+ AN WN WG GYFKI RG NECGIE D++ G+P
Sbjct: 285 ILGWGV-EDGTPYWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGGIP 333
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 147/231 (63%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCW+FGAVE+++DR CIH + + +S DL+ CC CG GC+GG+ AW Y
Sbjct: 104 QGSCGSCWSFGAVESITDRICIHSNGKVKVHISAEDLMTCCT-SCGMGCNGGFLPQAWHY 162
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+V++G+VT + C PY + C H C PTPKC +KC N+ +
Sbjct: 163 WVNNGIVTGGQYHSHKGCQPY-EIPKCEHHVKGPFKACGKELPTPKCSQKCQPGYNKTFN 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y I ++ + I EI NGPVE +FTVY DF YKSGVY+H TG +GGHAV
Sbjct: 222 QDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAV 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWGT ++ YW++AN WN +WG GYFKI RG +ECGIE +VAG+P
Sbjct: 282 KILGWGTENN-TPYWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMP 331
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 158/252 (62%), Gaps = 17/252 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CI ++S+ V+ D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVSVEVSAEDMLTCCGDQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY G P TPKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCS 209
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY S+Y ++S ++IMAEI+KNGPVE +FTVY DF YKSGVY+
Sbjct: 210 KICEPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQ 269
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+ GD+MGGHAV+++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 270 HVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 328
Query: 227 GLPSSKNLVKEI 238
G+P + K I
Sbjct: 329 GIPCTDQYWKRI 340
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 118/234 (50%), Positives = 144/234 (61%), Gaps = 23/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA GAVEA++DR CI N +++S +DLL+CC CG GCDGG P +AW Y
Sbjct: 117 QSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGGDPYAAWSY 175
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL 116
+V +G+VT Y +GC +P CE YPT C KC +
Sbjct: 176 WVSNGIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSI 233
Query: 117 WRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
NS KHY S Y + D I EI NGPVEV+F VYEDF HY SG+YKH TGD +GG
Sbjct: 234 SYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGG 293
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWGT ++G DYWI AN WN WG +G+F+I RG +EC IE VVAG P
Sbjct: 294 HAVKMLGWGT-ENGTDYWICANSWNSDWGENGFFRILRGVDECQIESSVVAGEP 346
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/235 (49%), Positives = 146/235 (62%), Gaps = 19/235 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI LS +L++CC CG GC+GG+P SAW Y
Sbjct: 116 QSSCGSCWAFGAVEAMSDRICIKSKGKHKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLY 174
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ + G+VT + C PY + C H P C+ TP C C N +
Sbjct: 175 WKNQGIVTGDLYNTTNGCQPY-EFPPCEHHVIGPLPSCDGDVETPSCKTNCQPGYNIPYE 233
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y YRI+S+PE IM E+ +NGPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 234 KDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAV 293
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
+L+GWG ++ YW++AN WN WG GYFKI RG NECGIE DV AG+P KN
Sbjct: 294 RLLGWG-EENNVPYWLIANSWNSDWGDKGYFKIVRGKNECGIESDVNAGIPKIKN 347
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/233 (49%), Positives = 152/233 (65%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY-- 72
QG CGSCWAFGAVEA+SDR+CI F +++S +LL+CC CG GCDGGYP +AWR+
Sbjct: 108 QGACGSCWAFGAVEAMSDRYCISFKEQVNISAENLLSCCE-TCGSGCDGGYPAAAWRHWA 166
Query: 73 --FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQL 116
++ G+VT C PY C H PG C + TP C R C+ ++
Sbjct: 167 DKLLYEGIVTGGQYDSNAGCQPY-TIPKCDHHEPGPYENCSGSQSTPSCKRSCISSYDKS 225
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+R+ KHY ++Y I+SD I EI NGPVE +F+VY DF Y SGVY+H TG +GGH
Sbjct: 226 YRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGH 285
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A+K++GWGT ++G YW++AN WN SWG G+FKI RG +ECGIE +VAG+P
Sbjct: 286 AIKILGWGT-ENGVPYWLVANSWNPSWGDSGFFKIIRGKDECGIESSIVAGMP 337
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 112/242 (46%), Positives = 148/242 (61%), Gaps = 18/242 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 62
+ N + ++ QG CGSCWAFGA EA+SDR CIH N+++S +LL+CC + CG GC+
Sbjct: 93 WPNCPSIRLIRDQGSCGSCWAFGAAEAMSDRICIHTNKNVNISAENLLSCC-YSCGFGCN 151
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRK 109
GG+P +AW+Y+ G+V+ C PY D C H C TPKC R
Sbjct: 152 GGFPGAAWKYWTSKGLVSGGLYGSHSGCQPY-DIEPCEHHVNGTRQPCAEGGRTPKCHRT 210
Query: 110 CVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C +N K S S+Y I SDP+ I EI NGPVE +F+VY DF + KSGVY+H
Sbjct: 211 CENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGVYRH 270
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ G ++GGHA++++GWG + G YW++AN WN WG G FKI RGS+ CGIE VV G
Sbjct: 271 VKGSLLGGHAIRILGWGV-EKGTPYWLVANSWNTDWGDKGTFKILRGSDHCGIEGSVVTG 329
Query: 228 LP 229
LP
Sbjct: 330 LP 331
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 148/230 (64%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + + LS +L++CC CG GCDGG+P SAW Y
Sbjct: 103 QGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLVSCCDS-CGYGCDGGFPASAWDY 161
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PGCEPAYP----TPKCVRKCVKKNQLWRN 119
+ + G+V+ + C PY + C H PG PA TP C +C + + + +
Sbjct: 162 WQNEGIVSGGNYGSKQGCQPYSIAP-CEHHVPGSRPACSGGGDTPDCRNQCDEGSGISYD 220
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
HY + + I AEI KNGPVE +FTVYED +YK GVY+H+ G+ +GGHA+K
Sbjct: 221 QDHYYGETVYTLDEAKQIQAEILKNGPVEAAFTVYEDLLNYKEGVYQHVAGEALGGHAIK 280
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWG +D YW++AN WN WG +G+FKI RGS+ECGIE+ +VAGLP
Sbjct: 281 ILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGLP 329
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 214 bits (545), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/233 (46%), Positives = 148/233 (63%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH N S +L++CC + CG GC+GG+P +AW Y
Sbjct: 113 QGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNY 171
Query: 73 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-W 117
+ G+V+ PY + GC + C+ TP CV+KC + ++ +
Sbjct: 172 WKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
H+ SAY I +D + I EIY NGPVE +FTVYEDF Y++GVYKH+ G +GGHA
Sbjct: 230 AQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
++++GWG + YW++AN WN WG+DG+FKI RGS+ECGIE + AGLP+
Sbjct: 290 IRILGWGVQNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 342
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/231 (48%), Positives = 148/231 (64%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH N S +DL++CC + CG GC+GGYP +AW Y
Sbjct: 99 QGSCGSCWAFGAVEAMSDRVCIHSNGESNFHFSSDDLVSCC-WTCGMGCNGGYPGAAWHY 157
Query: 73 FVHHGVVT-------EECDPYF-----DSTGCSHPGCEPAY-PTPKCVRKCVKKNQL-WR 118
+V G+V+ + C PY T S P C+ + TPKC + C ++ +
Sbjct: 158 WVRKGLVSGGQYGTKQGCRPYEIPPCEHHTNGSRPACDASEGNTPKCAKSCESNYKINYS 217
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
N H+ AY I+SD + I AEI +NGPVE +F+VY DF +YK+GVY+HI G +GGHA+
Sbjct: 218 NDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGVYQHIKGQFLGGHAI 277
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++ GWG ++ YW++AN WN WG G FKI RGS+ CGIE +VAGLP
Sbjct: 278 RIFGWGVENN-TPYWLIANSWNTDWGDSGTFKILRGSDHCGIESGIVAGLP 327
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 109/245 (44%), Positives = 153/245 (62%), Gaps = 19/245 (7%)
Query: 1 MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 58
+ + N ++ + QG CGSCWAFGA EA+SDR+CIH +++ +S DLL+CC CG
Sbjct: 87 LQWPNCPTIKEIRDQGSCGSCWAFGAAEAISDRYCIHSNGKVSVEISAEDLLSCCD-ACG 145
Query: 59 DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPK 105
GC GG+P +AW Y+ G+VT C PY + C H P C TPK
Sbjct: 146 MGCMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAP-CEHHVNGTRPPCTGEGDTPK 204
Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
CV +C ++ K + Y + + IM E+YKNGPVE +F+VYEDF YK+GV
Sbjct: 205 CVSECNAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGV 264
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
Y+H+TG ++GGHA+K++GWG ++ YW++AN WN WG +G+FKI RG +ECGIE ++
Sbjct: 265 YQHVTGQMLGGHAIKILGWG-KENNTPYWLVANSWNTDWGDNGFFKILRGKDECGIESEI 323
Query: 225 VAGLP 229
VAG+P
Sbjct: 324 VAGIP 328
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 214 bits (544), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 116/232 (50%), Positives = 143/232 (61%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CI N +S DL +CC CG+GC+GG+P +AW Y
Sbjct: 111 QGACGSCWAFGAVEAMSDRICIKSQGKENTHISAEDLTSCC-RTCGNGCEGGFPSAAWSY 169
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKC-VKKNQLW 117
+ G+VT + C PY C H P + PTPKC C N +
Sbjct: 170 YKKDGLVTGGQYNSHQGCLPY-TIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTY 228
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY SAY ++ E IM EI NGPVE +FTVY DF YKSGVYKH TG +GGHA
Sbjct: 229 EKDKHYGSSAYSVHG-VEKIMTEIMTNGPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHA 287
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+K++GWGT ++G+DYW++AN WN WG G+FKI RG +ECGIE + AG P
Sbjct: 288 IKILGWGT-ENGDDYWLVANSWNPDWGDQGFFKILRGQDECGIESQISAGEP 338
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 214 bits (544), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/231 (48%), Positives = 147/231 (63%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +N+ +S DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSNGKVNVEISSEDLLTCCDS-CGMGCNGGYPSAAWDF 159
Query: 73 FVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY + G P TP+CVR+C +
Sbjct: 160 WASEGLVSGGLYESHIGCRPYTIAPCEHHVNGSRPPCTGEGGDTPECVRQCESGYTPSYI 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y + SD + I EIYKNGPVE +FTVYEDF YK+GVY+H++G +GGHA+
Sbjct: 220 QDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW+ AN WN WG +GYFKI RGS+ CGIE ++VAG+P
Sbjct: 280 KVLGWG-EENGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEIVAGIP 329
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 114/223 (51%), Positives = 141/223 (63%), Gaps = 18/223 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGAVE+ DR CIH G+++ LS DL+ C DGC+GG +SAW +
Sbjct: 100 QARCGSCWAFGAVESAQDRICIHKGLDVQLSFLDLVTC--DQSDDGCEGGDDVSAWNFLK 157
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNSKHYSIS 126
GVVT+EC PY + P C PA TP CV++C + L + KH
Sbjct: 158 KQGVVTQECKPY------TIPTCPPAQQPCLNFVNTPNCVKQCESNSTLIYSQDKHKMAK 211
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
Y INS E IM EI NGPVE F+VYEDF YKSGVY+H TG +GGH VK+ G+GT
Sbjct: 212 IYSINS-VEAIMQEISTNGPVEACFSVYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYGTL 270
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+G +YW +AN W SWG +G F IKRGS+ECGIE++VVAG+P
Sbjct: 271 -NGVNYWSVANSWTTSWGDNGIFLIKRGSDECGIEDEVVAGIP 312
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 115/245 (46%), Positives = 151/245 (61%), Gaps = 19/245 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + Q CGSCWAFGAVE++SDR CIH +++ LS +LL+CC CG G
Sbjct: 102 WKNCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLLSCCS-RCGFG 160
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCE-PAYPTPKC 106
C+GG P AW Y+ G+VT C PY ST +H CE Y TP+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220
Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+ C + + N K+Y S+Y + SD IM EI NGPVE +F V++DF +YK+GVY
Sbjct: 221 YQTCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVY 280
Query: 166 KHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
K++TG ++GGHA+++IGWG S + YW+ AN WN+ WG GYFKI RGSNECGIE V
Sbjct: 281 KYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRGSNECGIESMV 340
Query: 225 VAGLP 229
AGLP
Sbjct: 341 TAGLP 345
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 213 bits (543), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 116/242 (47%), Positives = 149/242 (61%), Gaps = 22/242 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
E ++ + Q CGSCWAFGAVEA+SDR CI H + +SLS +DLL+CC CG GC+GG
Sbjct: 134 ESIKAIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFGCNGG 192
Query: 65 YPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRK 109
P++AWRY+V G+VT C PY C H P YPTPKC ++
Sbjct: 193 DPLAAWRYWVKDGIVTGSNFTANSGCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKR 251
Query: 110 CVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + ++ + K Y SAY + D E I E+ +GP+E++F VYEDF +Y GVY H
Sbjct: 252 CNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVH 311
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G + GGHAVKLIGWG +DG YW +AN WN WG DG+F+I RG +ECGIE VV G
Sbjct: 312 TGGKLGGGHAVKLIGWGI-EDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGG 370
Query: 228 LP 229
+P
Sbjct: 371 IP 372
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 213 bits (543), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 113/234 (48%), Positives = 141/234 (60%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A E +SDR CI +S+S +D+ ACCG CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASKGQTQVSISADDINACCGMACGNGCNGGYPIEAWRH 178
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
+V +G VT Y + TGC +P CE P+ YPT KC R C L
Sbjct: 179 YVKNGYVTG--GSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKCERSCQAGYSL 236
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ H+ SAY ++ +I EI NGPVEV+FTVY DF Y GVY H G +GG
Sbjct: 237 TYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYADFEVYSGGVYVHTAGASLGG 296
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWG D+G YW+ AN WN WG +GYF+I RG NECGIE VV G+P
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIEHGVVGGIP 349
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 213 bits (543), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 110/229 (48%), Positives = 144/229 (62%), Gaps = 17/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCWA GAVEA+SDR+C+ F N+ +S +L+ CC F CG+GC GG+ AW Y+V
Sbjct: 104 QGSCGSCWALGAVEAMSDRYCVSFQENVHISAENLMTCCKF-CGNGCAGGFLQQAWEYWV 162
Query: 75 HHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNS 120
G+VT E C PY C+H PG C TP+C R C +
Sbjct: 163 KDGLVTGGQYGSDEGCQPYLIPK-CNHHEPGPYENCTGEGKTPQCERTCRSGYTTSYEAD 221
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
HY AY ++ + E I EI NGPVE +FTVY DF YKSGVY+H+ G +GGHA+++
Sbjct: 222 LHYGEKAYAVHREVEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRI 281
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GWGT ++G YW++AN WN SWG GYFK+ RG ++CGIE ++VAG P
Sbjct: 282 LGWGT-ENGVPYWLIANSWNPSWGDKGYFKMIRGKDDCGIESNIVAGTP 329
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 213 bits (542), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 114/230 (49%), Positives = 143/230 (62%), Gaps = 17/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH SLS DL++CCG+ CG GC GGYP +AW +
Sbjct: 102 QSSCGSCWAFGAVEAMSDRLCIHSNGTFTKSLSSIDLVSCCGY-CGFGCQGGYPPAAWDF 160
Query: 73 FVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRN 119
+ +G+VT + DP + CSH G + Y TPKCV KC N +
Sbjct: 161 WQAYGIVTGGSKEDPMGCRSYPFPKCSHHGSKKYPPCPHRIYDTPKCVPKCDTPNIDYET 220
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K + Y + IM EI NGPVE +F VYEDF YK GVY H TG+ +GGHA++
Sbjct: 221 DKTRANITYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQGVYFHSTGEFIGGHAIR 280
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWG ++G YW++AN WN WG DGYFK+ RG NECGIE++V AGLP
Sbjct: 281 ILGWG-EENGTPYWLIANSWNEGWGEDGYFKMLRGKNECGIEDEVTAGLP 329
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 213 bits (542), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 112/231 (48%), Positives = 144/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 104 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 162
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V T+ C PY + C H PG C TPKC++KC N ++
Sbjct: 163 WKHFGLVSGGSYNSTQGCRPY-EIPPCEHHVPGNRLPCSGDTKTPKCIKKCEDNYNVAYK 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY Y + + I AE+YKNGPVE +FTVY D YKSGVYKH+ GD +GGHA+
Sbjct: 222 QDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVAGDALGGHAI 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 282 KIMGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 213 bits (542), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 150/243 (61%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGA EA+SDR CIH +S+ ++ DLL CC CG G
Sbjct: 89 WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNARVSVEISSEDLLTCCES-CGMG 147
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GGYP +AW ++ G+VT C PY G P TP+C+
Sbjct: 148 CNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCI 207
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+C ++ KHY ++Y + ++ I EIYKNGPVE +F VYEDF YKSGVY+
Sbjct: 208 NQCESGYTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQ 267
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G ++GGHA+K++GWG +DG YW+ AN WN WG +GYFKI RGS+ CGIE +VVA
Sbjct: 268 HVSGSLIGGHAIKILGWGV-EDGVPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVA 326
Query: 227 GLP 229
G+P
Sbjct: 327 GIP 329
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 213 bits (542), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 117/234 (50%), Positives = 150/234 (64%), Gaps = 19/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI G++ LS +L+ACC CG GC+GG+P SAW Y
Sbjct: 114 QSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSY 172
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+VT + C PY + C H P CE TPKC C N +
Sbjct: 173 WKRSGIVTGDLYNPTDGCQPY-EFPPCEHHVVGPRPSCEGDVETPKCKTTCQPGYNIPYN 231
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y + YR++S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 232 KDKWYGKTVYRVHSNQEAIMKEVKEHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAV 291
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+L+GWG ++G YW++AN WN WG +GYFKI RG NECGIE DV AG+P K
Sbjct: 292 RLLGWG-EENGVPYWLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAGIPKLK 344
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 213 bits (542), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 111/234 (47%), Positives = 145/234 (61%), Gaps = 18/234 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH + +S +DLL+CCG CG GC+GG P +AWRY
Sbjct: 106 QGTCGSCWAFGATEAMSDRICIHSEGKEVVRISADDLLSCCGLFCGFGCNGGLPENAWRY 165
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY + C H P C+ TPKC R+CV+ + ++
Sbjct: 166 WAIDGIVSGGLYGSHVGCRPY-EIPPCEHHTSGNRPDCKGNSKTPKCQRQCVESFDGKYQ 224
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH++ + Y + + EDIM EI GPVE F VY DF YKSGVY+H+ G +GGHAV
Sbjct: 225 ADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYADFLTYKSGVYQHVKGGFLGGHAV 284
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
K++GWG ++G YW+ AN WN WG G+FKI RG N C IE D+ AG+P +
Sbjct: 285 KILGWG-EENGVPYWLCANSWNTDWGDGGFFKILRGYNHCKIEADINAGIPKIR 337
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 213 bits (542), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 143/230 (62%), Gaps = 12/230 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +S +DLL+CCG CG+GC+GG
Sbjct: 99 KSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 158
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
YPI A R++ GVVT C PY + C+ C P TP C C + +
Sbjct: 159 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTPSCSMSCQSGYSTAY 216
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ +SAY + + I AEIY NGPVE +F+VYEDF YKSGVYKH G +GGHA
Sbjct: 217 AKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHA 276
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT + G YW++AN W +WG G+FKI RG ++CGIE VVAG
Sbjct: 277 IKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAG 325
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 213 bits (541), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 146/231 (63%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH MN+ +S DLL+CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRLCIHSNGLMNVEISAEDLLSCCDS-CGMGCNGGYPSAAWEF 159
Query: 73 FVHHGVVTEE-------CDPYFDS------TGCSHPGCEPAYPTPKCVRKC-VKKNQLWR 118
+ G+V+ C PY + G P TP+C +KC +
Sbjct: 160 WTTDGLVSGGLYDSHIGCRPYSIAPCEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYT 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y ++ ++I EIYKNGPVE +FTVYEDF YK+GVY+H+TG +GGHA+
Sbjct: 220 QDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW+ AN WN WG +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 280 KVLGWG-EENGTPYWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAGIP 329
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 212 bits (540), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 144/230 (62%), Gaps = 14/230 (6%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q +CGSCWAFGA E +SDR CI +S D++ CCG CG GCDGG
Sbjct: 101 KSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMVDCCGEYCGYGCDGG 160
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
Y I A R++V GVVT + C PY C+ GC P TP+C C K N +
Sbjct: 161 YSIQALRWWVFDGVVTGGDYQGDGCKPY---QFCNSAGC-PDAVTPECALSCQSKYNTEY 216
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K++ SAY + I +I NGPVE SF VYEDF YKSGVYK+I G ++GGHA
Sbjct: 217 AKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSGVYKYIAGKMLGGHA 276
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT ++G YW++AN W WG +G+FKI+RG NECGIE +VVAG
Sbjct: 277 IKIIGWGT-ENGTAYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAG 325
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/230 (49%), Positives = 141/230 (61%), Gaps = 20/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG+ EA+SDR CI H + LS +D+L+CC + CGDGCDGGYPISAW Y
Sbjct: 116 QSQCGSCWAFGSAEAMSDRVCIASHGNKTVELSADDILSCC-YDCGDGCDGGYPISAWEY 174
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQL-W 117
FV GVVT + C PY + C H E Y TP CV C + +
Sbjct: 175 FVETGVVTGGLYGTKDSCRPY-EIPPCGHHRNETFYGNCTQIADTPDCVTTCQAGYPISY 233
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K + +Y I S I EI GPV +F VYEDF HY G+YKH++G GGHA
Sbjct: 234 DDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHA 293
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
V+++GWG + G YW++AN WN WG +GYF+I RGSNECGIEE+VVAG
Sbjct: 294 VRILGWG-EEKGTAYWLVANSWNTDWGENGYFRILRGSNECGIEENVVAG 342
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 116/245 (47%), Positives = 150/245 (61%), Gaps = 24/245 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI H + +SLS +DLL+CC CG GC+GG P++AWRY
Sbjct: 128 QSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFGCNGGDPLAAWRY 186
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVK--KN 114
+V G+VT Y ++GC P CE YPTPKC +KC+ +
Sbjct: 187 WVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTD 244
Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ + K Y SAY + D E I E+ +GP+E++F VYEDF +Y GVY H G + G
Sbjct: 245 KTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGG 304
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
GHAVKLIGWG +DG YW AN WN WG DG+F+I RG +ECGIE VV G+P ++
Sbjct: 305 GHAVKLIGWGI-EDGIPYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSV 363
Query: 235 VKEIT 239
++
Sbjct: 364 SSRLS 368
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 113/243 (46%), Positives = 150/243 (61%), Gaps = 19/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI-HFGMNLS-LSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWA AVEA+SDR C+ G ++ +S DL +CC CG+G
Sbjct: 90 WANCPTIKEVRDQGSCGSCWALAAVEAMSDRICVASKGSTMAHISAEDLNSCCKS-CGNG 148
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P +AW Y+ G+VT + C PY + C H P C PTP+C
Sbjct: 149 CNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPY-EIKPCEHHINGSRPACGKLEPTPRCK 207
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C N + KHY+ +AY ++S + I EI NGPVE +FTVY DF HYKSGVY+
Sbjct: 208 KSCESGYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQ 267
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H +G +GGHAVK+IGWGT + YW++AN WN WG G+FKI RG +ECGIE D+VA
Sbjct: 268 HESGAELGGHAVKMIGWGT-EGSTPYWLIANSWNTDWGNMGFFKILRGQDECGIERDIVA 326
Query: 227 GLP 229
G P
Sbjct: 327 GEP 329
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 144/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N LS +L++CC + CG GC+GG+P +AW +
Sbjct: 111 QGSCGSCWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCC-YTCGFGCNGGFPGAAWSH 169
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT + C PY C H P C TPKC++ C + +
Sbjct: 170 WVKKGIVTGGNFNSSQGCQPYI-IPACEHHTTGDRPPCSEGGGTPKCLKTCEDGYTVDYT 228
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
HY S+Y ++ EDI EI NGPVE + TVYEDF YKSGVY+H+ G +GGHA+
Sbjct: 229 QDLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVYQHVHGKALGGHAI 288
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++G YW++AN WN WG +GY K+ RG + CGIE + AGLP
Sbjct: 289 RILGWGV-EEGVPYWLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLP 338
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 106/236 (44%), Positives = 151/236 (63%), Gaps = 18/236 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ QG CGSCWAFGAVEA+SDR CIH +++S +LL+CC + CG GC+GG+P +
Sbjct: 100 ISLIRDQGSCGSCWAFGAVEAMSDRLCIHSNKIVNVSAENLLSCC-YSCGFGCNGGFPGA 158
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKN- 114
AW ++ G+V+ + C PY + C H P C TPKC C ++
Sbjct: 159 AWSFWKKKGLVSGGLYGSHKGCQPYAIAP-CEHHANGTRPPCSGGGRTPKCHTFCENEDY 217
Query: 115 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ K + S+Y + SDP+ I EI NGPVE +F+VY DF +YKSGVY+H+ G ++
Sbjct: 218 SLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVYRHVKGSLL 277
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHA++++GWG ++G YW++AN WN WG +G FKI +GS+ CGIE +VAGLP
Sbjct: 278 GGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVAGLP 332
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 142/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWAFGA A+SDR CI G +S DL+ CC CG GC GGYP AW Y
Sbjct: 110 QSNCGSCWAFGAAGAISDRICIASGGKHQPRISPEDLVDCCAD-CGMGCQGGYPAQAWEY 168
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP------TPKCVRKCVKK-NQLWR 118
+V +G+VT + C PY C H P P TP+CV+KC + + +
Sbjct: 169 WVRNGLVTGDLYNTTDTCRPY-SFPPCEHHVVGPRKPCTGDPTTPQCVKKCQPEYPKTYE 227
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
N K Y + AY I+SD E IM ++ GP+EV F VY DF Y SGVY+H+ G ++GGHAV
Sbjct: 228 NDKWYGLKAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGVYRHVAGGLLGGHAV 287
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+L+GWG +DG DYW++AN WN WG GYFKI+RG NECGIE D AG P
Sbjct: 288 RLVGWGV-EDGADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAGHP 337
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/234 (47%), Positives = 144/234 (61%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWGY 168
Query: 73 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 116
+V G+V+ PY S GC + P CE Y TP+C KC ++
Sbjct: 169 WVRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVD 226
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KH+ AY I+ + DI EI NGPVE +FTVYED YK GVY+H+ G +GGH
Sbjct: 227 YKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAFTVYEDLILYKDGVYEHVHGKELGGH 286
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
A+++IGWG D YW++AN WN WG +G+FKI RG + CGIE + AGLP
Sbjct: 287 AIRIIGWGVEKD-TPYWLIANSWNTDWGNNGFFKILRGKDHCGIESSISAGLPK 339
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 146/231 (63%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 104 QGSCGSCWAFGAVEAMTDRICTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 162
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
+ H G+V+ + C PY + C H PG C TPKCV++C ++ ++
Sbjct: 163 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRLPCSGDTKTPKCVKECESGYKVPYK 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY Y + + I AE+YKNGPVE +FTVY D YKSGVYKH+TGD +GGHA+
Sbjct: 222 QDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVYKHVTGDALGGHAI 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 282 KIMGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 331
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 146/236 (61%), Gaps = 21/236 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH------FGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
QG CGSCWAFGA EA+SDR CI + + LS +DLL+CC CG GC+GG+P
Sbjct: 118 QGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSADDLLSCCRD-CGMGCNGGFPSQ 176
Query: 69 AWRYFVHHGVVTE------------ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
AW ++ H G+V+ E P + P CE PTPKC C ++ ++
Sbjct: 177 AWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHHVNGTRPPCEGDAPTPKCKNVCQEEYKV 236
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ KHY++ Y ++S+ + I E+ +GPVE F VY DF YKSGVY+H++G ++GG
Sbjct: 237 PYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPTYKSGVYQHVSGALLGG 296
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
HA+KL+GWG +DG YW+ AN WN WG G+FKI RG N CGIE D+VAG+P +
Sbjct: 297 HAIKLMGWG-EEDGVPYWLCANSWNTDWGEGGFFKILRGKNHCGIESDIVAGIPQN 351
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/232 (47%), Positives = 148/232 (63%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +S+ ++ DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLTCC-MSCGMGCNGGYPSAAWDF 159
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLW 117
+ G+V+ C PY + C H P C TP+C+ KC +
Sbjct: 160 WTKEGLVSGGLYDSHIGCRPYTIAP-CEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSY 218
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KH+ ++Y + SD E I +EI+KNGPVE +F VYEDF YKSGVY+H++G +GGHA
Sbjct: 219 KEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHA 278
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+K++GWG +DG YW+ AN WN WG +G+FK RGS+ CGIE +VVAG+P
Sbjct: 279 IKILGWGV-EDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAGIP 329
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 115/246 (46%), Positives = 151/246 (61%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+T+ + + Q CGSCWAFGAVEA+SDR CI LS +L++CC CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P SAW Y+ + G+VT + C PY + C H P C+ TP C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHNTLGPLPVCDGDVETPPCK 222
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C N + N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341
Query: 227 GLPSSK 232
G+P K
Sbjct: 342 GIPKIK 347
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 115/246 (46%), Positives = 151/246 (61%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+T+ + + Q CGSCWAFGAVEA+SDR CI LS +L++CC CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P SAW Y+ + G+VT + C PY + C H P C+ TP C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCK 222
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C N + N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341
Query: 227 GLPSSK 232
G+P K
Sbjct: 342 GIPKIK 347
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/237 (46%), Positives = 151/237 (63%), Gaps = 24/237 (10%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
++ + Q CGSCWAFGA E +SDR CI +S D+L+CCG CG GC GGY
Sbjct: 111 LKFIRNQASCGSCWAFGAAEVISDRVCIQSNGTQQPIISAEDILSCCGSTCGKGCQGGYT 170
Query: 67 ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPA----YPTPKCVRKCVKKNQL 116
I A +Y+++ GVVT C PY S P C+ + + TP C C +K
Sbjct: 171 IEAMKYWMNSGVVTGGDYNGAGCMPY------SFPPCKKSPCVEFSTPSCKTTCQEKYTT 224
Query: 117 --WRNSKHYSISAYRINSDPE---DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
++N KH++ SAY++++ I EIY NGPVE S+ V+EDF YKSGVY H++G+
Sbjct: 225 ADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHNGPVEASYRVFEDFYQYKSGVYHHVSGN 284
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++GGHAVK+IGWGT ++G DYW++AN W S+G G+FKI+RG+NEC IE ++VAGL
Sbjct: 285 LVGGHAVKIIGWGT-ENGVDYWLVANSWGTSFGEKGFFKIRRGTNECQIESNIVAGL 340
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 113/231 (48%), Positives = 144/231 (62%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH G +S+ ++ DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSGSKVSVEISSEDLLTCCD-ACGMGCNGGYPSAAWDF 159
Query: 73 FVHHGVVTEE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWR 118
+ G+V+ C PY S P C TPKCV C + +
Sbjct: 160 WTKEGLVSGGLYNSHIGCRPYTIPPCEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYT 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY S+Y + + E I AEI +NGPVE +F VYEDF YKSGVY+H TG +GGHA+
Sbjct: 220 KDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG +DG YW+ AN WN WG +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 280 KVLGWG-EEDGVPYWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAGIP 329
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/244 (45%), Positives = 154/244 (63%), Gaps = 20/244 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR CIH +N+ LS +DL++CC + CG G
Sbjct: 90 WPNCPTIREIRDQGSCGSCWAFGAVEAMSDRVCIHSKGAVNVRLSADDLVSCC-YSCGMG 148
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH--PGCEPA-----YPTPKC 106
C+GG+P +AW Y+V+ G+V+ + C PY + C H G P TP C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPY-EIAPCEHHVNGTRPPCTGDDNKTPSC 207
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
++C K N ++ K++ AY I+S+ + I EI NGPVE +F VYED YK GVY
Sbjct: 208 KQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVY 267
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+H+ G+ +GGHA++++GWGT + G YW++AN WN WG +G FKI RG + CGIE +V
Sbjct: 268 QHVKGEALGGHAIRILGWGT-EKGTPYWLIANSWNSDWGDNGTFKILRGEDHCGIESSIV 326
Query: 226 AGLP 229
AG+P
Sbjct: 327 AGIP 330
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 115/246 (46%), Positives = 151/246 (61%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+T+ + + Q CGSCWAFGAVEA+SDR CI LS +L++CC CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P SAW Y+ + G+VT + C PY + C H P C+ TP C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCK 222
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C N + N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341
Query: 227 GLPSSK 232
G+P K
Sbjct: 342 GIPKIK 347
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 211 bits (537), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/235 (47%), Positives = 142/235 (60%), Gaps = 23/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A E +SDR CI + +N +S DLL+CC CGDGCDGGYP+ AWRY
Sbjct: 100 QSDCGSCWAVAAAETISDRLCIASNGSINTFVSAEDLLSCCTS-CGDGCDGGYPLQAWRY 158
Query: 73 FVHHGVVT-------EECDPYFDS------TGCSHPGCEPAY--PTPKCVRKCVKKNQL- 116
+V G+V+ C PY + G + P C PA TP+C C K+
Sbjct: 159 WVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKC-PAQEEATPECASHCTSKSSYS 217
Query: 117 --WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ KHY +SAY + I EI ++GPVE F VY DF YKSG+Y H++G +G
Sbjct: 218 VAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSGIYTHVSGQELG 277
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GHAVK++GWG ++G YW++AN WN +WG GYF+I RG NECGIE VVAG+P
Sbjct: 278 GHAVKILGWGV-ENGTKYWLVANSWNINWGEKGYFRILRGRNECGIESAVVAGIP 331
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 113/230 (49%), Positives = 141/230 (61%), Gaps = 17/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CGDGCDGG+P AW +
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCK-DCGDGCDGGFPPMAWDF 166
Query: 73 FVHHGVVT----EE---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
+ HG+VT EE C PY S G P YPTPKCV+ C ++
Sbjct: 167 WKTHGIVTGGSKEEPTGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQK 226
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K + ++Y ++ IM EI NGPVE +F V+EDF YKSG+Y H G +GGHA++
Sbjct: 227 DKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIR 286
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWG ++G YW++AN WN WG GY + RG NECGIEE+ AGLP
Sbjct: 287 ILGWG-EENGVPYWLIANSWNEDWGEKGYLRFLRGHNECGIEEEATAGLP 335
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 145/237 (61%), Gaps = 19/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S D++ CC CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
F++ GVV+ + C PY C H G C PTP C RKC +++
Sbjct: 168 FIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMY 226
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
R K Y AY + + I +EI KNGPV SF VYEDF HYKSG+YKH G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
VK+IGWG +++ D+W++AN W+ WG GYF+I RGSN+CGIE + AG+ +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 149/244 (61%), Gaps = 24/244 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ + + Q CGSCWAFGAVEA+SDR CI H + +SLS +DLL+CC CG GC+GG
Sbjct: 119 QSIRNIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVSLSADDLLSCC-RSCGFGCNGG 177
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVR 108
P++AWRY+V G+VT Y ++GC P CE YPTPKC +
Sbjct: 178 DPLAAWRYWVKDGIVTGS--NYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK 235
Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
KC+ ++ + K Y SAY + D E I E+ +GP+E++F VYEDF +Y GVY
Sbjct: 236 KCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYV 295
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G + GGHAVKL+GWG ++G YW AN WN WG DG+F+I RG +ECGIE VV
Sbjct: 296 HTGGKLGGGHAVKLVGWGI-ENGIPYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVG 354
Query: 227 GLPS 230
G+P
Sbjct: 355 GVPK 358
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 141/231 (61%), Gaps = 20/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH + +++S D L CC +CG GC+GG P AW +
Sbjct: 107 QSTCGSCWAFGAVEAMSDRICIHSNATVKVNISAEDPLDCC-TICGMGCNGGMPAMAWLH 165
Query: 73 FVHHGVVTEECDPYFDSTGCSH--------------PGCEPAYPTPKCVRKCVKKNQLWR 118
+ +G+VT Y D+ GC P C P PTP C ++C + L
Sbjct: 166 WTVNGIVTG--GNYEDTNGCKAYSFAPCEHHVDGDLPPCGPTKPTPDCKKECDSGSSLTY 223
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ S Y I+ P+ I EI NGPVE SF+VYEDF YKSGVY+H+ G+ GGHA+
Sbjct: 224 QNDLTHGSNYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSGVYQHLEGEYAGGHAI 283
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG +D YW++AN WN WG GYFKI RGSNECGIE +VAG+P
Sbjct: 284 KILGWGVEND-TPYWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIP 333
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 210 bits (535), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 115/241 (47%), Positives = 145/241 (60%), Gaps = 25/241 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA AVEA+SDR CI ++LS +DLL+CC CG GC GG P++AW+Y
Sbjct: 145 QSSCGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKY 203
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQ 115
+V G+VT Y + +GC P CE YPTPKCV+KC K +
Sbjct: 204 WVLRGIVTG--SEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGK 261
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ K+Y Y + S+ E I EI GPVE SF VY DF +Y G+YKH+ G + GG
Sbjct: 262 SYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGG 321
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
HAVK++GWG D G YW+ AN WN WG DGYF+I RG NECGIE ++AG+P K L
Sbjct: 322 HAVKVLGWGI-DQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIP--KQLA 378
Query: 236 K 236
K
Sbjct: 379 K 379
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 116/234 (49%), Positives = 148/234 (63%), Gaps = 19/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI G++ LS +L+ACC CG GC+GG+P SAW Y
Sbjct: 114 QSSCGSCWAFGAVEAMSDRICIESKGLHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSY 172
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+VT + C PY + C H P C TPKC C N +
Sbjct: 173 WKRSGIVTGDLYNTTDGCQPY-EFPPCEHHVVGPRPSCGGDVETPKCKTTCQPGYNIPYN 231
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y + YR++S+ E IM E+ +GPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 232 KDKWYGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAV 291
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+L+GWG ++G YW++AN WN WG +GYFKI RG NECGIE DV AG+P K
Sbjct: 292 RLLGWG-EENGVPYWLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAGIPKLK 344
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/229 (48%), Positives = 144/229 (62%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCW+FGAVEA+SDR CI G N+ LS DLL CC CG GC+GG AW Y
Sbjct: 111 QSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDY 169
Query: 73 FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C+PY T +P C Y TP+C + C +K + +
Sbjct: 170 WVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYT 229
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 230 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAI 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++IGWG ++ YW++AN WN WG +GYF+I RG +EC IE +V+AG
Sbjct: 290 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 145/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR+C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 105 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSSEDLLSCCP-ICGLGCNGGIPSLAWEY 163
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V T+ C PY + C H PG C TPKC + C N +++
Sbjct: 164 WKHFGIVSGGNYNSTQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKCQKNCENGYNVMYK 222
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y Y +++ + I AE+YKNGPVE +FTVY D YKSGVYKHI GD +GGHA+
Sbjct: 223 KDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAFTVYADLLAYKSGVYKHIQGDALGGHAI 282
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG +D + YW++AN WN WG +G+FKI RG N CGIE ++AG P
Sbjct: 283 KILGWGVENDNK-YWLVANSWNTDWGDNGFFKILRGENHCGIEGSIIAGEP 332
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 138/230 (60%), Gaps = 12/230 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +S +DLL+CCG CG+GC+GG
Sbjct: 100 KSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 159
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
YPI A R++ GVVT C PY + C+ C P TP C C +
Sbjct: 160 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTPACSLSCQSGYTTAY 217
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ SAY + I EI NGPVE +FTVYEDF YKSGVYKH G +GGHA
Sbjct: 218 AKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 277
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT + G YW++AN W SWG G+FKI RG ++CGIE VVAG
Sbjct: 278 IKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAG 326
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 210 bits (534), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 108/233 (46%), Positives = 144/233 (61%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH N S +L++CC CG GC+GG+P +AW Y
Sbjct: 111 QGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLVSCC-RTCGFGCNGGFPGAAWHY 169
Query: 73 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKCVRKCVKKNQL-W 117
+ G+V+ PY GC + C+ TP CV+KC ++ +
Sbjct: 170 WKTKGIVSG--GPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPY 227
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
H SAY + +D + I EIY NGPVE +FTVYEDF Y++GVYKH+ G +GGHA
Sbjct: 228 AQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHA 287
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
++++GWG + YW++AN WN WG+DG+FKI RGS+ECGIE + AGLP+
Sbjct: 288 IRILGWGVQNGEIPYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAGLPA 340
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 209 bits (533), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 138/230 (60%), Gaps = 12/230 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +S +DLL+CCG CG+GC+GG
Sbjct: 100 KSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 159
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
YPI A R++ GVVT C PY + C+ C P TP C C +
Sbjct: 160 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGSC-PESKTPACSLSCQPGYTTAY 217
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ SAY + I EI NGPVE +FTVYEDF YKSGVYKH G +GGHA
Sbjct: 218 AKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 277
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT + G YW++AN W SWG G+FKI RG ++CGIE VVAG
Sbjct: 278 IKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAG 326
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 209 bits (533), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 112/229 (48%), Positives = 143/229 (62%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG AW Y
Sbjct: 111 QSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDY 169
Query: 73 FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C+PY T +P C Y TP+C + C KK + +
Sbjct: 170 WVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYT 229
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH S+Y + +D + I EI K GPVE FTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 230 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAI 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++IGWG ++ YW++AN WN WG +GYF+I RG +EC IE +V AG
Sbjct: 290 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 337
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 209 bits (533), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 112/229 (48%), Positives = 145/229 (63%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG AW +
Sbjct: 116 QSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGILGPAWDF 174
Query: 73 FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C+PY T +P C Y TP+C + C KK + +
Sbjct: 175 WVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYT 234
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+
Sbjct: 235 QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAI 294
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++IGWG ++ YW++AN WN WG +GYF+I RG +EC IE +V+AG
Sbjct: 295 RIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIAG 342
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 209 bits (533), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 139/236 (58%), Gaps = 22/236 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
Q HCGSCWA A EA+SDR CI + +N LS D+L CC F CGDGC+GGYPI AW
Sbjct: 95 QSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDILTCCTGKFNCGDGCEGGYPIQAW 154
Query: 71 RYFVHHGVVT-------EECDPYFDST------GCSHPGCEPAYP-TPKCVRKCVKKNQL 116
RY+V +G+VT C PY + G + P C TPKC C N
Sbjct: 155 RYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKCEHHCTGNNSY 214
Query: 117 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ KH+ SAY I + I EI +GPVEV F VYEDF YK+G+Y H+ G +
Sbjct: 215 PIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGEL 274
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHAVK++GWG D+G YW+ AN WN WG GYF+I RG +ECGIE VAG+P
Sbjct: 275 GGHAVKMLGWGV-DNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMP 329
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 209 bits (533), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 142/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + + LS +LL+CC CGDGC GG P SAW Y
Sbjct: 104 QGSCGSCWAFGAVEAMSDRLCIHSNGKLQVHLSAENLLSCCD-SCGDGCLGGSPESAWEY 162
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY C H P C TPKC ++C K + +
Sbjct: 163 WHKFGIVSGGNYGSKQGCQPY-SIAPCEHSIHGSSPACGGVTDTPKCKKQCEKGYSIPYD 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ +Y Y I +D + I AEI KNGP+ SF VYED YK GVY+H+ G+ +GGH +
Sbjct: 222 KAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGVYQHVAGEFLGGHVI 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K+ GWG ++G YW++AN WN WG +G+FKI RG +ECGIE DV AGLP
Sbjct: 282 KIFGWGI-ENGTPYWLVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAGLP 331
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 209 bits (532), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 108/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 108 QGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 166
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ H G+V+ E C PY + C H P C TP+C+ KC + +
Sbjct: 167 WTHKGIVSGGSYGSKEGCRPY-EVEPCEHHVNGTRPPCHSG-STPRCMHKCESGYSVDYA 224
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ AY +N +P DI EI NGPVE +FTVYED YK+GVY+H+ G +GGHA+
Sbjct: 225 KDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGRQLGGHAI 284
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG D+ YW++ N WN WG +G+F+I RG + CGIE + AGLP
Sbjct: 285 RILGWGVWGDNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISAGLP 336
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 209 bits (532), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 112/243 (46%), Positives = 147/243 (60%), Gaps = 19/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWA A EA+SDR C+ + + + LS +L+ACC CG G
Sbjct: 90 WANCPTIKEVRDQGSCGSCWAEAAAEAMSDRTCVASNGKVQVHLSSENLMACCE-TCGMG 148
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
C GG+P +AW Y+ G+VT + C PY + C H P C PTP+C
Sbjct: 149 CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPY-EIAPCEHHINGSRPACGKIEPTPRCK 207
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C N + KHY+ SAY ++S + I EI NGPVE +FTVY DF HYKSGVY+
Sbjct: 208 KTCESGYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQ 267
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H +G +GGHAVK+IGWG + YW++AN WN WG G+FKI RG +ECGIE D+VA
Sbjct: 268 HESGAELGGHAVKMIGWGM-EGSTPYWLIANSWNSDWGDMGFFKILRGQDECGIERDIVA 326
Query: 227 GLP 229
G P
Sbjct: 327 GEP 329
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 209 bits (532), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 115/246 (46%), Positives = 149/246 (60%), Gaps = 25/246 (10%)
Query: 10 EILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPI 67
+I +++ GSCWA AVEA+SDR CI ++LS +DLL+CC CG GC GG P+
Sbjct: 6 DIYILKSSSGSCWAVAAVEAMSDRICIMSKGKKQVTLSADDLLSCCK-TCGFGCFGGEPM 64
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCV 111
+AW+Y+V G+VT Y + +GC P CE YPTPKCV+KC
Sbjct: 65 AAWKYWVLRGIVTG--SEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCD 122
Query: 112 KK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
K + ++ K+Y S Y + S+ E I EI GPVE SF VY DF +Y G+YKH+ G
Sbjct: 123 KNYGKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGGIYKHVAG 182
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+ GGHAVK++GWG D G YW+ AN WN WG DGYF+I RG NECGIE ++AG+P
Sbjct: 183 SMGGGHAVKVLGWGI-DQGVPYWLAANSWNTDWGEDGYFRILRGVNECGIESGIIAGIP- 240
Query: 231 SKNLVK 236
K L K
Sbjct: 241 -KQLAK 245
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 209 bits (532), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 113/242 (46%), Positives = 145/242 (59%), Gaps = 22/242 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI + + +SLS +DLL+CC CG GCDGG P++AW+Y
Sbjct: 102 QSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLLSCCK-SCGFGCDGGDPMAAWKY 160
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQ 115
+V G+VT + C PY C H P YPTPKC +KC + +
Sbjct: 161 WVKEGIVTGSNFTMKQGCKPY-PFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEK 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ K + +AY + D I EI +GPVEV+F VYEDF Y G+Y H G + GG
Sbjct: 220 TYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGG 279
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
HAVK++GWG + G YW++AN WN WG DG+F+I RG +ECGIE VV GLP
Sbjct: 280 HAVKMLGWGV-EQGVPYWLVANSWNTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTY 338
Query: 236 KE 237
K+
Sbjct: 339 KK 340
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 209 bits (532), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 105/237 (44%), Positives = 145/237 (61%), Gaps = 19/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S D++ CC CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
F++ GVV+ + C PY C H G C PTP C RKC +++
Sbjct: 168 FIYDGVVSGGEYLTKDVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMY 226
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
R K Y AY + + I +EI +NGPV SF VYEDF HYKSG+YKH G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
VK+IGWG +++ D+W++AN W+ WG GYF+I RG+N+CGIE + AG+ +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 209 bits (532), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 144/231 (62%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CI +S +S DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDS-CGMGCNGGYPSAAWDF 159
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+VT C PY G P TP C KC + L++
Sbjct: 160 WTTDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYK 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ ++Y + S+ IMAE++KNGPVE +FTVYEDF YKSGVY+H++G +GGHA+
Sbjct: 220 EDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW+ AN WN WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 280 KILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 149/243 (61%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGA EA+SDR CIH +S +S DLL CC CG G
Sbjct: 89 WPNCPTLKEIRDQGSCGSCWAFGASEAISDRLCIHSNAKVSVEISAEDLLTCCD-SCGMG 147
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GGYP +AW ++ G+V+ C PY G P TP+C+
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCL 207
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+C +R KHY ++Y + SD +I EIYKNGPVE +FTVYEDF YKSGVY+
Sbjct: 208 SQCEAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQ 267
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G +GGHA+K++GWG ++G YW+ AN WN WG +G+FK RGS+ CGIE ++VA
Sbjct: 268 HVSGSAVGGHAIKVLGWG-EENGVPYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVA 326
Query: 227 GLP 229
G+P
Sbjct: 327 GIP 329
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/230 (47%), Positives = 137/230 (59%), Gaps = 12/230 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +S +DLL+CCG CG+GC+GG
Sbjct: 136 KSIKLIRDQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 195
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
YPI A R++ GVVT C PY C+ C P TP C C +
Sbjct: 196 YPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCTSGNC-PESKTPSCSLSCQSGYTTAY 253
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ SAY + I EI NGPVE +FTVYEDF YKSGVYKH G +GGHA
Sbjct: 254 AKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 313
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT + G YW++AN W SWG G+F+I RG ++CGIE VVAG
Sbjct: 314 IKIIGWGT-ESGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIESAVVAG 362
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/234 (47%), Positives = 144/234 (61%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 168
Query: 73 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 116
+V G+V+ PY S GC + P CE Y TP+C KC ++
Sbjct: 169 WVRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVD 226
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KH+ AY I+ + DI EI +GPVE +FTVYED YK GVY+H+ G +GGH
Sbjct: 227 YKTDKHFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDGVYEHVHGKELGGH 286
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
A+++IGWG D YW++AN WN WG +G+FKI RG + CGIE + AGLP
Sbjct: 287 AIRIIGWGVEKD-IPYWLVANSWNTDWGNNGFFKILRGKDHCGIESSISAGLPK 339
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/244 (45%), Positives = 149/244 (61%), Gaps = 18/244 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGA EA+SDR CIH +S +S DLL CC CG G
Sbjct: 89 WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISAQDLLTCCDG-CGMG 147
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GGYP +AW ++ G+VT C PY G P TP C
Sbjct: 148 CNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCD 207
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
C + ++ KH+ ++Y + S+ +DIM E+YKNGPVE +FTVYEDF YKSGVY+
Sbjct: 208 MSCEPGYSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQ 267
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G +GGHA+K++GWG ++G YW+ AN WN WG +GYFKI RG + CGIE ++VA
Sbjct: 268 HVSGPALGGHAIKILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVA 326
Query: 227 GLPS 230
G+P
Sbjct: 327 GIPQ 330
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 141/238 (59%), Gaps = 17/238 (7%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGG 64
N E ++ + Q CGSCWAFGA EA+SDR CI G +S DLL CCG CG GC+GG
Sbjct: 87 NCESIKEVRDQSTCGSCWAFGAAEAMSDRLCIATGKQTRISTEDLLTCCGITCGMGCNGG 146
Query: 65 YPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-------PGCEPAYPTPKCVRKC 110
+P AW YF + G+VT + C PY C H C + PTP CV+ C
Sbjct: 147 FPSGAWNYFKNKGLVTGDLFGDNSWCRPY-TFPPCDHHVDDGKYGPCGDSQPTPACVKSC 205
Query: 111 VKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
++ + + + K SI +Y ++S E I EI GPVE SFTVYEDF YKSGVY+++
Sbjct: 206 TAQSGRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASFTVYEDFLTYKSGVYQNVA 265
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G +GGHAVK+IGWG + YW++ N WN WG +G FKI RGSN GIE + AG
Sbjct: 266 GANLGGHAVKIIGWGVEKN-VPYWLVVNSWNEGWGENGLFKILRGSNHVGIEGGIYAG 322
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/242 (46%), Positives = 145/242 (59%), Gaps = 22/242 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI + + +SLS +DLL+CC CG GCDGG P++AW+Y
Sbjct: 143 QSSCGSCWAFGAVEAMSDRICIASNGKIQVSLSADDLLSCCK-SCGFGCDGGDPMAAWKY 201
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQ 115
+V G+VT + C PY C H P YPTPKC +KC + +
Sbjct: 202 WVKEGIVTGSNFTMKQGCKPY-PFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEK 260
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ K + +AY + D I EI +GPVEV+F VYEDF Y G+Y H G + GG
Sbjct: 261 TYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGG 320
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
HAVK++GWG + G YW++AN WN WG DG+F+I RG +ECGIE VV GLP
Sbjct: 321 HAVKMLGWGV-EQGVPYWLVANSWNTDWGEDGFFRIIRGIDECGIESSVVGGLPKLNRTY 379
Query: 236 KE 237
K+
Sbjct: 380 KK 381
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/232 (46%), Positives = 145/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR C+ G ++ S DL++CC CG GC+GG+P +AW Y
Sbjct: 107 QGSCGSCWAFGAVEAMSDRVCVASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSY 165
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKK-NQLW 117
+V G+V+ C PY + C H P CE TPKCV+KC + N +
Sbjct: 166 WVRKGLVSGGPFGSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPY 224
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K + S+Y I I EI NGPVE +FTVYED HYK GVY+H+TG ++GGHA
Sbjct: 225 QKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHA 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++++GWG ++G YW++AN WN WG +G+FKI RG + GIE + AGLP
Sbjct: 285 IRILGWGV-ENGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSISAGLP 335
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/237 (47%), Positives = 145/237 (61%), Gaps = 18/237 (7%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGG 64
+ + + Q CGSCWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG
Sbjct: 17 KSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCE-SCGLGCEGG 75
Query: 65 YPISAWRYFVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCV 111
AW Y+V G+VT C+PY T +P C Y TP+C + C
Sbjct: 76 ILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQ 135
Query: 112 KKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
KK + + KH S+Y + +D + I EI K GPVE FTVYEDF +YKSG+YKHITG
Sbjct: 136 KKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITG 195
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ +GGHA+++IGWG + YW++AN WN WG +GYF+I RG +EC IE +V AG
Sbjct: 196 ETLGGHAIRIIGWGVENKA-PYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 251
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/235 (49%), Positives = 143/235 (60%), Gaps = 23/235 (9%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
Q CGSCWA GAVEA++DR CI N +++S +DLL+CC CG GCDG P +AW
Sbjct: 1 FQSSCGSCWAVGAVEAMTDRICIASKGNQKVTISADDLLSCCD-ECGFGCDGRDPYAAWS 59
Query: 72 YFVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQ 115
Y+V +G+VT Y +GC +P CE YPT C KC
Sbjct: 60 YWVSNGIVTGS--NYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYS 117
Query: 116 LWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ NS KHY S Y + D I EI NGPVEV+F VYEDF HY SG+YKH TGD +G
Sbjct: 118 ISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLG 177
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GHAVK++GWGT ++G DYWI AN WN WG +G+F+I RG +EC IE VVAG P
Sbjct: 178 GHAVKMLGWGT-ENGTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEP 231
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/233 (50%), Positives = 148/233 (63%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A EA SDRFCI + +N LS D+L+CC CG GCDGGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCDGGYPINAWKY 161
Query: 73 FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCV--KKNQL 116
V G T C PY ++ G + P C + Y TP CV KC K N
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPACVNKCTNTKYNTA 221
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+++ KH+ +AY + I AEI +GPVE +FTVYEDF YKSGVY H TG +GGH
Sbjct: 222 YKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGH 281
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWGT D+G YW++AN WN +WG +GYF+I RG+NECGIE VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 111/232 (47%), Positives = 148/232 (63%), Gaps = 25/232 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q +CGSCWA AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++
Sbjct: 120 QSNCGSCWAIAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWW 178
Query: 74 VHHGVVTEECDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQL----WRNS 120
V G+ TE+C PY FD CSH G YP TPKC C ++N++ ++ S
Sbjct: 179 VWVGIATEDCQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTC-ERNEMDLVKYKGS 235
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
YS+ + ++M E+ NGP+E++ VY DF YKSGVYKH+ GD +GGHAVKL
Sbjct: 236 TSYSVKGEK------ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGDFLGGHAVKL 289
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+GWGT DG YW +AN WN WG GYF I+RG+NEC IE VAG+P+ +
Sbjct: 290 VGWGT-QDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIPAQE 340
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 142/231 (61%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N S +DL+ CC CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVTCC-HTCGFGCNGGFPGAAWSY 168
Query: 73 FVHHGVV-------TEECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WRN 119
+ G+V TE C PY + C H P P TP C +C + +
Sbjct: 169 WTTRGIVSGGSYNSTEGCRPY-EVEPCEHHVDGPRPPCHSGSTPHCKHQCQPNYSVDYEK 227
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KH+ S+Y IN +P +I EI NGPVE +FTVYED YK+GVY+H+ G +GGHA++
Sbjct: 228 DKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVYQHVHGKQLGGHAIR 287
Query: 180 LIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+IGWG + + YW++AN WN WG +G+F+I RG + CGIE + AGLP
Sbjct: 288 IIGWGVWGESKVPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAGLP 338
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 207 bits (528), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/230 (47%), Positives = 139/230 (60%), Gaps = 12/230 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +S +DLL+CCG CG+GC+GG
Sbjct: 100 KSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 159
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
YPI A R++ GVVT C PY + C+ C P TP C C + +
Sbjct: 160 YPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CTSGNC-PESKTPACSLSCQSGYSTAY 217
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ SAY + I EI NGPVE +FTVYEDF YKSGVYKH G +GGHA
Sbjct: 218 AKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYKSGVYKHTAGKALGGHA 277
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWGT + G YW++AN W +WG G+FKI RG ++CGIE VVAG
Sbjct: 278 IKIIGWGT-ESGSPYWLVANSWGTNWGESGFFKILRGDDQCGIEGAVVAG 326
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/234 (47%), Positives = 146/234 (62%), Gaps = 18/234 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ + Q CGSCWAFGAVEA+SDR CI G N+ LS DLL+CC CG GC+GG
Sbjct: 105 IATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCES-CGLGCEGGIL 163
Query: 67 ISAWRYFVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKK 113
AW ++V G+VT C+PY T +P C Y TP+C + C KK
Sbjct: 164 GPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKK 223
Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ + KH S+Y + +D + I EI K GPVE SFTVYEDF +YKSG+YKHITG+
Sbjct: 224 YKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEA 283
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
+GGHA+++IGWG ++ YW++AN WN WG +GYF+I RG +EC IE +V+A
Sbjct: 284 LGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGENGYFRIVRGRDECFIESEVIA 336
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 114/246 (46%), Positives = 150/246 (60%), Gaps = 19/246 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+T+ + + Q CGS WAFGAVEA+SDR CI LS +L++CC CG G
Sbjct: 105 WTHCPSISEIRDQSSCGSYWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMG 163
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P SAW Y+ + G+VT + C PY + C H P C+ TP C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCK 222
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
R C N + N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+
Sbjct: 223 RTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQ 282
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYFKI RG NECGIE DV A
Sbjct: 283 HVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNA 341
Query: 227 GLPSSK 232
G+P K
Sbjct: 342 GIPKIK 347
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/231 (47%), Positives = 144/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR+C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 106 QGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 164
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V+ + C PY + C H PG C TPKC + C N +R
Sbjct: 165 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYR 223
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y + ++S + I AE++KNGPVE +FTVY D +YK+GVYKH GD +GGHAV
Sbjct: 224 KDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAV 283
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 284 KILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 333
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 109/231 (47%), Positives = 144/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR+C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 106 QGSCGSCWAFGAVEAMTDRYCTYSNGTQHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 164
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V+ + C PY + C H PG C TPKC + C N +R
Sbjct: 165 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYR 223
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y + ++S + I AE++KNGPVE +FTVY D +YK+GVYKH GD +GGHAV
Sbjct: 224 KDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAV 283
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 284 KILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 333
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 207 bits (527), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/231 (48%), Positives = 146/231 (63%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE +SDR CIH N S +L++CC LCG GC+GG+P +A++Y
Sbjct: 102 QGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLVSCC-HLCGFGCNGGFPGAAFKY 160
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+VH G+V T+ C PY + C H P C TPKCV++C + +
Sbjct: 161 WVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVPGPRPKCSEGGGTPKCVKRCENGYTVDYE 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ H+ AY I D + I EI KNGPVE +FTVY DF HYKSGVY+H G +GGHA+
Sbjct: 220 SDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++G YW+ AN WN WG +G FKI RGS+ CGIE ++ AGLP
Sbjct: 280 RILGWG-EENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLP 329
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 114/237 (48%), Positives = 143/237 (60%), Gaps = 26/237 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA AVEA+SDR CI + LS +DLL+CC CG GC GG P++AW+Y
Sbjct: 143 QSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKY 201
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQ 115
+V G+VT Y + +GC P CE YPTPKC ++C K +
Sbjct: 202 WVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTK 259
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ K+Y AY + +D E I EI GPVE SF VY DF HY SG+YKH+ G V GG
Sbjct: 260 SYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGG 319
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWG D G YW+ AN WN WG D GYF+I RG++ECGIE +VAG+P
Sbjct: 320 HAVKILGWGI-DQGVSYWLAANSWNNDWGEDVFSGYFRILRGADECGIESGIVAGIP 375
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/223 (50%), Positives = 144/223 (64%), Gaps = 19/223 (8%)
Query: 24 FGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT- 80
FGAVE++SDR CIH G + L+ +D+L+CC + CG GC+GG+P +AW Y+V G+VT
Sbjct: 120 FGAVESMSDRHCIHSGAKNIVHLAADDVLSCC-WGCGSGCNGGFPAAAWSYWVDKGIVTG 178
Query: 81 ------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 127
E C PY C H C PTPKCVR C K N +++ KHY S+
Sbjct: 179 GNYDTDEGCMPY-PVPSCDHHVNGTLGPCGQDPPTPKCVRLCRKGYNVDFKDDKHYGKSS 237
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + S+ I EI KNGPVE +FTVY DF YKSGVYK + D +GGHA++++GWG +
Sbjct: 238 YSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVEN 297
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
D YW++AN WN WG GYFKI RGSNECGIEED+VAG+P
Sbjct: 298 D-VPYWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAGIPK 339
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 141/212 (66%), Gaps = 18/212 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFK 210
+++GWG ++G YW++AN WN WG +G+FK
Sbjct: 281 RILGWGV-ENGTPYWLVANSWNTDWGDNGFFK 311
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 106/233 (45%), Positives = 138/233 (59%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A A+SDR CIH M L+ D L+CC + CG GC GGYP AW Y
Sbjct: 108 QASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-CGQGCRGGYPPKAWDY 166
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
++ G+VT C P+ T C H G YPTP C R C N+
Sbjct: 167 WMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKT 225
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ K Y S+Y + IM EI KNGPVEV+F +++DF Y+SG+Y H+ G +G H
Sbjct: 226 YEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRH 285
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
AV++IGWG ++G +YW++AN WN WG +GYF++ RG NECGIE +VVAG+P
Sbjct: 286 AVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMP 337
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 110/237 (46%), Positives = 143/237 (60%), Gaps = 20/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CI ++ + D+L+CC CG+GC+GGYP++A Y
Sbjct: 116 QGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLSCC-LTCGNGCNGGYPLAAMEY 174
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVK--KNQLW 117
FV G+VT + C PY C H P C TPKC +C+ + +
Sbjct: 175 FVTRGLVTGGLYGTKDTCQPY-TLEACEHHVPGDRPPCTEGGGTPKCSHQCIPDYTTKAY 233
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
++ K + AY + +D I EI GPVE +FTVY DF YKSGVY+H +G +GGHA
Sbjct: 234 KDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYKSGVYRHTSGSELGGHA 293
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
+K+IGWGT + G+DYW++ N WN WG G FKI RGSNECGIE +VVA + L
Sbjct: 294 IKIIGWGT-EGGDDYWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVAATVDASTL 349
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 206 bits (525), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/223 (48%), Positives = 135/223 (60%), Gaps = 22/223 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A E +SDR CI LS+S +D+ ACCG +CG+GC+GGYPI AWR+
Sbjct: 119 QSSCGSCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRH 178
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKNQL 116
+V G VT Y D TGC +P CE P+ YPT KC R C L
Sbjct: 179 YVKKGYVTG--GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYAL 236
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ H+ SAY ++ +I EI +GPVEV+FTVYEDF HY GVY H G +GG
Sbjct: 237 TYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGG 296
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 218
HAVK++GWG D+G YW+ AN WN WG +GYF+I RG NEC
Sbjct: 297 HAVKMLGWGV-DNGTPYWLCANSWNEDWGENGYFRIIRGVNEC 338
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 206 bits (525), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CI G ++ S DL++CC CG GC+GG+P +AW Y
Sbjct: 105 QGSCGSCWAFGAVEAMSDRVCIASGGKIHFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSY 163
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKKNQL-W 117
+VH G+V+ C PY + C H P CE TPKCV+KC + +
Sbjct: 164 WVHKGLVSGGPFGSNLGCQPYAIAP-CEHHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPY 222
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K Y +Y I + I EI NGPVE +FTVYED HYK GVY+H+TG ++GGHA
Sbjct: 223 AKDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++++GWG ++ + YW++AN WN WG +G+FKI RG + GIE + AGLP
Sbjct: 283 IRILGWGVENNTK-YWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLP 333
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 206 bits (525), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 144/231 (62%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +S +S DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCDS-CGMGCNGGYPSAAWDF 159
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+VT C PY G P TP C KC + ++
Sbjct: 160 WATEGLVTGGLYNSHIGCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYK 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ ++Y + S+ IMAE++KNGPVE +FTVYEDF YKSGVY+H++G +GGHA+
Sbjct: 220 QDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW+ AN WN WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 280 KILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 206 bits (524), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 110/232 (47%), Positives = 145/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++ +S DL++CC CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRICIHSEGKVHFRVSSEDLVSCC-HTCGFGCNGGFPGAAWSY 168
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLW 117
+V G+V+ + C PY + C H P CE TPKCV+KC N +
Sbjct: 169 WVRKGLVSGGPFGSDQGCQPYAIAP-CEHHVNGSRPSCEGEGGKTPKCVKKCQASYNVPY 227
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K Y S+Y I + + I EI NGPVE +FTVYED +YK GVY H+ G ++GGHA
Sbjct: 228 AKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHA 287
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++++GWG +DG YW++AN WN WG +G+FKI RG + GIE + AGLP
Sbjct: 288 IRILGWGV-EDGTKYWLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAGLP 338
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 206 bits (524), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 113/252 (44%), Positives = 151/252 (59%), Gaps = 39/252 (15%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +S LS DLL CC CG GC+GGYP SAW +
Sbjct: 101 QGSCGSCWAFGASEAMSDRVCIHSNAKVSVELSAQDLLTCCNS-CGMGCNGGYPSSAWNF 159
Query: 73 FVHHGVVTE-------------------ECDPYFDSTGC--------------SHPGCE- 98
+V G+V+ D F S GC S P C
Sbjct: 160 WVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCEHHVNGSRPSCSG 219
Query: 99 PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 157
TP+C+ +C + ++ KH+ ++Y ++S+ ++I EIYKNGPVE +FTVYEDF
Sbjct: 220 EGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDF 279
Query: 158 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 217
YKSGVY+H++G +GGHA+K++GWG ++G YW+ AN WN WG +G+FKI RG++
Sbjct: 280 VLYKSGVYQHVSGSALGGHAIKMLGWG-EENGVPYWLCANSWNTDWGDNGFFKILRGADH 338
Query: 218 CGIEEDVVAGLP 229
CGIE ++VAG P
Sbjct: 339 CGIESEIVAGNP 350
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 206 bits (524), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 108/235 (45%), Positives = 145/235 (61%), Gaps = 17/235 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH + +S DL++CCG+ CG GC GG+P +AW +
Sbjct: 102 QSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPTAWDF 160
Query: 73 FVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRN 119
+ G+VT + +P + CSH G + Y TP CV+KC + +
Sbjct: 161 WQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYAT 220
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K + Y + + IM EI NGPVE +F VYEDF YKSGVY H G ++GGHA++
Sbjct: 221 DKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIR 280
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
++GWG ++G YW++AN WN WG DGYFK+ RG NECGIE++V AGLP ++
Sbjct: 281 ILGWG-EENGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPELSSI 334
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 206 bits (524), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 107/231 (46%), Positives = 141/231 (61%), Gaps = 13/231 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI + +SV D+L+CCG CG GC GG
Sbjct: 111 KSIKLIRNQATCGSCWAFGAAEIISDRICIQSNATQTPIISVEDILSCCGVSCGKGCQGG 170
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQL 116
Y I A R++ G VT C PY C C TP C C K
Sbjct: 171 YSIEALRFWKSSGAVTGGDYNGAGCMPY-SFAPCKKDSCAQG-TTPSCKTTCQSSYKTAE 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KH+ +AY+I + I EIY NGPVE SF VYEDF YKSGVY++ +G ++GGH
Sbjct: 229 YTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKSGVYQYTSGKLVGGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
AVK+IGWGT ++G DYW++AN W ++G G+FK++RG+NE GIE +VVAG
Sbjct: 289 AVKIIGWGT-ENGVDYWLIANSWGTTFGDSGFFKMRRGTNEVGIEGNVVAG 338
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 206 bits (524), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 109/243 (44%), Positives = 149/243 (61%), Gaps = 19/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR CIH ++ +S DL++CC CG G
Sbjct: 93 WPNCPTIREIRDQGSCGSCWAFGAVEAMSDRICIHSKGKVHFRVSAEDLVSCC-HTCGFG 151
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDS------TGCSHPGCEPAYPTPKCV 107
C+GG+P +AW Y+V G+V+ + C PY S G P C TPKCV
Sbjct: 152 CNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGP-CNGEGKTPKCV 210
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+KC N + K + S+Y I S + I E++ NGPVE +FTVYED +YK GVY+
Sbjct: 211 KKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVYEDLLNYKEGVYQ 270
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G ++GGHA++++GWG +D + +W++AN WN WG +GYFKI RGS+ GIE + A
Sbjct: 271 HTAGKMLGGHAIRILGWGVENDTK-FWLIANSWNSDWGDNGYFKILRGSDHLGIESSIAA 329
Query: 227 GLP 229
GLP
Sbjct: 330 GLP 332
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 206 bits (523), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 107/243 (44%), Positives = 148/243 (60%), Gaps = 17/243 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+++ + + ++ Q CGSCWAFGA EA+SDR CIH M +++S DLL CC CG G
Sbjct: 95 WSHCDSIHLIRDQSTCGSCWAFGATEAMSDRICIHSKGKMQVNISAEDLLDCCD-TCGHG 153
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDS-----TGCSHPGCEPAYPTPKCVR 108
C GG+P +AW ++ G+V+ + C PY + T C P C P TP+CV
Sbjct: 154 CKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNCIPIVHTPECVH 213
Query: 109 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C K ++ ++ KH+ Y I+ D + I EI+ NGPVE F VY DF YKSGVY+
Sbjct: 214 HCRKGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQR 273
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ D G HA++++GWGT ++G YW+ AN WN +WG GYFKI R +NECGIEE + AG
Sbjct: 274 HSNDGRGMHAIRILGWGT-ENGTPYWLAANSWNENWGDKGYFKILRRTNECGIEEHIYAG 332
Query: 228 LPS 230
+P
Sbjct: 333 IPK 335
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 206 bits (523), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 108/233 (46%), Positives = 141/233 (60%), Gaps = 12/233 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q CGSCWAFGA E +SDR CI +SV D+L+CCG CG GC GG
Sbjct: 107 KSIKLIRNQATCGSCWAFGAAEVISDRICIQSNGTQQPIISVEDILSCCGTTCGKGCQGG 166
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR 118
Y I A R++ +G VT C PY + P E PT K + +
Sbjct: 167 YSIEAMRFWKSNGAVTGGDYNGNGCMPYSFAPCQKSPCVESTTPTCKTTCQSSYTTANYT 226
Query: 119 NSKHYSISAYRI---NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
KHY SAYR+ N+ I EIY NGPVE S+ VYEDF YKSGVY +++G ++GG
Sbjct: 227 TDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYHYVSGKLVGG 286
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
HAVK+IGWGT +D DYW++AN W +G G+FKI+RG+NEC IE +VVAG+
Sbjct: 287 HAVKIIGWGTEND-VDYWLVANSWGIKFGEGGFFKIRRGTNECQIESNVVAGV 338
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 206 bits (523), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 111/232 (47%), Positives = 144/232 (62%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR+C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 104 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGIPSFAWEY 162
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V+ + C PY + C H PG C TPKC R C K+ ++
Sbjct: 163 WKHFGIVSGGNYNSSQGCLPY-EIPPCEHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYK 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ K Y Y + E I AEI+KNGPVE +FTVY D YKSGVYKH G+ +GGHA+
Sbjct: 222 SDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVYKHTEGEALGGHAI 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VAG PS
Sbjct: 282 KIMGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEPS 332
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/233 (49%), Positives = 147/233 (63%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A EA SDRFCI + +N LS D+L+CC CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161
Query: 73 FVHHGVVTEE-------CDPYF-----DSTG-CSHPGCEP-AYPTPKCVRKCVKKNQ--L 116
V G T C PY ++ G + P C Y TP CV KC N
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPACVNKCTNSNYNVA 221
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+++ KH+ +AY + I AEI +GPVE +FTVYEDF YKSGVY H TG+ +GGH
Sbjct: 222 YKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGVYVHTTGEELGGH 281
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWGT D+G YW++AN WN +WG +GYF+I RG+NECGIE VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 112/231 (48%), Positives = 145/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE +SDR CIH N S +L++CC LCG GC+GG+P +A++Y
Sbjct: 101 QGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCC-HLCGFGCNGGFPGAAFKY 159
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+VH G+V T+ C PY + C H P C TPKC + C K + +
Sbjct: 160 WVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVPGPRPKCSEGGGTPKCAKTCEKGYIVDYE 218
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ H+ AY I D + I EI KNGPVE +FTVY DF HYKSGVY+H G +GGHA+
Sbjct: 219 SDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAI 278
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++G YW+ AN WN WG +G FKI RGS+ CGIE ++ AGLP
Sbjct: 279 RVLGWG-EENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLP 328
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 142/230 (61%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWAFGAVEA+SDR CI +S DLL+CC +CG GC GG P AW +
Sbjct: 124 QSNCGSCWAFGAVEAISDRICIATDGRQKPHISSTDLLSCCK-ICGFGCQGGDPHQAWSF 182
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
+V +G+VT + C PY S G P PTP C + C ++ N
Sbjct: 183 WVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYN 242
Query: 120 S-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K+Y + AY +++ D+ E+ NGP+EV+F VYEDF YK+GVY+H TG V+GGHAV
Sbjct: 243 KDKYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAV 302
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+L+GWG ++G YW+LAN WN WG G+FKI RG NECGIE + VAGL
Sbjct: 303 RLLGWG-EENGVPYWLLANSWNTEWGDKGFFKIYRGRNECGIESEAVAGL 351
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/233 (49%), Positives = 147/233 (63%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A EA SDRFCI + +N LS D+L+CC CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161
Query: 73 FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--L 116
V G T C PY ++ G + P C + Y TP CV KC N
Sbjct: 162 LVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIA 221
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+++ KH+ +AY + I AEI +GPVE +FTVYEDF YKSGVY H TG +GGH
Sbjct: 222 YKDDKHFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYEDFYQYKSGVYVHTTGQELGGH 281
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWGT D+G YW++AN WN +WG +GYF+I RG+NECGIE VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 110/231 (47%), Positives = 140/231 (60%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG CWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y
Sbjct: 112 QSRCGPCWAFAAVEAMSDRICIQSKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170
Query: 73 FVHHGVVTEE-------CDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C E Y TPKC +KC K + ++
Sbjct: 171 WVEEGIVTGSSKENHTGCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYK 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K+Y +Y + S + I EI +GPVE +FTVY DF +YKSG+YKH+ G V+GGHAV
Sbjct: 231 KDKYYGKLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAV 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++IGWG + YW++AN WN WG GYF+I RG + CGIE V AGLP
Sbjct: 291 RIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAGLP 340
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/231 (47%), Positives = 141/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR C + + S DLL+CC +CG GC GG P AW Y
Sbjct: 105 QGSCGSCWAFGAVEAMTDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEY 163
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
+ H G+V+ + C PY + C H PG C TPKC +KC + ++
Sbjct: 164 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYK 222
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y Y ++ D + I AE++KNGPVE +FTVY D YKSGVYKH GD +GGHAV
Sbjct: 223 QDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAV 282
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG +D + YW++AN WN WG +G+FKI RG + CGIE +V G P
Sbjct: 283 KILGWGVENDNK-YWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVTGEP 332
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 108/243 (44%), Positives = 150/243 (61%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGA EA+SDR CIH +S+ ++ DLL+CC CG G
Sbjct: 89 WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCDS-CGMG 147
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCV 107
C+GGYP +AW ++ G+VT C PY G P TP+C
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCS 207
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+C ++ KH+ ++Y + S+ + IMAE+ KNGPVE +FTVYEDF YKSGVY+
Sbjct: 208 NQCETGYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQ 267
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G +GGHA+K++GWG + G YW+ AN WN WG +G+FKI RG + CGIE ++VA
Sbjct: 268 HVSGSAVGGHAIKVLGWG-EEGGTPYWLAANSWNTDWGENGFFKILRGKDHCGIESEMVA 326
Query: 227 GLP 229
G+P
Sbjct: 327 GVP 329
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 109/231 (47%), Positives = 144/231 (62%), Gaps = 23/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q +CGSCWA AVEA+SDR+C G+ + +S ++LL+CC F+CG GC GG P AW ++
Sbjct: 120 QSNCGSCWAIAAVEAISDRYCTFGGVPDRRMSTSNLLSCC-FICGLGCHGGIPTVAWLWW 178
Query: 74 VHHGVVTEECDPY-FDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ---LWRNSK 121
V G+ TE+C PY FD CSH G YP TPKC C + ++ S
Sbjct: 179 VWVGIATEDCQPYPFDP--CSHHGNSEKYPPCPSTIYDTPKCNTTCERSEMDLVKYKGST 236
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
YS+ + ++M E+ NGP+E++ VY DF YKSGVYKH+ G+ +GGHAVKL+
Sbjct: 237 SYSVKGEK------ELMIELMTNGPLELTMQVYSDFVGYKSGVYKHVLGEFLGGHAVKLV 290
Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GWGT DG YW +AN WN WG GYF I+RG+NEC IE VAG+P+ +
Sbjct: 291 GWGT-QDGVPYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIPAQE 340
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 143/230 (62%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G S LS DL++CC CGDGC GG+P AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCKGGFPGQAWDY 170
Query: 73 FVHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT EE C PY F T +P C Y TP+C + C K + +
Sbjct: 171 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 291 RIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/245 (46%), Positives = 148/245 (60%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
F +EH + V Q +CGSCWA AVEA+SDR+C G+ + +S ++LL+CC F
Sbjct: 107 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 165
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
+CG GC GG P AW ++V G+ TE C PY CSH G YP TPKC
Sbjct: 166 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 224
Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C K K+ ++Y + + E +M E+ NGP+EV+ VY DF YKSGVYKH
Sbjct: 225 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKH 281
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGSNECGIE VAG
Sbjct: 282 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 340
Query: 228 LPSSK 232
P+ +
Sbjct: 341 TPAQE 345
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/245 (46%), Positives = 148/245 (60%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
F +EH + V Q +CGSCWA AVEA+SDR+C G+ + +S ++LL+CC F
Sbjct: 102 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 160
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
+CG GC GG P AW ++V G+ TE C PY CSH G YP TPKC
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 219
Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C K K+ ++Y + + E +M E+ NGP+EV+ VY DF YKSGVYKH
Sbjct: 220 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKH 276
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGSNECGIE VAG
Sbjct: 277 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 335
Query: 228 LPSSK 232
P+ +
Sbjct: 336 TPAQE 340
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/244 (44%), Positives = 144/244 (59%), Gaps = 23/244 (9%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
N ++ + Q CGSCWAFGA EA++DR CI + ++S +DLL+CC CG GCD
Sbjct: 105 NCPSIKSIRDQSSCGSCWAFGAAEAMTDRICIASKGAIQFTVSADDLLSCCD-ECGFGCD 163
Query: 63 GGYPISAWRYFVHHGVVTEECDPYFDSTGCS----------------HPGCEPAYPTPKC 106
GG+P +AW Y+V G+V+ Y +GC HP + YPT C
Sbjct: 164 GGFPYAAWNYWVEKGIVSG--GSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTC 221
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
KC + N K Y AY + + + I EI +GPVEV++ VYEDF HY G+Y
Sbjct: 222 EHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKGIY 281
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
KH G +GGHAVK+IGWGT ++G YWI +N WN WG +G+F+I RG++ECGIE VV
Sbjct: 282 KHTAGSYLGGHAVKMIGWGT-ENGIPYWICSNSWNSDWGENGFFRILRGTDECGIESGVV 340
Query: 226 AGLP 229
AGLP
Sbjct: 341 AGLP 344
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 143/230 (62%), Gaps = 17/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + + LS +LL+CC CG GC GG +AW Y
Sbjct: 103 QGSCGSCWAFGAVEAMSDRICIHSNGKLQVHLSAENLLSCCDS-CGYGCLGGSAENAWEY 161
Query: 73 FVHHGVVT-------EECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRN 119
+ G+V+ + C PY S S P CE TPKC ++C K + + +
Sbjct: 162 WHKFGIVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACEGVRDTPKCKKQCEKGYGIPYGD 221
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
Y Y I +D + I AEI KNGP+ S VYED YK+GVY+H+ G+V+GGH +K
Sbjct: 222 DLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGVYQHVAGEVLGGHVIK 281
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWG +D YW++AN WN WG +G+FKI RGS+ECGIE+ +VAG+P
Sbjct: 282 ILGWGVEND-TPYWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAGIP 330
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/245 (46%), Positives = 148/245 (60%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
F +EH + V Q +CGSCWA AVEA+SDR+C G+ + +S ++LL+CC F
Sbjct: 102 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 160
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
+CG GC GG P AW ++V G+ TE C PY CSH G YP TPKC
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 219
Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C K K+ ++Y + + E +M E+ NGP+EV+ VY DF YKSGVYKH
Sbjct: 220 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGVYKH 276
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGSNECGIE VAG
Sbjct: 277 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 335
Query: 228 LPSSK 232
P+ +
Sbjct: 336 TPAQE 340
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 141/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGA EA +DR CI + LS DLL CC CG GC+GG+P AW +
Sbjct: 91 QSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSDQDLLTCCE-SCGFGCNGGWPSMAWSW 149
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
F GV T + C+ Y + C H P C PTP+CV KC + + ++
Sbjct: 150 FHSTGVTTGGEYGSKDWCNAY-EFPKCDHHVEGKYPPCGETQPTPECVEKCQEGYPVEYK 208
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ AY + S+ E I E+ NGP+EV F+VYEDF YKSG+Y+H+ G +GGHAV
Sbjct: 209 KDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGIYQHVAGKYLGGHAV 268
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KL+GWG +DG +YW +AN WN WG +GYF+I G NECGIE D VAG+P
Sbjct: 269 KLVGWGV-EDGVEYWKIANSWNEDWGENGYFRIIAGKNECGIESDGVAGIP 318
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 105/237 (44%), Positives = 140/237 (59%), Gaps = 18/237 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q CG+CWAFGAVEA+SDR CIH + +++S DLL CC + C GC GG P
Sbjct: 99 IHVIRDQSSCGACWAFGAVEAISDRICIHTKGSVQVNISAQDLLTCCDY-CRTGCKGGVP 157
Query: 67 ISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
AW ++ G+VT + C PY + +TG P P P C R+C K
Sbjct: 158 SYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKS 217
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ + KHY Y ++ D I EI+KNGPVE F VY DF YKSGVY+ +
Sbjct: 218 YGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFYSYKSGVYQAHSRVR 277
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G HA++++GWGT ++G YW+ AN W WG GYFKI+RG+NECGIEED+ AG+P
Sbjct: 278 CGSHAIRILGWGT-ENGVPYWLAANSWTEHWGDKGYFKIRRGNNECGIEEDINAGIP 333
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/233 (45%), Positives = 138/233 (59%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A A+SDR CIH M L+ D L+CC + CG GC GGYP AW Y
Sbjct: 16 QASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-CGQGCRGGYPPKAWDY 74
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
++ G+VT C P+ T C H G YPTP C R C N+
Sbjct: 75 WMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYPTPPCARACQTGYNKT 133
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ K Y S+Y + IM EI KNGPVEV+F +++DF Y+SG+Y H+ G +G H
Sbjct: 134 YEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRH 193
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
AV++IGWG ++G +YW++AN WN WG +GYF++ RG NECGIE +VVAG+P
Sbjct: 194 AVRMIGWGV-ENGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMP 245
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 112/224 (50%), Positives = 139/224 (62%), Gaps = 15/224 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q +CGSCWA AVEA+SDR+C G+ + +S +LL+CC F+CG GC GG P AW ++
Sbjct: 120 QSNCGSCWAIAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWW 178
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSI 125
V GV TE C PY CSH G YP TPKC C N K+ +
Sbjct: 179 VWVGVTTELCQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGV 235
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
S+Y I + E +M E+ NGP+EV+ VY DF YKSGVYKH++GD +GGHAVKL+GWG
Sbjct: 236 SSYSIKGERE-LMVELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV 294
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
DG YW +AN WN WG GYF I+RG++ECGIE VAG P
Sbjct: 295 -KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/233 (49%), Positives = 146/233 (62%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A EA SDRFCI + +N LS D+L+CC CG GC+GGYPI+AW+Y
Sbjct: 103 QSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN-CGYGCEGGYPINAWKY 161
Query: 73 FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQ--L 116
V G T C PY ++ G + P C + Y TP CV KC KN
Sbjct: 162 LVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVA 221
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KH+ +AY + I AEI +GPVE +FTVYEDF YK+GVY H TG +GGH
Sbjct: 222 YTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGH 281
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWGT D+G YW++AN WN +WG +GYF+I RG+NECGIE VV G+P
Sbjct: 282 AIRILGWGT-DNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 204 bits (519), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 144/231 (62%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE +SDR CIH N S +L++CC LCG GC+GG+P +A++Y
Sbjct: 101 QGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCC-HLCGFGCNGGFPGAAFKY 159
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+VH G+V T+ C PY + C H P C TPKC + C K + +
Sbjct: 160 WVHSGIVSGGSFNSTQGCQPY-EIAPCEHHVSGPRPKCSEGGGTPKCAKTCEKGYIVDYE 218
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ H+ AY I D + I EI NGPVE +FTVY DF HYKSGVY+H G +GGHA+
Sbjct: 219 SDLHHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAI 278
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++G YW+ AN WN WG +G FKI RGS+ CGIE ++ AGLP
Sbjct: 279 RVLGWG-EENGTPYWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLP 328
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 204 bits (519), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 113/250 (45%), Positives = 149/250 (59%), Gaps = 22/250 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
++++ Q CGSCWAFGA E +SDR CI +SV D+L+CCG CG GC GGY
Sbjct: 108 IKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYS 167
Query: 67 ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWR 118
I A R++ G VT C PY S C P TP C C K + ++
Sbjct: 168 IEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTPSCKTTCQSSYKTEEYK 224
Query: 119 NSKHYSISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
KHY SAY++ + +I EIY GPVE S+ VYEDF HYKSGVY + +G ++GGH
Sbjct: 225 KDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGH 284
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
AVK+IGWG ++G DYW++AN W S+G G+FKI+RG+NEC IE +VVAG + K
Sbjct: 285 AVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAG------IAK 337
Query: 237 EITSADMFED 246
T ++ +ED
Sbjct: 338 LGTHSETYED 347
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 143/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR+C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V+ + C PY + C H PG C TPKC + C N +
Sbjct: 168 WKHFGLVSGGSYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDSKTPKCHKTCESSYNVDYH 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y Y ++S + I AE+YKNGPVE +FTVY D +YK+GVYKH G+ +GGHA+
Sbjct: 227 KDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAI 286
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 287 KILGWGV-ENGNKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 142/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 106 QGSCGSCWAFGAVEAMTDRVCTYSDGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEY 164
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH--PG----CEPAYPTPKCVRKC-VKKNQLWR 118
+ H G+V T+ C PY + C H PG C TPKC + C N ++
Sbjct: 165 WKHAGIVSGGSYNSTQGCIPY-EVPPCEHHVPGNRLPCNGDTKTPKCQKTCEAGYNVPFK 223
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY Y ++ + ++I AE++KNGPVE +FTVY D YKSGVY+H G +GGHAV
Sbjct: 224 KDKHYGKHVYSVSGNEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAV 283
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +V G P
Sbjct: 284 KILGWGV-ENGSKYWLIANSWNSDWGDNGFFKILRGEDHCGIESSIVTGEP 333
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 103/237 (43%), Positives = 144/237 (60%), Gaps = 19/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S D++ CC CGDGC+GG+PI AW+Y
Sbjct: 108 QANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKY 167
Query: 73 FVHHGVVTE-------ECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
F++ GVV+ C PY C H G C PTP C ++C +++
Sbjct: 168 FIYDGVVSGGEYLTKGVCRPY-PIHPCGHHGNDTYYGECRGTAPTPPCKKECRPGVRKVY 226
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
R K Y AY + + I +EI +NGPV SF VYEDF HYKSG+YKH G++ G HA
Sbjct: 227 RIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
VK+IGWG +++ D+W++AN W+ WG GYF+I RG+N+CGIE + AG+ +++L
Sbjct: 287 VKMIGWG-NENNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 142/230 (61%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + S DLL CC CG GC+GG P +AW Y
Sbjct: 115 QGSCGSCWAFGAVEAMSDRVCIHSKGKTHFHFSAEDLLTCCSS-CGFGCNGGEPGAAWDY 173
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-----TPKCVRKCVKKNQL-WRN 119
+V G+V+ + C PY C H P TP+CV++C + + +
Sbjct: 174 WVSTGIVSGGSYNSHQGCQPYAIEP-CEHHVNGTRKPCGEGDTPRCVKRCEEGYDVPYGK 232
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
+H+ SAY + + I E+ NGP E + TVY+DF HY++GVY+H++G +GGHAV+
Sbjct: 233 DRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVYQHVSGGALGGHAVR 292
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
L+GWG +DG YW+LAN WN WG +GYF+I RG +ECGIE D+ GLP
Sbjct: 293 LLGWGV-EDGTPYWLLANSWNYDWGDNGYFRILRGQDECGIESDINGGLP 341
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 203 bits (517), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 105/236 (44%), Positives = 140/236 (59%), Gaps = 14/236 (5%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
++N +E++ Q CGSCWAF E +SDR CI ++S D+LACCG CGDG
Sbjct: 91 WSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGNSCGDG 150
Query: 61 CDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKK 113
C GGYPI A+R++ GVVT C PY + S P TP C C
Sbjct: 151 CKGGYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCISCP----EEKTPTCSLSCQFGY 206
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ + K + +SAY + + I EI NGPV +FT+YED YKSGVY+H G ++
Sbjct: 207 STAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLL 266
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHA+K+IGWGT +G YW++AN W +WG +G+ K++RG NECGIE VVAG+P
Sbjct: 267 GGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMP 321
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 203 bits (517), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 108/231 (46%), Positives = 143/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE ++DR CIH N S +L++CC LCG GC+GG+P +A++Y
Sbjct: 101 QGSCGSCWAFGAVEVMTDRDCIHSNGTKNFHYSAENLVSCC-HLCGFGCNGGFPGAAFQY 159
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+VH G+V T+ C PY + C H P C TPKC + C + +
Sbjct: 160 WVHSGIVSGGAFNSTQGCQPY-EIAPCEHHVSGPRPKCAEGGSTPKCHKNCESNYVVDYE 218
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ H+ Y ++ D I +I NGPVE +FTVY DF HYKSGVY+H G +GGHA+
Sbjct: 219 SDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAI 278
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG +DG YW+ AN WN WG +GYFKI RGS+ CGIE ++ AGLP
Sbjct: 279 RVLGWG-EEDGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEISAGLP 328
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 203 bits (516), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 107/233 (45%), Positives = 142/233 (60%), Gaps = 20/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N S DL++CC CG GC+GG+P +AW Y
Sbjct: 112 QGSCGSCWAFGAVEAMSDRVCIHSNGTVNFHFSAEDLVSCC-HTCGFGCNGGFPGAAWSY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ H G+V+ E C PY + C H P C+ TP C +C + +
Sbjct: 171 WTHKGIVSGGSYNSNEGCRPY-EIEPCEHHVNGTRPPCKNGR-TPSCKHQCESSYSVDYA 228
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y I +P +I EI NGPVE +FTVYED YKSGVYKH+ G +GGHA+
Sbjct: 229 KDKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVYEDLILYKSGVYKHVHGKELGGHAI 288
Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+++GWG D + YW++ N WN WG +G+F+I RG + CGIE + AGLP+
Sbjct: 289 RILGWGVWGDSKVPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISAGLPA 341
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 141/230 (61%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G + LS DL++CC CGDGC GG+P AW Y
Sbjct: 74 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCKGGFPGQAWDY 132
Query: 73 FVHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT EE C PY F T +P C Y TP+C + C K + +
Sbjct: 133 WVKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 192
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 193 QDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 252
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++IGWG + YW++AN WN WG G F+I RG +EC IE VVAGL
Sbjct: 253 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 301
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 108/224 (48%), Positives = 143/224 (63%), Gaps = 18/224 (8%)
Query: 23 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
AFGA EA+SDR CIH +S LS DLL+CC CG GC+GGYP +AW ++ G+V+
Sbjct: 25 AFGASEAMSDRICIHSNAKISVELSAEDLLSCC-ESCGMGCNGGYPSAAWDFWTKDGLVS 83
Query: 81 EE-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSIS 126
C PY S P C TP+CV +C ++ KHY +
Sbjct: 84 GGLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKT 143
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
+Y ++SD +DI EIYKNGPVE +FTVYEDF YK+GVY+H+TG +GGHA+K++GWG
Sbjct: 144 SYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWG-E 202
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
++G YW+ AN WN WG +G+FKI RGSN CGIE ++VAG+P+
Sbjct: 203 ENGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 140/229 (61%), Gaps = 16/229 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A SDR CI G + +LS L CC + CG+GCDGG P SAW +
Sbjct: 113 QSNCGSCWAVSAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPESAWYF 171
Query: 73 FVHHGVVT-------EECDPY-FDSTGCSHPGCEPAYP-TPKC-VRKCVKKN--QLWRNS 120
F+ HG+VT + C PY G C P TP C ++ C N + +R
Sbjct: 172 FMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDTPDCSIKTCTNSNYSKNYRAD 231
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
HY + Y ++ EDIM ++YKNGPV+ +F VY DF +YKSGVY + G + GGHA+K+
Sbjct: 232 LHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKI 291
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GWG DDG YW+ AN W+RSWG +G F+I RG+NEC IE+ V+AG+P
Sbjct: 292 LGWGV-DDGTKYWLCANSWSRSWGENGLFRILRGNNECHIEDRVIAGMP 339
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 141/230 (61%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G S LS DL++CC CGDGC GG+P AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDY 170
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C Y TP+C +KC K + +
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K+Y Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA+
Sbjct: 231 QDKNYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAI 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++IGWG + G+ YW++AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 291 RIIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 140/230 (60%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G + LS DL++CC CGDGC GG+P AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKD-CGDGCQGGFPGVAWDY 170
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C Y TP+C + C K + +
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y + ++ + I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++IGWG + YW++AN WN WG G F++ RG +EC IE DVVAGL
Sbjct: 291 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAGL 339
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 114/245 (46%), Positives = 147/245 (60%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVI------QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGF 55
F +EH + V Q +CGSCWA AVEA+SDR+C G+ + +S ++LL+CC F
Sbjct: 102 FDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGVPDRRISTSNLLSCC-F 160
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
+CG GC GG P AW ++V G+ TE C PY CSH G YP TPKC
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPPCPNTIYDTPKCN 219
Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C K K+ ++Y + + E +M E+ NGP+EV+ VY DF YKSG YKH
Sbjct: 220 TTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYSDFVGYKSGGYKH 276
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGSNECGIE VAG
Sbjct: 277 VSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAG 335
Query: 228 LPSSK 232
P+ +
Sbjct: 336 TPAQE 340
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 202 bits (513), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 146/239 (61%), Gaps = 16/239 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 62
N ++++ Q +CGSCWAF A E +SDR CI +S D+L+CCG C +GC
Sbjct: 97 NCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDILSCCGSSCNNGCQ 156
Query: 63 GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC---VKK 113
GGY I A +Y+++ GVVT C PY CS C+ P C C K
Sbjct: 157 GGYTIEAMKYWMNSGVVTGGDYQGAGCIPY-SFRPCS--TCKEPKDAPSCKTTCQASYKA 213
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+R S +A N+ + I EIY NGPVEV++ VY+DF HYKSGVY H+ GD
Sbjct: 214 KSAYRLPTTTSSNAIVANA-VQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVYYHVYGDKP 272
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GHAVK+IGWGT + DYW++AN W+ ++G +G+FKI+RG+NECGIEE+VVAGLP SK
Sbjct: 273 SGHAVKIIGWGT-EKKVDYWLVANSWSTTFGENGFFKIRRGTNECGIEENVVAGLPKSK 330
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 109/246 (44%), Positives = 153/246 (62%), Gaps = 20/246 (8%)
Query: 1 MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCG 58
+ + N ++ + QG CGSCWAFGA EA+SDR CIH +S+ ++ DLL+CC CG
Sbjct: 87 LQWPNCPTLKEVRDQGSCGSCWAFGAAEAISDRVCIHSNAKVSVEISSEDLLSCCES-CG 145
Query: 59 DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAY-PTP 104
GC+GGYP +A ++ G+V+ C PY C H P C+ TP
Sbjct: 146 MGCNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPY-SIPPCEHHVNGTRPPCKGEEGDTP 204
Query: 105 KCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
+C +C ++ KH+ +Y + SD ++IM E+YKNGPVE +FTVYEDF YKSG
Sbjct: 205 QCTNQCEPGYTPGYKQDKHFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSG 264
Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
VY+H++G +GGHA+K++GWG + G YW+ AN WN WG +G+FKI RG + CGIE +
Sbjct: 265 VYRHVSGSAVGGHAIKVLGWG-EEGGIPYWLAANSWNTDWGENGFFKIVRGEDHCGIESE 323
Query: 224 VVAGLP 229
+VAG+P
Sbjct: 324 MVAGIP 329
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/231 (45%), Positives = 140/231 (60%), Gaps = 20/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH + +S DLL CC CG GCDGG P + W++
Sbjct: 105 QGSCGSCWAFGAVEAMSDRVCIHSKGKVLFRVSAEDLLTCCTN-CGHGCDGGAPGAGWKH 163
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------------TPKCVRKCVKK-NQLWR 118
++ G+V+ P+ GC EP TPKC++KC+ N +
Sbjct: 164 WIEKGLVSG--GPFGSDQGCRPYTIEPCVHVENGAQSPCKDSITPKCIKKCLPGYNVPYA 221
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K + S Y I +D I EI+ NGPVE +FTV++DFA YK G+Y+H +G++ G HAV
Sbjct: 222 KDKSFGKSTYSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHGIYQHTSGNLAGEHAV 281
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++G YW+ AN WN WG +GYFKI RGSN IE +VAGLP
Sbjct: 282 RILGWGV-ENGTKYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAGLP 331
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 112/267 (41%), Positives = 156/267 (58%), Gaps = 47/267 (17%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
N + ++++ Q +CG+CWAFGA E +SDR CI G +SV D+L+CCG CG+GC
Sbjct: 88 NCKSIKLIRNQAYCGACWAFGAAEIISDRICIQSGGAHQPIISVEDILSCCGSSCGEGCK 147
Query: 63 GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC------ 110
GGYP+ +++++ GVVT C PY CS CE + TP C +KC
Sbjct: 148 GGYPLEGLKFWMNSGVVTGGDYNGTGCQPY-TFPPCSS--CEASKSTPSCQKKCQTGYLE 204
Query: 111 --VKKNQLWRNSKH---------YSI--------SAYRINSDPED----------IMAEI 141
K ++ + N + Y + SAYR+++ I EI
Sbjct: 205 ATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEI 264
Query: 142 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 201
Y NGPVEVS+ V+EDF YKSGVY +++G + G HAVK+IGWGT ++ DYW++AN W
Sbjct: 265 YNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIGWGT-ENKVDYWLVANSWGT 323
Query: 202 SWGADGYFKIKRGSNECGIEEDVVAGL 228
+G G+FKI+RG+NECGIEE+VVAGL
Sbjct: 324 DFGEKGFFKIRRGTNECGIEENVVAGL 350
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 112/250 (44%), Positives = 144/250 (57%), Gaps = 32/250 (12%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACC--GFLCGDGCD 62
E ++ + Q +CGSCWAFG VEA+SDR CI G +S +LL+CC F CG GC+
Sbjct: 100 ESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACGMGCN 159
Query: 63 GGYPISAWRYFVHHGVVT------------EECDPYFDSTGCSH------PGCE--PAYP 102
GGY AW Y+V G+V+ EC PY CSH C P +
Sbjct: 160 GGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPY-SFPPCSHHVQGEYQACTDLPQFN 218
Query: 103 TPKCVRKCVKKNQLWRNSK----HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 158
TPKC +C +Q +NS H +S+Y + E I AEIY+ G SF VY DF
Sbjct: 219 TPKCYTEC--NSQYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFL 276
Query: 159 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 218
Y SGVY++ +G MGGHA+K++GWG ++G YW+ AN WN SWG +G+FKI RGSNEC
Sbjct: 277 TYSSGVYQNTSGSYMGGHAIKMLGWGV-ENGTPYWLCANSWNSSWGENGFFKILRGSNEC 335
Query: 219 GIEEDVVAGL 228
GIE +VAG
Sbjct: 336 GIESGMVAGF 345
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 113/232 (48%), Positives = 135/232 (58%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKD-CGFGCRGGYPAVAWDY 166
Query: 73 FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
+ HG+VT D +GC P CE YPTP+CV++C + +
Sbjct: 167 WKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGY 224
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K + +Y I + IM EI GPVE FT+YEDF Y SGVY H G M GHA
Sbjct: 225 LEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHA 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V+++GWG + YW++AN WN WG +GY K RG NECGIE+DV AGLP
Sbjct: 285 VRILGWGELGN-VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAGLP 335
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 111/244 (45%), Positives = 143/244 (58%), Gaps = 21/244 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
++N ++ + QG CGSCWAFGAVE++SDR CI N +S DL +CC CG+G
Sbjct: 124 WSNCPTIKEIRDQGSCGSCWAFGAVESMSDRICIKSNGQQNAHISAEDLTSCC-RSCGNG 182
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKC 106
C+GG+ AW Y+ G+VT + C PY C H P + TP C
Sbjct: 183 CNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPY-TVKACDHHVVGKLQPCSKKEEHTPVC 241
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+C N + KHY +AY + + IM EI NGPVE +FTVY DF YKSGVY
Sbjct: 242 KHECESGYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTNGPVEGAFTVYADFPQYKSGVY 300
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
KH TG +GGHA+K++GWGT + G+DYW++AN WN WG G FKI RG +ECGIE +
Sbjct: 301 KHTTGSPLGGHAIKIMGWGT-EGGDDYWLVANSWNPDWGNQGTFKILRGRDECGIESQIA 359
Query: 226 AGLP 229
AG P
Sbjct: 360 AGEP 363
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 107/235 (45%), Positives = 143/235 (60%), Gaps = 17/235 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH + +S DL++CCG+ CG GC GG+P AW +
Sbjct: 102 QSSCGSCWAFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPIAWDF 160
Query: 73 FVHHGVVT--EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRN 119
+ G+VT + +P + CSH G + Y TP CV+KC + +
Sbjct: 161 WQTEGIVTGGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYAT 220
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K + Y + + IM EI NGPVE +F VYEDF YKSGVY H G ++GGHA++
Sbjct: 221 DKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIR 280
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
++GWG ++G YW++AN WN WG DG FK+ RG NECGIE++V AGLP ++
Sbjct: 281 ILGWG-EENGVAYWLIANSWNDGWGEDGCFKMLRGKNECGIEDEVTAGLPELSSI 334
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 110/234 (47%), Positives = 137/234 (58%), Gaps = 24/234 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVE++SDR CI + LS +DLL+CC CGDGCDGG +W Y
Sbjct: 101 QSACGSCWAFGAVESMSDRICIASNATKIVRLSASDLLSCC-TSCGDGCDGGQLGPSWDY 159
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVK--KNQ 115
+ + G+VT C PY D C+H P YP TPKC + CV
Sbjct: 160 YKNKGIVTGYLYNTTGYCKPY-DFPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTAN 218
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ HY S+Y + I EI +GPVE +FTVY DF Y+SGVYKH +G V+GG
Sbjct: 219 TYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGVYKHTSGSVLGG 278
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HA+ ++GWGT + G YW++ N WN SWG G+FKI RG +CGI DVV GLP
Sbjct: 279 HAISIVGWGT-ESGSPYWLVKNSWNPSWGDGGFFKILRG--DCGINNDVVGGLP 329
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 104/236 (44%), Positives = 139/236 (58%), Gaps = 14/236 (5%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
++N +E++ Q CGSCWAF E +SDR CI ++S D+LACCG CGDG
Sbjct: 91 WSNCTSIEMIRDQAQCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGNSCGDG 150
Query: 61 CDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKK 113
C G YPI A+R++ GVVT C PY + S P TP C C
Sbjct: 151 CKGRYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCISCP----EEKTPTCSLSCQFGY 206
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ + K + +SAY + + I EI NGPV +FT+YED YKSGVY+H G ++
Sbjct: 207 STAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLL 266
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GGHA+K+IGWGT +G YW++AN W +WG +G+ K++RG NECGIE VVAG+P
Sbjct: 267 GGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMP 321
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 109/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
+V G+VT C PY +TG +P C E Y TPKC +KC K + +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K+Y +Y + ++ I EI +GPVE +FTV+ DF +YKSG+YK++TG +GGHA
Sbjct: 230 KKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V++IGWG + YW++AN WN WG GYF+I RG +ECGIE +V GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 200 bits (509), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 144/234 (61%), Gaps = 22/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++ S +DL++CC CG GC+GG+P +AW Y
Sbjct: 114 QGSCGSCWAFGAVEAMSDRLCIHSNATIHFHFSADDLVSCC-HTCGFGCNGGFPGAAWAY 172
Query: 73 FVHHGVVTEECDPYFDSTGC--------------SHPGCEPAY-PTPKCVRKCVKKNQL- 116
+ G+V+ PY S GC + P C+ + TP C +C K +
Sbjct: 173 WTRKGIVSG--GPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVD 230
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KH+ +Y + + +DI EI +NGPVE +FTVYED YK GVY+H+ G +GGH
Sbjct: 231 YKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGGH 290
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
A++++GWG ++ YW++AN WN WG +G+FK+ RG + CGIE + AGLP
Sbjct: 291 AIRILGWGV-ENKTPYWLIANSWNTDWGNNGFFKMLRGEDHCGIESAIAAGLPK 343
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 200 bits (509), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 114/240 (47%), Positives = 157/240 (65%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE++SDR CIH +N+ +S D+L CCG CG+GC+GGYP +AW +
Sbjct: 102 QGSCGSCWAFGAVESISDRICIHTNGHVNVEVSAEDMLTCCGGQCGEGCNGGYPSAAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTKKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSSSYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY S+Y + ++IMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA+
Sbjct: 221 EDKHYGYSSYSVPGIEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWGT ++G YW++AN WN WG +G+FKI RG + CGIE ++VAG+P + +I
Sbjct: 281 RILGWGT-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPRTDQYWAKI 339
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 104/236 (44%), Positives = 140/236 (59%), Gaps = 17/236 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S DL+ CC CG GCDGG+ I AW Y
Sbjct: 107 QANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPTCGFGCDGGWSIKAWEY 166
Query: 73 FVHHGVVT------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLWR 118
F + G+V+ + C + C H G C TP C +KC +L+R
Sbjct: 167 FTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYR 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y A+++ E I E+ KNGPV SF VYEDF+ YKSG+Y+H G++ G HAV
Sbjct: 227 MDKRYGTDAFQLPKSVEAIQKELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAV 286
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
K+IGWGT ++ DYW++AN W+ WG +GYF+I RG N+CGIEE+V AGL ++L
Sbjct: 287 KMIGWGT-ENRTDYWLIANSWHDDWGENGYFRIIRGINDCGIEENVAAGLIDVESL 341
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 111/224 (49%), Positives = 138/224 (61%), Gaps = 15/224 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q +CGSCWA AVEA+SDR+C G+ + +S +LL+CC F+CG GC GG P AW ++
Sbjct: 120 QSNCGSCWAIAAVEAMSDRYCTMSGIPDRRISTTNLLSCC-FICGFGCYGGIPAMAWLWW 178
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSI 125
V GV TE C PY CSH G YP TPKC C N K+ +
Sbjct: 179 VWVGVTTELCQPY-PFGPCSHHGNSSKYPPCPNTIYNTPKCNTTC--DNVEMELVKYKGV 235
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
S+Y I + E + E+ NGP+EV+ VY DF YKSGVYKH++GD +GGHAVKL+GWG
Sbjct: 236 SSYSIKGERE-LDHELMNNGPLEVAMQVYADFVAYKSGVYKHVSGDHLGGHAVKLVGWGV 294
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
DG YW +AN WN WG GYF I+RG++ECGIE VAG P
Sbjct: 295 -KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 104/232 (44%), Positives = 138/232 (59%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + + C H P C TPKC C + +
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGATPKCSHVCQSSYTVDYA 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + DI EI NGPVE +FTVYED YK GVY+H G +GGHA+
Sbjct: 227 KDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG D+ YW++ N WN WG G+F+I RG + CGIE + AGLP
Sbjct: 287 RILGWGVWGDEKIPYWLIGNSWNTDWGDQGFFRILRGQDHCGIESSISAGLP 338
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 109/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFTAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
+V G+VT C PY +TG +P C E Y TPKC +KC K + +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K+Y +Y + ++ I EI +GPVE +FTV+ DF +YKSG+YK++TG +GGHA
Sbjct: 230 KKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V++IGWG + YW++AN WN WG GYF+I RG +ECGIE +V GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 199 bits (506), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 105/232 (45%), Positives = 144/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N LS +DL++CC CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSQGKVNFHLSADDLVSCC-HTCGFGCNGGFPGAAWSY 168
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + C H P C TP+C C ++ ++
Sbjct: 169 WTRKGIVSGGNFGSQQGCRPY-EIEPCEHHVNGTRPPCSSG-STPRCQHVCESSYKVDYK 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K++ +Y I ++ DI EI NGPVE +FTVYED YKSGVY+H+ G +GGHA+
Sbjct: 227 KDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVYEHVHGKELGGHAI 286
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG D+ YW++AN WN WG +G+F+I RG + CGIE + AGLP
Sbjct: 287 RILGWGVWGDEKIPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAGLP 338
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 199 bits (506), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 141/231 (61%), Gaps = 20/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA + EA+SD C+ + + +S +D+L+CCG CG GC GG+PI A+++
Sbjct: 111 QSSCGSCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWPIEAYKW 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKK-NQL 116
GVVT + C PY C H +P Y PTPKC + C +K N+
Sbjct: 171 MQRDGVVTGGKYRQKKVCKPY-AFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKS 229
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KH++ AY + ++ +I EIYKNGPV +F VY+DF++YK G+Y H G G H
Sbjct: 230 YQEDKHFATRAYYLPNNERNIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAH 289
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
AVK++GWG ++ DYW++AN WN WG GYF+I RG+NECGIE +V G
Sbjct: 290 AVKVVGWG-RENATDYWLIANSWNTDWGESGYFRIVRGTNECGIEAQMVGG 339
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 138/234 (58%), Gaps = 28/234 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q +CGSCWAFG+ EA++DR CI N+ +S D+ CC CG GC+GGYP +AW ++V
Sbjct: 109 QANCGSCWAFGSAEAMTDRICIAGKGNIHISAEDINDCCKS-CGMGCNGGYPAAAWEWYV 167
Query: 75 HHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVK------KNQ 115
GVV+ E C PY +TG P C PTPKC +KC+ N
Sbjct: 168 DTGVVSGGQYGTNEGCMPYSLPHCDHHTTGKYQP-CPAVVPTPKCEKKCLTGYPKSYSND 226
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
R K Y + + IM E+ NGPV +F VY DF YK+GVY+H TG GG
Sbjct: 227 KTRGKKSYGVRGV------QSIMQELVDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGG 280
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK+IG+GT + G+DYW++AN WN WG G+FKI +G +ECGIE +VAG P
Sbjct: 281 HAVKIIGYGT-ESGQDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAGDP 333
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 109/232 (46%), Positives = 143/232 (61%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
+V G+VT C PY +TG +P C E Y TPKC +KC K + +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K+Y +Y + ++ I EI +GPVE +FTV+ DF +YKSG+YK++TG +GGHA
Sbjct: 230 GKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V++IGWG + YW++AN WN WG GYF+I RG +ECGIE +V GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRILRGKDECGIESEVTGGLP 340
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 139/230 (60%), Gaps = 18/230 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G S LS DL++CC CG GC GG+P AW Y
Sbjct: 49 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGQGCQGGFPGVAWDY 107
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C Y TP+C + C K + +
Sbjct: 108 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 167
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y + ++ + I +I GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 168 QDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 227
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++IGWG + YW++AN WN WG G F+I RG +EC IE +VVAGL
Sbjct: 228 RIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 276
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/226 (47%), Positives = 137/226 (60%), Gaps = 13/226 (5%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q +CGSCWA AVEA+SDR+C G+ +L +S LL+CC F+CG GC GG P AW ++
Sbjct: 120 QSNCGSCWAIAAVEAMSDRYCTVAGITDLRVSTGHLLSCC-FVCGMGCQGGIPTMAWLWW 178
Query: 74 VHHGVVTEECDPY------FDSTGCSHPGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSIS 126
V G+ +E C PY + G +P C Y TP C C + +KH
Sbjct: 179 VWVGLTSEVCQPYPFPPCGHHTDGGKYPACPSTIYDTPTCNSTCADSHTAL--TKHKGEK 236
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
+Y + + E M E+ GP EV+F VY DF YKSGVY H TG+ +GGHAVKL+GWG
Sbjct: 237 SYSLRGERE-YMIELMTYGPFEVAFDVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGV- 294
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+G YW +AN WN WG +GYF I+RG++ECGIE VAGLPS K
Sbjct: 295 QNGTPYWKIANSWNSDWGDNGYFLIRRGTDECGIESTGVAGLPSLK 340
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 108/225 (48%), Positives = 136/225 (60%), Gaps = 23/225 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA AVEA+SDR CI + LS +DLL+CC CG GC GG P++AW+Y
Sbjct: 99 QSSCGSCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKY 157
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQ 115
+V G+VT Y + +GC P CE YPTPKC ++C K +
Sbjct: 158 WVLSGIVTGS--DYTNHSGCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTK 215
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ K+Y AY + +D E I EI GPVE SF VY DF HY SG+YKH+ G V GG
Sbjct: 216 SYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGG 275
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
HAVK++GWG D G YW+ AN WN WG DGYF+I RG++ECG+
Sbjct: 276 HAVKILGWGI-DQGVSYWLAANSWNNDWGEDGYFRILRGADECGM 319
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 143/230 (62%), Gaps = 20/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG VEA +DR CI +N LS DL +CC CG+GC+GG+ AW Y
Sbjct: 108 QGACGSCWAFGCVEAATDRLCIQSKGIVNAHLSAEDLTSCC-RTCGNGCNGGFLEGAWNY 166
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
G+VT + C PY + C H C+ PTP+C ++C N +
Sbjct: 167 LKRDGIVTGGPYNSHQGCLPY-EIKACDHHVVGKLQPCKGDGPTPRCKKECESGYNNTYS 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+H++ + + + E IM EI NGPVE +FTVY DF YKSGVY+H +G +GGHA+
Sbjct: 226 KDEHHAKTVHAVEG-VEQIMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAI 284
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
K +GWG ++DG+DYW++AN WN WG +G+FKI RG +ECGIE ++VAG+
Sbjct: 285 KTLGWG-NEDGKDYWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAGM 333
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 111/225 (49%), Positives = 137/225 (60%), Gaps = 18/225 (8%)
Query: 23 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
AFGAVE++SDR CIH +S LS +LL+CC CG GC GG P AW Y+ + G+VT
Sbjct: 45 AFGAVESMSDRICIHSKNKISVELSAINLLSCCT-RCGFGCRGGIPGMAWDYWKYEGIVT 103
Query: 81 -------EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSI 125
C PY S+ S+P CE Y PTP+C C + ++ K Y
Sbjct: 104 GGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGK 163
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
S+Y + S+ IM EI NGPVE F VYEDF +YKSGVYKHITG +GGHA+++IGWG
Sbjct: 164 SSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGI 223
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+ YW+ AN WN WG GYFKI RG+NECGIE V AGLP+
Sbjct: 224 QQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLPN 268
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/231 (45%), Positives = 142/231 (61%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR+C + + S DLL+CC +CG GC+GG P AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
+ H G+V+ + C PY + C H PG C TPKC + C + +
Sbjct: 168 WKHFGLVSGGSYNSGQGCRPY-EIPPCEHHVPGNRVPCNGDSKTPKCHKTCEASYSVDYH 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y Y ++S + I AE++KNGPVE +FTVY D +YK+GVYKH G+ +GGHA+
Sbjct: 227 KDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAI 286
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G Y ++AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 287 KILGWGV-ENGNKYRLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAGEP 336
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 108/232 (46%), Positives = 144/232 (62%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF AVEA+SDR CI ++ LS DLL+CC CG GC GG+P +AW Y
Sbjct: 112 QSRCGSCWAFAAVEAMSDRICIESKGKKSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDY 170
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
+V G+VT C PY +TG +P C E Y TPKC +KC K + +
Sbjct: 171 WVEDGIVTGSSKENHTGCQPYPFPKCEHHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K+Y +Y + ++ I EI +GPVEV+FTV+ DF +YKSG+YK++TG +G HA
Sbjct: 230 KKDKYYGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V++IGWG + YW++AN WN WG GYF++ RG +ECGIE V +GLP
Sbjct: 290 VRIIGWGV-EKKTPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLP 340
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/225 (48%), Positives = 138/225 (61%), Gaps = 18/225 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GG+P AW ++
Sbjct: 114 QSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGFPEVAWVFY 172
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYS 124
V HG+V+E C PY F S C+H C Y TPKC C KK L R ++S
Sbjct: 173 VVHGLVSEYCQPYPFPS--CAHHVNSSDLAPCSGDYKTPKCNSTCTEKKIPLIRYRGNHS 230
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
+ S E E+ NGP EV+F VY DF Y GVYKH+ GD++GGHAV+L+GWG
Sbjct: 231 Y----VLSGEEHFKRELLLNGPFEVAFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWG 286
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GE YW +AN WN WG +GYF I RG NECGIE + VAG P
Sbjct: 287 EL-NGEPYWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTP 330
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 105/246 (42%), Positives = 142/246 (57%), Gaps = 27/246 (10%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q CGSCWA A E +SDR C+ ++ +S D+L+CCG CG GC+GG+P
Sbjct: 99 LNLIRDQAKCGSCWAVSAAETMSDRICVQSNCSIKACISDTDILSCCGLYCGYGCNGGFP 158
Query: 67 ISAWRYFVHHGVVT-------EECDPYF------------DSTGCSHPG----CEPAYPT 103
I AWR+F G T C PY D C + C T
Sbjct: 159 IEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADT 218
Query: 104 PKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
P+C R+C+ + + + ++Y SAY + + I EI KNGPV SF VYEDF HYKS
Sbjct: 219 PRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKS 278
Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
G+YKH G++ G HAVK+IGWG ++ D+W++AN W++ WG GYF+I RG NECGIE
Sbjct: 279 GIYKHTAGELRGYHAVKIIGWG-KENNTDFWLIANSWHQDWGEKGYFRIVRGKNECGIET 337
Query: 223 DVVAGL 228
DVVAG+
Sbjct: 338 DVVAGI 343
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 105/232 (45%), Positives = 133/232 (57%), Gaps = 16/232 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCD 62
++N ++ + Q CGSCWAFGAVE++SDRFCIH G ++ LS DL+ C +GC
Sbjct: 80 WSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVLLSFQDLVTC--DQSDNGCQ 137
Query: 63 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQ 115
GG +A ++ G+V+ +C PY + P C PA TP+CV KC +
Sbjct: 138 GGDAYTAMKFIQKKGIVSNDCLPY------TIPTCAPAQQPCLNFVDTPQCVEKCSNASY 191
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ H+ Y +N I EI NGPVE F VYEDF YKSGVY+H TG +GG
Sbjct: 192 TYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYEDFLGYKSGVYQHTTGKDLGG 251
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
H VK+IGWGT ++ E YWI N W WG G F IK G NECGIE DVVA
Sbjct: 252 HCVKMIGWGTQNN-ELYWICNNSWTTYWGNQGVFWIKAGVNECGIESDVVAA 302
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 107/229 (46%), Positives = 136/229 (59%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WAFGAVEA+SDR CI G N+ LS DLL+CC CGDG +GG+P AW Y
Sbjct: 89 QSRCGSSWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCEH-CGDGFEGGFPALAWDY 147
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C E Y TP C C K + +
Sbjct: 148 WVKEGIVTGSSKENHTSCQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYA 207
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH S Y + +D + I EI K GPVE +F VYEDF +YKSG+YKHITG ++ HA+
Sbjct: 208 QDKHRGKSRYNVKNDEKAIQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLVSWHAI 267
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++IGWG ++ YW++ N WN WG +G F+I RG +EC IE +V AG
Sbjct: 268 RIIGWGV-ENNTPYWLIPNSWNEDWGENGNFRILRGRHECSIESEVTAG 315
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 103/222 (46%), Positives = 134/222 (60%), Gaps = 12/222 (5%)
Query: 17 HCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
CGSCWAF E +SDR CI ++S D+LACCG CGDGC+GGYPI A+R++
Sbjct: 60 QCGSCWAFSTAEVISDRICIATKGTQQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWN 119
Query: 75 HHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISA 127
GVVT C PY + C+ C P TP C C + + K + +SA
Sbjct: 120 SRGVVTGGDFRGSGCRPYPFAP-CNSYKC-PEEKTPTCSLSCQFGYSTAYAKDKRFGVSA 177
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + + I EI NGPV +FT+YED YKSGVY+H G ++GGHA+K+IGWGT
Sbjct: 178 YAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-Q 236
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+G YW++AN W WG +G+ K++RG NECGIE VVAG+P
Sbjct: 237 NGIPYWLIANSWGADWGENGFLKMRRGVNECGIESAVVAGMP 278
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/62 (58%), Positives = 46/62 (74%), Gaps = 1/62 (1%)
Query: 144 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 203
NGPVE SFTVYEDF YK GVY++ G V+G HA+K++GWGT + G DYW++AN W
Sbjct: 3 NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGT-EHGTDYWLIANSWGAQC 61
Query: 204 GA 205
G+
Sbjct: 62 GS 63
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 138/232 (59%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 109 QGECGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + C H P C TPKC C + +
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EIAPCEHHVNGTRPPCGHGGGTPKCSHVCESGYTVDYA 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + DI EI NGPVE +FTVYED YK GVY+H G +GGHA+
Sbjct: 227 KDKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHQHGKELGGHAI 286
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++ YW++ N WN WG +G+F+I RG + CGIE + AGLP
Sbjct: 287 RILGWGVWGEEKIPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLP 338
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 138/232 (59%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + + C H P C TPKC C + +
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGGTPKCSHVCQSSYTVDYA 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + +I EI NGPVE +FTVYED YK GVY+H G +GGHA+
Sbjct: 227 KDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG D+ YW++ N WN WG G+F+I RG + CGIE + AGLP
Sbjct: 287 RILGWGVWGDEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 111/231 (48%), Positives = 133/231 (57%), Gaps = 21/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCKD-CGFGCRGGYPAVAWDY 166
Query: 73 FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
+ HG+VT D +GC P CE YPTP+CV++C + +
Sbjct: 167 WKTHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGY 224
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K + +Y I + IM EI GPVE FT+YEDF Y SGVY H G M GHA
Sbjct: 225 LEDKTRANMSYNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHA 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
V+++GWG + YW++AN WN WG +GY K RG NECGIE+DV A L
Sbjct: 285 VRILGWGELGN-VPYWLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAVL 334
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 114/240 (47%), Positives = 157/240 (65%), Gaps = 18/240 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVE++SDR CIH ++S+ V+ DLL CCG CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVESISDRICIHTNGHVSVEVSAEDLLTCCGGQCGDGCNGGYPAEAWNF 161
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPACTGEGDTPKCSKTCEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ ++Y + ++ +IMAEIYKNGPVE +F+VY DF YKSGVY+H+TGD+MGGHA+
Sbjct: 221 EDKHFGYTSYSLPTNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+++GWG ++G YW++AN WN WG G+F+I RG + CGIE +VVAG+P + ++I
Sbjct: 281 RILGWG-EENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 91/120 (75%), Positives = 105/120 (87%)
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R +SDP IM E+YKNGPVEV+FTVYEDFAHYKSGVYKH+TGD +GGHAVKLIGWGTS+D
Sbjct: 2 RGSSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSED 61
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 248
GEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAG+PS KNL E+ +D F DAS
Sbjct: 62 GEDYWLLANQWNRGWGDDGYFKIRRGTNECDIEDEVVAGMPSPKNLNMELDVSDAFLDAS 121
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 108/245 (44%), Positives = 138/245 (56%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDG 60
+ N ++ + Q CGSCWAF A E SDR CI L S+S DLL CC CG+G
Sbjct: 97 WPNCNSIKTIRDQSTCGSCWAFAATETYSDRICIASNQELQTSISSEDLLECCA-TCGNG 155
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
C GGYP +AW+Y GV T C PY C H P C P PTPKCV
Sbjct: 156 CQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPP-CDHHVVGQYPPCGPIKPTPKCV 214
Query: 108 RKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
++C + + ++ H+ Y++ ++ E I EI +GPV+ SF V DF YKSGVY
Sbjct: 215 KQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIMAHGPVQASFRVASDFLTYKSGVY 274
Query: 166 -KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
+ GGH+VK+IGWG + G YW++AN WN WG +G FK+ RG NECGIE +V
Sbjct: 275 IRDPKLKYEGGHSVKIIGWGV-EQGTPYWLIANSWNEDWGENGLFKMLRGKNECGIEAEV 333
Query: 225 VAGLP 229
VAGLP
Sbjct: 334 VAGLP 338
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 105/231 (45%), Positives = 137/231 (59%), Gaps = 15/231 (6%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPI 67
++ + Q CGSCWA A A+SDRFC+ G+ +L +S DLL+CC CGDGCDGGYP
Sbjct: 107 IKRIADQSSCGSCWAVAAATAMSDRFCVTGGVRDLGISAGDLLSCC-TSCGDGCDGGYPD 165
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRN 119
AW YF G+V++ C PY C H G P TPKC C K
Sbjct: 166 EAWLYFTESGLVSDYCQPY-PFPPCKHSGGRSKNPSCHDMHFHTPKCNATCTDKRIP--V 222
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
++++ +Y + + ED E+Y GP EV+FTVYEDF Y+SGVYKH++G +GGHAV+
Sbjct: 223 VRYFASESYSLQGE-EDYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHAVR 281
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
++GWG +G YW +AN WN WG +GY RG +ECGIE AG PS
Sbjct: 282 VVGWG-ERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIESQGSAGTPS 331
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 138/232 (59%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + + C H P C TPKC C + +
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCANGSGTPKCSHVCQSSYTVDYA 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + +I EI NGPVE +FTVYED YK GVY+H G +GGHA+
Sbjct: 227 KDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++ YW++ N WN WG G+F+I RG + CGIE + AGLP
Sbjct: 287 RILGWGVWGNEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 196 bits (497), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/233 (45%), Positives = 136/233 (58%), Gaps = 19/233 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 110 QGSCGSCWAFGAVEAMSDRVCIHSNGNVNFRFSADDLVSCC-HTCGFGCNGGFPGAAWSY 168
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
+ G+V+ C PY + C H C TPKC +C N +
Sbjct: 169 WTRKGIVSGGRYGSKTGCRPY-EIAPCEHHVNGTRAPCNHDSKTPKCQHQCEAGYNVEYS 227
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + DI EI NGPVE +FTVYED YKSGVY+H G +GGHA+
Sbjct: 228 KDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAI 287
Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+++GWG E YW++AN WN WG G+F+I RG + CGIE + AGLP
Sbjct: 288 RILGWGVWGKEEVPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAGLPK 340
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 196 bits (497), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 142/240 (59%), Gaps = 14/240 (5%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGC 61
+ N + + Q CGSCWA A A+SDRFC G+ ++ +S DLLACC CGDGC
Sbjct: 81 WPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGC 139
Query: 62 DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKK 113
+GG P AW YF G+V++ C PY H + YP TPKC C
Sbjct: 140 NGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDP 199
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
N + S ++Y + + +D M E++ GP EV+F VYEDF Y SGVY H++G +
Sbjct: 200 TIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYL 256
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
GGHAV+L+GWGTS +G YW +AN WN WG DGYF I+RGS+ECGIE+ AG+P + N
Sbjct: 257 GGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 315
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 196 bits (497), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 108/238 (45%), Positives = 141/238 (59%), Gaps = 14/238 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A A+SDRFC G+ ++ +S DLLACC CGDGC+G
Sbjct: 106 NCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGCNG 164
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
G P AW YF G+V++ C PY H + YP TPKC C
Sbjct: 165 GDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTI 224
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
N + S ++Y + + +D M E++ GP EV+F VYEDF Y SGVY H++G +GG
Sbjct: 225 PVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG 281
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
HAV+L+GWGTS +G YW +AN WN WG DGYF I+RGS+ECGIE+ AG+P + N
Sbjct: 282 HAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 338
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 196 bits (497), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 136/230 (59%), Gaps = 19/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI + + +S D+L+CCG CG GC+GG+PI A+ Y
Sbjct: 112 QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 171
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
F G VT C PY C H G C TPKCVRKC + +
Sbjct: 172 FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 230
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ + AY + + + I EI KNGPV +FTVYEDF++YK G+YKH G GGHA
Sbjct: 231 KKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 290
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWG + G YW++AN W+ WG +GYF+I RGSN CGIEE+VVAG
Sbjct: 291 IKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 339
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 111/235 (47%), Positives = 139/235 (59%), Gaps = 26/235 (11%)
Query: 20 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA AVEA+SDR CI + LS +DLL+CC CG GC GG P++AW+Y+V G
Sbjct: 163 SCWAVAAVEAMSDRICITSKGKKQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSG 221
Query: 78 VVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKK-NQLWRNS 120
+VT Y + +GC P CE YPTPKC R+C K + ++
Sbjct: 222 IVTG--SDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKAD 279
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
K+Y AY + +D E I EI GPVE SF VY DF HY G+YKH+ G V GGHAVK+
Sbjct: 280 KYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKI 339
Query: 181 IGWGTSDDGEDYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLPSSK 232
+GWG D G YW+ AN WN WG D GYF+I RG +ECGIE +VAG+P +
Sbjct: 340 LGWGI-DQGVSYWLAANSWNTDWGEDVFSGYFRILRGVDECGIESGIVAGIPRKE 393
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 142/240 (59%), Gaps = 14/240 (5%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGC 61
+ N + + Q CGSCWA A A+SDRFC G+ ++ +S DLLACC CGDGC
Sbjct: 82 WPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGC 140
Query: 62 DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKK 113
+GG P AW YF G+V++ C PY H + YP TPKC C
Sbjct: 141 NGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDP 200
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
N + S ++Y + + +D M E++ GP EV+F VYEDF Y SGVY H++G +
Sbjct: 201 TIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYL 257
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
GGHAV+L+GWGTS +G YW +AN WN WG DGYF I+RGS+ECGIE+ AG+P + N
Sbjct: 258 GGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 316
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLP 229
G+P
Sbjct: 328 GMP 330
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/238 (45%), Positives = 141/238 (59%), Gaps = 14/238 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A A+SDRFC G+ ++ +S DLLACC CGDGC+G
Sbjct: 106 NCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCS-DCGDGCNG 164
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
G P AW YF G+V++ C PY H + YP TPKC C
Sbjct: 165 GDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTI 224
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
N + S ++Y + + +D M E++ GP EV+F VYEDF Y SGVY H++G +GG
Sbjct: 225 PVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG 281
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
HAV+L+GWGTS +G YW +AN WN WG DGYF I+RGS+ECGIE+ AG+P + N
Sbjct: 282 HAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLAPN 338
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 269 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 327
Query: 227 GLP 229
G+P
Sbjct: 328 GMP 330
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 138/232 (59%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 109 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + + C H P C TPKC C + +
Sbjct: 168 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYA 226
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + +I EI NGPVE +FTVYED YK GVY+H G +GGHA+
Sbjct: 227 KDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 286
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++ YW++ N WN WG G+F+I RG + CGIE + AGLP
Sbjct: 287 RILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 338
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 195 bits (496), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 138/232 (59%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH G +N S +DL++CC CG GC+GG+P +AW Y
Sbjct: 99 QGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFGCNGGFPGAAWSY 157
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ + C PY + + C H P C TPKC C + +
Sbjct: 158 WTRKGIVSGGPYGSNQGCRPY-EISPCEHHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYA 216
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ +Y + + +I EI NGPVE +FTVYED YK GVY+H G +GGHA+
Sbjct: 217 KDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAI 276
Query: 179 KLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG ++ YW++ N WN WG G+F+I RG + CGIE + AGLP
Sbjct: 277 RILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAGLP 328
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 195 bits (495), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 107/232 (46%), Positives = 138/232 (59%), Gaps = 22/232 (9%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
+ ++ Q +CGSCWAFGA E++SDR+CIH M+L +S +L+ CC CG+GC+GG+ +
Sbjct: 96 IGLIEDQSNCGSCWAFGATESMSDRYCIHMKMHLLISAANLMECCRN-CGNGCEGGFLGA 154
Query: 69 AWRYFVHHGVVT-----------EECDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC 110
AW Y+ G+VT + C PY C H G +PA P TP+CV C
Sbjct: 155 AWNYWKQEGLVTGGLYNPSATESDTCQPY-PLPSCEHHINGSKPACPSKIAKTPECVHTC 213
Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
+ HY SAY + +I EI NGPVE +FTVY DF YKSGVYK +
Sbjct: 214 HAGYPTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYADFPAYKSGVYKRHS 273
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
+GGHAVK+IGWG +DG YW++AN WN WG GYFKI RG +ECGIE
Sbjct: 274 LRQLGGHAVKMIGWG-EEDGIPYWLIANSWNSDWGDHGYFKIVRGQDECGIE 324
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/223 (48%), Positives = 132/223 (59%), Gaps = 20/223 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGA E+ +DR CIH N+ LS D++ C +GC+GG SAW +
Sbjct: 101 QARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLR 158
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQL-WRNSKHYSIS 126
G V+EEC PY + P C PA TP C ++C + L + KH
Sbjct: 159 KQGAVSEECLPY------TIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAK 212
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
Y +SD E IM EI NGPVE FTV+EDF YKSGVY H TG +GGH VKL+G+GT
Sbjct: 213 IYSFDSD-EAIMQEIVTNGPVEACFTVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL 271
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+G DY+ NQW SWG +G F IKRG +CGI +DVVAGLP
Sbjct: 272 -NGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/244 (44%), Positives = 140/244 (57%), Gaps = 24/244 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA V A+SDR CIH M LS DL++CC + CG+GC GG P +AW Y
Sbjct: 98 QSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDY 156
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
+ +G+VT C PY C HPG YPTP C C ++
Sbjct: 157 WWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQLNPCPRYTYPTPSCYPYCQAGYDKT 215
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ K Y ++Y ++ IM EI KNGPVE F VY DFA YKSG+Y H++G G H
Sbjct: 216 YEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKH 275
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
A+++IGWG ++G YW+ AN WN WG +GYF+I RG++EC IE VVAG+P L K
Sbjct: 276 AIRIIGWGV-ENGVKYWLTANSWNVGWGENGYFRILRGTDECRIESIVVAGMP---RLQK 331
Query: 237 EITS 240
IT+
Sbjct: 332 NITN 335
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 141/239 (58%), Gaps = 28/239 (11%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ QG CGSCWA A A++DR+CI S D+LACC CGDGC GGY
Sbjct: 103 LNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACC-HACGDGCKGGYL 161
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP------------TPKCVRKC---V 111
AW+++V GV + PY GC HP YP TPKC ++C
Sbjct: 162 GPAWQFWVEQGVSSG--GPYNSRQGC-HP-----YPIDVCDASGEEADTPKCSKRCQSGY 213
Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
+W++ + Y AY I +D + IM EIY NGPV+ +F Y+D YKSGVY+H+ G
Sbjct: 214 NVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGH 272
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+ GGHAVKL+GWG ++G YW++AN W WG +G+FKI RG N CGIE+DV AGLPS
Sbjct: 273 MAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLPS 330
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 112/243 (46%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 11 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 70
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 71 CNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 129
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 130 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 189
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG ++G YW++AN WN WG +G+FKI RG + CGIE ++VA
Sbjct: 190 HVSGEIMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVA 248
Query: 227 GLP 229
G+P
Sbjct: 249 GMP 251
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 105/245 (42%), Positives = 142/245 (57%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + Q C SCWA + A++DR CIH LS D+++CC + CG G
Sbjct: 96 WANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYG 154
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEP----AYPTPK 105
C+GG P +W Y+ GVVT C PY CSH PG P YPTPK
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPK 213
Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C +KC N+ + K S+Y + DIM EI KNGPV+ F ++EDF YKSG+
Sbjct: 214 CEKKCHAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGI 273
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
Y + TG ++GGHA+++IGWG ++G YW++AN WN WG GYF+++RG+NECGIE +
Sbjct: 274 YHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARI 332
Query: 225 VAGLP 229
AGLP
Sbjct: 333 NAGLP 337
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 158/243 (65%), Gaps = 18/243 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 11 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 70
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GG+P AW ++ G+V+ C PY C H P C TPKC
Sbjct: 71 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 129
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 130 KTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 189
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G++MGGHA++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VA
Sbjct: 190 HVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVA 248
Query: 227 GLP 229
G+P
Sbjct: 249 GMP 251
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 137/230 (59%), Gaps = 19/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI + + +S D+L+CCG CG GC+GG+PI A+ Y
Sbjct: 24 QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 83
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
F G VT C PY C H G C TPKCVRKC + +
Sbjct: 84 FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 142
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ + AY + + + I EI KNGPV +FTVYEDF++YK G+YKH G GGHA
Sbjct: 143 KKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 202
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWG ++G YW++AN W+ WG +GYF+I RGSN CGIEE+VVAG
Sbjct: 203 IKIIGWG-KENGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 251
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 141/239 (58%), Gaps = 28/239 (11%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ QG CGSCWA A A++DR+CI S D+LACC CGDGC GGY
Sbjct: 103 LNVIRNQGCCGSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACC-HACGDGCKGGYL 161
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP------------TPKCVRKC---V 111
AW+++V GV + PY GC HP YP TPKC ++C
Sbjct: 162 GPAWQFWVEQGVSSG--GPYNSRQGC-HP-----YPIDVCDASGEEADTPKCSKRCQSGY 213
Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
+W++ + Y AY I +D + IM EIY NGPV+ +F Y+D YKSGVY+H+ G
Sbjct: 214 NVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVYRHVWGH 272
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+ GGHAVKL+GWG ++G YW++AN W WG +G+FKI RG N CGIE+DV AGLPS
Sbjct: 273 MAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAGLPS 330
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/234 (44%), Positives = 139/234 (59%), Gaps = 13/234 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI-HFGMNLSL-SVNDLLACCGFLCGDGCD 62
N ++++ Q CGSCWAFGA E +SDR CI G + S DLL+CCG CG GC
Sbjct: 87 NCRSIKMIRNQATCGSCWAFGAAEVMSDRICIASMGTKQPIISPTDLLSCCGNFCGYGCK 146
Query: 63 GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQ 115
G P+ A+R++ GVVT C PY C+ C + TP+C C ++
Sbjct: 147 GASPLQAFRWWNKKGVVTGGDYRGSGCKPY-PFAPCTALPCTKS-ETPRCSLNCQPAYSK 204
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ K++ AY + D I EI NGPVE +F VY+DF HY+SGVY+H+ G ++GG
Sbjct: 205 AYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVEAAFIVYDDFNHYRSGVYRHVAGKLVGG 263
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK+IGWG +G YW++AN W WG +G+FK+ RG +ECGIE +VAG P
Sbjct: 264 HAVKIIGWGI-QNGAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTIVAGKP 316
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 140/232 (60%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N LS +DL++CC +CG GC+GG+P +AW Y
Sbjct: 108 QGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLVSCC-HICGFGCNGGFPGAAWSY 166
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V T+ C PY + C H P C TP C KC + +
Sbjct: 167 WTRKGIVSGGPYGSTQGCRPY-EIAPCEHHVNGTRPPCSHG-STPSCQHKCQASYSVEYA 224
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K++ +Y + + +I EI NGPVE +FTVYED YKSGVY+H G +GGHA+
Sbjct: 225 KDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAI 284
Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+++GWG + + YW++ N WN WG +G+F+I RG + CGIE + AGLP
Sbjct: 285 RILGWGVWGESKVPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAGLP 336
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 140/234 (59%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WAF AVE +SDR CI ++ LS DLL+CC CG GC GG+P SAW Y
Sbjct: 112 QSRCGSGWAFAAVEVMSDRICIQSKGEKSVELSAVDLLSCC-RECGLGCLGGFPGSAWDY 170
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
+V GVVT C PY ++TG +P C + Y TPKC +KC K + +
Sbjct: 171 WVEEGVVTGSSGENHTGCQPYPFPKCEHNTTG-KYPACGQKIYETPKCQKKCQKGYKTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KHY AY + ++ + I EI +GPV FTVY DF +YKSG+YKH+ G +G H
Sbjct: 230 KKDKHYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHT 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+++GWG + G YW++AN WN WG GYF+I RG +EC IE V+ GLP +
Sbjct: 290 VRIVGWGV-EKGTPYWLIANSWNEGWGEKGYFRILRGKDECDIESLVIGGLPRN 342
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 105/232 (45%), Positives = 133/232 (57%), Gaps = 12/232 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ ++++ Q +CGSCWAF E +SDR CI + +S DLL CCG CG+GCDGG
Sbjct: 97 KSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCDGG 156
Query: 65 YPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLW 117
+P A++++ GVVT C PY C+ C TP C C +
Sbjct: 157 FPYRAFQWWARRGVVTGGDYLGTGCKPY-PIRPCNSDNCV-NLQTPPCRLSCQPGYRTTY 214
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
N K+Y SAY + I A+IY NGPV +F VYEDF YKSG+Y+HI G GGHA
Sbjct: 215 TNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKGGHA 274
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWGT + G YW+ N W WG G F+I RG +ECGIE +VAGLP
Sbjct: 275 VKLIGWGT-ERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAGLP 325
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 96/197 (48%), Positives = 130/197 (65%), Gaps = 16/197 (8%)
Query: 18 CGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
CGSCWAFGAVEA+SDR CIH +++ +S DLL CCG +CGDGC+GGYP AW ++ G
Sbjct: 1 CGSCWAFGAVEAISDRICIHTNVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG 60
Query: 78 VVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHY 123
+V+ C PY C H P C TPKC + C + ++ KHY
Sbjct: 61 LVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHY 119
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
+Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GW
Sbjct: 120 GYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGW 179
Query: 184 GTSDDGEDYWILANQWN 200
G ++G YW++AN WN
Sbjct: 180 GV-ENGTPYWLVANSWN 195
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 194 bits (492), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 136/211 (64%), Gaps = 16/211 (7%)
Query: 42 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 94
+ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY C H
Sbjct: 1 VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEH 59
Query: 95 ------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 147
P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPV
Sbjct: 60 HVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 119
Query: 148 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 207
E +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 120 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNG 178
Query: 208 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 179 FFKILRGQDHCGIESEVVAGIPRTDQYWEKI 209
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 194 bits (492), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 113/232 (48%), Positives = 131/232 (56%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y
Sbjct: 108 QSGCGSCWAFGAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCEN-CGYGCSGGYPAVAWDY 166
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKKNQLW 117
+ HG+VT D +GC P CE YPTP+CV+ C +
Sbjct: 167 WGAHGIVTGGSKE--DPSGCRSYPFPKCEHHVQGHYPPCPHQYYPTPECVQHCDTPGIDY 224
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K + +Y I S IM EI GPVE FTVYEDF YK GVY H G + HA
Sbjct: 225 VKDKTRANMSYNIYSSEILIMKEIMLRGPVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHA 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++++GWG D YW++AN WN WG GY K RG NECGIE+DV AGLP
Sbjct: 285 IRILGWGEEGD-VPYWLIANSWNEDWGEKGYMKFLRGLNECGIEDDVTAGLP 335
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 194 bits (492), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 135/206 (65%), Gaps = 16/206 (7%)
Query: 40 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 92
+++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 1 VSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPC 59
Query: 93 SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 145
H P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNG
Sbjct: 60 EHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNG 119
Query: 146 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 205
PVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG
Sbjct: 120 PVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGD 178
Query: 206 DGYFKIKRGSNECGIEEDVVAGLPSS 231
+G+FKI RG + CGIE +VVAG+P +
Sbjct: 179 NGFFKILRGQDHCGIESEVVAGIPRT 204
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 193 bits (491), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 136/230 (59%), Gaps = 19/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI + + +S D+L+CCG CG GC+GG+PI A+ Y
Sbjct: 24 QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 83
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
F G VT C PY C H G C TPKCVRKC + +
Sbjct: 84 FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 142
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ + AY + + + I EI KNGPV +FTVYEDF++YK G+YKH G GGHA
Sbjct: 143 KKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 202
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWG + G YW++AN W+ WG +GYF+I RGSN CGIEE+VVAG
Sbjct: 203 IKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILRGSNHCGIEENVVAG 251
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 108/225 (48%), Positives = 135/225 (60%), Gaps = 18/225 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 114 QSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 172
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYS 124
HG+V+E C PY F S C+H C Y TP C C KK L + Y
Sbjct: 173 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIK----YR 226
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
+ I S E E+ NGP EVSF+VY DF Y GVYKH+TG +GGHAV+++GWG
Sbjct: 227 GNTSYILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG 286
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GE YW +AN WN WG +GYF I RG +ECGIE VAG+P
Sbjct: 287 -ELNGEPYWKIANSWNHEWGMNGYFLIARGVDECGIEGSGVAGIP 330
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 106/227 (46%), Positives = 133/227 (58%), Gaps = 22/227 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 114 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 172
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
HG+V+E C PY F S C+H C Y TP C C K +R +
Sbjct: 173 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTS 230
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y +S E E+ NGP EVSF+VY DF Y GVYKH+ G +GGHAV+++G
Sbjct: 231 YLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVG 284
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WG +GE YW +AN WNR WG +GYF I RG +ECGIE VAG P
Sbjct: 285 WG-ELNGEPYWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTP 330
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 193 bits (491), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 113/266 (42%), Positives = 149/266 (56%), Gaps = 38/266 (14%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
++++ Q CGSCWAFGA E +SDR CI +SV D+L+CCG CG GC GGY
Sbjct: 46 IKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYS 105
Query: 67 ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWR 118
I A R++ G VT C PY S C P TP C C K + ++
Sbjct: 106 IEALRFWASSGAVTGGDYGGHGCMPY--SFAPCTKNC-PESTTPSCKTTCQSSYKTEEYK 162
Query: 119 NSKHY----------------SISAYRINSDPE--DIMAEIYKNGPVEVSFTVYEDFAHY 160
KHY SAY++ + +I EIY GPVE S+ VYEDF HY
Sbjct: 163 KDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHY 222
Query: 161 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
KSGVY + +G ++GGHAVK+IGWG ++G DYW++AN W S+G G+FKI+RG+NEC I
Sbjct: 223 KSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 281
Query: 221 EEDVVAGLPSSKNLVKEITSADMFED 246
E +VVAG + K T ++ +ED
Sbjct: 282 EGNVVAG------IAKLGTHSETYED 301
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 193 bits (490), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 104/245 (42%), Positives = 141/245 (57%), Gaps = 21/245 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + Q C SCWA + A++DR CIH LS D+++CC + CG G
Sbjct: 96 WANCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYG 154
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEP----AYPTPK 105
C+GG P +W Y+ GVVT C PY CSH PG P YPTPK
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPK 213
Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C +KC N+ + K S+Y + D M EI KNGPV+ F ++EDF YKSG+
Sbjct: 214 CEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPVDGIFYMFEDFLVYKSGI 273
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
Y + TG ++GGHA+++IGWG ++G YW++AN WN WG GYF+++RG+NECGIE +
Sbjct: 274 YHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARI 332
Query: 225 VAGLP 229
AGLP
Sbjct: 333 NAGLP 337
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 192 bits (489), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 160/252 (63%), Gaps = 18/252 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S D+L CCG CGDG
Sbjct: 90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNGHVNVEVSAEDMLTCCGGQCGDG 149
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GGYP AW ++ G+V+ C PY C H P C TP+C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPY-SIPPCEHHVNGSRPACTGEGDTPRCS 208
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY S+Y ++SD +I AEIYKNGPVE +FTVY DF YKSGVY+
Sbjct: 209 KTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPVEGAFTVYSDFLMYKSGVYQ 268
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H TGD+MGGHA++++GWG ++G YW++AN WN WG G+FKI RG + CGIE ++VA
Sbjct: 269 HTTGDIMGGHAIRILGWG-EENGVPYWLVANSWNTDWGDKGFFKILRGQDHCGIESEIVA 327
Query: 227 GLPSSKNLVKEI 238
G+P + ++I
Sbjct: 328 GIPRTDQYWRQI 339
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 106/227 (46%), Positives = 133/227 (58%), Gaps = 22/227 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 114 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYY 172
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
HG+V+E C PY F S C+H C Y TP C C K +R +
Sbjct: 173 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTS 230
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y +S E E+ NGP EVSF+VY DF Y GVYKH+ G +GGHAV+++G
Sbjct: 231 YVLSG------EEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVG 284
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WG +GE YW +AN WNR WG +GYF I RG +ECGIE VAG P
Sbjct: 285 WG-ELNGEPYWKIANSWNREWGMNGYFLIARGVDECGIEGSGVAGTP 330
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/234 (46%), Positives = 140/234 (59%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
Q CGSCWAF AV A+SDR CIH +N+ LS DLLACC CG GC GG+ AW
Sbjct: 108 QSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWD 166
Query: 72 YFVHHGVVT-------EECDPY-------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
Y+ +G+VT C PY + G +P C E Y TP+CV +C K
Sbjct: 167 YWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYAT 226
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ + K + ++Y + I EI+ GPVE + VY DFA+Y GVYKH TG+++GG
Sbjct: 227 KYEDDKIRASTSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGG 286
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HA++L+GWG +DG YW+ AN WN SWG G+F+I RGS+ CGIE DV AGLP
Sbjct: 287 HAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 192 bits (487), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 103/229 (44%), Positives = 136/229 (59%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF + E +SDR CI H + LS +D+L+CC G GCDGG+P+SAW+Y
Sbjct: 88 QSQCGSCWAFSSAEVMSDRVCIASHGHKKVELSADDILSCC-TDGGYGCDGGWPVSAWQY 146
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
FV GVVT + C PY + C TP C C + +
Sbjct: 147 FVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNCTQEIDTPDCKTTCQAGYPISYD 206
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
+ K Y +AY +++ I EI GPV +FTVY+DF HYK+G+YKH++G GGHAV
Sbjct: 207 DDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAV 266
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+++GWG G YW++AN WN WG +GYF+I RGS+ECGIE+ VVAG
Sbjct: 267 RILGWG-QQGGVPYWLVANSWNTDWGENGYFRILRGSDECGIEDGVVAG 314
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 109/234 (46%), Positives = 140/234 (59%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
Q CGSCWAF AV A+SDR CIH +N+ LS DLLACC CG GC GG+ AW
Sbjct: 108 QSGCGSCWAFAAVTAMSDRVCIHSNQTLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWD 166
Query: 72 YFVHHGVVT-------EECDPY-------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL 116
Y+ +G+VT C PY + G +P C E Y TP+CV +C K
Sbjct: 167 YWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYAT 226
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ + K + ++Y + I EI+ GPVE + VY DFA+Y GVYKH TG+++GG
Sbjct: 227 KYEDDKIRASTSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKHTTGELLGG 286
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HA++L+GWG +DG YW+ AN WN SWG G+F+I RGS+ CGIE DV AGLP
Sbjct: 287 HAIRLLGWGVEEDGTPYWLAANSWNPSWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 136/230 (59%), Gaps = 13/230 (5%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG--- 63
+ ++ Q CGSCWAF A E++SDR CIH + +++S DLLACC CG GCDG
Sbjct: 103 IRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLACC-HTCGHGCDGRCH 161
Query: 64 --GYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRN 119
I R V V TE+ C PY S P C PTPKC C K + +
Sbjct: 162 CSSVAILQGRRLVPEPVRTEDGCQPY--SLPPCVPNCTHPEPTPKCQHVCRKGYEKSYEE 219
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KH++ + YR+ + I +IYKNGPVE +F VY DF YKSGVY+ MG HA+K
Sbjct: 220 DKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQHMIKFMGVHAIK 279
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWGT +DG YW++AN WN WG GYFKI RG +ECGIEE + AG+P
Sbjct: 280 ILGWGT-EDGVPYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIP 328
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 102/227 (44%), Positives = 139/227 (61%), Gaps = 17/227 (7%)
Query: 23 AFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
AFGAVEA+SDR CIH + +S DL++CCG+ CG GC GG+P +AW ++ G+VT
Sbjct: 1 AFGAVEAMSDRLCIHTNGTFTKRISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVT 59
Query: 81 --EECDPY----FDSTGCSHPGCEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISA 127
+ +P + CSH G + Y TP CV+KC + + K +
Sbjct: 60 GGSKENPTGCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANIT 119
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + + IM EI NGPVE +F VYEDF YKSGVY H G ++GGHA++++GWG +
Sbjct: 120 YNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EE 178
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
+G YW++AN WN WG DGYFK+ RG NECGIE++V AGLP ++
Sbjct: 179 NGVAYWLIANSWNDGWGEDGYFKMLRGKNECGIEDEVTAGLPELSSI 225
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 104/243 (42%), Positives = 132/243 (54%), Gaps = 20/243 (8%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGDGCD 62
N + ++++ Q CGSCWAF A E SDR CI L S+S DLL CC CG GC
Sbjct: 100 NCQSIKLIRDQSTCGSCWAFAATETFSDRICIASNQTLQTSISSEDLLECCADYCGMGCK 159
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRK 109
GGYP +AW Y GV T C PY TG P C P PTP+CV++
Sbjct: 160 GGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPIQPTPQCVKE 218
Query: 110 CVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-K 166
C + + H++ Y I + + I EI +GPV+ SF V DF YKSGVY +
Sbjct: 219 CNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIR 278
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
+ GGH+VK+IGWG + YW++AN WN WG G F++ RG NECGIE +VA
Sbjct: 279 NPKLKYEGGHSVKIIGWG-KEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVA 337
Query: 227 GLP 229
GLP
Sbjct: 338 GLP 340
>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 83/98 (84%), Positives = 90/98 (91%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QGHCGSCWAFGAVE+LSDRFCIH+ +++SLSVNDLLACC FLCG GCDGGYPI+AWRYF
Sbjct: 120 QGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFK 179
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
GVVTEECDPYFD+TGCSHPGCEP YPTPKC RKCVK
Sbjct: 180 RSGVVTEECDPYFDTTGCSHPGCEPLYPTPKCHRKCVK 217
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 131/218 (60%), Gaps = 15/218 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGA EALSDR I + +N+ LS DL++C GCDGGYPI+AW Y
Sbjct: 101 QQQCGSCWAFGATEALSDRLAIASNNSINVVLSPQDLVSCDS--TDYGCDGGYPINAWHY 158
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
GVVT+ C PY G S TP C K + +AY++ +
Sbjct: 159 MQSLGVVTDTCYPYTSGNGDSGTCQITGKKTPACATATFYKAK----------TAYQVAN 208
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+ I +EI NGPVE +F+VY+DF Y SGVY H +G + GGHAVK++GWG D Y
Sbjct: 209 NMAAIQSEILANGPVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGV-DGTTPY 267
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
WI+AN W SWG G+F IKRG++ECGIE+ +VAGL +
Sbjct: 268 WIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAGLAA 305
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 190 bits (482), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 138/231 (59%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A +SDR C+ + L V+D +LACCG CGDGC GG+P AW +
Sbjct: 107 QSRCGSCWAVSAASVMSDRLCVQSNGKIKLHVSDTDILACCGEFCGDGCSGGWPFQAWEW 166
Query: 73 FVHHGVVTE-------ECDPYFDSTGCSHP-----GCEP--AYPTPKCVRKCVKKN-QLW 117
+GV T C PY +H G P ++PTP+C + C + + +
Sbjct: 167 VRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPY 226
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K Y+ +Y + +D ++I +I KNGPV+ +F VYEDF YK G+YKH G GGHA
Sbjct: 227 KKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
VK+IGWG D+G DYW++AN W++ WG G+F++ RG N+C IE+ + AG+
Sbjct: 287 VKIIGWG-KDNGTDYWLIANSWSKDWGESGFFRMVRGENDCEIEDMITAGI 336
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 190 bits (482), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/229 (44%), Positives = 138/229 (60%), Gaps = 19/229 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA + A+SDR CI + +S D+++CC + CGDGC+GG+PISA+R+
Sbjct: 113 QANCGSCWAVSSAAAMSDRICIASKGAKQVLISAQDVVSCCTW-CGDGCEGGWPISAFRF 171
Query: 73 FVHHGVVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQLWR 118
GVVT C PY + C H G E Y TP+C R+C+
Sbjct: 172 HADEGVVTGGDYNTKGSCRPY-EIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSY 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
S Y AY++ + + I +I KNGPV ++TVYEDFAHY+SG+YKH G G HAV
Sbjct: 231 PSDRYYKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAV 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
K+IGWG + G YWI+AN W+ WG +G+F++ RGSN+CG EE + AG
Sbjct: 291 KVIGWG-EEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 338
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 190 bits (482), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 105/232 (45%), Positives = 135/232 (58%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGA EA +DR CI + LS DLL CC CG GCDGG+ AWR+
Sbjct: 91 QSVCGSCWAFGAAEAATDRLCIASKGKIQDRLSEQDLLTCCD-SCGFGCDGGWLDMAWRW 149
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
F GV T + C+ Y C H P C + TP+CV++C + + +
Sbjct: 150 FQSTGVTTGGEYGSKDWCNAY-SFPKCEHHAEGKYPPCGESQETPECVKQCQEGYPVEYE 208
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KH+ AY + + I E+ NGP+EVSF VYEDF YKSG+Y+H+ G +GGHAV
Sbjct: 209 KDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLTYKSGIYQHVAGKYLGGHAV 268
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
KL+GWG +DG +YW +AN WN WG +GYF+I G ECGIE + G+P
Sbjct: 269 KLVGWGV-EDGIEYWKIANSWNEDWGENGYFRIVAGKGECGIEVGPIGGIPK 319
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 136/222 (61%), Gaps = 19/222 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR CI+ + S DL++CC +CG GC+GG P AW Y
Sbjct: 65 QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEY 123
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWR 118
+ H G+V+ + C PY + C H PG C TPKC + C N ++
Sbjct: 124 WKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCQKNCESSYNVPFK 182
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y Y ++ + I AE++KNGPVE +FTVY D YK+GVYKH G+ +GGHA+
Sbjct: 183 KDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAI 242
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
K+IGWG ++ + YW++AN WN WG +G+FKI RG + CGI
Sbjct: 243 KIIGWGVENNNK-YWLIANSWNSDWGDNGFFKILRGEDHCGI 283
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 105/226 (46%), Positives = 131/226 (57%), Gaps = 16/226 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA A E +DR+CIH S DLL+CC CGDGC GG AW++
Sbjct: 111 QGCCGSCWAISAAETFTDRWCIHSEDKDQFSFGAYDLLSCC-HSCGDGCQGGNLGPAWQF 169
Query: 73 FVHHGVVTEECDPYFDSTGCSHP-------GCEPAYPTPKCVRKCVKKNQLWRNS--KHY 123
+V GV + PY GC HP + TPKC RKC + S + +
Sbjct: 170 WVQRGVSSG--GPYNSRQGC-HPYPVDVCHSADEDADTPKCTRKCQSMYNVTNVSDDRRF 226
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
AY ++ D E I EI++NGPV+ SF VY DF YK+GVY+H+ G + GGHAVK+IGW
Sbjct: 227 GRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGW 286
Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G ++G YW+ +N W WG G+FKI RG N CGIE DV AGLP
Sbjct: 287 GV-ENGTKYWLCSNSWGEDWGERGFFKIVRGENHCGIESDVHAGLP 331
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 189 bits (480), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 104/217 (47%), Positives = 130/217 (59%), Gaps = 20/217 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGC-DGGYPISAWR 71
QGHCGSCWAF + E LSDR CI N+ LS DLL+C G GC DGG AWR
Sbjct: 65 QGHCGSCWAFASTEVLSDRLCIQTRGSTNIILSSEDLLSC--DKAGRGCSDGGRLSEAWR 122
Query: 72 YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
Y GVV C PY +TG P+C+ KC + ++ K Y + Y +
Sbjct: 123 YMQKKGVVANRCKPYTSGATGF----------IPECMSKCTGEGHAYQ--KFYGLYLYTV 170
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
+ + + I EI NGPVE +FTVY D HYKSGVY H +G +GGHAVK++GWG D+ E
Sbjct: 171 SGENQ-IKVEIMTNGPVEAAFTVYSDIVHYKSGVYHHTSGGKLGGHAVKVLGWGVEDE-E 228
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+YW++AN W WG G+FKIKRGS+ECGIE V+ G
Sbjct: 229 EYWLVANSWGPDWGDQGFFKIKRGSDECGIESRVLTG 265
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 147/239 (61%), Gaps = 18/239 (7%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 62
N + ++++ Q +CGSCWAFGA E +SDR CI +S D+L CC GC
Sbjct: 92 NCKSIKMIRDQAYCGSCWAFGAAEVISDRICIQSNGTDQPIISPEDILTCC--TNSHGCQ 149
Query: 63 GGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK--N 114
GG+ + A +++ GVVT + C PY CS C A TPKC +C K
Sbjct: 150 GGFVLEAMKFWKSKGVVTGGDFQGDGCIPY-SYGSCSD--CHTAQTTPKCKNECQVKYTK 206
Query: 115 QLWRNSKHYSISAYRINSDP--EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
++ K+Y SAYR+++ I +EI +NGPVE ++ VYEDF +YKSGVY++I+G
Sbjct: 207 NEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKSGVYEYISGRH 266
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
MGGHAVK+IGWG ++ +YW++AN W +G +G+FK++RG+NECGIE VVAG+ S
Sbjct: 267 MGGHAVKIIGWGV-EENVNYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAKS 324
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 133/216 (61%), Gaps = 19/216 (8%)
Query: 31 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 81
SDR CIH + +++S DLL CC CG GC+GGYP +AW+++ G+VT +
Sbjct: 1 SDRICIHTKGKVQVNISAEDLLTCCD-SCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTED 59
Query: 82 ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDP 134
C PY+ C H P C PTP+C + C + + + KH+ Y I+SD
Sbjct: 60 GCQPYYFPP-CEHHTVGPLPNCTGIKPTPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
I EI KNGPVE F VY DF YKSGVY+ + +++GGHA++++GWGT +DG YW+
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGT-EDGVPYWL 177
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+AN WN WG GYFKI+RG++ECGIE D+ AG+P
Sbjct: 178 VANSWNEDWGDKGYFKIRRGNDECGIENDINAGIPK 213
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 137/231 (59%), Gaps = 13/231 (5%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
E + + QG CGSCWAFGAVE +SDR CI + S DLLACC CG GC GG
Sbjct: 93 ESIRKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQDLLACCK-ECGHGCGGG 151
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPKCVRKCV--KKNQLWRN 119
Y AW+Y+V G+V+ + S GC HP A+ TP C C K + +
Sbjct: 152 YSSRAWQYWVTDGIVSG--GDFNTSQGC-HPYSVQAFRDSTTPNCSSFCTNPKYQKNYSE 208
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K Y +YRI + E I AEI +GPV+ S+ VY+DF Y++GVY+H+ G+V G H+VK
Sbjct: 209 DKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVK 268
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAGLP 229
++GWG ++G DYW++AN W R WG G+FK RG N C IE +++ G P
Sbjct: 269 ILGWG-RENGTDYWLVANSWGRDWGRLGGFFKFLRGENHCDIESNILGGDP 318
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/223 (44%), Positives = 137/223 (61%), Gaps = 19/223 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA++DR CI+ + S DL++CC +CG GC+GG P AW Y
Sbjct: 66 QGSCGSCWAFGAVEAMTDRVCIYSNATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEY 124
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKKNQL-WR 118
+ H G+V+ + C PY + C H PG C TPKC + C + ++
Sbjct: 125 WKHVGLVSGGNYNSSQGCRPY-EIPPCEHHVPGNRMPCNGDTKTPKCEKTCESSYTVPFK 183
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y Y ++ ++I AE++KNGPVE +FTVY D YKSGVY+H G+ +GGHA+
Sbjct: 184 KDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAI 243
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
K++GWG ++G YW++AN WN WG +G+ KI RG + CGIE
Sbjct: 244 KILGWGV-ENGSKYWLIANSWNSDWGDNGFLKILRGEDHCGIE 285
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/235 (43%), Positives = 134/235 (57%), Gaps = 14/235 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR C G+ L +S LL+CC CGDGCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKD-CGDGCDG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
GYP SAW Y+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GYPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ +Y + +D E+Y NGP V+F VY DF YK+GVY+H++GD +GG
Sbjct: 220 PL--IKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFLGG 277
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+F I RG+NECGIE AGLP+
Sbjct: 278 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPA 331
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 147/243 (60%), Gaps = 20/243 (8%)
Query: 2 PFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD 59
PF S H + QG CGSCWA V +SDR CIH +NL L+ DL+ CC CG+
Sbjct: 102 PFCQSIHS--VRNQGTCGSCWAVATVSVMSDRLCIHSDGEVNLELATEDLMGCCK-DCGN 158
Query: 60 GCDGGY-PISAWRYFVHHGVVT-------EECDPY-FDSTGCSHP--GCEPAYPTPKCVR 108
GC+GG+ +A++Y+V G+V+ E C PY F+ CS+P GC PKC+
Sbjct: 159 GCNGGFLDGTAFQYWVDAGLVSGAPYNSSEGCKPYPFEP--CSYPFVGCHHEKKNPKCLH 216
Query: 109 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C+ ++ +R K + +AY+I +D I EI NGPV F V+EDF Y SGVYKH
Sbjct: 217 HCINGYDRKYRKDKFFGATAYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSGVYKH 276
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ G +G HA++++GWGT ++G YW++AN + +WG G+FK+ RGSN GIE V+AG
Sbjct: 277 VVGKKVGMHAIRIVGWGT-ENGTPYWLIANSYGDTWGDKGFFKMLRGSNHLGIESTVIAG 335
Query: 228 LPS 230
LP
Sbjct: 336 LPQ 338
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 136/222 (61%), Gaps = 17/222 (7%)
Query: 23 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV- 79
AFGAVEA+SDR CIH + + +S DL+ CC CG GC GG +AW+Y+ G+V
Sbjct: 1 AFGAVEAMSDRVCIHSNGRVQVDISAEDLMDCCD-KCGSGCSGGVSAAAWQYWKDAGLVS 59
Query: 80 ------TEECDPYF-----DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISA 127
T+ C PY S+ S P C PTPKC R+C + + + + K+++ +
Sbjct: 60 GGLYNTTDGCKPYSLAPCEHSSQGSLPECVGTLPTPKCKRQCREGYERSYDDDKYFAKNV 119
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y IN + I EI++NGPVE FT Y DF YKSGVY+H + D++G HA++++GWG S+
Sbjct: 120 YSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWG-SE 178
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
D YW+LAN WN WG GYFK+ RG NEC IE V AG+P
Sbjct: 179 DNNPYWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAGIP 220
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/243 (42%), Positives = 142/243 (58%), Gaps = 21/243 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+++ + + ++ Q CGSC AFGA EA+SDR CIH + +++S DLL CC CG G
Sbjct: 35 WSHCDSIHLIRDQSTCGSCRAFGATEAMSDRICIHTKGRVQVNISAQDLLTCC-HQCGMG 93
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCV 107
C GGYP +AW Y+ G+VT + C PY+ C H P C PTPKC+
Sbjct: 94 CFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP-CEHHTKGPLPNCTDTKPTPKCL 152
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C K + + K+++ + Y ++SD I EIYKNGPVE F+VY DF YKSGVY+
Sbjct: 153 QVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKNGPVEADFSVYTDFLAYKSGVYQ 212
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
+ ++ L GW W++AN WN+ WG GYFKI+RG+NECGIE D+ A
Sbjct: 213 RHSYELWEARHQNL-GWALKR--RSVWLVANSWNQDWGDKGYFKIRRGNNECGIENDINA 269
Query: 227 GLP 229
G+P
Sbjct: 270 GIP 272
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/237 (43%), Positives = 138/237 (58%), Gaps = 17/237 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
+ N + + + QGHCGSCWA + E L DRFCI LS L +C G
Sbjct: 86 WPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHLTSCTPGC--SG 143
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR-KC----VKKNQ 115
C+GG+ +A+ + +G++ E+C PY C HPGC +PTPKC + KC K +
Sbjct: 144 CNGGWMSTAFGFMQSNGILGEDCIPY-QMGKCKHPGCS-TWPTPKCNKTKCYPNDTKSTE 201
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
LW ++ S+Y + S+ DI EIY+NGPV SF VYED + Y+SGVY+H+TG G
Sbjct: 202 LW-----HAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEGL 256
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
HA+K++GWG DG YW + N W WG DG I+RG +ECGIE DVVAG P K
Sbjct: 257 HAIKVVGWGIL-DGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKLK 312
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 103/240 (42%), Positives = 137/240 (57%), Gaps = 20/240 (8%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
N + + Q +CGSCWA ALSDR CI + +S D ++CC CG GCD
Sbjct: 106 NCTSIRHIRDQANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCE-SCGYGCD 164
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVR 108
GG+PI A+ ++ + G VT + C PY C H G C TPKC R
Sbjct: 165 GGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGNDTYYGECPKGAKTPKCRR 223
Query: 109 KCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
+C + + + K Y AY + + I EI KNGPV +FTVYEDF++YK G+YKH
Sbjct: 224 RCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKH 283
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G GGHA+K+IGWG +D YW++AN W+ WG +GYF++ RG NECGIE++VVAG
Sbjct: 284 TAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVAG 342
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 132/221 (59%), Gaps = 11/221 (4%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH--FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA + ++DR CI LS +L++CC +CG GCDGGYP A+ Y
Sbjct: 106 QANCGSCWAVSSASVMTDRICIESIAAKQPLLSEEELVSCCK-ICGYGCDGGYPDKAFIY 164
Query: 73 FVHHGVVTEECDPYFDSTGCSH----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISA 127
+ G+ T PY + GC E TP C R+C+ + +H+
Sbjct: 165 WATRGIPTG--GPYGSTKGCKPYSIGSNSEDEAETPLCTRQCINEYPYNLSQDRHFGEKP 222
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y +NS+ E IM E+YKNGPV V+F VYEDF +Y GVY+H G +GGHAVKLIGWG +
Sbjct: 223 YWVNSNEEQIMQELYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGI-E 281
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+ + YW+++N WN +WG +G+FKI RG N C IE VVAG+
Sbjct: 282 NSKKYWLISNSWNTTWGENGFFKIIRGKNCCAIESYVVAGM 322
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 98/218 (44%), Positives = 131/218 (60%), Gaps = 14/218 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA E LSDRF I + ++LS L+ C L GC GG+PI+AW Y
Sbjct: 103 QGQCGSCWAFGASEVLSDRFAIASKNQIYVTLSAQQLVDCD--LDNSGCSGGWPINAWNY 160
Query: 73 FVHHGVVTEEC-DPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
V G++TE+C PY+ C T C + K + + Y + A +
Sbjct: 161 MVKTGLLTEQCYGPYY----AKQYTCRLTANTTDCPWQPGVKARFYHAKSAYKLPAKNV- 215
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
E I +I NGPVE FT+++DF Y+SG+Y H TG +GGHA+K++GWGT D+ D
Sbjct: 216 ---EAIQTDIMNNGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHAIKILGWGTEDN-VD 271
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
YW+ AN W +WG GYFKI+RG++ECGIE+ + AGLP
Sbjct: 272 YWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLP 309
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 186 bits (472), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 106/234 (45%), Positives = 133/234 (56%), Gaps = 23/234 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+ DR CI + LS +D+L+CC CG GC+GG AW Y
Sbjct: 116 QSACGSGWAVAAVGAIMDRICIASEGKQQVILSADDILSCCT-ECGYGCEGGDTYKAWNY 174
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE-------------PAYPTPKCVRKCVKKNQL 116
+ G+VT Y +GC +P CE YPT C KC +
Sbjct: 175 WTTDGIVTGS--NYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTI 232
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+ KHY Y + D I EI +GPVEV+F VYEDF HY SG+YKH+ G+ +G
Sbjct: 233 SYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGV 292
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWGT ++G DYWI AN WN WG +G+F+I RG NECGIE +VVAG P
Sbjct: 293 HAVKMLGWGT-ENGVDYWICANSWNSDWGENGFFRILRGENECGIESNVVAGKP 345
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 186 bits (471), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 138/231 (59%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA + EA+SD+ C+ + +S D+L+CCG CG GC+ PI A+R+
Sbjct: 109 QSACGSCWAVSSAEAMSDQICVQSNRTTRVMISDTDILSCCGISCGYGCEV-LPIEAYRW 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKK-NQLW 117
VVT + C PY +H P +PTPKC + C +K N+ +
Sbjct: 168 MQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGPCPRGLWPTPKCRKACQRKYNKSY 227
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K+++ +Y + S+ I EIYKNGPV +F VY+DF++Y+ G+Y H G G HA
Sbjct: 228 NEDKYFATRSYYLPSNERSIREEIYKNGPVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHA 287
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
VK++GWG ++G DYW++AN WN WG +GYF+I RGSNECGIE +V+G+
Sbjct: 288 VKVVGWG-RENGTDYWLIANSWNTDWGENGYFRIARGSNECGIEGQMVSGV 337
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 133/230 (57%), Gaps = 19/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI + + +S D+L+CCG CG GC+GG+PI A+ Y
Sbjct: 24 QANCGSCWAVSTASALSDRICIASNGRKQVHVSATDILSCCGNQCGYGCNGGWPIQAFNY 83
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKC-VKKNQLW 117
F G VT C PY C H G C TPKCVRKC + +
Sbjct: 84 FSKQGAVTGGDYKATSGCRPY-PFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSY 142
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ + AY + + EI KNGPV +FTVYEDF++YK G+YKH G GGHA
Sbjct: 143 KKDRSIGKDAYEEPNAEKATQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHA 202
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWG + G YW++AN W+ WG +GYF+I GSN CGIEE+VVAG
Sbjct: 203 IKIIGWG-KEGGVPYWLIANSWHNDWGENGYFRILCGSNHCGIEENVVAG 251
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/230 (43%), Positives = 133/230 (57%), Gaps = 20/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI + +S D ++CC C GCDGG+PI A+ +
Sbjct: 116 QANCGSCWAVSTASALSDRICIESNGETQMHISSIDFVSCCE-SCSYGCDGGWPILAFDF 174
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
+ + G VT + C PY C H G C TPKC R+C + + +
Sbjct: 175 YTYEGAVTGGDYGSKDGCRPY-PFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAY 233
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K Y AY + + I EI KNGPV +FTVYEDF++YK G+YKH G GGHA
Sbjct: 234 YMDKSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYEDFSYYKKGIYKHTAGQARGGHA 293
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+K+IGWG +D YW++AN W+ WG +GYF++ RG NECGIE++VVAG
Sbjct: 294 IKIIGWGVEND-VPYWLIANSWHNDWGEEGYFRMIRGINECGIEQEVVAG 342
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 105/225 (46%), Positives = 132/225 (58%), Gaps = 14/225 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA A EA +DR+CIH + + S DL++CC CGDGC GG AW Y
Sbjct: 120 QGCCGSCWAISAAEAFTDRWCIHSPEHTTFSFGSFDLISCC-HSCGDGCQGGVLGPAWDY 178
Query: 73 FVHHGVVTEECDPYFDSTGC-SHPGCEPAYP-----TPKCVRKCVKKNQLWRNSK--HYS 124
+V GV + PY GC S+P P PKC RKC + SK +
Sbjct: 179 WVQKGVSSG--GPYNSKQGCHSYPFDTCHSPDEDDDAPKCSRKCQSSYSVQDVSKDRRFG 236
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
AY + +D IM EI+ NGPV+ +F VY DF YKSGVY+H+TG + GGHA+K++GWG
Sbjct: 237 RVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDFKTYKSGVYRHVTGPLEGGHAIKILGWG 296
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++G YW+ +N W WG G+FKI RG N GIE DV AGLP
Sbjct: 297 V-ENGTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDVHAGLP 340
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 135/235 (57%), Gaps = 14/235 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR C G+ L +S LL+CC CGDGCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCK-DCGDGCDG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
GYP +AWRY+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GYPDAAWRYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ +Y + +D E+Y NGP V+F V+ DF YK+GVY+H++GD +GG
Sbjct: 220 PL--IEYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFLGG 277
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+F RG+NECGIE + AGLP+
Sbjct: 278 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLPA 331
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 103/245 (42%), Positives = 138/245 (56%), Gaps = 23/245 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + Q C SCWA G A++DR CIH LS DL++CC + CG G
Sbjct: 96 WPNCSSISEIPDQSSCSSCWAVGTASAMTDRICIHSNGEKKPRLSAVDLVSCCPY-CGYG 154
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEP----AYPTPK 105
C+GGYP AW Y+ HG+V+ C PY CSH PG P Y TPK
Sbjct: 155 CEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPY-PFPKCSHLEETPGLAPCPRELYATPK 213
Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C ++C ++ K S+Y + DIM EI NGPV + ++EDF YKSG+
Sbjct: 214 CEKQCQAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPVSTIYYIFEDFTVYKSGI 273
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
Y++ +G +MGGH + IGWG ++G YW+ AN WN WG +GYF+I+RG+NECGIE +
Sbjct: 274 YQYTSGSLMGGHGI--IGWGV-ENGVKYWLAANSWNEGWGENGYFRIRRGTNECGIESRI 330
Query: 225 VAGLP 229
AGLP
Sbjct: 331 NAGLP 335
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 135/235 (57%), Gaps = 14/235 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR+C G+ L +S L++CC CGDGC G
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLMSCCED-CGDGCKG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
G P SAW Y+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GAPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ ++Y + + +D E+Y NGP V F VY DF YK+GVY+H++GDV+GG
Sbjct: 220 PL--IKYRGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVLGG 277
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+F I RG+NECGIE AGLP+
Sbjct: 278 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPA 331
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/253 (44%), Positives = 150/253 (59%), Gaps = 32/253 (12%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
++++ Q +CGSCWAFGA E +SDR CIH +S D+L CCG CG+GC GG
Sbjct: 86 IKMVRDQSNCGSCWAFGAAEVISDRICIHSNGKEQPVISAEDILTCCGKSCGNGCQGGQG 145
Query: 67 ISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL--WR 118
+ A +++ +G VT + C PY CS+ C + TP C KC + ++
Sbjct: 146 LEAMKFWTTYGAVTGGDYKGDGCKPY-SFAPCSN--CVESKTTPSCQSKCQSTYTVTNYK 202
Query: 119 NSKHYS---------------ISAYRINSDPED---IMAEIYKNGPVEVSFTVYEDFAHY 160
KHY SAYR+++ I EIY+NGPVEV++TVY+DF HY
Sbjct: 203 GDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPIIQNEIYQNGPVEVAYTVYDDFYHY 262
Query: 161 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
KSGVY H+TG GGHAVK+IGWGT + G DYW++ N W S+G G+FKI+RG+NECGI
Sbjct: 263 KSGVYHHVTGKDTGGHAVKIIGWGT-EKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGI 321
Query: 221 EEDVVAGLPSSKN 233
E +VVAG+ N
Sbjct: 322 ESNVVAGMAKVGN 334
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/236 (42%), Positives = 138/236 (58%), Gaps = 19/236 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q CGSCWAF E++SDR CI N + SV D+L CC CG GCDGG+P
Sbjct: 109 ISLIRDQADCGSCWAFAVGESISDRVCIATDANKTAEFSVEDILTCCD-ECGFGCDGGFP 167
Query: 67 ISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY------PTPKCVRKCVKK 113
+AW YFV GVVT C PY S +HP E Y TP C C K
Sbjct: 168 DAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPN-ETFYRNCTGVSTPSCKTSCQKG 226
Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ +++ K +Y + + I +I K+GP+ +F+VYEDF +YK G+Y++ G
Sbjct: 227 YPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVATFSVYEDFMYYKKGIYRYTHGGY 286
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GGHAV+++GWG ++ + YWI+AN WN WG DG+F++ RG N+CGIEE V AGL
Sbjct: 287 EGGHAVRILGWGVENNVK-YWIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAGL 341
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 137/235 (58%), Gaps = 15/235 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR+C G+ L +S LL+CC CG GCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLLSCCKD-CGYGCDG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
GYP +AW Y+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GYPGTAWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ +Y ++ + +D E+Y NGP V+F VY DF YK+GVY+H++GDV+GG
Sbjct: 220 PL--IKYRGNHSYGLDGE-DDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDVLGG 276
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+F I RG +ECGIE + AGLP+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLPA 330
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 106/234 (45%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC CG GCDGG+P AW Y
Sbjct: 112 QSRCGSSWAVSAVGAISDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V HG+VT C PY C H P C + Y TP+C RKC K +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KHY A + + I EI GPVE ++EDF +YKSG+YK+ TG +G H
Sbjct: 230 EHDKHYGGIAINVIKNELAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHY 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V++IGWG ++G YW+ AN WN WG GYF+I RG NEC IE VVAG S
Sbjct: 290 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 102/235 (43%), Positives = 135/235 (57%), Gaps = 15/235 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR C G+ L +S LL+CC CG GCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKD-CGYGCDG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
GYP +AWRY+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GYPDAAWRYYVSHGLASSYCQPY-PFPHCDHHGGKGKKPPCSKYDFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ +Y ++ + ED E+Y NGP V+F VY DF YK+GVY+H++GDV+GG
Sbjct: 220 PL--IKYRGNHSYEVHGE-EDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLGG 276
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+F I RG +ECGIE AG P+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSPA 330
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 134/231 (58%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+ WAF AV+A+SDR CI ++ LS DLL+CC CG GC G+P AW Y
Sbjct: 112 QSRCGAGWAFAAVQAMSDRICIESKGKKSVELSAVDLLSCC-IECGLGCQMGFPGIAWDY 170
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C E Y PKC +KC K + +
Sbjct: 171 WVQEGIVTGGSKENHTGCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K+Y +Y + + + I EI +GPVE SF V+ DF +YKSG+YKH+TG +G H V
Sbjct: 231 KDKYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVV 290
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++IGWG + YW++AN WN WG GYF++ RG +ECGIE V +GLP
Sbjct: 291 RIIGWGVEKE-TPYWLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSGLP 340
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 182 bits (463), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 103/229 (44%), Positives = 129/229 (56%), Gaps = 22/229 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG+VE ++DR CI S +DLLACC CG GCDGG P A+ Y
Sbjct: 104 QGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADDLLACCT-ACGKGCDGGAPYRAFEY 162
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHY 123
+V G+V+ E C PY S + TPKC KC+ K + KHY
Sbjct: 163 WVAKGIVSGGDYNSNEGCQPYEGSAFLNSV-------TPKCSTKCLNSKYTTPYAKDKHY 215
Query: 124 SIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y + + +I EI NGPV VYEDF YKSGVY+H++G+ MGGHAVK+IG
Sbjct: 216 GTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQHVSGNSMGGHAVKIIG 275
Query: 183 WGTSDDGEDYWILANQWNRSWG-ADGYFKIKRGSNECGIEEDVVAGLPS 230
WGT + G YW++AN W W DG++KI RG N C IE + G P
Sbjct: 276 WGT-EKGVPYWLIANSWGAKWADLDGFYKILRGKNHCKIETYIYGGTPQ 323
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 182 bits (463), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC CG GCDGG+P AW Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V HG+VT C PY C H P C + Y TP+C RKC K +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KHY + + + I EI GPVE ++EDF +YKSG+Y++ TG +G H
Sbjct: 230 EHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHY 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V++IGWG ++G YW+ AN WN WG GYF+I RG NEC IE VVAG S
Sbjct: 290 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 182 bits (463), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTSCRPY-PFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 182 bits (463), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 133/237 (56%), Gaps = 25/237 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCG--FLCGDGCDGGYPISAW 70
Q CGSCWAFGAVEA+SDR CIH + + +S DL +CC F CG GCDGGY W
Sbjct: 106 QSSCGSCWAFGAVEAMSDRICIHSDQSNQVYVSAEDLNSCCFGLFACGLGCDGGYVAEPW 165
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPA----------------YPTPKCVRKCVKKN 114
Y+ G+VT Y S GC EP + TP+CVR C + +
Sbjct: 166 DYWRTDGIVTG--GAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYESS 223
Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VM 173
+ S + ++ + + EI KNGP+E +FTVY DF YKSGVY+ D +
Sbjct: 224 LDYTESLTFGQQVSTFTNEKQ-MQLEILKNGPIEAAFTVYNDFLSYKSGVYQATAQDESV 282
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
GGHA+K++GWG ++G YW++AN WN WG +GYFK RG + CGIE + A LP+
Sbjct: 283 GGHAIKVLGWGV-EEGTKYWLIANSWNTDWGDNGYFKFLRGVDHCGIESETAASLPA 338
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC CG GCDGG+P AW Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V HG+VT C PY C H P C + Y TP+C RKC K +
Sbjct: 171 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KHY + + + I EI GPVE ++EDF +YKSG+Y++ TG +G H
Sbjct: 230 EHDKHYGGISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHY 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V++IGWG ++G YW+ AN WN WG GYF+I RG NEC IE VVAG S
Sbjct: 290 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVVVAGRLKS 342
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/231 (44%), Positives = 138/231 (59%), Gaps = 18/231 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q HCGSCWA + E +SDR C+ + + LS D+LACC CG GC GG+ I AW Y
Sbjct: 112 QSHCGSCWAVSSAETMSDRLCVQSNGTIKVLLSDTDILACCPN-CGAGCGGGHTIRAWEY 170
Query: 73 FVHHGVVT-------EECDPY--FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSK 121
F + GV T + C PY + S+ C + ++PTPKC + C K ++ + + K
Sbjct: 171 FKNTGVCTGGLYGTKDSCKPYAFYPCKDESYGKCPKDSFPTPKCRKICQYKYSKKYADDK 230
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
+Y+ SAYRI + I EI +NGPV SF +Y DF Y+ GVY G +GGHA+K+I
Sbjct: 231 YYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPDFGFYEKGVYVTSGGRELGGHAIKII 290
Query: 182 GWGTSD-DGED--YWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAGL 228
GWGT +G D YW++AN W WG +GYF+I RG N C IE+ V+AG+
Sbjct: 291 GWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIAGM 341
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/225 (44%), Positives = 132/225 (58%), Gaps = 20/225 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAF AVE +SDR CIH S DLL+CC CG C GGY ++A+ +
Sbjct: 103 QGSCGSCWAFAAVETMSDRICIHSSGAKKFFFSAEDLLSCCT-ACGS-CSGGYMMAAFDF 160
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYS 124
++ GVV+ E C PY T +H TP C + C K + + KHY
Sbjct: 161 YIKQGVVSGGDLNSNEGCRPY---TADAHDKGV----TPSCTKSCRKGYPTSYSSDKHYG 213
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
Y +++ +I EI NGP+ VSF VY+DF +Y SGVY H++G+ G H VK++GWG
Sbjct: 214 SKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWG 273
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
T + +DYW++AN W SWG G+FKI RG NECGIE + A LP
Sbjct: 274 TEKE-QDYWLIANSWGSSWGEHGFFKILRGKNECGIENNPYAVLP 317
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 134/231 (58%), Gaps = 21/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA ALSDR CI + +S D+L+CC CGDGCDGGY I A+++
Sbjct: 113 QADCGSCWAVSTASALSDRICIASKGAKQVYVSATDILSCC-HSCGDGCDGGYVIDAFKF 171
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKNQL-W 117
F G VT + C PY C H G E Y TP+CVRKC + + +
Sbjct: 172 FAEQGAVTGGDYGAKDCCRPY-PFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEY 230
Query: 118 RNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ AYR+ + I EI +NGPV +F V++DF+ Y+ G+Y H+ G GGH
Sbjct: 231 HEDRVRGEDAYRLPIGSVKAIQKEIMRNGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGH 290
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
AVK+IGWGT + G YWI+AN W+ WG DGYF++ RG N+CGIE +VVAG
Sbjct: 291 AVKIIGWGT-EHGVPYWIIANSWHSDWGEDGYFRMVRGINDCGIETNVVAG 340
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/237 (43%), Positives = 129/237 (54%), Gaps = 21/237 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DL++CC CG GC GGY AW
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDL 166
Query: 73 FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
+ HG+VT TGC P CE YPTP+C+++C K +
Sbjct: 167 WKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEIDY 224
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K + +Y + + +M EI GPV VYED YKSGVY H+ G +G H
Sbjct: 225 EKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHG 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
++++GWG +DG YW++AN WN WG GY ++ R NECGI + V AGLP N
Sbjct: 285 IRILGWG-EEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDLSNF 340
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WA +V A+SDR CI G ++ LS DL++CC CG GCDGGY + +W Y
Sbjct: 112 QSRCASSWAVSSVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGYFLPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V HG+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVSHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 133/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 107/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWAFGAVEA++DR CI G S ++ L L C CG GC GG+P AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCKDCGGGCKGGFPGQAWDYW 171
Query: 74 VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
V G+VT EE C PY F T +P C Y TP+C + C K + +
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIR 291
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 138/241 (57%), Gaps = 22/241 (9%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCD 62
N ++ + Q +CGSCWA LSDR CI + ++ D ++CC CG GC+
Sbjct: 106 NCTSIKHIRDQANCGSCWAVSTASVLSDRICIASKQKKQVHISSIDFVSCCD-SCGFGCE 164
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAY-------PTPKCVR 108
GG+PI A+ Y+ + GVVT C PY C H G E Y TP+CV+
Sbjct: 165 GGWPIDAFEYYSYQGVVTGGDYGSKTGCRPY-PFHPCGHHGNETYYGECPKEESTPECVK 223
Query: 109 KCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+C K KN +R K + Y + + + I EI ++GPV SFTVY+DF++Y G+YK
Sbjct: 224 QCQKGYKNS-YRRDKTWGEDYYEVENSVKAIQREIMRSGPVVSSFTVYDDFSYYVKGIYK 282
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H G G HA+K+IGWGT + YWI+AN W+ WG G+F++ RG+N CGIEEDVVA
Sbjct: 283 HTAGKARGSHAIKIIGWGT-EKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDVVA 341
Query: 227 G 227
G
Sbjct: 342 G 342
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 125/203 (61%), Gaps = 18/203 (8%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVE++SDR C+H G N+ +S DLL+CCGF CG G
Sbjct: 23 WPNCPTIREIRDQGSCGSCWAFGAVESMSDRVCVHSGGKQNVEVSAEDLLSCCGFECGMG 82
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCE-PAYPTPKC 106
C+GGYP AW+Y+ G+V+ C PY C H P C TPKC
Sbjct: 83 CNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPP-CEHHVNGSRPSCSGEGGDTPKC 141
Query: 107 VRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V+KC + K Y SAY + S PE IM EIYK+GPVE +FTVYEDF YKSGVY
Sbjct: 142 VQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVY 201
Query: 166 KHITGDVMGGHAVKLIGWGTSDD 188
+H TG+ +GGHA+K++GWG ++
Sbjct: 202 QHHTGEAVGGHAIKILGWGIENN 224
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 179 bits (455), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 179 bits (455), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 132/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 179 bits (455), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC I+ ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAGLIKS 342
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 179 bits (454), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 79 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 137
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 138 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 195
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 196 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 255
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 256 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 179 bits (454), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WA AV A+SDR CI G ++ LS DL++CC CG GCDGG+P AW Y
Sbjct: 86 QSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDY 144
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQL-W 117
+V HG+VT C PY C H P C + Y TP+C RKC K + +
Sbjct: 145 WVSHGIVTGGSKENHTGCQPY-PFPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPY 203
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ KHY + + + I EI GPVE ++EDF +YKSG+Y++ TG +G H
Sbjct: 204 EHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHY 263
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V++IGWG ++G YW+ AN WN WG GYF+I RG NEC +E VVAG S
Sbjct: 264 VRIIGWGI-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSVESVVVAGRLKS 316
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 179 bits (454), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 101/235 (42%), Positives = 133/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 97/238 (40%), Positives = 133/238 (55%), Gaps = 19/238 (7%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDG 63
S+ + +V Q CGSCWA A A+SDR CI + + +S +LL+CC CG GC+G
Sbjct: 93 SDVISTVVDQSDCGSCWAVAAASAMSDRRCIASQGKLKVPVSAENLLSCCDS-CGYGCEG 151
Query: 64 GYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPA-YPTPKCVRK 109
GYP AW Y++ G+ T + C PY C H C Y TP C K
Sbjct: 152 GYPTMAWSYWIDTGITTGGLYGSKQGCQPY-SLQPCEHHTEGNKVQCSTLDYDTPSCKHK 210
Query: 110 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
C +++ + + R +I EI NGPVE +F VY DF +YKSGVY+H+
Sbjct: 211 CDDSALNYKSELTFGSGSVRNFYSVANIQKEILTNGPVEAAFDVYSDFVNYKSGVYQHVA 270
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G+ +GGHAV+++GWG + G YW++AN WN WG G FKI+RG+NE G E+ +VA
Sbjct: 271 GEYLGGHAVRILGWG-EESGVPYWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIVAA 327
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 101/232 (43%), Positives = 127/232 (54%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWAFG A +DR C+ N LS ++ CC CG GC GGYPI AW+Y
Sbjct: 110 QGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEEITFCC-HTCGFGCHGGYPIKAWKY 168
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C+PY D G + +P +C R C L N
Sbjct: 169 FSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYN 228
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H ++ Y + I ++ GP+E SF VY+DF YKSGVY K +GGHA
Sbjct: 229 DDHRFTRDFYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYEKTENASYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG ++G YW++ N WN WG G FKI+RG+NECGI+ AG+P
Sbjct: 287 VKLIGWGV-EEGTPYWLMVNSWNAQWGDKGLFKIRRGTNECGIDNSTTAGVP 337
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AG S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGRIKS 342
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 109/235 (46%), Positives = 134/235 (57%), Gaps = 26/235 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCG---DGCDGGYPIS 68
Q CGSCWAF EA SDR CI + LS ACC G GCDGG P S
Sbjct: 150 QSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSAGHTAACCSEAEGCFSFGCDGGQPDS 209
Query: 69 AWRYFVHHGVVTE---ECDPYFDSTGCSH----PGCEPAY---PTPKCVRKCVKKNQLWR 118
AWR+F HGVV+E C PY + CSH G EP P+P C C +N ++
Sbjct: 210 AWRWFSEHGVVSELDSGCWPY-NFPECSHHVETKGMEPCKGNSPSPVCSTTC--RNHHFK 266
Query: 119 NS----KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
S +H++ + ++I EI NGPV +FTVYEDF +YKSGVYKH+ G +G
Sbjct: 267 PSFESDRHFTEDEGYSLDEVDEIKKEIIDNGPVAAAFTVYEDFLYYKSGVYKHVNGSELG 326
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GHAVK+IGWGT D E YW++ N WN +WG G FKI G ECGI+ +V AG+P
Sbjct: 327 GHAVKIIGWGT-DQNEQYWLVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIP 378
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/232 (44%), Positives = 125/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWAFG A +DR CI N LS +L CC CG GC+GGYPI AW
Sbjct: 106 QGHCGSCWAFGTSSAFADRLCIATEGEFNELLSAEELTFCC-HKCGFGCNGGYPIRAWER 164
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +P +C R C L +
Sbjct: 165 FRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFN 224
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
N HY+ AY + I ++ GP+E SF VY+DF YKSGVY K +GGHA
Sbjct: 225 NDHHYTRDAYYLTYGT--IQNDVLTYGPIEASFEVYDDFPSYKSGVYVKTENASYLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 283 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 333
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/228 (43%), Positives = 134/228 (58%), Gaps = 18/228 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA A A++DR+C+ DLL+CC CG GC GG AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
+V G+ + + C PY C PG + TPKC KC +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 YGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
WG ++G YW++AN W R WG +G+FKI RG N CGIEE++ AGLP+
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKIVRGENHCGIEENIHAGLPN 368
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/232 (42%), Positives = 128/232 (55%), Gaps = 19/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A+SDR CI +++S D+L CC CG GC GG+ I AW Y
Sbjct: 106 QANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGFGCGGGWSIRAWEY 165
Query: 73 FVHHGVVTE-------ECDPYFDSTGCSHPG-------CEPAYPTPKCVRKCVKK-NQLW 117
FV+ GVV+ C PY C H G C TP C +KC +++
Sbjct: 166 FVYEGVVSGGEYLTKGVCRPY-PIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIF 224
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
R K AY + E I EI ++GPV SF VYEDF+ YK+GVYKH G + G HA
Sbjct: 225 RMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHA 284
Query: 178 VKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
VK++GWG S YW++AN W+ WG +GYF+ RG N+C IE+ V AG+
Sbjct: 285 VKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTVAAGI 336
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 122/211 (57%), Gaps = 16/211 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGA+E++SDRFCIH ++ LS DL+ C +GC+GG P +A++Y
Sbjct: 92 QAECGSCWAFGAIESISDRFCIHKNESVQLSFQDLITCDN--QDNGCEGGDPYTAYKYVQ 149
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYP-------TPKCVRKCVKKNQLWRNSKHYSISA 127
+GVVT C PY + P C PA TP C KC + ++ H+ +
Sbjct: 150 KNGVVTSNCQPY------TIPTCPPAQQPCMNFVNTPPCSAKCANSSVNFQQDLHHLKTV 203
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + + I EI NGPVE F VYEDF YKSGVY H +G +GGH +K++G+G S
Sbjct: 204 YAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFGVS- 262
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNEC 218
+G YWI N W SWG +G F I+ G NEC
Sbjct: 263 NGTPYWICNNSWTTSWGNNGIFWIEAGKNEC 293
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 132/234 (56%), Gaps = 15/234 (6%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
++ + ++ Q CGSCWAF AVEA+SDR CIH L +S DLL C GC+G
Sbjct: 95 TQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLVSSQDLLTCG---TAGGCNG 151
Query: 64 GYPISAWRYFVH-------HGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQ 115
G+P AW + + +G + + C YF HP C TP CV +C + +
Sbjct: 152 GWPAVAWSDWTNGIVTGGLYGALEQGCKSYFLEGCDDHPNKCRNYVSTPACVEQCDEPSL 211
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
++ + Y + Y I + E I EI NGPVE + VY DFA Y+SG+Y+ T + GG
Sbjct: 212 YYKAQETYGQTPYEIQGE-EQIQYEIMTNGPVEATMDVYVDFAQYQSGIYQLTTDEYEGG 270
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK++GWG +DG YW++AN WN WG +G F+I RG +E GIE + A LP
Sbjct: 271 HAVKILGWGV-EDGVKYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALP 323
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 132/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +GP E +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/231 (44%), Positives = 130/231 (56%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA + F H + + LS +L+ CCG CG GC GG P SAW Y
Sbjct: 103 QGSCGSCWALELLRLCLIVFVSHSNGKLQVHLSAENLVTCCGS-CGAGCFGGDPGSAWEY 161
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V+ E C PY C H P C T C ++C K + +
Sbjct: 162 WRDVGIVSGGNYGSKEGCQPY-SIAPCEHHIPGSRPPCRGEGHTADCRKQCEKGYSIPYD 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
HY+ Y D ++I EI KNGPVE +F VYED YK GVYKH+ G +GGHA+
Sbjct: 221 KDLHYAEFVYSTERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAI 280
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
K++GWG ++G YW++AN WN WG +G+FKI RGS+ECGIE DV AGLP
Sbjct: 281 KILGWGV-ENGTPYWLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAGLP 330
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 101/242 (41%), Positives = 139/242 (57%), Gaps = 21/242 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA A A++DR+C+ DLL+CC CG GC GG AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
+V G+ + + C PY C PG + TPKC KC +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 YGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
WG ++G YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP N ++ +A
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFHRQGEAAK 377
Query: 243 MF 244
F
Sbjct: 378 YF 379
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 128/232 (55%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CG+CWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 107 QGNCGTCWAFGTSSAFADRLCIATNGEFNELLSAEELAFCC-HKCGSGCHGGYPIKAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY FD G + +PA +C R C L ++
Sbjct: 166 FRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
Y+ AY +N + I ++ GP+E S+ VY+DF +YKSGVY K +GGHA
Sbjct: 226 EDHRYTRDAYYLNY--QIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/254 (42%), Positives = 135/254 (53%), Gaps = 42/254 (16%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI---------------HFGMNLSLSVNDLLACC-GFLCG 58
Q CGSCWAF + EA +DR CI L LS D ACC GF CG
Sbjct: 303 QSDCGSCWAFASTEAFNDRRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCG 362
Query: 59 --DGCDGGYPISAWRYFVHHGVVT----------EECDPY--------FDSTGCSHPGC- 97
GC+GG P SAW++F GVVT C PY D +P C
Sbjct: 363 LSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACP 422
Query: 98 EPAYPTPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 154
+ YPTP+C+ +C + N + K + AY + + E+I ++ K G V +F+V+
Sbjct: 423 DGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGIENIQRDMMKYGSVTAAFSVF 481
Query: 155 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKR 213
DF Y GVY H +G MGGHAVK+IGWGT + GEDYW++AN WN SWG G F+I R
Sbjct: 482 SDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRILR 541
Query: 214 GSNECGIEEDVVAG 227
G NECGIE +VAG
Sbjct: 542 GVNECGIEGQIVAG 555
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/242 (41%), Positives = 139/242 (57%), Gaps = 21/242 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA A A++DR+C+ DLL+CC CG GC GG AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
+V G+ + + C PY C PG + TPKC KC +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 YGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
WG ++G YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP N ++ +A
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFHRQGEAAK 377
Query: 243 MF 244
F
Sbjct: 378 YF 379
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 136/253 (53%), Gaps = 24/253 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + Q +CGSCWA A+SDR CI + S D+L CCG CG G
Sbjct: 99 WKNCSSFHTIRDQANCGSCWAVSTAAAISDRICIATKGKKQVYASDTDILTCCGARCGLG 158
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HP-----------GCEPAYPTPK 105
C GG+PI AW++F + GVV+ PY CS HP C PTP
Sbjct: 159 CRGGWPIEAWKFFEYDGVVSG--GPYLGKGCCSPYPLHPCGRHGNDTFYGNCVGMAPTPP 216
Query: 106 CVRKCVKKNQ-LWRNSKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
C RKC + ++R K Y Y + I +I + G V F VYEDF+HY+S
Sbjct: 217 CKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQS 276
Query: 163 GVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
G+YKH G GG HAVK+IGWG D+G DYW++AN W+ WG +G+F++ RG N CGIE
Sbjct: 277 GIYKHTAGRFTGGYHAVKMIGWG-KDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIE 335
Query: 222 EDVVAGLPSSKNL 234
E V AG+ ++L
Sbjct: 336 EQVDAGIVDVESL 348
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 128/215 (59%), Gaps = 18/215 (8%)
Query: 30 LSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 80
++DR CI G + LS DL++CC CG GC GG+P AW Y+V G+VT
Sbjct: 1 MTDRICIQSGGGQSAELSALDLISCC-EDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENH 59
Query: 81 EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 133
C PY T +P C Y TP+C +KC K + ++ KHY +Y + S+
Sbjct: 60 TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISN 119
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
+ I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKR-TPYW 178
Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++AN WN WG G F+I RG +EC IE +VVAGL
Sbjct: 179 LIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 213
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/236 (42%), Positives = 131/236 (55%), Gaps = 18/236 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q +CGSCWA A + +SDR CIH + LS D+LACCG CG GCDGGY
Sbjct: 112 LRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYN 171
Query: 67 ISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VK 112
AW++ GVVT C PY +H G P++P TP C C
Sbjct: 172 ARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYG 231
Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ + N K + + Y + +D I EI K GPV +F +YEDF HY GVY H G +
Sbjct: 232 YGKRYENDKIKAKTWYWLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNGGVYIHTAGAM 291
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 227
GGH++K+IGWG D G YW++AN W+ WG D GYF++ RG N C IE V+AG
Sbjct: 292 EGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 132/235 (56%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V G+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + I +I +GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 107/229 (46%), Positives = 141/229 (61%), Gaps = 16/229 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWAFGAVEA++DR CI G S ++ L L C CG GC GG+P AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171
Query: 74 VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
V G+VT EE C PY F T +P C Y TP+C + C K + ++
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQ 231
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY +Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA++
Sbjct: 232 DKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIR 291
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 137/229 (59%), Gaps = 17/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWR 71
QG CG+CWA AV +SDR CIH ++ L+ DL+ CC CG+GC+GG+ ++++
Sbjct: 107 QGLCGACWAVAAVSVMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQ 165
Query: 72 YFVHHGVV-------TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSK 121
Y+V G+V T+ C PY C +P GC P TP C C + + +R K
Sbjct: 166 YWVDVGLVSGAAYNSTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDK 223
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
+Y +AY++ +D I EI NGPVE F+VY+D YK+GVY+H+ G +G HAV+LI
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283
Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
GWG + G YW++AN + WG GYFK RGSN GIE V+AGLP
Sbjct: 284 GWG-KERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 127/232 (54%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW +
Sbjct: 107 QGHCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELAFCC-HKCGFGCHGGYPIKAWEW 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C +L ++
Sbjct: 166 FKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQELDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H++ AY + I ++ GP+E SF VY+DF +YKSGVY K +GGHA
Sbjct: 226 EDHHWTRDAYYLTY--TTIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI RG+NECGI+ G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGGVP 334
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 127/220 (57%), Gaps = 20/220 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A E+LSDRFCI + +++ LS D+++C GCDGG +AW +
Sbjct: 106 QEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDMVSCD--YNDMGCDGGNLDNAWWW 163
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN-----QLWRNSKHYSISA 127
+ G+V + C PY G P C C N QL+ IS
Sbjct: 164 MKNKGIVPDSCMPYVSGGG----------NVPACPSNCNGTNIPISSQLYYAKSFSHISP 213
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
+ DI EIY NGPV+ F+VY+DF +YKSGVY H TG +GGHA+K+IGWG +
Sbjct: 214 WMFWERVADIQQEIYTNGPVQGGFSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGV-E 272
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G DYW++AN W+ WG DG FKI RG NECGIE+DV AG
Sbjct: 273 GGVDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDVYAG 312
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 128/231 (55%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG A +DR C+ N LS +L CC CG+GC+GGYPI AW+Y
Sbjct: 109 QGYCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELTFCC-HTCGNGCNGGYPIKAWKY 167
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C+PY + G S +P +C R C L N
Sbjct: 168 FSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYN 227
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAV 178
H Y + I ++ GP+E SF VY+DF YKSGVY+ +GGHAV
Sbjct: 228 DDHRFTRDYYYLT-YGSIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAV 286
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KLIGWG ++G YW++ N W+ WG +G FKI+RG++ECGI+ AG+P
Sbjct: 287 KLIGWGV-EEGIPYWLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 336
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 125/231 (54%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWA A +DR C+ N LS ++ CC CG GC+GGYPI AW+Y
Sbjct: 110 QGHCGSCWAMATSSAFADRLCVATNGDFNELLSAEEITFCC-HTCGFGCNGGYPIKAWKY 168
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C+PY D G S +P +C R C L N
Sbjct: 169 FSSHGIVTGGNYKSGEGCEPYRVPPCPQDEEGKSSCAGKPIEKNHRCTRMCYGNQDLDYN 228
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAV 178
H Y + I ++ GP+E SF VY+DF YKSGVY+ +GGHAV
Sbjct: 229 DDHRFTRDYYYLT-YGSIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAV 287
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KLIGWG ++G YW++ N WN WG +G FKI+RG++ECGI+ AG+P
Sbjct: 288 KLIGWGV-EEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECGIDSAATAGVP 337
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA A+ A+SDR CI G ++ LS DL++CC CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAIGAMSDRICIQSGGKQSVKLSAVDLISCCEN-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 127/215 (59%), Gaps = 18/215 (8%)
Query: 30 LSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 80
++DR CI G S LS DL++CC CGDGC GG+P AW Y+V G+VT
Sbjct: 1 MTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGQAWDYWVTQGIVTGGSKENH 59
Query: 81 EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSD 133
C PY T +P C Y TP+C + C K + + KHY +Y + S+
Sbjct: 60 TGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISN 119
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
+ I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW
Sbjct: 120 EKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYW 178
Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++AN WN WG G F+I RG +EC IE VVAGL
Sbjct: 179 LIANSWNEDWGEKGLFRIVRGRDECSIESHVVAGL 213
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 107 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSAEELAFCC-HKCGFGCSGGYPIRAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I +I GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 125/200 (62%), Gaps = 18/200 (9%)
Query: 44 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 94
+S N+LLACC CGDGC+GGYP +AW F H GVVT + C PY + C H
Sbjct: 12 VSANELLACC-ESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA-CDHHV 69
Query: 95 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 149
C+ TP+C +KC N +++ KHY +Y ++S DIM E+ GPVE
Sbjct: 70 VGKLKPCKGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-VNDIMEELVTRGPVEA 128
Query: 150 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 209
+FTVY DF Y SGVY+H TG +GGHAVK++G+G ++G+ YW++AN WN WG G+F
Sbjct: 129 AFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGV-ENGDKYWLVANSWNPDWGDQGFF 187
Query: 210 KIKRGSNECGIEEDVVAGLP 229
KI RG +ECGIE +VAG P
Sbjct: 188 KILRGVDECGIEGQIVAGEP 207
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 99/236 (41%), Positives = 132/236 (55%), Gaps = 18/236 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q +CGSCWA A + +SDR CIH + LS D+LACCG CG GCDGGY
Sbjct: 112 LRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYN 171
Query: 67 ISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCE----PAYP--TPKCVRKC-VK 112
AW++ GVVT C PY +H G P++P TP C C
Sbjct: 172 ARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYG 231
Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ + N K + + Y + +D I EI + GPV +F +YEDF HY+ GVY H G +
Sbjct: 232 YGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEGGVYIHTAGAM 291
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 227
GGH++K+IGWG D G YW++AN W+ WG D GYF++ RG N C IE V+AG
Sbjct: 292 EGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/215 (44%), Positives = 126/215 (58%), Gaps = 19/215 (8%)
Query: 31 SDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 81
+DR C + + S DLL+CC +CG GC+GG P AW Y+ H G+V+ +
Sbjct: 1 TDRVCTYSNGTKHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQ 59
Query: 82 ECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDP 134
C PY C H PG C TPKC + C N L++ K Y Y +
Sbjct: 60 GCSPYVIPP-CEHHVPGNRLPCNGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGGE 118
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
+ I AE++KNGPVE +FTVY D YKSGVYKH+ GD +GGHA+K+IGWG ++G YW+
Sbjct: 119 DHIKAELFKNGPVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGV-ENGNKYWL 177
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+AN WN WG +G+FKI RG + CGIE +VAG P
Sbjct: 178 IANSWNTDWGNNGFFKILRGEDHCGIESSIVAGEP 212
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 107 QGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIKAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +P +C R C L ++
Sbjct: 166 FKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I ++ GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 177 bits (448), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 130/235 (55%), Gaps = 24/235 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C SCWA + +SDR CIH G + LS +LL+CC LCG GC GG+P AW +
Sbjct: 135 QGSCASCWAVAPTDVMSDRICIHSGSRHIVRLSAGNLLSCCK-LCGKGCKGGFPGGAWMH 193
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAY-PTPK--CVRKCVKK---------------N 114
+ HG+VT Y GC P Y P K KC K N
Sbjct: 194 WSKHGIVTG--GSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLECRETCRTSYN 251
Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ ++ +Y S YRI +D I EI +NGPV+ + +YEDF HYK GVY+H+ G +
Sbjct: 252 KSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQANLRIYEDFLHYKFGVYRHVHGQGLE 311
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVK+ GWGT + G YW+ AN W++ WG G+FKI RGSN IE+ V+AG+P
Sbjct: 312 YHAVKIFGWGT-EGGTPYWLAANPWSKRWGNGGFFKILRGSNHAEIEDHVMAGIP 365
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 132/241 (54%), Gaps = 16/241 (6%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCG--FLCG 58
++ E ++ + Q CGSCWA + +SDR CI L +S D++ CC
Sbjct: 91 WSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQKNQLRISAADMIECCESCTFSV 150
Query: 59 DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-------HPGCEPAYPTPKCVRKCV 111
DGC GG P + + G V+ Y + GC +P C+ Y P C ++C
Sbjct: 151 DGCHGGIPSFTFTEWKDSGFVSG--GEYNSTNGCMSYPLPRCNPSCKTLYDAPTCKKECD 208
Query: 112 KKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI- 168
K + L + KHY+ AYRI S E I EI KNGPV SFTVY DF HY SGVYK
Sbjct: 209 KGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFTVYADFIHYLSGVYKFDG 268
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
++GGHAV++IGWG + YW+++N WN WG G FKI RG NECGIEE++ AGL
Sbjct: 269 ESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGL 328
Query: 229 P 229
P
Sbjct: 329 P 329
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 127/217 (58%), Gaps = 20/217 (9%)
Query: 15 QGHCGSCWA-----FGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPI 67
Q CGSCWA + E LSDRFCI G +N+ LS DL++C + GCDGG
Sbjct: 22 QEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLSPQDLVSCNWY--NAGCDGGILW 79
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
+AW Y H G+VT++C PY G + P C + C + + K+ +
Sbjct: 80 AAWIYLKHTGIVTDQCLPYSSGNGVA----------PSCPKYCNGTSTPIDSVKYKAKDW 129
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + S E IM EI NGPV+ F+VY+DF YKSGVY H TG +GGHA+K++GWG +
Sbjct: 130 YEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVYTHQTGSFLGGHAIKIVGWGVEN 189
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
+ + YW++AN W WG +G FKIKRG NECGIE DV
Sbjct: 190 NVK-YWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 98/248 (39%), Positives = 133/248 (53%), Gaps = 21/248 (8%)
Query: 3 FTNSEH------VEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGF 55
F +EH + + Q C + WA A+SDR+C + G L +S DL+ACC
Sbjct: 95 FDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGKQLRISAADLMACCK- 153
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCV 107
CG GC+GGYP +AW Y+V HG+ + +C PY C H G + P TP+C
Sbjct: 154 DCGGGCEGGYPDAAWEYYVSHGITSSQCQPY-PFPRCEHRGAQGKKPPCSKYKFVTPQCN 212
Query: 108 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C K+ K+ +Y + + ED E+Y NGP V F V+ DF YKSGVY+H
Sbjct: 213 ATCTDKSVPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQH 269
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ G+ +GG AV+++GWG +G YW +AN W+ WG +GYF I RG NEC IE AG
Sbjct: 270 VAGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAG 328
Query: 228 LPSSKNLV 235
P L
Sbjct: 329 TPDPSQLA 336
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 176 bits (447), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 99/229 (43%), Positives = 136/229 (59%), Gaps = 17/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGY-PISAWR 71
QG CG+CWA V +SDR CIH ++ L+ DL+ CC CG+GC+GG+ ++++
Sbjct: 107 QGLCGACWAVATVSVMSDRLCIHSEGKFDVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQ 165
Query: 72 YFVHHGVV-------TEECDPYFDSTGCSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSK 121
Y+V G+V T+ C PY C +P GC P TP C C + + +R K
Sbjct: 166 YWVDVGLVSGAAYNNTDGCKPY-PFKPCLYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDK 223
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
+Y +AY++ +D I EI NGPVE F+VY+D YK+GVY+H+ G +G HAV+LI
Sbjct: 224 YYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTGVYQHVVGREVGKHAVRLI 283
Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
GWG + G YW++AN + WG GYFK RGSN GIE V+AGLP
Sbjct: 284 GWG-KERGVPYWLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLPK 331
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 107/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWAFGAVEA++DR CI G S ++ L L C CG GC GG+P AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171
Query: 74 VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
V G+VT EE C PY F T +P C Y TP+C + C K + +
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIR 291
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 176 bits (446), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 129/218 (59%), Gaps = 20/218 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR CIH N S +L++CC + CG G
Sbjct: 38 WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLVSCC-WTCGFG 96
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
C+GG+P +AW Y+ G+V+ PY + GC + C+ TPKC
Sbjct: 97 CNGGFPGAAWHYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPKC 154
Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V+KC ++ + H SAY +++D + I EIY NGPVE +FTVYEDF Y++GVY
Sbjct: 155 VKKCEDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 214
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 203
KH+ G +GGHA++++GWG + YW++AN WN W
Sbjct: 215 KHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDW 252
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 176 bits (446), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 110 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C L ++
Sbjct: 169 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 228
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I +I GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 229 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 287 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 176 bits (446), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 99/231 (42%), Positives = 126/231 (54%), Gaps = 20/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG A +DR C+ G N LS L CC + CG GC GG PI AW+Y
Sbjct: 108 QGNCGSCWAFGTTGAFADRLCVATGGGFNEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKY 166
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F HG+ T E C PY +D G +P KC R C + +
Sbjct: 167 FKRHGITTGGDYGSNEGCAPYKVPPCYDDQGEFLCQGKPTEHNHKCPRACYGNSTV---E 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 179
Y + + + + I +I K GPVE SF VY+DF YKSG+Y+ +GGH+VK
Sbjct: 224 NRYKVKSIYVLDSSKTIEQDIRKYGPVEASFDVYDDFITYKSGIYQKTPNAFYVGGHSVK 283
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
LIGWG +DG YW+L N W++ WG G F+I +G NECGIE AG+PS
Sbjct: 284 LIGWG-EEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIERSATAGVPS 333
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 107 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I +I GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 110 QGKCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C L ++
Sbjct: 169 FKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 228
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I +I GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 229 EDHHYTRDAYYLTYGT--IQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 287 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 337
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 100/227 (44%), Positives = 127/227 (55%), Gaps = 17/227 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WAFGAVE++SDR CIH N SLS DLL+CC CG GC G+ AW +
Sbjct: 73 QSSCESFWAFGAVESMSDRLCIHSNGAFNKSLSATDLLSCCED-CGLGCGAGFHPMAWDF 131
Query: 73 FVHHGVVT----EE---CDPY-FDSTGCSHPGCEPA-----YPTPKCVRKCVKKNQLWRN 119
+ HG+VT EE C + F G G P YPTP+C+++C + +
Sbjct: 132 WKTHGIVTGGSKEEPSGCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQCDEPEVNYEK 191
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K + +Y + IM EI NGPVE SF +Y DF Y GVY H G + HA++
Sbjct: 192 DKTRANISYNVYPSDISIMKEIMLNGPVEASFGIYADFLEYNGGVYFHCWGGPISRHAIR 251
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
++GWG DDG YW++AN WN WG GY + RG NECGIEE+V A
Sbjct: 252 ILGWG-EDDGVPYWLIANSWNEDWGEKGYVRFLRGHNECGIEEEVTA 297
Score = 163 bits (412), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 105/294 (35%), Positives = 133/294 (45%), Gaps = 78/294 (26%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DL++CC CG GC GGY AW +
Sbjct: 661 QSSCGSCWAFGAVEAMSDRLCIHSKGAFNKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDF 719
Query: 73 FVHHGVVTEECDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLW 117
+ HG+VT TGC P CE YPTP+C+++C K +
Sbjct: 720 WKTHGIVTGGSKE--KPTGCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRCDTKEIDY 777
Query: 118 RNSK----------------------------------------HYSIS----------- 126
K H+SI
Sbjct: 778 EKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLTCLNFMHHSIDLLSSRLEKAVL 837
Query: 127 ------AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
+Y + + +M EI GPV VYED YKSGVY H+ G +G H +++
Sbjct: 838 RSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRI 897
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
+GWG +DG YW++AN WN WG GY ++ R NECGI + V AGLP N
Sbjct: 898 LGWG-EEDGVPYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLPDLSNF 950
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWAFG A +DR CI N LS +L CC CG GC GG PI AW
Sbjct: 107 QGHCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGNPIKAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C L ++
Sbjct: 166 FQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I ++ GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQYDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTGGVP 334
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 87/196 (44%), Positives = 125/196 (63%), Gaps = 14/196 (7%)
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPT 103
+CGDGC+GGYP AW ++ G+V+ C PY S P C T
Sbjct: 1 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDT 60
Query: 104 PKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
PKC + C + ++ KHY ++Y +++ + IMAEIYKNGPVE +F+VY DF YKS
Sbjct: 61 PKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYKS 120
Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
GVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI RG + CGIE
Sbjct: 121 GVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIES 179
Query: 223 DVVAGLPSSKNLVKEI 238
+VVAG+P + ++I
Sbjct: 180 EVVAGIPRTDQYWEKI 195
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 100/242 (41%), Positives = 138/242 (57%), Gaps = 21/242 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWA A A++DR+C+ DLL+CC CG GC GG AW++
Sbjct: 147 QGCCGSCWAVSAASAMTDRWCVRSKGKEQFIFGSLDLLSCC-HSCGQGCRGGTLGPAWQF 205
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKH 122
+V G+ + + C PY C PG + TPKC KC +W++ +H
Sbjct: 206 WVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED--TPKCSNKCRSGYNVTDVWQD-RH 261
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
AY + +D IM EI+ NGPV+ +F Y D YKSG+Y+H+ G + GGHAVKL+G
Sbjct: 262 IGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSGIYRHVWGPLSGGHAVKLLG 321
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 242
WG ++G YW++AN W R WG +G+FK+ RG N CGIEE++ AGLP N ++ +A
Sbjct: 322 WGV-ENGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAGLP---NFHRQGEAAK 377
Query: 243 MF 244
F
Sbjct: 378 YF 379
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 125/212 (58%), Gaps = 16/212 (7%)
Query: 18 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
CGSCWA A SDR CI G + +LS L CC + CG+GCDGG P +AW +F+
Sbjct: 1 CGSCWAASAASVFSDRLCIATGGAVARNLSAEQLNTCC-YRCGNGCDGGSPEAAWYFFMR 59
Query: 76 HGVVT-------EECDPY-FDSTGCSHPGC-EPAYPTPKC-VRKCVKKN--QLWRNSKHY 123
HG+VT + C PY G C + TP C +R C N + +R HY
Sbjct: 60 HGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHY 119
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
+ Y ++ EDIM +IYKNGPV+ +F VY DF +YKSGVY + G + GGHA+K++GW
Sbjct: 120 VDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGW 179
Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGS 215
G DD YW+ AN W+RSWG +G F+I RG+
Sbjct: 180 GV-DDNTKYWLCANSWSRSWGENGLFRILRGN 210
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 90/234 (38%), Positives = 134/234 (57%), Gaps = 22/234 (9%)
Query: 17 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG--FLCGDGCDGGYPISAWRY 72
C + WAF A E++SDR CI+ G N LS +LL+CC F CG+GC+GG P AW+Y
Sbjct: 100 ECKTSWAFAAAESMSDRLCINSGGFKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQY 159
Query: 73 FVHHGVVTEE-------CDPYF-----DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQL-- 116
HG+ T C PY + G ++P C PTP C +KC +
Sbjct: 160 IQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPI 219
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+HY +S ++ + +I +++ NGP++ +F VY+DF Y +G+Y H+TG+ G
Sbjct: 220 DIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGH 279
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+V++IGWG G YW+ AN W R WG +G F++ RG+NECG+E + V+G+P
Sbjct: 280 LSVRIIGWGVW-QGVPYWLCANSWGRQWGENGTFRVLRGTNECGLESNCVSGMP 332
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWAFGAVEA++DR CI G S ++ L L C CG GC GG+P AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171
Query: 74 VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
V G+VT EE C PY F T +P C Y TP+C + C K + +
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIR 291
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+IGWG + G+ YW++AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 127/232 (54%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA G A +DR CI N +S +L CC CG GC+GG P+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCIATNGDFNELISAEELTFCC-HRCGFGCNGGNPLKAWQY 164
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HGVVT + C PY D G + +P P KC R C
Sbjct: 165 FKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYK 224
Query: 120 SKHYSI-SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHA 177
HY +AY +N D + + GP+E SF VY+DF +Y+SGVY+ +GGHA
Sbjct: 225 KGHYKTKNAYYLNIDT--MQKDTIAYGPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VK+IGWG +DG YW++ N W WGA+G FKI RG+NECGIE AG+P
Sbjct: 283 VKMIGWG-EEDGTPYWLMVNSWGEQWGANGMFKILRGTNECGIEGSPTAGVP 333
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 106/229 (46%), Positives = 139/229 (60%), Gaps = 16/229 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDL-LACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWAFGAVEA++DR CI G S ++ L L C CG GC GG+P AW Y+
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCEDCGGGCKGGFPGQAWDYW 171
Query: 74 VHHGVVT---EE----CDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
V G+VT EE C PY F T +P C Y TP+C + C K + +
Sbjct: 172 VKRGIVTGGSEENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQ 231
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+ G ++GGHA++
Sbjct: 232 DKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIR 291
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+IGWG + G+ YW++AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 292 IIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 132/234 (56%), Gaps = 20/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGS WA AV A+SDR CI G ++ LS DL++CC + CG GCDGG+ +W Y
Sbjct: 112 QSQCGSSWAVSAVGAMSDRICIQSGGKQSVELSAVDLISCCKY-CGSGCDGGFLGPSWDY 170
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLW 117
+V G+VT C PY C H C + Y TP+C + C K N +
Sbjct: 171 WVLRGIVTGGSKENHTGCRPY-PFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSY 229
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHA
Sbjct: 230 EQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHA 289
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
V+LIG G ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 290 VRLIGCGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/235 (41%), Positives = 132/235 (56%), Gaps = 15/235 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR C G+ L +S L++CC CGDGCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCED-CGDGCDG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
GYP ++W Y+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ +Y ++ + +D E+Y NGP V F VY DF YK+GVY+H++GD +GG
Sbjct: 220 PL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG 276
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+ RG+NECGIE AG P+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPA 330
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 130/236 (55%), Gaps = 18/236 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q +CGSCWA A + +SDR CIH + LS D+LACCG CG GCDGGY
Sbjct: 112 LRVIPDQSNCGSCWAVSAAQCMSDRLCIHSQGRKKVLLSATDILACCGKFCGYGCDGGYN 171
Query: 67 ISAWRYFVHHGVVTE-------ECDPYFDSTGCSHPGCE----PAYPTPKCVRKCVKK-- 113
AW++ GVVT C PY +H G P++P RK +
Sbjct: 172 ARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPARKPYCQYG 231
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+ + N K + + Y + +D I EI + GPV +F +YEDF HY GVY H G +
Sbjct: 232 YGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYNGGVYIHTAGAM 291
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD-GYFKIKRGSNECGIEEDVVAG 227
GGH++K+IGWG D G YW++AN W+ WG D GYF++ RG N C IE V+AG
Sbjct: 292 EGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGEDGGYFRVVRGINNCDIEGGVLAG 346
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 119/189 (62%), Gaps = 17/189 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL+CC CG+GC+GGYP AW +
Sbjct: 17 QGSCGSCWAFGAVEAMSDRICIHSRGKVNVEVSAEDLLSCCKLECGNGCNGGYPSGAWEF 76
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
+ + G+V+ C PY S C H P C TP+C R+C + +
Sbjct: 77 WTNDGLVSGGLYYSHIGCRPYSISP-CEHHVNGSRPKCSGEIETPRCSRRCEAGYSPKYS 135
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +++Y I SD +IM EIYKNGPVE + V++DF YKSGVY+H TG +GGHA+
Sbjct: 136 EDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSGVYQHKTGGSIGGHAI 195
Query: 179 KLIGWGTSD 187
K++GWG +
Sbjct: 196 KILGWGEEN 204
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 130/235 (55%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WA AV A+SDR CI G ++ LS DL++CC CG GCDGG +W Y
Sbjct: 79 QSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDY 137
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V HG+VT + TGC P C+ Y TP+C + C K N
Sbjct: 138 WVSHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 195
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + S I +I +G VE +YEDF +YKSG+Y++ TG + GH
Sbjct: 196 YEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGH 255
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 256 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAGLIKS 309
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 128/232 (55%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWAFG A +DR C+ N LS +L CC CG GC+GGYPI AW+Y
Sbjct: 109 QGHCGSCWAFGTSSAFADRLCVATDGDFNELLSAEELTFCC-HACGHGCNGGYPIKAWKY 167
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT + C+PY + G S +P +C R C L +
Sbjct: 168 FSTHGLVTGGNYKSGKGCEPYRVPPCPRNEDGKSSCAGKPKEKNHRCTRMCYGNQDLDYD 227
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
H ++ Y + I ++ GP+E SF VY+DF YKSGVY+ +GGHA
Sbjct: 228 DDHRFTRDFYYLTYG--SIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHA 285
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG ++G YW++ N WN WG +G FKI+RG++EC I+ AG+P
Sbjct: 286 VKLIGWGV-EEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVP 336
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 131/232 (56%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCW+F A +DR C+ G N LS +L CC CG+GC+GGYPI AWRY
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNELLSPEELAFCCK-DCGNGCEGGYPIKAWRY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY ++ G + G +P +C + C K
Sbjct: 166 FRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKNTCGGKPMERNHQCPKTCYGKTT--DQK 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
++ + S Y INS + I +I GPVE SF VY+DF+ YKSG+Y+ GH+VK
Sbjct: 224 RYKTKSEYVINS-IKTIEQDIKTYGPVEASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+IGWG ++G YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 283 IIGWG-QENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAGIPSS 333
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 128/215 (59%), Gaps = 20/215 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR CIH N S +L++CC + CG G
Sbjct: 36 WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSNGTKNFHFSAENLVSCC-WTCGFG 94
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
C+GG+P +AW Y+ G+V+ PY + GC + C+ TP C
Sbjct: 95 CNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTC 152
Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V+KC + ++ + H+ SAY I +D + I EIY NGPVE +FTVYEDF Y++GVY
Sbjct: 153 VKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 212
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 200
KH+ G +GGHA++++GWG + YW++AN WN
Sbjct: 213 KHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 123/195 (63%), Gaps = 16/195 (8%)
Query: 18 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH 75
CGSCWAFGAVEA+SDR CIH +N+ +S DLL CCG CGDGC+GGYP AW ++
Sbjct: 1 CGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTK 60
Query: 76 HGVVTEE-------CDPYF-----DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKH 122
G+V+ C PY S P TP+C + C + ++ KH
Sbjct: 61 KGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDTPRCNKSCEAGYSPSYKEDKH 120
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
+ ++Y +++ ++IMAEIYKNGPVE +FTV+ DF YKSGVYKH GD+MGGHA++++G
Sbjct: 121 FGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILG 180
Query: 183 WGTSDDGEDYWILAN 197
WG ++G YW+ AN
Sbjct: 181 WGV-ENGVPYWLAAN 194
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 128/235 (54%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WA AV A+SDR CI G ++ LS DL++CC CG GCDGG +W Y
Sbjct: 112 QSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V HG+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + I EI GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG +EC IE +VAG S
Sbjct: 289 AVRLIGWGV-ENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 107 QGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C + C L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I ++ GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECG + G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/193 (48%), Positives = 121/193 (62%), Gaps = 18/193 (9%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
++ + QG CGSCWAFGAVEA+SDR CIH +N+ +S DLL+CCG CG GC+GGYP
Sbjct: 29 IQEIRDQGSCGSCWAFGAVEAISDRVCIHSKGKVNVEISAEDLLSCCGMECGFGCNGGYP 88
Query: 67 ISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAY-PTPKCVRKC-V 111
AW ++ G+V+ C PY C H P C TPKCV +C
Sbjct: 89 SGAWNFWTETGLVSGGLFKSHIGCRPYTIPP-CEHHVNGSRPSCTGEEGDTPKCVMQCEA 147
Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
+ KH+ ++Y ++S+ DI EIYKNGPVE +FTVYEDF YKSGVYKH+TGD
Sbjct: 148 GYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGVYKHVTGD 207
Query: 172 VMGGHAVKLIGWG 184
+GGHA++++GWG
Sbjct: 208 AVGGHAIRILGWG 220
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/187 (45%), Positives = 119/187 (63%), Gaps = 16/187 (8%)
Query: 57 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 103
CG GC+GGYP +AW+++ +VT + C PY+ C H P C PT
Sbjct: 3 CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPP-CEHHTVGPLPNCTGIKPT 61
Query: 104 PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
P+C + C + Q + KH+ Y I+SD I EIYKNGPVE F+VY DF YKS
Sbjct: 62 PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121
Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
GVY+ + +++GGHA++++GWGT +DG YW++AN WN WG GYFKI+RG++ECGIE+
Sbjct: 122 GVYQRHSEEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIED 180
Query: 223 DVVAGLP 229
D+ AG+P
Sbjct: 181 DINAGIP 187
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 107 QGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCC-HKCGFGCSGGYPIRAWER 165
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C + C L ++
Sbjct: 166 FKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFK 225
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY+ AY + I ++ GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 226 EDHHYTRDAYYLTYGT--IQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATYLGGHA 283
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW+L N WN WG G FKI+RG+NECG + G+P
Sbjct: 284 VKLIGWG-EEYGVPYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 99/231 (42%), Positives = 123/231 (53%), Gaps = 21/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A +++DR+C IH L +S DLLACCG CG GC GG P AW YF
Sbjct: 112 QSSCGSCWAVAAATSMTDRYCTIHGVRGLRISAADLLACCGD-CGYGCLGGDPDMAWAYF 170
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKH 122
G+ + C PY CSH YP TP C C + +R K
Sbjct: 171 SSEGIASGRCQPY-PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKS 229
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
YS+S ED E+Y GP + F V+ D YK GVYKH+ G +G HAV+++G
Sbjct: 230 YSLSG------EEDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVG 283
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
WG + G YW +AN WN WG GYF + RG NECGIE+ AG+P+ N
Sbjct: 284 WG-NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAIPN 333
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 130/229 (56%), Gaps = 21/229 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA + EA+SD C+ + + +S D+L+CCG CG GC GG+PI A+R+
Sbjct: 110 QSACGSCWAVSSAEAMSDEICVQSNSTIKVMISDTDILSCCGLDCGYGCQGGWPIEAYRW 169
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKK-NQL 116
GVVT + C PY C P Y PTPKC + +K N+
Sbjct: 170 MQRDGVVTGGKYRQRDVCKPY-SFYPCGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKT 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KH++ +Y + ++ I EIYKNGPV +F VYED++ G+Y H G G H
Sbjct: 229 YQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAH 287
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
A K+IGWG ++G DYW++AN WN WG DGY++I R ++ C IE +V
Sbjct: 288 ADKVIGWG-RENGTDYWLIANSWNTDWGEDGYYRIVRETDNCEIERQMV 335
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 135/234 (57%), Gaps = 22/234 (9%)
Query: 17 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRY 72
C S WAF A E++SDR CI+ G +N LS +LL+CC G L CG+GC GG AW+Y
Sbjct: 52 ECKSSWAFAAAESMSDRLCINSGGTINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQY 111
Query: 73 FVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-- 116
+ HG+ T C PY + ++P C PTP C +KC KN
Sbjct: 112 WGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPV 171
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
+HY S ++ + +I +++ NGP+E +F VY+DF Y +G+Y H+TG+ G
Sbjct: 172 DIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGH 231
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+V+++GWG +G YW+LAN W + WG +G F+ RG+NECG+E + V+G+P
Sbjct: 232 LSVRILGWGMY-EGVPYWLLANSWGKEWGENGTFRALRGTNECGLEANCVSGMP 284
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 95/247 (38%), Positives = 132/247 (53%), Gaps = 19/247 (7%)
Query: 3 FTNSEH------VEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGF 55
F +EH + + Q C + WA A+SDR+C + G L +S DL+ACC
Sbjct: 95 FDAAEHWPHCPTIREIADQSACRASWAVATASAISDRYCTVGKGKQLRISAADLMACCK- 153
Query: 56 LCGDGCDGGYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVR 108
CG GC+GGYP +AW Y+V HG+ + +C PY + G P + + TP+C
Sbjct: 154 DCGGGCEGGYPDAAWEYYVSHGIASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQCNA 213
Query: 109 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
C K K+ +Y + + ED E+Y NGP V F V+ DF YK+GVY+H+
Sbjct: 214 TCTDKTIPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKNGVYQHV 270
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
G+ +GG AV+++GWG +G YW +AN W+ WG +GYF I RG NEC IE AG
Sbjct: 271 AGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAGT 329
Query: 229 PSSKNLV 235
P L
Sbjct: 330 PDPSQLT 336
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 173 bits (438), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 106/249 (42%), Positives = 131/249 (52%), Gaps = 33/249 (13%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLC----GDGCD 62
+E++ QG+CGSCWA A +SDR CI G +S DLL+CCG C GCD
Sbjct: 87 IELIPDQGNCGSCWAVSAASTMSDRLCIASGQTDKRQISAEDLLSCCGINCELDGNGGCD 146
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAY-----PT 103
GGYP AW+Y G+VT C PY CSH CE + T
Sbjct: 147 GGYPYGAWKYLRVDGIVTGGTYNDFSLCKPY-SFPPCSHGNDSGKYSKCENDFFMLTEVT 205
Query: 104 PKCVRKCVKKNQLWRNSKHYSISA----YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 159
P C +KC Q R I + Y++ D E I EIY NGPV+ FTV++DF +
Sbjct: 206 PSCTKKC--HPQFSRTYDVDKIRSRENPYKLIKDQEQIKNEIYLNGPVQAVFTVFDDFLN 263
Query: 160 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 219
YKSGVY+ TG G HAVK+IGWGT ++G YW N WN WG +G FKI RG N
Sbjct: 264 YKSGVYQQTTGQRRGKHAVKIIGWGT-ENGVPYWEAINSWNDGWGINGKFKILRGFNHLD 322
Query: 220 IEEDVVAGL 228
IE +V A +
Sbjct: 323 IEGEVYASI 331
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 173 bits (438), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 128/235 (54%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WA AV A+SDR CI G ++ LS DL++CC CG GCDGG +W Y
Sbjct: 112 QSRCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCKN-CGSGCDGGVTGYSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V HG+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + I EI GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGEFSYNVIGVESVIQKEIMMYGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG +EC IE +VAG S
Sbjct: 289 AVRLIGWGV-ENGTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 100/235 (42%), Positives = 128/235 (54%), Gaps = 22/235 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C S WA AV A+SDR CI G ++ LS DL++CC CG GCDGG +W Y
Sbjct: 112 QSRCASSWAVSAVAAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGVTGYSWDY 170
Query: 73 FVHHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQL 116
+V HG+VT + TGC P C+ Y TP+C + C K N
Sbjct: 171 WVKHGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTS 228
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ KHY +Y + I EI GPVE +YEDF +YKSG+Y++ TG + GH
Sbjct: 229 YEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGH 288
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
AV+LIGWG ++G YW+ AN WN WG GYF+I RG +EC IE +VAG S
Sbjct: 289 AVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 95/235 (40%), Positives = 139/235 (59%), Gaps = 20/235 (8%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
N + + + QG CGSCWAF ++E++SDR CIH S DLL+CC CGD C
Sbjct: 95 NCDSLNRIRDQGACGSCWAFASIESMSDRICIHSSGSAQFMFSPEDLLSCCT-SCGD-CG 152
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-N 114
GGY +SA ++++ G+V+ E C PY T +H + TP C + C +
Sbjct: 153 GGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQGQ----TPACTKSCRNGYS 205
Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ KHY + Y ++S + I E+ NGP+ V+F V++DF +Y SGVY+H++G+ +G
Sbjct: 206 TSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGVYRHVSGESVG 265
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
H VK++GWG ++G YW++AN W SWG G+FK+ RG NECGIE A +P
Sbjct: 266 FHVVKIVGWGV-ENGVPYWLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAVMP 319
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 172 bits (437), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 115/187 (61%), Gaps = 16/187 (8%)
Query: 57 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 103
C C+GG+P SAW Y+ G+VT + C PY C H C+ PT
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPY-QIKSCDHHVNGTKGPCQGEGPT 227
Query: 104 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 162
P+C KC + + KHY++S I+++PE EI NGPVE FTVYEDF YKS
Sbjct: 228 PECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKS 287
Query: 163 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
GVY+H TG V+GGHA+K++GWG ++G YW++AN WN WG +G+FKI RGSNECGIE
Sbjct: 288 GVYQHTTGGVLGGHAIKILGWGV-EEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIES 346
Query: 223 DVVAGLP 229
D+ G+P
Sbjct: 347 DINFGIP 353
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 172 bits (437), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 98/232 (42%), Positives = 131/232 (56%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCW+F A +DR C+ G N LS +L CC CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCC-MDCGKGCGGGYPIKAWKY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY +D G + G +P +C + C K +
Sbjct: 166 FRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNTCGGKPMERNHQCPKTCYGKTTV--QD 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
++ + + Y INS E I ++ GPVE SF VY+DF+ YKSG+Y+ GGH++K
Sbjct: 224 RYKTKNEYVINS-IETIEQDLMTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+IGWG ++G YW+ N W++ WG G FKI +G NECGIE V AG+PS+
Sbjct: 283 IIGWG-EENGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPST 333
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 172 bits (437), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 89/186 (47%), Positives = 112/186 (60%), Gaps = 18/186 (9%)
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKC 106
C+GGYPI AW+++V HG+VT C PY + G + P C E PTPKC
Sbjct: 14 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73
Query: 107 VRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
V C N + KH+ +AY + E I EI +GP+EV+FTVYEDF Y +G
Sbjct: 74 VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133
Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
VY H G +GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE
Sbjct: 134 VYVHTAGKSLGGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHS 192
Query: 224 VVAGLP 229
VAGLP
Sbjct: 193 AVAGLP 198
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/231 (42%), Positives = 122/231 (52%), Gaps = 21/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A +++DR+C IH L +S DLLACCG CG GC GG P AW YF
Sbjct: 112 QSSCGSCWAVAAATSMTDRYCTIHGVRGLRISAADLLACCGD-CGYGCLGGDPDMAWAYF 170
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKN---QLWRNSKH 122
G+ + C PY CSH YP TP C C + +R K
Sbjct: 171 SSEGIASGRCQPY-PFPRCSHYTNSTTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKS 229
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
YS S ED E+Y GP + F V+ D YK GVYKH+ G +G HAV+++G
Sbjct: 230 YSFSG------EEDFRRELYFRGPFQAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVG 283
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
WG + G YW +AN WN WG GYF + RG NECGIE+ AG+P+ N
Sbjct: 284 WG-NQSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGVPAIPN 333
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/232 (42%), Positives = 124/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 105 QGKCGSCWAFGTSSAFADRLCIATNGEFNELLSAEELTFCC-HKCGFGCHGGYPIKAWER 163
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WR 118
F HG+VT E C PY D G + +PA +C R C L ++
Sbjct: 164 FQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFK 223
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H++ AY + I ++ GP+E S+ VY+DF YKSGVY + +GGHA
Sbjct: 224 KDHHFTRDAYYLTFGI--IQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATYLGGHA 281
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 282 VKLIGWG-EEYGVPYWLMVNSWNDQWGDKGLFKIRRGTNECGIDNSTTGGVP 332
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 98/238 (41%), Positives = 131/238 (55%), Gaps = 17/238 (7%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
N + + QG+CGSCWAF A +DR CI + N LS + +CC + CG GC
Sbjct: 96 NCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAEHVTSCC-YRCGLGCQ 154
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKN 114
GGYPI AWRY+ HG+VT E C PY + C + KC +KC
Sbjct: 155 GGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQSEKNHKCQKKCFGNT 214
Query: 115 QL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGD 171
+ +R + Y S Y + D ++ +I GP+E SF VY+DF YKSGVY K
Sbjct: 215 SISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT 272
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GGH+VK IGWG + YW++ N WN +WG GYFKI+RG+NEC +E+ AG+P
Sbjct: 273 YLGGHSVKCIGWGV-ERNVSYWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAGVP 329
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 125/231 (54%), Gaps = 20/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG A +DR C+ G N LS L CC + CG GC GG PI AW+Y
Sbjct: 108 QGNCGSCWAFGTTGAFADRLCVATGGGFNEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKY 166
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F G+ T E C PY +D G +P KC R C + +
Sbjct: 167 FKRRGITTGGDYGSNEGCAPYKVPPCYDDQGEFLCQGKPTEHNHKCPRACYGNSTV---E 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVK 179
Y + + + + I +I GPVE SF VY+DF YKSG+Y+ + +GGH+VK
Sbjct: 224 NRYKVESIYVLDSFKTIEQDIRTYGPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVK 283
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
LIGWG +DG YW+L N W++ WG G F+I +G NECGIE AG+PS
Sbjct: 284 LIGWG-EEDGIPYWLLVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIPS 333
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 132/231 (57%), Gaps = 29/231 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH +N LS +DL++CC +CG GC+GG+P +AW Y
Sbjct: 108 QGSCGSCWAFGAVEAMSDRVCIHSEGKVNFHLSADDLVSCC-HICGFGCNGGFPGAAWSY 166
Query: 73 FVHHGVV-------TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WR 118
+ G+V T+ C PY + C H P C TP C KC + +
Sbjct: 167 WTRKGIVSGGPYGSTQGCRPY-EIAPCEHHVNGTRPPCSHG-STPSCQHKCQASYSVEYA 224
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K++ +Y + + +I EI NGPVE +FTVYED YKSGVY+H G +GGHA+
Sbjct: 225 KDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVYQHEHGKELGGHAI 284
Query: 179 KLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+++GWG + + YW++ N WN WG + + CGIE + AGL
Sbjct: 285 RILGWGVWGESKVPYWLIGNSWNTDWGDN---------DHCGIESSISAGL 326
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 123/232 (53%), Gaps = 22/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF-GMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q C SCWAFG VE +DR CI G N + LS D+L CC CG C GGY AW Y
Sbjct: 91 QSSCASCWAFGVVEVATDRICIESKGKNQVRLSAEDVLECCKD-CGFQCQGGYSAMAWEY 149
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLW 117
GVVT E C Y CSH G E YP PKC C + +
Sbjct: 150 LRRTGVVTGGQYNSTEWCKSY-PFPPCSH-GIEGQYPQCSTKPPVVPKCETTCQEGYPIE 207
Query: 118 RNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
Y S Y++ ++ + I EI +NGPV+ SF VYEDF YKSG+Y H+ G M H
Sbjct: 208 YEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYEDFMTYKSGIYHHVEGKFMNLH 267
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
VK+IGWG ++GE YW N WN WG +G F+I+ G+NEC IE V GL
Sbjct: 268 TVKIIGWG-EENGEAYWKAVNSWNSEWGENGLFRIRLGTNECTIESQVEGGL 318
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 130/231 (56%), Gaps = 7/231 (3%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
E + + +G CGSCWAF AVE +SDR C+ S ++++CC CG GC GG
Sbjct: 98 ESIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCT-ACGGGCRGG 156
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHY 123
+ ++Y+V +G+ + Y GC + TP+C + CV + W +
Sbjct: 157 FLNEPYKYWVTNGIPSG--GDYGSKLGCKPYTAAVSGETPQCQKACVSGYEKSWEKDLRH 214
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGW 183
+ SAY++N I EI NGPV VYEDF Y +G+Y+H +G +GGHAVK+IGW
Sbjct: 215 ATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTGIYQHTSGSFVGGHAVKIIGW 274
Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
G+ +D YWI AN W +G DG+F+I RGSN GIE +VAG P++ +
Sbjct: 275 GSEND-VPYWIAANSWGTGFGEDGFFRILRGSNCAGIESYIVAGYPNTSEV 324
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 124/213 (58%), Gaps = 18/213 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAF E+L DRF I LS DL++C G C+GGY ++W + +
Sbjct: 84 QEKCGSCWAFSIAESLGDRFGILGCGKGHLSPQDLISCDSNDLG--CNGGYQENSWTWVL 141
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
G+ TE C PY +G P C +CV + L RN+ I+ YR D
Sbjct: 142 TTGITTESCWPYRSGSG----------RIPSCPHRCVNGSVLQRNT----INNYR-RLDS 186
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
++ E+Y NGP++V++ VYEDF +Y G+YKH++G+ +GGHAV L+GWG +DG YW+
Sbjct: 187 SELQDELYNNGPIQVTYVVYEDFFYYSKGIYKHLSGNKVGGHAVVLMGWGI-EDGVKYWL 245
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ N W WG GYF+I RGSNECGIE AG
Sbjct: 246 VQNSWGYEWGEQGYFRILRGSNECGIESSAYAG 278
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/235 (40%), Positives = 131/235 (55%), Gaps = 15/235 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q CGSCWA A+SDR C G+ L +S L++CC CG GCDG
Sbjct: 102 NCPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCED-CGYGCDG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQ 115
GYP ++W Y+V HG+ + C PY C H G + P TPKC C K
Sbjct: 161 GYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDKAI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ +Y ++ + +D E+Y NGP V F VY DF YK+GVY+H++GD +GG
Sbjct: 220 PL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLGG 276
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
HAV+++GWG +G YW +AN W+ WG +G+ RG+NECGIE AG P+
Sbjct: 277 HAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPA 330
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 123/221 (55%), Gaps = 11/221 (4%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSC+ A++DR+CIH G + D LACC CDGGY W+Y
Sbjct: 103 QGCCGSCYVVSTAAAITDRYCIHSGGQKQFTFGATDYLACCTDCFK--CDGGYVGKTWQY 160
Query: 73 FVHHGVVTEECDPYFDSTGC-SHPGCEPAY--PTPKCVRKCVKKNQL-WRNSKHYSISAY 128
+V G+ +E PY GC S+P P P C R C L + Y SAY
Sbjct: 161 WVDSGLTSE--GPYKSGQGCNSYPFGSYCVNDPLPTCSRTCQAGYPLTYSQDLKYGGSAY 218
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
R+ + IM EIY+NGPV V F V+ DF YKSGVY+H+TG G HAV++IGWG ++
Sbjct: 219 RVMWNENAIMTEIYQNGPVVVQFEVFADFYQYKSGVYRHVTGATEGWHAVRVIGWGV-EN 277
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G YW++AN W WG G+FK RG N GIE+ V AGLP
Sbjct: 278 GVKYWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYAGLP 318
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/233 (41%), Positives = 127/233 (54%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA G A +DR C+ N +S +L CC CG GC+GGYP+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HRCGFGCNGGYPLKAWQY 164
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HGVVT + C PY D G + +P KC +KC + +
Sbjct: 165 FKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYK 224
Query: 120 SKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY AY + + +Y GP+E SF VY+DF +Y+SGVY+ +GGHA
Sbjct: 225 KNHYKTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
VK+IGWG ++G YW++ N W WG G FKI RG++ECGIE AG+PS
Sbjct: 283 VKMIGWGV-EEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVPS 334
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 131/232 (56%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCW+F A +DR C+ G N LS +L CC CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELAFCCK-DCGQGCGGGYPIKAWKY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY ++ G + G +P +C + C K + +
Sbjct: 166 FRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYGKTTV--QN 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
++ + S Y INS + I ++ GPVE SF VY+DF+ YKSG+Y+ G H++K
Sbjct: 224 RYKTKSEYSINS-IKTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+IGWG ++G YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 283 IIGWG-QENGTTYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 137/232 (59%), Gaps = 17/232 (7%)
Query: 11 ILVIQGH--CGSCWAFGAVEALSDRFCIHF-GMNLS-LSVNDLLACCGFLCGDGCDGGYP 66
I +I+ H CGSCWA A +SDR CI G N LS D+LACCG CG GC+GGYP
Sbjct: 104 IGLIRDHSACGSCWAVSAASVMSDRLCIQTNGTNQKILSSADILACCGEDCGSGCEGGYP 163
Query: 67 ISAWRYFVHHGVVT-------EECDPY-FDSTGCSHPGC--EPAYPTPKCVRKCVKKNQL 116
I A+ Y + GV + C PY F ++ C E A+ TPKC + C + +
Sbjct: 164 IQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNYGPCPKEGAFDTPKCRKICQFRYPV 223
Query: 117 -WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ K + +++ + D E I EI+ NGPV +F V+EDF HYK G+YK G +G
Sbjct: 224 PYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEGIYKQTYGKWIG 283
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
HA+KLIGWGT ++G DYW++AN +N WG +G F+I RG+N C IE V+A
Sbjct: 284 VHAIKLIGWGT-ENGTDYWLVANSYNYDWGENGTFRILRGTNHCLIESQVIA 334
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/215 (41%), Positives = 125/215 (58%), Gaps = 20/215 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR CIH N S +L++CC + CG G
Sbjct: 34 WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGAKNFHFSAENLVSCC-WTCGFG 92
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
C+GG+P +AW Y+ G+V+ PY GC + C+ TP C
Sbjct: 93 CNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAPCEHHVNGTRGPCKEGGKTPAC 150
Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V+KC ++ + H SAY + +D + I EIY NGPVE +FTVYEDF Y++GVY
Sbjct: 151 VKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 210
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 200
KH+ G +GGHA++++GWG + YW++AN WN
Sbjct: 211 KHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 245
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 102/246 (41%), Positives = 135/246 (54%), Gaps = 35/246 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG---FLCGDGCDGGYPISA 69
Q CGSCWAFG VEA + R CI G +N LS D+LACC F GC GG PI++
Sbjct: 123 QSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAADMLACCNIGHFCLSFGCSGGNPITS 182
Query: 70 WRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCVR 108
W + +G+V+ + C PY + C+H P + Y TP C
Sbjct: 183 WTFLHTNGIVSGGGFVPEKNMKAADGCWPY-NFPKCAHHQKESDYKPCAKEIYDTPSCSS 241
Query: 109 KC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C K + +HY+ S + R S I EI NGP +F+VYEDF YKSGV
Sbjct: 242 SCPNAKYGTAFDKDRHYTESLFPSRFGST-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGV 300
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
YKH +G +GGHAV++IGWGT + G DYW++ N WN WG G FKI +G +CGI++ +
Sbjct: 301 YKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDMI 357
Query: 225 VAGLPS 230
+AG P+
Sbjct: 358 LAGTPA 363
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 105/230 (45%), Positives = 121/230 (52%), Gaps = 32/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
Q CGSCWAF A E LSDRF I + +N LS DL++C GD GC GGY AW
Sbjct: 114 QARCGSCWAFAASEVLSDRFAIASNGTVNKILSPEDLVSCDK---GDMGCQGGYLDKAWD 170
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y +G+VTE C PY G + P C CV K Y S Y
Sbjct: 171 YLKTNGIVTESCFPYAAQKGVA----------PSCRISCVDGEPY----KKYKASDYYQL 216
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTS---- 186
+ EDIM EIY NGPVE F VY F YKSGVY H D+M GGHA+K++GWG
Sbjct: 217 TTEEDIMKEIYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKR 276
Query: 187 --DDGEDYWILANQWNRSWGADGYFKIKRGSN-----ECGIEEDVVAGLP 229
YWI AN W WG +G+FKI+RG N ECGIE+ V AG P
Sbjct: 277 FWQKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFAGHP 326
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 97/242 (40%), Positives = 131/242 (54%), Gaps = 21/242 (8%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
N + + + QG+CGSCWA A +DR C+ + N LS +L CC CG GC+
Sbjct: 100 NCKTIGAIRDQGNCGSCWALATSSAFADRLCVVSNEDFNQLLSAEELTFCC-HKCGFGCN 158
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRK 109
GGYPI AW +F HG+VT E C+PY +D +G + +P +C R
Sbjct: 159 GGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPYDESGNNTCAGKPMEANHRCTRM 218
Query: 110 CVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KH 167
C L + H Y+ +Y + I ++ GPVE SF VY+DF YKSGVY +
Sbjct: 219 CYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGPVEASFDVYDDFPSYKSGVYIRS 276
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+GGHA KLIGWG + G YW++ N WN WG +G FKI+RG+NECGI+ G
Sbjct: 277 ENASYLGGHAAKLIGWG-EEYGVPYWLMVNSWNADWGDNGLFKIQRGTNECGIDNSTTGG 335
Query: 228 LP 229
+P
Sbjct: 336 VP 337
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 170 bits (430), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 123/230 (53%), Gaps = 14/230 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q C + WA A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
V +G+ + C PY C H G + + TPKC C K+ K+
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKSVPL--IKYRGN 228
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+ Y + ED E+Y NGP F VY D YKSGVY+++ GD +GG AVK++GWG
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGK 288
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 289 L-NGTPYWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 337
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 170 bits (430), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 126/225 (56%), Gaps = 54/225 (24%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCWAFGAVEA+SDR C + VN
Sbjct: 102 QGSCGSCWAFGAVEAISDRIC--------IHVNG-------------------------- 127
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 133
S P C TPKC + C + ++ KHY ++Y +++
Sbjct: 128 ------------------SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNS 169
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
+DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW
Sbjct: 170 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYW 228
Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 229 LVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 273
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 170 bits (430), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 98/240 (40%), Positives = 129/240 (53%), Gaps = 19/240 (7%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCD 62
N + + Q +CGSCWA A +SDR CI + S D+L+CC + CG GCD
Sbjct: 101 NCTSIRTIRDQSNCGSCWAVSAASVMSDRLCIQSNGTIQSWASDTDILSCC-WNCGMGCD 159
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-------PGCEPAYPTPKCVR 108
GG P +A+ + + +GV T C PY H P + +PTPKC +
Sbjct: 160 GGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRK 219
Query: 109 KC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C +K N +++ K Y AY + ++ IM EI+ NGPV SF+V+ DFA YK GVY
Sbjct: 220 MCQLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKGVYVS 279
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G HAVK+IGWG D G YW++AN WN WG +GY + RG N CGIE VV G
Sbjct: 280 NGIQQNGAHAVKIIGWGVQD-GLKYWLIANSWNNDWGDEGYVRFLRGDNHCGIESRVVTG 338
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 170 bits (430), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 122/192 (63%), Gaps = 16/192 (8%)
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 107
C+GGYP AW ++ G+V+ C PY C H P C TPKC
Sbjct: 87 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 145
Query: 108 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 146 KICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 205
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVA
Sbjct: 206 HVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 264
Query: 227 GLPSSKNLVKEI 238
G+P + ++I
Sbjct: 265 GIPRTDQYWEKI 276
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 95/230 (41%), Positives = 121/230 (52%), Gaps = 14/230 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q C + WA A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
V +G+ + C PY C H G + + TPKC C K K+
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGN 228
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+ Y + ED E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGK 288
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+G YW LAN W+ WG GY I RG+NEC IE AG P + L
Sbjct: 289 L-NGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 95/230 (41%), Positives = 121/230 (52%), Gaps = 14/230 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q C + WA A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
V +G+ + C PY C H G + + TPKC C K K+
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGN 228
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+ Y + ED E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGK 288
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+G YW LAN W+ WG GY I RG+NEC IE AG P + L
Sbjct: 289 L-NGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 95/230 (41%), Positives = 121/230 (52%), Gaps = 14/230 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q C + WA A+SDR+C + G L +S LL+CC CGDGC GG+P AWRY+
Sbjct: 113 QSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYY 171
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQLWRNSKHYSI 125
V +G+ + C PY C H G + + TPKC C K K+
Sbjct: 172 VEYGITSSSCQPY-PFPRCEHQGAQGNKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGN 228
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+ Y + ED E+Y NGP F VY D YKSGVY+H+ GD +GG AVK++GWG
Sbjct: 229 ATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGK 288
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+G YW LAN W+ WG GY I RG+NEC IE AG P + L
Sbjct: 289 L-NGTPYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPEASQLT 337
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 169 bits (429), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 125/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFG A +DR CI N LS +L CC CG GC GGYPI AW
Sbjct: 105 QGKCGSCWAFGTSSAFADRLCIATDGDFNELLSAEELTFCC-HTCGYGCHGGYPIKAWER 163
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCV-KKNQLWR 118
F HG+VT E C PY D G + +PA +C R C +++ ++
Sbjct: 164 FKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFK 223
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
++ AY + I ++ GP+E S+ VY+DF YKSGVY + +GGHA
Sbjct: 224 EDHRFTRDAYYLTYGT--IQKDVMTYGPIEASYEVYDDFPSYKSGVYVRTENATYLGGHA 281
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG G FKI+RG+NECGI+ G+P
Sbjct: 282 VKLIGWG-EEYGVPYWLMVNSWNDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 169 bits (428), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 99/238 (41%), Positives = 129/238 (54%), Gaps = 23/238 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR CI + N +S +L++CC + CG GC+GG+P +AW +
Sbjct: 114 QGNCGSCWAVSVAAAFADRLCIASNAKWNGHISSRELMSCCSY-CGFGCEGGFPDAAWVF 172
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCE--PAYPTPKCVRKCVKKNQL- 116
HG+VT + C PY C H P C P PTP C C + L
Sbjct: 173 IKRHGLVTGGDYHSHDGCQPY-PIAPCEHHMEGSKPNCSASPTEPTPACETTCTHGSSLA 231
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMGG 175
++ + SAY + + EI+KNGP+ +F VYEDF YKSGVYK H G
Sbjct: 232 YQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVYKRHPESPFRGR 291
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
HAVK+IGWG +G YW++ N W+ WG G FKI RG NEC E+ + AGLP K
Sbjct: 292 HAVKVIGWG-EQNGLPYWLVQNSWDYDWGDKGLFKIARG-NECDFEKSMTAGLPKYKK 347
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 169 bits (428), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 125/230 (54%), Gaps = 20/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAFG A +DR C+ G N LS D+ CC CG GC+GGYPI AW+Y
Sbjct: 107 QGNCGSCWAFGTTGAFADRLCVSTGGKFNELLSPEDVAFCCQ-NCGKGCEGGYPIKAWQY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY FD G + +P +C + C +
Sbjct: 166 FRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNTCAGKPLERNHQCPKTCYGSTTV---Q 222
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVK 179
K Y + + + P + ++ K GP+E SF +++D + YKSG+Y K + GH++K
Sbjct: 223 KRYKVKNEYVLNSPNTMEQDLIKYGPIEASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+IGWG ++G YW+ N W++ WG G F+I +G NECGIE AG+P
Sbjct: 283 IIGWG-KENGVPYWLAVNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 331
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 169 bits (428), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 97/238 (40%), Positives = 130/238 (54%), Gaps = 17/238 (7%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCD 62
N + + QG+CGSCWAF A +DR CI + N LS + +CC + CG GC
Sbjct: 96 NCTTIGTIRDQGNCGSCWAFSTSGAFADRLCIASNGSFNQLLSAEHVTSCC-YRCGLGCQ 154
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKN 114
GGYPI AWRY+ HG+VT E C PY + C + KC +KC
Sbjct: 155 GGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTGNNSCSGQSEKNHKCQKKCFGNT 214
Query: 115 QL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGD 171
+ +R + Y S Y + D ++ +I GP+E SF VY+DF YKSGVY K
Sbjct: 215 SISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT 272
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GGH+VK IGWG + YW++ N WN +WG G FKI+RG+NEC +E+ AG+P
Sbjct: 273 YLGGHSVKCIGWGV-ERNVSYWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAGMP 329
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 135/237 (56%), Gaps = 15/237 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
N E + + QG CGSCWA A +SDR CIH +N++L+ DL+ CC CG+GC+
Sbjct: 99 NCESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLMGCC-VDCGNGCN 157
Query: 63 GGY-PISAWRYFVHHGVV-------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK 113
GG+ ++++Y+V G+V T+ C PY C +P + +PKC C
Sbjct: 158 GGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPY-PFKPCEYPFNDCHVEISPKCTHHCRDG 216
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
++ + K + AY + D I EI NGPVE F VYED YKSGVY+H+ G+
Sbjct: 217 VDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSGVYRHVYGEQ 276
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+G HAV++IGWG D G YW++AN + WG GYFK RGSN GIE ++ GLP
Sbjct: 277 IGKHAVRIIGWG-RDGGIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIESKIITGLP 332
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 135/232 (58%), Gaps = 20/232 (8%)
Query: 17 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRY 72
C S WAF A E++SDR CI+ G ++ LS +LL+CC G L CG+GC GG P+ AW+Y
Sbjct: 109 ECKSSWAFAAAESMSDRLCINSGGMIDTILSAQELLSCCTGVLSCGEGCAGGNPLKAWQY 168
Query: 73 FVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
+ HG+ T C PY + ++P C PTP C +KC +
Sbjct: 169 WQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTCEKKCKPGYPVDL 228
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+HY +S ++ + +I +++ NGPVE + +Y+DF Y +G+Y H+ G+ G +
Sbjct: 229 DKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIYVHLAGNKQGHLS 288
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V+++GWG +G YW+LAN W + WG +G F++ RG NECG+E + ++G+P
Sbjct: 289 VRILGWGMF-EGVPYWLLANSWGKEWGENGTFRVLRGVNECGLEANCISGMP 339
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/235 (38%), Positives = 135/235 (57%), Gaps = 23/235 (9%)
Query: 17 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRY 72
C S WAF A E++SDR CI+ G +N LS +LL+CC G L CG+GC GG AW+Y
Sbjct: 96 ECKSSWAFAAAESMSDRLCINSGGMINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQY 155
Query: 73 FVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKCVRKCVKKNQL-- 116
+ HG+ T C PY + ++P C PTP C +KC KN
Sbjct: 156 WGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPV 215
Query: 117 -WRNSKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+HY S+ ++ + +I +++ NGP+E +F VY+DF Y +G+Y H+TG+ G
Sbjct: 216 DIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQG 275
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+V+++GWG +G YW+LAN W + WG +G F+ RG+NECG+E + V+ +P
Sbjct: 276 HLSVRILGWGMY-EGVPYWLLANSWGKEWGENGTFRALRGTNECGLEANCVSAMP 329
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/235 (38%), Positives = 124/235 (52%), Gaps = 12/235 (5%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPI 67
+ + Q C + WA A+SDR+C + G L +S DL+ACC CGDGC GG+P
Sbjct: 106 IREIADQSECRASWAVSTASAISDRYCTVGGGKQLRISAADLMACCK-QCGDGCKGGFPG 164
Query: 68 SAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
AW Y+V +G+ + +C PY + G P + + TPKC C K+
Sbjct: 165 FAWLYYVEYGITSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--V 222
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
K+ + Y + ED E+Y NGP F VY D YKSGVY+++ GD +GG AV++
Sbjct: 223 KYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRI 282
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+GWG +G YW +AN W+ WG +GY I RG+NEC IE G P L
Sbjct: 283 VGWGKL-NGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFPDPSQLT 336
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 93/232 (40%), Positives = 128/232 (55%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 110 QGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCCS-SCGYGCNGGYPIKAWES 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F + G+VT E C+PY +D+ G + +P +C R C L N
Sbjct: 169 FNNRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYN 228
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H ++ +Y + I ++ + GP+E SF +Y+DF YKSGVY + +GGHA
Sbjct: 229 DDHRFTRDSYYLTY--SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG +G FKI+RG+NECGI+ G+P
Sbjct: 287 VKLIGWG-EEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/233 (40%), Positives = 126/233 (54%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA G A +DR C+ N +S +L CC C GC+GGYP+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCVATNGEFNELISAEELTFCC-HRCVFGCNGGYPLKAWQY 164
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HGVVT + C PY D G + +P KC +KC + +
Sbjct: 165 FKRHGVVTGGDYDTTDGCQPYRVPPCVKDDEGHNSCSGQPTERNHKCSKKCYGDDTIDYK 224
Query: 120 SKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY AY + + +Y GP+E SF VY+DF +Y+SGVY+ +GGHA
Sbjct: 225 KNHYKTKDAYYLKNTTMQKDTMVY--GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
VK+IGWG ++G YW++ N W WG G FKI RG++ECGIE AG+PS
Sbjct: 283 VKMIGWGV-EEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGIESSCTAGVPS 334
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/232 (41%), Positives = 127/232 (54%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 110 QGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAWER 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F G+VT E C+PY +D+ G + +P +C R C L +
Sbjct: 169 FKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFD 228
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ +Y + I ++ GP+E SF VY+DF YKSGVY K +GGHA
Sbjct: 229 EDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG +G FKI+RG+NECGI+ AG+P
Sbjct: 287 VKLIGWG-EEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 95/233 (40%), Positives = 125/233 (53%), Gaps = 21/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA G A +DR CI N +S +L CC CG GC+GG P+ AW+Y
Sbjct: 106 QGNCGSCWAHGTTGAFADRLCIATDGEFNELISAEELTFCC-HTCGFGCNGGNPLKAWKY 164
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HGVVT + C PY D G + +P KC +KC +
Sbjct: 165 FKRHGVVTGGNYNTTDGCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYK 224
Query: 120 SKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHA 177
HY AY +++ +Y GP+E SF VY+DF Y+SGVY+ +GGHA
Sbjct: 225 KNHYKTKDAYYLSNTTMQKDTMVY--GPIEASFDVYDDFTSYESGVYQKTENASYLGGHA 282
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
VK+IGWG ++G YW++ N W WG G FKI RG++ECG+E AG+PS
Sbjct: 283 VKMIGWGV-EEGTPYWLMVNSWGEQWGDKGMFKILRGTDECGVESSCTAGVPS 334
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 101/246 (41%), Positives = 134/246 (54%), Gaps = 35/246 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG---FLCGDGCDGGYPISA 69
Q CGSCWAFG VEA + R CI G +N LS ++LACC F GC GG PI++
Sbjct: 2 QSACGSCWAFGTVEAFNARVCIKSGGKLNQLLSAANMLACCNIGHFCLSFGCSGGNPITS 61
Query: 70 WRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCVR 108
W + +G+V+ + C PY C+H P + Y TP C
Sbjct: 62 WTFLHTNGIVSGGGFVPEKNMKAADGCWPY-SFPKCAHHQDGSDYKPCAKEIYDTPSCSS 120
Query: 109 KC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C K + +HY+ S + R S I EI NGP +F+VYEDF YKSGV
Sbjct: 121 SCPNAKYGTAFDKDRHYTESLFPSRFGST-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGV 179
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
YKH +G +GGHAV++IGWGT + G DYW++ N WN WG G FKI +G +CGI++ +
Sbjct: 180 YKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDTI 236
Query: 225 VAGLPS 230
+AG P+
Sbjct: 237 LAGTPA 242
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 127/216 (58%), Gaps = 19/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A E LSDRFCI G +++ LS +++C GCDGGY +AW +
Sbjct: 33 QEQCGSCWAFSASEVLSDRFCIASGGKVDVVLSPQYMVSCDS--TDYGCDGGYLNNAWAF 90
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+ +++C PY G V C K Q + K Y + +
Sbjct: 91 LAGTGIPSDKCAPYTSQNGD--------------VAACPSKCQDGSSVKLYKAKNPQQLN 136
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGED 191
D IM ++ +NGPV+ +F+VY DF YKSGVY H++G ++GGHA+K++GWG S +
Sbjct: 137 DIPSIMEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIKMVGWGVDSATNKP 196
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
YWI+AN W SWG +G+F I RGS+ECGIE++V +G
Sbjct: 197 YWIIANSWGPSWGLNGFFWILRGSDECGIEDNVWSG 232
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/232 (41%), Positives = 127/232 (54%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 110 QGNCGSCWAMATSSAFADRLCVATNADFNELLSAEEITFCC-HSCGFGCNGGYPIKAWER 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F G+VT E C+PY +D+ G + +P +C R C L +
Sbjct: 169 FKKRGLVTGGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFD 228
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ +Y + I ++ GP+E SF VY+DF YKSGVY K +GGHA
Sbjct: 229 EDHRYTRDSYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENATYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG +G FKI+RG+NECGI+ AG+P
Sbjct: 287 VKLIGWG-EEYGVPYWLMVNSWNADWGDNGLFKIRRGTNECGIDNSTTAGVP 337
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 103/245 (42%), Positives = 134/245 (54%), Gaps = 31/245 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAW 70
QG CGSCWAF + EAL+DRFCI G +LS +CC L GC GG P AW
Sbjct: 260 QGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAW 319
Query: 71 RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC--- 110
R+F + GVVT + C PY + C H P CE P PKC + C
Sbjct: 320 RWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEA 378
Query: 111 --VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
K + +++ H++ SAY + + I E+ +NG + +F VYEDF YK GVY H+
Sbjct: 379 EYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV 437
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
TG MGGHAVK+IG+G ++DG DYW+ N WN WG G FKI+ G E GI+++ G
Sbjct: 438 TGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGE 494
Query: 229 PSSKN 233
P N
Sbjct: 495 PKVPN 499
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 103/245 (42%), Positives = 134/245 (54%), Gaps = 31/245 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGDGCDGGYPISAW 70
QG CGSCWAF + EAL+DRFCI G +LS +CC L GC GG P AW
Sbjct: 260 QGDCGSCWAFASTEALNDRFCIKSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAW 319
Query: 71 RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAYP-TPKCVRKC--- 110
R+F + GVVT + C PY + C H P CE P PKC + C
Sbjct: 320 RWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEA 378
Query: 111 --VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
K + +++ H++ SAY + + I E+ +NG + +F VYEDF YK GVY H+
Sbjct: 379 EYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV 437
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
TG MGGHAVK+IG+G ++DG DYW+ N WN WG G FKI+ G E GI+++ G
Sbjct: 438 TGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGE 494
Query: 229 PSSKN 233
P N
Sbjct: 495 PKVPN 499
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 121/222 (54%), Gaps = 13/222 (5%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG EA +DR CI + LS ++ AC F GC GG P SAW +
Sbjct: 165 QSACGSCWAFGVTEAFNDRLCIKSNGAFTELLSAGEMNACTLFF---GCGGGDPYSAWSW 221
Query: 73 FVHHGVVTEE-CDPYFDSTGCSHP--GCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISA 127
G+ T E P S + P + YPTP CV +C K R+ +H+ + +
Sbjct: 222 VHDKGIATGEGSRPKRVSESEAIPVIAYQDIYPTPNCVEQCRNPKYTTTLRDDRHFMLES 281
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
+ D I +GPV SFTVYEDF YKSGVYKH +G +GGHAVK+IGWG
Sbjct: 282 SPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EK 340
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G+ YW+ N WN WG G FKI G+ CGI++D++ G P
Sbjct: 341 SGQAYWLAVNSWNEDWGDKGLFKIALGN--CGIDDDLLGGTP 380
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 92/230 (40%), Positives = 127/230 (55%), Gaps = 25/230 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C S +A AV ++DR+C+H + D+L+CC CG GCDGG P + W Y
Sbjct: 153 QGCCDSSYAVAAVSTMTDRWCVHSEGKAQFNFGAYDVLSCC-HRCGFGCDGGVPSAVWHY 211
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP------------TPKCVRKCVKK-NQLWRN 119
+V +G+ + SH GC+ +YP TP+C+R C N +
Sbjct: 212 WVENGITS-------GGAFGSHEGCQ-SYPFDVCKKSGDSNDTPRCLRFCQPGYNVTYPE 263
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY AY + D E IM E++ GP + +FT+Y DF YKSGVY+H G +G H+VK
Sbjct: 264 DKHYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSGVYRHTFGVRVGTHSVK 323
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++GWG +D + YW+ AN W WG G+FKI RG + E +VVAGLP
Sbjct: 324 VMGWGVENDVK-YWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAGLP 372
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 97/234 (41%), Positives = 130/234 (55%), Gaps = 25/234 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW
Sbjct: 110 QGNCGSCWAIATSSAFADRLCVATNADFNQLLSAEEITFCC-HKCGYGCNGGYPIKAWER 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C+PY +D +G + +P +C R C L +
Sbjct: 169 FKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCSGKPMEQNHRCTRMCYGDQDLDFD 228
Query: 120 SKH-YSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGG 175
H ++ +Y I S +D+M GP+E SF VY+DF YKSGVY + +GG
Sbjct: 229 DDHRHTRDSYYLTIGSIQKDVMTY----GPIEASFDVYDDFLSYKSGVYVRSENASYLGG 284
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAVKLIGWG + G YW++ N WN WG +G FKI+RG+NECG++ AG+P
Sbjct: 285 HAVKLIGWG-EEYGTPYWLMMNSWNADWGDEGLFKIRRGTNECGVDNSTTAGVP 337
>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
Length = 206
Score = 167 bits (422), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 73/86 (84%), Positives = 80/86 (93%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHFG+++ LSVNDLLACCGFLCG GCDGGYPISAW+
Sbjct: 121 ILDQGHCGSCWAFGAVESLSDRFCIHFGVDVPLSVNDLLACCGFLCGSGCDGGYPISAWK 180
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGC 97
YF HHGVVTEECDPYFD GCSHPGC
Sbjct: 181 YFAHHGVVTEECDPYFDQIGCSHPGC 206
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 93/222 (41%), Positives = 127/222 (57%), Gaps = 14/222 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAF A E +SDR C+ + + S DL+ CC CG C GGY AW+Y
Sbjct: 96 QGKCGSCWAFAAAEVMSDRLCVATNGSVKFEFSPEDLINCCE-TCGKKCKGGYSYYAWKY 154
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKCVRKC--VKKNQLWRNSKHYSISA 127
+ G+V+ Y S GC P + + +P+C + C K + N +H+
Sbjct: 155 YTSTGLVSG--GDYNTSRGC-QPYSKSNFNDGVSPECSKTCQNTKYPTSYLNDRHFGDGT 211
Query: 128 YRINSDPEDIMAEIY-KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
Y I + I EI + GPV F VYEDF Y+ GVY H +G ++G HAVK+IGWGT
Sbjct: 212 YYILKNVTTIQQEILLRGGPVMAGFDVYEDFKLYREGVYVHTSGALLGSHAVKIIGWGT- 270
Query: 187 DDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
++G YW++AN W + WGA G FKI+RG+NEC IE+ ++ G
Sbjct: 271 ENGWAYWLVANSWGKDWGALGGVFKIRRGTNECKIEQSIITG 312
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 96/233 (41%), Positives = 123/233 (52%), Gaps = 31/233 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGS WA AV A+SDR CI G S CG GCDGG+ +W Y+V
Sbjct: 112 QSQCGSSWAVSAVGAISDRICIQSGGKQSY------------CGSGCDGGFLGPSWDYWV 159
Query: 75 HHGVVTEECDPYFDSTGCS---HPGCE------------PAYPTPKCVRKCVKK-NQLWR 118
G+VT + TGC P C+ Y TP+C + C K N +
Sbjct: 160 LRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYE 217
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y + S I +I +GPVE +YEDF +YKSG+Y++ TG + GHAV
Sbjct: 218 QDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAV 277
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+LIGWG ++G YW+ AN WN WG GYF+I RG NEC IE ++ AGL S
Sbjct: 278 RLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAGLIKS 329
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 166 bits (421), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 126/232 (54%), Gaps = 18/232 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ N LS ++ CC CG GC GGYPI AW+
Sbjct: 110 QGNCGSCWAVATSSAFADRLCVATTGDFNELLSAEEITFCC-HTCGFGCHGGYPIKAWKR 168
Query: 73 FVHHGVVT-------EECDPYF---DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
F HG+VT E C+PY + G S +P C R C + N H
Sbjct: 169 FSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHCYGNQSIDFNDDH 228
Query: 123 -YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKL 180
Y+ Y + I ++ GP+E SF VY+DF YKSGVY K +GGHAVKL
Sbjct: 229 RYTRDYYYLTYG--SIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKL 286
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
IGWG +DG YW++ N WN WG +G+FKI+RG+NECG++ AG+P +
Sbjct: 287 IGWG-EEDGTPYWLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTAGVPVTN 337
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 166 bits (420), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 116/208 (55%), Gaps = 17/208 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y
Sbjct: 108 QSSCGSCWAFGAVEAMSDRLCIHSNGSFNKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDY 166
Query: 73 FVHHGVVT--EECDPY----FDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQLWRN 119
+ HG+VT + DP + C H P C YPTP+CV+ C +
Sbjct: 167 WRTHGIVTGGSKEDPSGCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDCDTPELGYLE 226
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
K + +Y I + IM EI GPVE FTVYEDF YKS VY H G M GHA++
Sbjct: 227 DKTRANISYNIYASEISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIR 286
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADG 207
++GWG D YW++AN WN WG G
Sbjct: 287 ILGWGEEGD-VPYWLIANSWNEDWGEKG 313
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 166 bits (419), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 95/232 (40%), Positives = 125/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ + N LS ++ CC CG GC+GGYPI AW+
Sbjct: 108 QGNCGSCWAVATSSAFADRLCVATNADFNELLSAEEITFCC-HTCGFGCNGGYPIKAWKR 166
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F G+VT E C+PY D G + +P +C R C L +
Sbjct: 167 FSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNNTCAGKPMESNHRCTRMCYGDQDLDFD 226
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ Y + I ++ GP+E SF VY+DF YKSGVY K +GGHA
Sbjct: 227 EDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDVYDDFPSYKSGVYVKSENASYLGGHA 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG G+FKI+RG+NECG++ AG+P
Sbjct: 285 VKLIGWG-EEYGVPYWLMVNSWNEDWGDHGFFKIQRGTNECGVDNSTTAGVP 335
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 124/219 (56%), Gaps = 17/219 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWAF A L+DRFCI G +N+ LS +++C G +GC+GG+ + WR+
Sbjct: 145 QKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQFMVSCSG--QNNGCNGGFFDATWRF 202
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
V G V+E C PY S G + P C V+ C Q S Y + R
Sbjct: 203 LVSVGTVSEACVPYV-SFGGAVPACN--------VKSCGVPGQ---KSPFYRAGSARKLE 250
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG-TSDDGED 191
DIMA++ NGP++V+ VY DF YKSGVY H++G +GGHAVK++GWG S
Sbjct: 251 GMLDIMADLKANGPIQVAMGVYRDFYSYKSGVYHHVSGRYVGGHAVKIVGWGYDSASKLP 310
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
YWI AN W WG GYF I RG ECGI + V +G P+
Sbjct: 311 YWICANSWGEDWGIKGYFWILRGRGECGIGKMVWSGKPA 349
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/245 (38%), Positives = 133/245 (54%), Gaps = 18/245 (7%)
Query: 1 MPFTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG 58
+ +T+ + + QG CGSCWA +SDR CI MN LS D+L+CC +CG
Sbjct: 97 LRWTSCPTISEIREQGSCGSCWAIATTSVMSDRLCIGSNGVMNFRLSGLDMLSCCA-ICG 155
Query: 59 DGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----DSTGCSHPGCEPAYPTPKC 106
C GGYP +AW Y+ G+V+ + C PY S S P C +C
Sbjct: 156 FACQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHSGNGSRPVCTVGGGV-RC 214
Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
C ++ ++ K+++ Y I++D +I EI NGPV+ TVYEDF YK+GVY
Sbjct: 215 QHLCEPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGVY 274
Query: 166 KHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
H+ G+ +G HAV+++GWG YW++AN W WG +G+F I RG N C IE +
Sbjct: 275 YHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYI 334
Query: 225 VAGLP 229
+AGLP
Sbjct: 335 MAGLP 339
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 128/231 (55%), Gaps = 23/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI L +S D+++CC LCG GCDGG+PI A+ Y
Sbjct: 116 QANCGSCWAVSTASALSDRICIASKGETQLHISSIDIVSCCK-LCGYGCDGGWPIEAFDY 174
Query: 73 FVHHGVVTEE------CDPY---------FDSTGCSHPG-CEPAYPTPKCVRKCVKKNQL 116
F G VT E C PY D+ G G C+ + + V++ V +N
Sbjct: 175 FSRQGAVTGETTSKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKR-VTRNHT 233
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
R + RI + + NGPV FTVYEDF++YK G+Y HI G G H
Sbjct: 234 RRTG--LTARRLRITEFCQSHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAH 291
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
A+K+IGWG ++G YW++AN W+ WG G F+I RG NECGIE++VVAG
Sbjct: 292 AIKIIGWGV-ENGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEVVAG 341
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 128/225 (56%), Gaps = 21/225 (9%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
V ++ QG CGSCWAF A E+LSDR CI +N++LS L++C GC+GG P
Sbjct: 95 VHAVLNQGQCGSCWAFAASESLSDRLCIASQGAINVTLSPQALVSC-DIEFNQGCNGGIP 153
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV--KKNQLWRNSKHYS 124
AW Y HG+ T+ C PY G + P C ++C K QL++ K ++
Sbjct: 154 QMAWEYLELHGIPTDSCFPYTSGNGTA----------PDCQKECSDGSKYQLYKG-KTFT 202
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGW 183
+ + S I A ++ GP+E + VY+DF Y SGVY G ++GGHA+K++GW
Sbjct: 203 L---KTCSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVYVMTPGSKLLGGHAIKIVGW 259
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GT S G DYWI+ N W WG +G+F I+RG+N CGI+ D AG
Sbjct: 260 GTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTNMCGIDRDASAG 304
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 123/231 (53%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR CI ++ N LS +L CC LCG C GGYPI AW Y
Sbjct: 112 QGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELTFCC-HLCGFACHGGYPIKAWSY 170
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C PY + G + +P +C R C ++ +
Sbjct: 171 FRRHGIVTGGDYQSGEGCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYD 230
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAV 178
H Y + I ++ GP+E S VY+DF YKSGVY K +GGHAV
Sbjct: 231 DDHRFTRDYYYLT-YASIQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAV 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KLIGWG +DG YW++ N W+ WG G FKI+RG+NEC ++ + AG+P
Sbjct: 290 KLIGWG-EEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/212 (42%), Positives = 119/212 (56%), Gaps = 18/212 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAF E + DR I +S DL++C GC+GGY AW +
Sbjct: 83 QASCGSCWAFSVAETMGDRLSIKGCDFGDMSPQDLVSC--DTTDMGCNGGYMDHAWAWTK 140
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
HG+ TE+C PY +G P C KCV + + RN S+S ++N+
Sbjct: 141 SHGITTEKCMPYQSGSG----------RVPACPAKCVNGSAIVRNK---SVSYKKLNA-- 185
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
+ +M E+Y+NGP+ V+FTVY DF +YKSGVY H TG + GGHAV +GWG +D YW+
Sbjct: 186 QQMMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGV-EDNTPYWL 244
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
N W +WG G+FKI RGSN CGIE A
Sbjct: 245 CQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 123/231 (53%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR CI ++ N LS +L CC LCG C GGYPI AW Y
Sbjct: 112 QGNCGSCWALATSSAFADRLCIATNYEFNELLSAEELTFCC-HLCGFACHGGYPIKAWSY 170
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C PY + G + +P +C R C ++ +
Sbjct: 171 FRRHGIVTGGGYQSGEGCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYD 230
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAV 178
H Y + I ++ GP+E S VY+DF YKSGVY K +GGHAV
Sbjct: 231 DDHRFTRDYYYLTYAS-IQKDVMTYGPIEASMEVYDDFPSYKSGVYEKSENATYLGGHAV 289
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KLIGWG +DG YW++ N W+ WG G FKI+RG+NEC ++ + AG+P
Sbjct: 290 KLIGWG-EEDGVPYWLMVNSWSEMWGDKGLFKIRRGTNECSVDNSMTAGVP 339
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 134/241 (55%), Gaps = 31/241 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD-GCDGGYPISAWR 71
Q +CGSCWAFG+ EA++DR CI ++ LS D+ +C GD GC+GG P S +
Sbjct: 10 QANCGSCWAFGSTEAMTDRMCIASNGTVTTHLSAQDVTSCDKL--GDMGCNGGIPSSVYS 67
Query: 72 YFVHHGVVTEECDPYFDSTGC---------------SHPGCEPAYPTPKCVRKCVKKNQL 116
Y+ G+V + Y D +GC +P C PKC RKC +++
Sbjct: 68 YWALSGIV--DGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCARKCESEDKD 125
Query: 117 WRNSKHYSISAYRINSDPE-------DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HI 168
W +K Y + E + A+IY+NGP+ F V +DF YKSGVY+ +
Sbjct: 126 WTKAKVKGEKGYSVCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAYKSGVYEPKL 185
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+GGHA+K++G+GT +DG+DYW++AN WN WG DGYFKI RG N C IE+ V+ G
Sbjct: 186 LSPPLGGHAIKIMGFGT-EDGKDYWLVANSWNEDWGDDGYFKIIRGKNACQIEDPVINGG 244
Query: 229 P 229
P
Sbjct: 245 P 245
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/212 (42%), Positives = 119/212 (56%), Gaps = 18/212 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAF E + DR I ++ DL++C GC+GGY AW +
Sbjct: 83 QASCGSCWAFSVAETMGDRLSIKGCDYGDMAPQDLVSC--DTTDMGCNGGYMDHAWAWTK 140
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
HGV TE+C PY +G P C KCV + + RN S+S ++N+
Sbjct: 141 SHGVTTEKCMPYQSGSG----------RVPACPAKCVNGSAIVRNK---SVSYKKLNA-- 185
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
+ +M E+Y+NGP+ V+FTVY DF +YKSGVY H TG + GGHAV +GWG +D YW+
Sbjct: 186 QQMMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVLCVGWGV-EDNTPYWL 244
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
N W +WG G+FKI RGSN CGIE A
Sbjct: 245 CQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 121/229 (52%), Gaps = 12/229 (5%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q C + WA A+SDR+C + G L +S DLL+CC CGDGC GG+P AW Y+
Sbjct: 112 QSACRASWAVSTASAISDRYCTVGGGKQLRISAADLLSCCK-QCGDGCKGGFPGFAWLYY 170
Query: 74 VHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
V +G+ + C PY + G P + + TPKC C K+ K+ +
Sbjct: 171 VEYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNA 228
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
Y + ED E+Y NGP F VY D YKSGVY+++ GD +GG AV+++GWG
Sbjct: 229 TYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL 288
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+G YW +AN W+ WG +GY I RG+NEC IE G P L
Sbjct: 289 -NGTPYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFPDPSQLT 336
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 87/213 (40%), Positives = 116/213 (54%), Gaps = 18/213 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAF E +R I +S DL++C GC+GG P+ +W +
Sbjct: 83 QEQCGSCWAFAVAETTGNRLNILGCGRGDMSPQDLVSC--DKVDHGCNGGSPLFSWEWVK 140
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
H G+ TEEC PY G P C +KC + + R +K S+ +
Sbjct: 141 HSGITTEECIPYVSGGG----------RVPSCPKKCTNGSAIVR-TKAKSVGLVK----G 185
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
+ + E+Y GP E +F+VYEDF YKSGVY HITG ++GGHAV ++GWG +DG YW+
Sbjct: 186 DKMQNELYSRGPFEAAFSVYEDFKSYKSGVYHHITGKMLGGHAVMVVGWGV-EDGTPYWL 244
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ N W +WG G+FKI RG NECGIE G
Sbjct: 245 IQNSWGTTWGEQGFFKILRGKNECGIETTCFQG 277
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 128/231 (55%), Gaps = 20/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA + +SDR CI + LS +LL+CC CG GC+GGYP ++Y
Sbjct: 312 QANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAELLSCCT-SCGYGCNGGYPQRTFKY 370
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCE--PAYPTPKCVRKCVKKNQLWRNS-KH 122
+V+ G+ T + C PY P C TPKC + C+ L N +H
Sbjct: 371 WVYSGMPTGGPYGSNDTCKPY------PIPPCSNCSETRTPKCSKSCISTYPLSLNEDRH 424
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y + Y+ + +M +I GP+ +VYEDF HYK GVY +G +GGHAV++IG
Sbjct: 425 YGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYKEGVYTQESGIFLGGHAVRIIG 484
Query: 183 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
WG D+ YW++AN WN ++G DG FKI+RG +ECGIE V AG K
Sbjct: 485 WGEQDN-IPYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAKCKQ 534
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/234 (40%), Positives = 125/234 (53%), Gaps = 21/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGS WA A SDR C+ + N LS ++ CC CGDGC GGYPI AW+
Sbjct: 47 QGHCGSDWAMSTSSAFSDRLCVATNGDFNQLLSAEEITFCC-HTCGDGCSGGYPIRAWKR 105
Query: 73 FVHHGVVT-------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
+ HG+VT E C+PY D G + +P +C R C L +
Sbjct: 106 YKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFD 165
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ Y + I ++ GP+E SF VY+DF YKSG+Y K +GGH+
Sbjct: 166 EDHRYTRDHYYLTY--RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHS 223
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
VKLIGWG + G YW++ N WN WG G FKI+RG+NECG++ G+P++
Sbjct: 224 VKLIGWG-EEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVPAT 276
>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
Length = 166
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/86 (84%), Positives = 80/86 (93%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
++ QGHCGSCWAFGAVE+LSDRFCIHFG+++ LSVNDLLACCGFLCG GCDGGYPISAW+
Sbjct: 81 ILDQGHCGSCWAFGAVESLSDRFCIHFGVDVPLSVNDLLACCGFLCGSGCDGGYPISAWK 140
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGC 97
YF HHGVVTEECDPYFD GCSHPGC
Sbjct: 141 YFAHHGVVTEECDPYFDQIGCSHPGC 166
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 99/243 (40%), Positives = 135/243 (55%), Gaps = 22/243 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N ++ + Q C S WA +V A+SDR CI + + LS +L++CC C G
Sbjct: 94 WKNCPSIKRIYDQSQCYSSWAMASVAAISDRICIQTNGTVKVELSAIELVSCCS-KCAVG 152
Query: 61 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAYPT--------PK 105
C+ GY SAW Y+V +G+VT E C PY C H G +YP P
Sbjct: 153 CNFGYSESAWYYWVENGLVTGESNGNNSGCLPY-PFPKCDH-GSSDSYPMCGYVVYTPPV 210
Query: 106 CVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C C + + + KH+ SAY++ + DI EI GPVE S +Y+DF YKSGV
Sbjct: 211 CNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGV 270
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
YKH+TG ++ +V++IGWG ++G YW+ AN WN WG +G+FKI RGSNEC IE V
Sbjct: 271 YKHLTGRLITIQSVRIIGWGI-ENGIPYWLCANSWNEEWGLNGFFKILRGSNECEIEAFV 329
Query: 225 VAG 227
AG
Sbjct: 330 NAG 332
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/234 (39%), Positives = 127/234 (54%), Gaps = 21/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGS WA A +DR C+ N LS ++ CC CG+GC+GGYPI AW+
Sbjct: 108 QGNCGSDWALSTSSAFADRLCVATNGDFNQLLSAEEITFCC-HKCGNGCNGGYPIRAWKR 166
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F +HG+VT E C+PY +D G + +P KC +KC + N
Sbjct: 167 FKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQPMESNHKCSKKCYGDEDIDFN 226
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ Y + I ++ GP+E SF VY+DF +YKSG+Y K +GGH+
Sbjct: 227 KDHRYTRDDYYLTY--RGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENASYLGGHS 284
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
VKLIGWG + G YW++ N WN WG G FKI+RG+NEC ++ G+P +
Sbjct: 285 VKLIGWG-EEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECRVDNSTTGGVPDT 337
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 163 bits (413), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 88/222 (39%), Positives = 125/222 (56%), Gaps = 23/222 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA + + DR CI + +S D+L+C GC+GGYP A+ +
Sbjct: 131 QSNCGSCWAVSSASVIQDRICIASNGEQKVHISAQDILSCATDR-SQGCNGGYPDEAFEH 189
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEP---------AYPTPKCVRKC--VKKNQLWRNSK 121
+ GVVT S ++ GC+P Y TP+C +KC + + ++ K
Sbjct: 190 YAQSGVVT-------GSGNSANQGCKPYPFLPHTTVEYSTPECSKKCENYQYKKAYKQDK 242
Query: 122 HYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
H+ +S Y + SDP DI EI NGPVE + VY DF YKSGVY+ + +GGHAV++
Sbjct: 243 HFGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRI 302
Query: 181 IGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIE 221
+GWG + YW++AN WN WG DGYF+I+RG++E IE
Sbjct: 303 VGWGVDGPTKVPYWLVANSWNTDWGEDGYFRIRRGTDESYIE 344
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 163 bits (413), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 125/232 (53%), Gaps = 21/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A +DR C+ + + LS +L CC CG GC+GGYPI AW
Sbjct: 110 QGNCGSCWALATSSAFADRLCVATDADFNEFLSPEELTFCC-HTCGYGCNGGYPIKAWER 168
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F HG+VT E C+PY + G + +P +C R C L +
Sbjct: 169 FKSHGLVTGGDYKSGEGCEPYRVPPCRHHAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFD 228
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ +Y + I ++ GP+E SF VY+DF YKSGVY + +GGHA
Sbjct: 229 DDHRYTRDSYYLTYG--SIQKDVMNYGPIEASFDVYDDFPSYKSGVYIRSDNASYLGGHA 286
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VKLIGWG + G YW++ N WN WG G FKI+RG+NECG++ AG+P
Sbjct: 287 VKLIGWG-EESGVPYWLMVNSWNTDWGDKGLFKIQRGTNECGVDNSTTAGVP 337
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 163 bits (413), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 102/253 (40%), Positives = 137/253 (54%), Gaps = 49/253 (19%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI------HFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
QG CGSCWA AV A++DR CI HF S+ D+L+CCG+ CG+GC+GG
Sbjct: 106 QGGCGSCWAVAAVSAMTDRMCILSKGKEHF----YFSIKDVLSCCGY-CGNGCEGGVLTR 160
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSH---------------PGCE--PAYP-- 102
AW Y+ G+V+ + C PY C+H P C+ P P
Sbjct: 161 AWIYYKKIGIVSGGGYKSKQGCQPY-TIPPCNHLVWGEIEQCKNIPMTPKCKNIPVIPEQ 219
Query: 103 ------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 155
TP+C +KC K ++ + KH S YR+ +I EIY+ GPV FTVYE
Sbjct: 220 CKYIPITPECEKKCNKNYKVCYSKDKHRGKSVYRVKKS--EIFKEIYEYGPVTSYFTVYE 277
Query: 156 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR-G 214
DF +YK G+Y + +G +G H+VK+IGWG + G YW+ AN +N WG G+FKI R G
Sbjct: 278 DFLNYKEGIYNYTSGQKLGLHSVKIIGWG-EERGIKYWLAANSFNTDWGDKGFFKIIREG 336
Query: 215 SNECGIEEDVVAG 227
CGI ++VVAG
Sbjct: 337 VGSCGISDNVVAG 349
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 163 bits (413), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 93/217 (42%), Positives = 123/217 (56%), Gaps = 19/217 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A E+LSDRFCI +NL LS D+++C GC GGY AW+Y
Sbjct: 98 QAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSC--DTSNFGCFGGYLDQAWQY 155
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
GV ++ C+PY S G +P+ PT + +KK + S + A
Sbjct: 156 LEQQGVSSDSCEPYK-----SGNGDQPSCPTKCSNGQAIKKYKCKAGSTKQAKGA----- 205
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
E + I ++GPVE FTVY+DF +Y SGVY H+TGD GGHAVK++GWG E+Y
Sbjct: 206 --EATKSLIQESGPVETGFTVYQDFYNYNSGVYHHVTGDAEGGHAVKILGWG-KQGLENY 262
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WI+AN W WG GYF I++G + GI+E +P
Sbjct: 263 WIVANSWGEDWGEKGYFNIRQG--DSGIDEATFGCIP 297
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 120/220 (54%), Gaps = 25/220 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLAC-CGFLCGDGCDGGYPISAWR 71
Q CGSCWAF AVE+LSDRFCI +NL LS D+L+C C C GGY +AW+
Sbjct: 98 QAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSCDASNFC---CFGGYLDTAWQ 154
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA--YR 129
Y GV ++ C+PY G P C KC + K Y A +
Sbjct: 155 YLEQQGVGSDSCEPYKSGNG----------DQPSCPSKCSNGQAI----KKYKCKAGSTK 200
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
E + I ++GPVE FT+YEDF +Y SG+Y H+TG MGGHAVK++GWG
Sbjct: 201 QAKGAEATKSLIQQSGPVETGFTIYEDFLNYNSGIYHHVTGGNMGGHAVKILGWGKQGL- 259
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
E+YWI+AN W WG GYF I++G + GI+E +P
Sbjct: 260 ENYWIVANSWGEDWGEKGYFNIRQG--DSGIDEATFGCIP 297
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 162 bits (411), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 125/216 (57%), Gaps = 18/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGA EA SDRF I+ G ++ LS DL++C GC+GGY AW Y
Sbjct: 96 QQQCGSCWAFGATEAFSDRFAIN-GKDVILSPEDLVSC--DTNDYGCNGGYMDVAWEYLA 152
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
HG T+ C PY +G + P C KC + + R + ++ R +
Sbjct: 153 DHGAATDSCFPYSAGSGFA----------PACSDKCADGSAMQRFK--CAPNSVRQSKGV 200
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
I +EI +GPVE +FTVY DF +Y+SGVY T DV GGHA+K++G+G ++G YW+
Sbjct: 201 AQIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGV-ENGTPYWL 259
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
AN W +WG G+FKIK+G ECGIE+ V + P
Sbjct: 260 CANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQ 293
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 120/214 (56%), Gaps = 20/214 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYF 73
Q CGSCWA A EA+ +RF I LSV DL++C GD GC+GG + ++
Sbjct: 83 QASCGSCWAHAASEAIGNRFSIKGCGKGMLSVQDLVSCDK---GDSGCNGGSGPLSSKWL 139
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
V +GV TEEC PY G P C KC +Q+ R K+ Y +
Sbjct: 140 VSNGVTTEECLPYVSGNG----------RVPACAAKCSNGSQIIR-YKYEKAETYTV--- 185
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
++I E+ KNGPV FTVY DF +YKSGVY+H +G GGHAV LIGWG +DG YW
Sbjct: 186 -QNIQEELMKNGPVYFRFTVYSDFMNYKSGVYQHKSGYQEGGHAVLLIGWGV-EDGVPYW 243
Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+L N W +WG G+FKI RG NECG E+ AG
Sbjct: 244 LLQNSWGPAWGEKGHFKIIRGKNECGCEQGFYAG 277
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/238 (42%), Positives = 135/238 (56%), Gaps = 26/238 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGA EA+SDR CIH +S+ ++ DLL CC CG GC+GGYP +AW +
Sbjct: 101 QGSCGSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLTCCDS-CGMGCNGGYPSAAWDF 159
Query: 73 FVHHGVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY G P TP+C+ +C ++
Sbjct: 160 WTDVGLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYK 219
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY S+Y + SD E I +EIYKNGPVE +FTVYEDF YK+GVY+H+TG +GGHA+
Sbjct: 220 ADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGHAI 279
Query: 179 KLIGWGTSDDGEDYWILAN--QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
K S GE+ L + WG D GS+ CGIE ++VAG+P +++
Sbjct: 280 K------SWLGEEVCSLLALCHSDTDWG-DMVSLSSAGSDHCGIESEIVAGIPITQSF 330
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 33/241 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
QG CGSCWAF + EA +DR CI + LS +CC + C GC+GG P AW
Sbjct: 297 QGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAW 356
Query: 71 RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAY---PTPKCVRKCV 111
R+F GVVT C PY + C+H P C+ TPKC + C
Sbjct: 357 RWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCE 415
Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
++ + H + SAY + S +D+ ++ +GPV +F VYEDF YKSGVYK
Sbjct: 416 EQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYK 474
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G +GGHA+K+IGWGT ++GE+YW N WN WG G FKI G +CGI+ ++VA
Sbjct: 475 HVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQFKIAMG--QCGIDGEMVA 531
Query: 227 G 227
G
Sbjct: 532 G 532
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 33/241 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
QG CGSCWAF + EA +DR CI + LS +CC + C GC+GG P AW
Sbjct: 297 QGDCGSCWAFASTEAFNDRLCIRSQGKRLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAW 356
Query: 71 RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAY---PTPKCVRKCV 111
R+F GVVT C PY + C+H P C+ TPKC + C
Sbjct: 357 RWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCE 415
Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
++ + H + SAY + S +D+ ++ +GPV +F VYEDF YKSGVYK
Sbjct: 416 EQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYK 474
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G +GGHA+K+IGWGT ++GE+YW N WN WG G FKI G +CGI+ ++VA
Sbjct: 475 HVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQFKIAMG--QCGIDGEMVA 531
Query: 227 G 227
G
Sbjct: 532 G 532
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 123/221 (55%), Gaps = 19/221 (8%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V ++ DR C G++ + S +++C GD CDGG+
Sbjct: 91 VVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDMACDGGWLP 146
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
S WR+ G T+EC PY G A T C KC + L K
Sbjct: 147 SVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLYKATKAVD 197
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV ++G+GT D
Sbjct: 198 YGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDMVGYGTDD 255
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DG DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 256 DGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/241 (41%), Positives = 133/241 (55%), Gaps = 33/241 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
QG CGSCWAF + EA +DR CI + LS +CC + C GC+GG P AW
Sbjct: 300 QGDCGSCWAFASTEAFNDRLCIRSQGKGLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAW 359
Query: 71 RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEPAY---PTPKCVRKCV 111
R+F GVVT C PY + C+H P C+ TPKC + C
Sbjct: 360 RWFERKGVVTGGDFDALGKGTTCWPY-EVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCE 418
Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
++ + H + SAY + S +D+ ++ +GPV +F VYEDF YKSGVYK
Sbjct: 419 EQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYK 477
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H++G +GGHA+K+IGWGT ++GE+YW N WN WG G FKI G +CGI+ ++VA
Sbjct: 478 HVSGLPVGGHAIKIIGWGT-ENGEEYWHAVNSWNTYWGDGGQFKIAMG--QCGIDGEMVA 534
Query: 227 G 227
G
Sbjct: 535 G 535
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/198 (45%), Positives = 120/198 (60%), Gaps = 19/198 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CI LS +L++CC CG GC+GG+P SAW Y
Sbjct: 117 QSSCGSCWAFGAVEAMSDRICIESKGKYKPFLSAENLVSCCSS-CGMGCNGGFPHSAWLY 175
Query: 73 FVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWR 118
+ + G+VT + C PY + C H P C+ TP C R C N +
Sbjct: 176 WKNQGIVTGDLYNTTNGCQPY-EFPPCEHHTLGPLPVCDGDVETPPCKRTCQAGYNVSYE 234
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++GGHAV
Sbjct: 235 NDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHAV 294
Query: 179 KLIGWGTSDDGEDYWILA 196
+L+GWG ++ YW++A
Sbjct: 295 RLLGWG-EENNVPYWLIA 311
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/240 (40%), Positives = 128/240 (53%), Gaps = 32/240 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG EA +DR C+ + LS ++ AC GCDGGYP SAW +
Sbjct: 83 QSACGSCWAFGVTEAFNDRLCVKSNGTFTELLSAGEMNACAPSY---GCDGGYPDSAWSW 139
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC- 110
G+ T + C PY D C+H P C + +Y TP CV +C
Sbjct: 140 VHDEGIATGGDYVARGNLTKGDGCWPY-DFPPCAHHINDTKYPKCPKGSYETPNCVEQCH 198
Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
K + +N +HY + + + I +GPV S+ VYEDF YKSGVYKH +
Sbjct: 199 NPKYSTSLKNDRHYMLESSPYQYSVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTS 258
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G +GGHAVK+IGWG ++GE YW++ N WN WG G FKI G+ C I++D++ G P
Sbjct: 259 GSYLGGHAVKIIGWG-EENGEAYWLVVNSWNEDWGDHGLFKIALGN--CQIDDDLLGGTP 315
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 134/237 (56%), Gaps = 15/237 (6%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
N + + + QG CGSCWA A +SDR CIH N++++ DL+ CC CG+GC+
Sbjct: 104 NCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAEDLMGCCA-DCGNGCE 162
Query: 63 GGY-PISAWRYFVHHGVV-------TEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKK 113
GG+ ++++Y+V G+V TE C PY C +P + +PKC C
Sbjct: 163 GGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPY-PFKPCLYPFTDCHREESPKCKHHCQHG 221
Query: 114 -NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
++ + K + AY + D I EI NGPVE F VYED YKSGVY+H+ G+
Sbjct: 222 VDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYRHVYGEH 281
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+G HAV++IGWG + G YW+++N + WG GYFKI RG N GIE V+ GLP
Sbjct: 282 VGKHAVRIIGWG-REGGIPYWLISNSYGEDWGDHGYFKIVRGINHLGIESKVITGLP 337
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 125/232 (53%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFC--IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A A+SDR C + +N LS ++L+CC CG GC GGYP A+ Y
Sbjct: 116 QSSCGSCWAVAAASAMSDRVCALTNGRINRILSDTEVLSCCFGSCGFGCKGGYPARAFGY 175
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKNQL- 116
+G+ T + C PY C + EP Y PTP C R C +
Sbjct: 176 AWRYGLSTGGPYGEKDACQPY-AFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIP 234
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ K ++ Y I + +I EI GPV ++ VY DF +YK GVY H G+V G H
Sbjct: 235 FEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRDFDYYKKGVYIHREGEVTGLH 294
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
AVK+IGWG +D YW++AN WN WG +GYF+I RG++ C IE +V G+
Sbjct: 295 AVKIIGWGKGND-VPYWLVANSWNTDWGDNGYFRIVRGTDNCEIERQMVGGI 345
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 123/217 (56%), Gaps = 19/217 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A E+LSDRFCI +N+ LS D+++C GCDGGY AW+Y
Sbjct: 98 QAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQY 155
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
GV ++ C+PY ++G + P C KC Q + K + S + N
Sbjct: 156 LEKKGVASDSCEPYKSASGTA----------PSCPSKCAN-GQAIKKYKCQAGSTKQANG 204
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+ I ++GPVE FTVY DF +YKSG+Y H++G GGHAVK++GWG E+Y
Sbjct: 205 AAA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWG-KQGSENY 262
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WI+AN W SWG G+F I++G + GI++ +P
Sbjct: 263 WIVANSWGESWGEKGFFNIRQG--DSGIDQATFGCIP 297
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 161 bits (407), Expect = 3e-37, Method: Composition-based stats.
Identities = 94/214 (43%), Positives = 122/214 (57%), Gaps = 22/214 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI-HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWAF A E LSDR I H LS DL++C GC+GG +AW Y
Sbjct: 360 QQQCGSCWAFSAAEVLSDRNAIQHNKAEPVLSPEDLVSCD--RVDQGCNGGNLGTAWTYL 417
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
+ G+VT+ C PY G PKC C K W +K+ + SAY +N
Sbjct: 418 KNTGIVTDACFPYTAGGG----------DAPKCETSC-KDGSSW--TKYKAASAYAVNG- 463
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--GGHAVKLIGWGTSDDGED 191
E++ EI +GP++V+F VY+ F YKSGVY ++M GGHAVK++GWGT + G+D
Sbjct: 464 VENMQKEIMTHGPIQVAFNVYKSFMSYKSGVYAKKWYELMPEGGHAVKIVGWGT-EGGKD 522
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
YW++AN WN SWG +GYFKI G+ I DVV
Sbjct: 523 YWLVANSWNTSWGDEGYFKIAVGAES--ISLDVV 554
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/238 (39%), Positives = 128/238 (53%), Gaps = 27/238 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG CWAFG EA +DR CI + LS ++ AC L GC GG+P SAW +
Sbjct: 163 QSACGDCWAFGVTEAFNDRLCIKSNGTFTKLLSAGEMNACAPSLKDPGCRGGFPYSAWSW 222
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSHPGCEPAYPT-PKCVR---KCVKKNQ 115
G+ T + C PY D C+H +P YP PK R +CV K +
Sbjct: 223 VHDEGIATGGDYVPRDNMTEDDGCWPY-DFPPCAHFFKDPKYPACPKFARVNLRCVSKLR 281
Query: 116 ----LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
++ + +++ + + + +D I +GPV +F VYEDF YKSGVYKH +G
Sbjct: 282 HMMVVYFSDRYFMVESVPYHFSADDAKNAIRTDGPVSATFYVYEDFLAYKSGVYKHTSGS 341
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++G HAVK+IGWG D GE YW++ N WN WG G FKI G +CGI+ +++ G P
Sbjct: 342 LLGAHAVKIIGWG-EDGGEAYWLVVNSWNEGWGDHGLFKIALG--DCGIDNELLGGTP 396
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/217 (41%), Positives = 123/217 (56%), Gaps = 19/217 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A E+LSDRFCI +N+ LS D+++C GCDGGY AW+Y
Sbjct: 98 QAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSC--DTNNYGCDGGYLNLAWQY 155
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
GV ++ C+PY ++G + P C KC Q + K + S + N
Sbjct: 156 LEKKGVASDSCEPYKSASGTA----------PSCPSKC-SNGQAIKKYKCKAGSTKQANG 204
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+ I ++GPVE FTVY DF +YKSG+Y H++G GGHAVK++GWG E+Y
Sbjct: 205 AAA-TKSLIQQSGPVETGFTVYADFFNYKSGIYHHVSGGAEGGHAVKILGWG-KQGSENY 262
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WI+AN W SWG G+F I++G + GI++ +P
Sbjct: 263 WIVANSWGESWGEKGFFNIRQG--DSGIDQATFGCIP 297
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 126/219 (57%), Gaps = 23/219 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAFGA EALSDRF I + +++ S DL++C GC+GGY AW +
Sbjct: 96 QQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVSC--DTNDYGCNGGYMDMAWEF 153
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRI 130
HGVV + C PY +G + P C KC + K YS + R
Sbjct: 154 LDQHGVVADSCFPYSAGSGFA----------PACASKCADGSA----EKKYSCVHGSIRQ 199
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
+ E I +EI +GPVE +FTVY DF +Y+SGVY T DV GGHA+K++G+G ++G
Sbjct: 200 SQGVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHAIKILGFGV-ENGT 258
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
YW+ AN W SWG G+FKIK+G ECGIE+ V + P
Sbjct: 259 PYWLCANSWGPSWGMQGFFKIKQG--ECGIEDQVFSCDP 295
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 127/234 (54%), Gaps = 19/234 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWA A ++DR CI ++ S ++ ACC CG+ C GG +A+ +
Sbjct: 98 QGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCT-ECGNACYGGDEDTAFTH 156
Query: 73 FVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+V G V+ E C PY C H P CE P C C ++ + +
Sbjct: 157 WVTKGFVSGGRHNSNEGCQPY-SVEECEHHIEGPRPPCEGDMPELVCSETCHEEYGKTYE 215
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
Y + AY + D I EI NGPV +F VY+DF YKSGVY+H TG + G HAV
Sbjct: 216 EDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAV 275
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
++IGWG ++G YW++AN WN WG +G FKI RGS+EC E D+ A SSK
Sbjct: 276 RVIGWG-EEEGTPYWLVANSWNTDWGDNGLFKILRGSDECEFEGDMAAATYSSK 328
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 96/232 (41%), Positives = 126/232 (54%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCW+F A +DR C+ G N LS +L CC CG GC GG P+ AW Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEELTFCCK-DCGQGCGGGNPMKAWEY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY + G + +P +C + C K + +
Sbjct: 166 FRTQGVTTGGDYNTKEGCMPYKVPPCRNKQGENICDEQPMERNHQCPKTCYGKTTV--QN 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVK 179
++ + S Y INS + I +I GPVE SF Y+D + YKSG+Y K GGH++K
Sbjct: 224 RYKTKSEYYINS-IKTIEQDIKTYGPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+IGWG +DG YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 283 IIGWG-QEDGTPYWLAVNSWSKFWGDHGTFKIIKGRNECGIERAVTAGIPSS 333
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/233 (42%), Positives = 134/233 (57%), Gaps = 23/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A SDR CI + G+N LS + +CC CG+GC+GG+P AW+Y
Sbjct: 108 QSNCGSCWALSTASAFSDRLCITSNMGVNKVLSGEYINSCCNGKCGNGCNGGHPEKAWKY 167
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVR-KCVKKN--QL 116
+G+ T E C PY ++ CS + TP+C + +C N
Sbjct: 168 IKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENED----TPQCYKDQCTNNNYETP 223
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ +Y+ Y + PE IM+E++KNGPV + VY+DF YK G+Y++ TG + G H
Sbjct: 224 LVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIYQYTTGGLKGDH 283
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
AVK++GWG DDG DYW+ AN W SWG G FKI+RG NECGIE + GLP
Sbjct: 284 AVKIMGWG-EDDGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGLP 335
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 126/221 (57%), Gaps = 13/221 (5%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C S WAF A E +SDR CI + + LS DL+ CC + CG+ C GGY AW Y
Sbjct: 95 QGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHY-CGNQCKGGYTYYAWNY 153
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAY 128
F+ G+V+ Y STGC P E Y TP C C K + + KH+ S Y
Sbjct: 154 FMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIY 210
Query: 129 RINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
I + I EI G PV +F VY DF Y+ GVY + +G + G AVK+IGWGT +
Sbjct: 211 YIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKIIGWGT-E 269
Query: 188 DGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
+G YW+ AN W + WGA G+FKI+RG+NECG EE ++AG
Sbjct: 270 NGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIAG 310
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 96/240 (40%), Positives = 127/240 (52%), Gaps = 32/240 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG EA +DR CI H LS ++ AC GC+GG+P SAW +
Sbjct: 44 QSACGSCWAFGVTEAFNDRLCIKSHGTFTELLSAGEMNACAP---SHGCNGGFPNSAWSW 100
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC- 110
G+ T + C PY D C+H P C + +Y TP C +C
Sbjct: 101 VHDKGIATGGDYVAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETPNCAEQCH 159
Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
K R+ +H+ + + D I +GPV SFTVYEDF YKSGVYKH +
Sbjct: 160 NPKYTTTLRDDRHFMVESSPYQYSVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTS 219
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G+ +GGHAVK+IGWG + G+ YW++ N WN WG G FKI G+ CGI++ ++ G P
Sbjct: 220 GEYLGGHAVKIIGWG-EESGQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTP 276
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 134/242 (55%), Gaps = 23/242 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF-------GMNLSLSVNDLLACCGFLCGDGCDGGYPI 67
QG G CWA GA+EA+SD CIH G ++ +S D L C LCGDGC+GG P
Sbjct: 114 QGSYGFCWALGALEAISDWICIHPNVGGAQGGNHVEVSAEDKLTC---LCGDGCNGGXPN 170
Query: 68 SAWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY----PTPKCVRKCVKKNQL 116
W ++ G+V+ C + C H Y +PKC C + Q
Sbjct: 171 EGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKHHIHGXPYVXTGDSPKCSMTC-EPGQT 229
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KHY S+Y I+ +DIM IYKN VE +F+VY DF YK Y+ +TG++ GGH
Sbjct: 230 YKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFKEYQGVTGEMXGGH 289
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
A+ ++G ++ YW++AN WNR WG +G+FKI RG + GIE +VVA +P ++ +
Sbjct: 290 AICILGCKV-ENSTSYWLVANXWNRDWGDNGFFKILRGQDHYGIESEVVAEIPHTEQYWE 348
Query: 237 EI 238
+I
Sbjct: 349 KI 350
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 160 bits (404), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 85/202 (42%), Positives = 121/202 (59%), Gaps = 20/202 (9%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + QG CGSCWAFGAVEA+SDR CIH N S +L++CC + CG G
Sbjct: 38 WPNCPTIREVRDQGSCGSCWAFGAVEAMSDRVCIHSKGTKNFHFSAENLVSCC-WTCGFG 96
Query: 61 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGC--------------SHPGCEPAYPTPKC 106
C+GG+P +AW Y+ G+V+ PY + GC + C+ TPKC
Sbjct: 97 CNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEVAPCEHHVNGTRGPCKEGGKTPKC 154
Query: 107 VRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
V+KC ++ + H+ SAY +++D + I EIY NGPVE +FTVYEDF Y++GVY
Sbjct: 155 VKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY 214
Query: 166 KHITGDVMGGHAVKLIGWGTSD 187
KH+ G +GGHA++++GWG +
Sbjct: 215 KHVAGKALGGHAIRILGWGVQN 236
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 159 bits (402), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 90/197 (45%), Positives = 117/197 (59%), Gaps = 20/197 (10%)
Query: 20 SCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA + A+SDR CI + LS D+LACC + CG GC+GG+P+ AW+YF G
Sbjct: 1 SCWAVSSAAAMSDRVCIASXGAKQVLLSDQDMLACCSW-CGYGCEGGWPMKAWQYFXLEG 59
Query: 78 VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN-QLWRNSKH 122
VVT C PY + C G EP Y TPKC + C + + ++ KH
Sbjct: 60 VVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKH 118
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G + GGHAVK+IG
Sbjct: 119 FGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIG 178
Query: 183 WGTSDDGEDYWILANQW 199
WG + G YW++AN W
Sbjct: 179 WG-KEXGTPYWLIANSW 194
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 96/222 (43%), Positives = 121/222 (54%), Gaps = 41/222 (18%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGD------GCDGGYP 66
Q CGSC AFGAVEA+S+R CI G N+ LS DL G + G GC+ YP
Sbjct: 111 QSRCGSCCAFGAVEAMSERSCIQSGGKQNVELSAVDLE---GIVTGSSKENNTGCEP-YP 166
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSI 125
+F T +P C Y TP+C C K R Y+
Sbjct: 167 FPKCEHF----------------TKGQYPPCGSKIYKTPRCKTTCQK-----RYKTSYAQ 205
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+R I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG
Sbjct: 206 DKHRA------IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV 259
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++ YW++AN WN WG +GYF+I RG +EC IE +V AG
Sbjct: 260 -ENKTPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 122/228 (53%), Gaps = 17/228 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
+ N + + Q +CGSCWA A E +SDR C+ + ++D +LACCG CG G
Sbjct: 105 WKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164
Query: 61 CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCV 107
C+GG AW Y GVVT +E C PY +H G + ++ TP C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACK 224
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + + K Y S Y ++ D + I E+ KNGPV+ +F YEDF+ Y G+Y
Sbjct: 225 KYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYV 284
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
H G G HAVK++GWG ++G YW +AN W+ WG DGYF+I RG
Sbjct: 285 HTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGEDGYFRILRG 331
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 116/218 (53%), Gaps = 18/218 (8%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
E + + Q CGSCWAF E + DR I +S DL++C GC+GGY
Sbjct: 75 EQILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGHMSPQDLVSC--DTTDMGCNGGYM 132
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
AW + HGV EEC PY G P C KCV + + R +K S +
Sbjct: 133 DKAWAWTKSHGVTNEECMPYQSGGG----------RVPACPAKCVNGSTIVR-TKSQSFT 181
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
+ + + E+Y+NGP+ V+FTVY DF +YKSGVY H TG V GGHAV IGWG
Sbjct: 182 HFTAS----QMQQELYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVE 237
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
D+ YW+ N W +WG G+FKI RGSN CGIE V
Sbjct: 238 DN-TPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQV 274
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/215 (41%), Positives = 116/215 (53%), Gaps = 18/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCWAF E + DR + ++ DL++C F DGCDGG+ AW +
Sbjct: 83 QGECGSCWAFSIAETIGDRLGVLGCSRGDIAPEDLVSCDIF--DDGCDGGFIDMAWDWCQ 140
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
+G+ TEEC PY G P C C + ++R I +YR D
Sbjct: 141 ENGLTTEECIPYKAGEGVPSP----------CPETCEDGSAIYRTP----IESYRY-IDA 185
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
+DI EIY+ GPV + F VY DF YKSGVY H G + GGHAV ++GWG D+ YW+
Sbjct: 186 DDIQGEIYEYGPVSMGFIVYSDFMSYKSGVYVHQAGYIEGGHAVLIVGWGVEDE-VPYWL 244
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+ N W WG +G+FKI RGS+ C E +V AG P
Sbjct: 245 VQNSWGTDWGENGFFKILRGSDHCECESNVTAGYP 279
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/242 (37%), Positives = 130/242 (53%), Gaps = 30/242 (12%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP 66
+ ++ QG+C S +A A+SDR CIH + LS +L+CC +LCGDGC GG
Sbjct: 69 IGMVYDQGNCKSSYAISVASAVSDRICIHSNGTVKPKLSAQQILSCC-YLCGDGCSGGQH 127
Query: 67 ISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
+W ++ HG+V+ E C PY T + TP+C +C
Sbjct: 128 FESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTETAVENACSNKTLFTPECKVQCYNP 187
Query: 114 NQLWRNSK------HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
+ R K HY + AY M EIY+NGP+ SF +Y+DF +Y+SGVY +
Sbjct: 188 DYGTRYVKDNHQGTHYRVPAYTA-------MKEIYENGPITASFYMYQDFVNYQSGVYAY 240
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+G + AVK++GWG ++G YW+ AN +N WG +G+ KI RG+NEC IEE + AG
Sbjct: 241 NSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWGDNGFVKILRGANECYIEEFMYAG 299
Query: 228 LP 229
LP
Sbjct: 300 LP 301
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/221 (41%), Positives = 126/221 (57%), Gaps = 20/221 (9%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V +L DR C G++ ++ S +++C GD CDGG+
Sbjct: 91 VVDQGSCGSCWAFSSVASLGDRRCFA-GLDKKAVTYSPQYVVSCDH---GDMACDGGWLQ 146
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
S WR+ G T EC PY T + C PT KC +L S + A
Sbjct: 147 SVWRFLTKTGTTTNECVPYQSGTTGARGTC----PT-----KCADGGEL---STVKAKKA 194
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
D + IM + GP++ +FTVY DF +Y+ GVY+H++G V GGHAV+++G+GT +
Sbjct: 195 VDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEMVGYGTDE 254
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DYWI+ N W WG DGYF+I R +NECGIEE V+ G+
Sbjct: 255 YDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 122/228 (53%), Gaps = 17/228 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
+ N + + Q +CGSCWA A E +SDR C+ + ++D +LACCG CG G
Sbjct: 105 WKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164
Query: 61 CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCV 107
C+GG AW Y GVVT +E C PY +H G + ++ TP C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACK 224
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + + K Y S Y ++ D + I E+ KNGPV+ +F YEDF+ Y G+Y
Sbjct: 225 KYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYV 284
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
H G G HAVK++GWG ++G YW +AN W+ WG +GYF+I RG
Sbjct: 285 HTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGENGYFRILRG 331
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 122/228 (53%), Gaps = 17/228 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
+ N + + Q +CGSCWA A E +SDR C+ + ++D +LACCG CG G
Sbjct: 105 WKNCSSITYIRDQSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164
Query: 61 CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCV 107
C+GG AW Y GVVT +E C PY +H G + ++ TP C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACK 224
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ C + + K Y S Y ++ D + I E+ KNGPV+ + YEDF+ Y+ G+Y
Sbjct: 225 KYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAASITYEDFSFYRRGIYV 284
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
H G G HAVK++GWG ++G YW +AN W+ WG DGYF+I RG
Sbjct: 285 HTRGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDWGEDGYFRILRG 331
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 132/204 (64%), Gaps = 16/204 (7%)
Query: 40 MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 92
+N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 2 VNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPC 60
Query: 93 SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 145
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNG
Sbjct: 61 EHHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNG 120
Query: 146 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 205
PVE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG
Sbjct: 121 PVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGD 179
Query: 206 DGYFKIKRGSNECGIEEDVVAGLP 229
+G+FKI RG + CGIE ++VAG+P
Sbjct: 180 NGFFKILRGQDHCGIESEIVAGMP 203
>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
Length = 247
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 94/217 (43%), Positives = 117/217 (53%), Gaps = 22/217 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGA E LSDR CI ++ LS DL+AC G+ GC+GG AW Y
Sbjct: 49 QAQCGSCWAFGASETLSDRICIASDKKTDVILSPEDLVACDGW--NMGCNGGILPWAWSY 106
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+ G V + C PY G P C +KC + K S + S
Sbjct: 107 LTNTGAVEDSCFPYSSDKGA----------VPTCAKKCQNDKDSFTKYKCKKNSVVQA-S 155
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+ I AEI KNGP+E FTVYEDF +Y+SGVY H TG+ +GGHAVK++G+ G+ Y
Sbjct: 156 GVDKIKAEISKNGPMETGFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGY-----GDGY 210
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WI AN W+ WG G+F I G ECGI+ A P
Sbjct: 211 WICANSWSEKWGEKGFFNI--GFGECGIDSAAYACTP 245
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 156 bits (395), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 86/216 (39%), Positives = 118/216 (54%), Gaps = 17/216 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA A E +SDR C+ + ++D +LACCG CG GC+GG AW Y
Sbjct: 117 QSNCGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGSECGRGCNGGMDHKAWEY 176
Query: 73 FVHHGVVT----EE---CDPYFDSTGCSHPGC------EPAYPTPKCVRKC-VKKNQLWR 118
GVVT +E C PY +H G + ++ TP C + C + +
Sbjct: 177 VKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYE 236
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
K Y S Y ++ D + I E+ KNGPV+ +F YEDF+ Y G+Y H G G HAV
Sbjct: 237 KDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAV 296
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
K++GWG ++G YW +AN W+ WG +GYF+I RG
Sbjct: 297 KVVGWGV-ENGTKYWNVANSWSTDWGENGYFRILRG 331
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 156 bits (395), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 111/183 (60%), Gaps = 17/183 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G S LS DL++CC CGDGC GG+P AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDY 170
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C Y TP+C +KC K + +
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY +Y + S+ + I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+
Sbjct: 231 QDKHYGEESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAI 290
Query: 179 KLI 181
++I
Sbjct: 291 RII 293
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 156 bits (395), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 121/201 (60%), Gaps = 18/201 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF A E LSDRFCI + +++ LS +L C GCDGGY +AW +
Sbjct: 103 QQQCGSCWAFSASEVLSDRFCIASNGSVDVVLSPEYMLQCDS--TDYGCDGGYLNNAWAF 160
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+ +++CDPY ++G G P T K K +K S++ S
Sbjct: 161 LAGTGIPSDKCDPY--TSGNGDVGSCPTSCTDGSAIKLYK-------AKSSSVAQL---S 208
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED- 191
+DI +I NGPV+ +F+VY+DF YKSGVY+H++G + GGHA+K++GWG + DG+D
Sbjct: 209 SIDDIQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDT 268
Query: 192 -YWILANQWNRSWGADGYFKI 211
YWI+AN WN +WG +G+F I
Sbjct: 269 PYWIVANSWNTNWGQEGFFWI 289
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 91/196 (46%), Positives = 115/196 (58%), Gaps = 18/196 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 19 QSSCGSCWAVAAASAMSDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 77
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYS 124
HG+V+E C PY F S C+H C Y TP C C KK L + + S
Sbjct: 78 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTS 135
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
I S E E+ NGP EVSF+VY DF Y GVYKH+TG +GGHAV+++GWG
Sbjct: 136 C----ILSGEESFKRELLLNGPFEVSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG 191
Query: 185 TSDDGEDYWILANQWN 200
+GE YW +AN WN
Sbjct: 192 EL-NGEPYWKIANSWN 206
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 94/241 (39%), Positives = 127/241 (52%), Gaps = 27/241 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCG---DGCDGGYPISA 69
Q C SCWA VEA + R CI G N LS +++ACC GC GG ++A
Sbjct: 82 QSACASCWAIAPVEAFNARLCIKSGGKFNQLLSAGEMIACCNSTHSWQPRGCKGGMILNA 141
Query: 70 WRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCV--K 112
W + HG+ TE C PY + C+H P + Y TP C+ +C K
Sbjct: 142 WSFLKTHGIATEGSMSAADGCWPY-NFPKCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEK 200
Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
+H++ + + ++I EI NGP +F+VYEDF YKSGVYKH G +
Sbjct: 201 YGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTSATFSVYEDFVSYKSGVYKHTNGTL 260
Query: 173 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
MG H+V++IGWGT + G DYW++ N WN WG G FKI +G +CGI +D V G P +
Sbjct: 261 MGIHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGI-DDAVLGSPPAM 316
Query: 233 N 233
N
Sbjct: 317 N 317
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 97/231 (41%), Positives = 125/231 (54%), Gaps = 14/231 (6%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
+ QG CGSC A++DR+CIH + DLL+CC G GG P
Sbjct: 81 IRTQGCCGSCAYVSGASAMTDRWCIHSKGKKQFTFGAFDLLSCCYECGGGCTGGGIPGPI 140
Query: 70 WRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-TPKCVRKCVKKNQLWRN-- 119
W Y+V GV + + C PY C P E YP P C +C + +
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPS-EGDYPDEPNCSTRCNAGYNVTEDLR 199
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
+ + AY I +D IM +I+ NGPV+ F YED +Y GVY+H +G + GGHAVK
Sbjct: 200 DRRFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVK 259
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
LIGWG +DG YW++AN W R WG DG+FK+ RG N CGIEE+V AGLPS
Sbjct: 260 LIGWGV-EDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLPS 309
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 99/249 (39%), Positives = 131/249 (52%), Gaps = 43/249 (17%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF EA +DR CI N + LS ++ AC GC GG + AW++
Sbjct: 162 QSACGSCWAFAPTEAFNDRLCIKSAGNFTSLLSPGNVAACSK---TSGCHGGSSLDAWQW 218
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKCV 111
GVVT + C PY D C+H P C + Y P C C
Sbjct: 219 LHTTGVVTGGDYSAEKDMTESDGCWPY-DIPPCAHYTNSTLYPKCPKTKYDFPTCQESCP 277
Query: 112 KK--NQLWRNSKHY----SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
K + +H+ S+SA R + I EI NGPV S+ VY+DF YKSGVY
Sbjct: 278 NKKYDTPMEKDRHFVEEESLSALR---SIDAIKKEIMTNGPVSASYLVYDDFLTYKSGVY 334
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
K + + +GGHAVK+IGW GEDYW++ N WN++WG +G FKI G +CGIE++V+
Sbjct: 335 KRTSHNALGGHAVKIIGW-----GEDYWLVVNSWNKNWGDNGMFKI--GCGQCGIEDNVL 387
Query: 226 AGLPSSKNL 234
AG P + +L
Sbjct: 388 AGTPMTSSL 396
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 125/221 (56%), Gaps = 20/221 (9%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V ++ DR C+ G++ + S +++C GD CDGG+
Sbjct: 91 VVDQGGCGSCWAFSSVASVGDRRCVA-GLDKKAVRYSPQYVVSCDR---GDMACDGGWLP 146
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
S WR+ V G T+EC PY G A T C KC ++L + + A
Sbjct: 147 SVWRFLVKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSEL---PIYKATKA 194
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
D + IM + GP++ +FTVY DF +Y+ GVY+H+ G GGHAV+++G+GT +
Sbjct: 195 VDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEMVGYGTDE 254
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 255 YDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)
Query: 93 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 151
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F
Sbjct: 8 SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 67
Query: 152 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 211
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI
Sbjct: 68 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 126
Query: 212 KRGSNECGIEEDVVAGLPSSKNLVKEI 238
RG + CGIE +VVAG+P + ++I
Sbjct: 127 LRGQDHCGIESEVVAGIPRTDQYWEKI 153
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/198 (44%), Positives = 113/198 (57%), Gaps = 22/198 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 19 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 77
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
HG+V+E C PY F S C+H C Y TP C C K +R +
Sbjct: 78 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTS 135
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y +S E E+ NGP EVSF+VY DF Y GVYKH+ G +GGHAV+++G
Sbjct: 136 YLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVG 189
Query: 183 WGTSDDGEDYWILANQWN 200
WG +GE YW +AN WN
Sbjct: 190 WGEL-NGEPYWKIANSWN 206
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 122/240 (50%), Gaps = 61/240 (25%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAFGAVEA+SDR CIH ++S+ V+ DLL CCG +CGDGC+GGYP AW +
Sbjct: 102 QGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGIMCGDGCNGGYPAGAWNF 161
Query: 73 FVHHGVVTE-------ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWR 118
+ G+V+ C PY C H P C TPKC + C + ++
Sbjct: 162 WTRKGLVSGGLYDSHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
KHY ++Y +++ +DIMAEIYKN
Sbjct: 221 QDKHYGYNSYSVSNSEKDIMAEIYKN---------------------------------- 246
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
G YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 247 ----------GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 296
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/241 (39%), Positives = 132/241 (54%), Gaps = 33/241 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL-CGD-GCDGGYPISAW 70
QG CGSCWAF + EA +DR CI + LS +CC + C GC+GG P AW
Sbjct: 191 QGDCGSCWAFASTEAFNDRLCIRSQGKGVMPLSTQHTTSCCNAIHCASFGCNGGQPGMAW 250
Query: 71 RYFVHHGVVT----------EECDPYFDSTGCSH------PGCEP---AYPTPKCVRKCV 111
R+F GVVT C PY + C+H P C+ TPKC + C
Sbjct: 251 RWFERKGVVTGGDFDTLGKGTTCWPY-EIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCE 309
Query: 112 KKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
+ + H + S+Y + S + + ++ +G V +F VYEDF +YKSGVYK
Sbjct: 310 EAAYSEHVLPFDKDVHKASSSYSLRSR-DAVKRDMMAHGTVTGAFMVYEDFLNYKSGVYK 368
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+ G +GGHA+K+IGWGT +DGE+YW N WN WG G+FKI+ G +CG++ ++VA
Sbjct: 369 HVYGGPLGGHAIKIIGWGT-EDGEEYWHAVNSWNTYWGDSGHFKIEMG--QCGVDNEMVA 425
Query: 227 G 227
G
Sbjct: 426 G 426
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 88/198 (44%), Positives = 113/198 (57%), Gaps = 22/198 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 19 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYY 77
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
HG+V+E C PY F S C+H C Y TP C C K +R +
Sbjct: 78 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTS 135
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y +S E E+ NGP EVSF+VY DF Y GVYKH+ G +GGHAV+++G
Sbjct: 136 YLLSG------EESFKRELLLNGPFEVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVG 189
Query: 183 WGTSDDGEDYWILANQWN 200
WG +GE YW +AN WN
Sbjct: 190 WGEL-NGEPYWKIANSWN 206
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 92/236 (38%), Positives = 128/236 (54%), Gaps = 15/236 (6%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ I+ Q C S WA + ++SDR CI M + LS +L++C G C G+
Sbjct: 100 INIIHDQSKCDSGWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG--CQIGFS 157
Query: 67 ISAWRYFVHHGVVTEE---CDPYF-----DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL- 116
+W Y++ +G+VT + C PY + S+P C Y P C + C +
Sbjct: 158 EFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIP 217
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KHY Y + + DI EI NGPVE V+ DF +YKSGVY+HITG ++ H
Sbjct: 218 YKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIH 277
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+V++IGWG +D YW+ AN WN WG +GYFKI RGSNEC IE V AG +K
Sbjct: 278 SVRIIGWGIENDIP-YWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAGKVDNK 332
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 93/240 (38%), Positives = 121/240 (50%), Gaps = 32/240 (13%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CGS W SDRF I + LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSAQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPY-----------FDSTGCSHPGCEPAYPTPKCVRKCVK 112
G+ +AWRY GV+ E+C PY +S GC+PAY V
Sbjct: 256 GHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKIQRHNSRSLKANGCQPAYG--------VN 307
Query: 113 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--- 169
++ L+ YS+S DIMAEIY +GPV+ + +Y DF Y G+Y+
Sbjct: 308 RDSLYTVGPAYSLSR------EADIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQTAANR 361
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G G H+VKL+GWG DG YWI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 362 GAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWP 421
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 92/238 (38%), Positives = 123/238 (51%), Gaps = 15/238 (6%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
+ ++ ++ QG CGS W SDRF I + LS ++L+C GC+G
Sbjct: 153 ASYISDVLDQGWCGSSWVISTASVASDRFAIQSRGKEVIQLSPQNILSCTRR--QQGCNG 210
Query: 64 GYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSK 121
G+ +AWRY GVV E C PY C P + C V +++L+
Sbjct: 211 GHLDAAWRYLHKQGVVDESCYPYVGYRDACKIPHNSRSLRNNGCRSYSGVDRDELYTVGP 270
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAV 178
YS++ + DIMAEI+ +GPV+ + TVY DF Y G+Y+H G +G H+V
Sbjct: 271 AYSLN------NETDIMAEIFMSGPVQATLTVYRDFFSYSGGIYRHTAASRGSPVGFHSV 324
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
KLIGWG DG YWI N W WG G F+I RGSNECGIEE V+A P+ N K
Sbjct: 325 KLIGWGEEHDGNKYWIATNSWGTWWGEHGNFRILRGSNECGIEEYVLAAWPNVYNYFK 382
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/198 (44%), Positives = 113/198 (57%), Gaps = 22/198 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q CGSCWA A A+SDR+C G+ +L +S DL++CC +CG GC+GGYP AW Y+
Sbjct: 19 QSSCGSCWAVAAASAISDRYCTLGGVRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYY 77
Query: 74 VHHGVVTEECDPY-FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQ---LWRNSKH 122
HG+V+E C PY F S C+H C Y TP C C K +R +
Sbjct: 78 AVHGIVSEYCQPYPFPS--CAHHVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTS 135
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y +S E E+ NGP EVSF+VY DF Y GVYKH+ G +GGHAV+++G
Sbjct: 136 YVLSG------EEPFKRELILNGPFEVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVG 189
Query: 183 WGTSDDGEDYWILANQWN 200
WG +GE YW +AN WN
Sbjct: 190 WGEL-NGEPYWKIANSWN 206
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/232 (39%), Positives = 130/232 (56%), Gaps = 27/232 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C S +A AV ++DR+CIH S D+L+CC CG GCDGG P + W Y
Sbjct: 157 QGCCASSYAVAAVATITDRWCIHSEGKSQFSFGAYDVLSCC-HRCGFGCDGGVPSAVWHY 215
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP----TPK----------CVRKCVKK-NQLW 117
+V +G+ + SH GC+ +YP P+ C+R+C N +
Sbjct: 216 WVENGITS-------GGAYESHEGCQ-SYPFGVCKPQEIFAPHVDLICLRQCQPGYNTTY 267
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
KH+ AY + D + I+ E++ GPV+ SFTVY DF YKSGVY+H G +G H+
Sbjct: 268 LEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGVYRHTYGVRVGDHS 327
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VK++GWG ++G +W+ AN W WG +G+FKI RG + +E +VVAGLP
Sbjct: 328 VKIVGWGV-ENGTKFWLCANSWGAEWGENGFFKIIRGEDHLSVESNVVAGLP 378
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 122/215 (56%), Gaps = 19/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW+F ++ S R+C + + S + L+AC GC GG ++AWRY
Sbjct: 88 QGKCGSCWSFAVSKSFSHRYCRKYNKPVLFSQSHLVACDRR--NSGCGGGIEVNAWRYID 145
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW--RNSKHYSISAYRINS 132
G+ + C PY G Y C +KC +++ + + ++++S++ Y +
Sbjct: 146 LRGLPLDSCQPY--------DGNITKY---NCSKKCTNESETYEAQFTEYWSVARY---A 191
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
E++ I GPV S VY D +YKSG+Y H G+ +G HAV++IGWGT +G DY
Sbjct: 192 SIEEMQIGIMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHAVEIIGWGTK-NGIDY 250
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
WI++N WN +WG +G F IKRG NEC IE+ V AG
Sbjct: 251 WIISNSWNTTWGMNGLFLIKRGVNECHIEDYVCAG 285
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 85/236 (36%), Positives = 119/236 (50%), Gaps = 19/236 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ ++ Q G CWA + E ++DR CI + +S D+L+CCG CG GC G P
Sbjct: 110 IGLIRDQSAGGGCWAVSSAEVMTDRICIQSNGTKQVYVSETDILSCCGQRCGSGCTSGVP 169
Query: 67 ISAWRYFVHHGV-------VTEECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCV 111
A+ Y + GV C PY C + P Y PTP C + C
Sbjct: 170 RQAFNYAIRKGVCSGGPYGTKGVCKPY-PFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQ 228
Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD 171
+ N S + + E I EI+ NGP+ ++TVYEDFA+YK+G+Y G
Sbjct: 229 SDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNNGPLVATYTVYEDFAYYKNGIYMTGLGR 288
Query: 172 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
G HAVK+IGWG ++G YW++AN WN WG +G+F++ RG+N C IE G
Sbjct: 289 ATGAHAVKIIGWG-EENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATGG 343
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 153 bits (386), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 117/235 (49%), Gaps = 32/235 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ W SDRF I + LS ++L+C GCDGG+ +AWRY
Sbjct: 207 QGWCGASWVLSTTSVASDRFAIQSQGKEVVQLSAQNILSCTRR--QQGCDGGHLDAAWRY 264
Query: 73 FVHHGVVTEECDPYFDST-----------GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK 121
+GV+ C PY GC+PA+ V ++ +
Sbjct: 265 MHKNGVLDANCYPYIQQRDTCKVQRHRGRSLKAYGCQPAHG--------VNRDNFYTVGP 316
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAV 178
YS+S DIMAEIY +GPV+ + TVY DF Y SGVY+H G G H+V
Sbjct: 317 AYSLSR------EADIMAEIYHSGPVQATMTVYRDFFSYSSGVYQHTAANRGAATGFHSV 370
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
KL+GWG +G YWI AN W WG GYF+I RGSNECGIEE V+A P N
Sbjct: 371 KLVGWGEEHNGVKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWPHVYN 425
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 153 bits (386), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 95/243 (39%), Positives = 120/243 (49%), Gaps = 31/243 (12%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CGS W SDRF I + LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
G+ +AWRY GVV E C PY +S GC PAY V +
Sbjct: 256 GHLDAAWRYLHKKGVVDETCYPYTQRRDSCKIRHNSRSLKANGCRPAYG--------VNR 307
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---G 170
+ L+ YS+ DIMAEIY +GPV+ + VY DF Y GVY+ G
Sbjct: 308 DSLYTVGPAYSLKG------ETDIMAEIYHSGPVQATMRVYRDFFSYSGGVYRQTAANRG 361
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
G H+VK++GWG DG YWI AN W WG GYF+I RGSNECGIEE V+A P+
Sbjct: 362 APTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPN 421
Query: 231 SKN 233
N
Sbjct: 422 VYN 424
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/249 (35%), Positives = 124/249 (49%), Gaps = 19/249 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG 60
+ N + + Q CGSCWA A +SDR C+ L LS D+L+CCG +CGDG
Sbjct: 104 WKNCSSITYVRDQSRCGSCWAVSAASTMSDRICVQTKGKLQTILSDTDILSCCGRMCGDG 163
Query: 61 CDGGYPISAWRYFVHHGVVTE-------ECDPY-FDSTGCSHPGC-----EPAYPTPKCV 107
C+GGY AW + GVVT C PY F G H + ++ TP C
Sbjct: 164 CEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACK 223
Query: 108 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
C + + K + S Y +++D + I E+ KNGPV+ +F YEDF+ YK G+Y
Sbjct: 224 PYCQFGYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYV 283
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+ G G HAVKLIGWG ++G YW +AN W+ WG + S + +V
Sbjct: 284 HVKGRERGAHAVKLIGWGV-ENGTKYWTVANSWHDDWGGKRFLPYSTWSESLRVR--IVC 340
Query: 227 GLPSSKNLV 235
+NL+
Sbjct: 341 RFRRIQNLI 349
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/230 (42%), Positives = 126/230 (54%), Gaps = 22/230 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C S WAF A E +SDR CI + + LS DL+ CC + CG+ C GGY AW Y
Sbjct: 95 QGKCSSSWAFAAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHY-CGNQCKGGYTYYAWNY 153
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAY 128
F+ G+V+ Y STGC P E Y TP C C K + + KH+ S Y
Sbjct: 154 FMLTGLVSG--GDYNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIY 210
Query: 129 RINSDPEDIMAEIYKNG-PVEVSFTVYEDFAHYK---------SGVYKHITGDVMGGHAV 178
I + I EI G PV +F VY DF Y+ GVY + +G + G AV
Sbjct: 211 YIPQNETAIQNEILSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGRTAV 270
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
K+IGWGT ++G YW+ AN W + WGA G+FKI+RG+NECG EE ++AG
Sbjct: 271 KIIGWGT-ENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIAG 319
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 119/229 (51%), Gaps = 12/229 (5%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CG+ W SDRF I N+ LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 123
G+ +AWRY GVV E C PY C+ + C K + R+S +
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPYTQH----RDTCKIRHSRSLKANGCQKPVNVDRDSLYT 311
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKL 180
AY +N + DIMAEI+ +GPV+ + V DF Y GVY+ + G H+VKL
Sbjct: 312 VGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKL 370
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+GWG +GE YWI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 371 VGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWP 419
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 122/221 (55%), Gaps = 20/221 (9%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V ++ DR C G++ + S +++C GD CDGG+
Sbjct: 91 VVDQGGCGSCWAFSSVASVGDRRCFA-GLDKKAVKYSPQYVVSCDR---GDMACDGGWLP 146
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
S WR+ G T+EC PY G A T C KC + L + + A
Sbjct: 147 SVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDL---PIYKATKA 194
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
D + IM + GP++ +FTVY DF +Y+ GVY+H G V GGHAV+++G+GT +
Sbjct: 195 VDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEMVGYGTDE 254
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 255 YDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 122/236 (51%), Gaps = 38/236 (16%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS----LSVNDLLACCGFLCGDGCDGGYPISAW 70
Q CGSC+AF + + R + NL+ S D++ C + GCDGG+P
Sbjct: 210 QEQCGSCYAFSSSDMFGSR--VRIPSNLTQVPVYSPQDIVDCSAY--SQGCDGGFPFLVG 265
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYR 129
+Y + +G+ E CDPY + KC +C V + Q +S +Y + Y
Sbjct: 266 KYAMDYGLTVESCDPY------------QGHDLGKCSNQCPVNRQQRLHSSNYYFVGGYY 313
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-------------- 175
NS +M EIY+NGP+ + F VY D +YK GVYKH+T + +
Sbjct: 314 GNSHELSMMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELKAQGLSEDEMIPHFEV 373
Query: 176 --HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV ++GWG ++G YW + N W+ +WG +GYFKI RGS+ECG+E D AG+P
Sbjct: 374 VNHAVLMVGWGV-ENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 133/247 (53%), Gaps = 20/247 (8%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM--NLSLSVNDLLACCGFLCGDGCD 62
N + + QG CG+CWAF A EA+SDR CIH + S +LL+CC C GC
Sbjct: 63 NCPTIREIRDQGSCGACWAFAAAEAMSDRVCIHSSQTKHFHFSALNLLSCCD-SCEKGCL 121
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPTPKCVRK 109
G AW ++V HG+V+ E C PY C H C PTP C R
Sbjct: 122 GCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYH-LPPCEHHRAGPRRNCTKYGPTPSCARV 180
Query: 110 CVKKNQL-WRNSKHYSISAYRINSDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C ++ + + H+ Y + E I+ EI+ NGPVE + YEDF Y+SG+Y H
Sbjct: 181 CQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGIYHH 240
Query: 168 ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
I G + HAVK+IGWGT YW++AN +N WG G+FKIKRG NECGIE + A
Sbjct: 241 IEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDWGEYGFFKIKRGVNECGIENKITA 300
Query: 227 GLPSSKN 233
G+P+ KN
Sbjct: 301 GIPAYKN 307
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/209 (43%), Positives = 120/209 (57%), Gaps = 19/209 (9%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD 62
N + + Q CGSCWAFGAVE++SDR CIH +++ LS +LL+CC CG GC+
Sbjct: 104 NCSSIRTIRDQSSCGSCWAFGAVESMSDRICIHSKGRISIELSAVNLLSCCS-RCGFGCN 162
Query: 63 GGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCE-PAYPTPKCVR 108
GG P AW Y+ G+VT C PY ST +H CE Y TP+C +
Sbjct: 163 GGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQ 222
Query: 109 KCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 167
C + + N K+Y S+Y + SD IM EI NGPVE +F VY+DF +YK+GVYK+
Sbjct: 223 TCQPDYAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVYDDFLNYKTGVYKY 282
Query: 168 ITGDVMGGHAVKLIGWGTSDDGEDYWILA 196
+TG ++GGHA++ I W E Y IL
Sbjct: 283 VTGSLLGGHAIR-ITWLGCIHIESYTILV 310
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 98/232 (42%), Positives = 130/232 (56%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCW+F A +DR C+ G N LS +L A C CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEEL-AFCCKDCGKGCGGGYPIKAWKY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY ++ G + G +P +C + C K + +
Sbjct: 166 FRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYGKTTV--QN 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
++ + S Y INS + I +I GPVE SF VY+D + YKSG+Y+ GGH++K
Sbjct: 224 RYKTKSEYVINSI-KTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+IGWG +G YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 283 IIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/197 (42%), Positives = 108/197 (54%), Gaps = 21/197 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA A A+SDR CIH M L+ D L+CC + CG GC GGYP AW Y
Sbjct: 85 QASCGSCWATAAASAMSDRVCIHSNGQMRPRLAAADPLSCCTY-CGQGCRGGYPPKAWDY 143
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEP--------AYPTPKCVRKC-VKKNQL 116
++ G+VT C P+ T C H G YP P C R C N+
Sbjct: 144 WMREGIVTGGTWENRTGCQPWM-FTKCDHVGDSRKYSRCPHYTYPKPPCARACQTGYNKT 202
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ K Y S+Y + IM EI KNGPVEV+F +++DF Y+SG+Y H+ G +G H
Sbjct: 203 YEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGIYHHVAGKFIGRH 262
Query: 177 AVKLIGWGTSDDGEDYW 193
AV++IGWG ++G +YW
Sbjct: 263 AVRMIGWGV-ENGVNYW 278
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 121/239 (50%), Gaps = 31/239 (12%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CGS W SDRF I + LS ++L+C GC+G
Sbjct: 200 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 257
Query: 64 GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
G+ +AWRY GVV E C PY +S GC P+
Sbjct: 258 GHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGCRPS------------- 304
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---G 170
+ R+S + AY +N + DIMAEIY +GPV+ + VY DF Y SGVY+ G
Sbjct: 305 ANVDRDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRG 363
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G H+VKL+GWG +G+ YWI AN W WG GYF+I RGSNECGIE+ V+A P
Sbjct: 364 APTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 121/239 (50%), Gaps = 31/239 (12%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CGS W SDRF I + LS ++L+C GC+G
Sbjct: 200 SSYISEVPDQGWCGSSWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 257
Query: 64 GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
G+ +AWRY GVV E C PY +S GC P+
Sbjct: 258 GHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLKANGCRPS------------- 304
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---G 170
+ R+S + AY +N + DIMAEIY +GPV+ + VY DF Y SGVY+ G
Sbjct: 305 ANVDRDSFYTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRG 363
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G H+VKL+GWG +G+ YWI AN W WG GYF+I RGSNECGIE+ V+A P
Sbjct: 364 APTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 131/232 (56%), Gaps = 20/232 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCW+F A +DR C+ G N LS +L A C CG GC GGYPI AW+Y
Sbjct: 107 QGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSPEEL-AFCCKDCGKGCGGGYPIKAWKY 165
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
F GV T E C PY ++ G + G +P +C + C K + +
Sbjct: 166 FRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYGKTTV--QN 223
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVK 179
++ + S Y +NS + I ++ GPVE SF VY+DF+ YKSG+Y+ GGH++K
Sbjct: 224 RYKTKSEYVMNSI-KTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAKYQGGHSIK 282
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+IGWG +G YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 283 IIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 128/230 (55%), Gaps = 15/230 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
Q C + WA A+SDR+C + G L +S DL+ACC CG GC+GGYP +AW Y+
Sbjct: 113 QSACRASWAVATASAISDRYCTVGNGKQLRISAADLMACCT-GCGGGCEGGYPDAAWEYY 171
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSI 125
V +G+ + +C PY C H G + P TP C C K+ K+
Sbjct: 172 VSNGITSSQCQPY-PFPRCEHRGAQGKKPPCSKYNFDTPTCNATCTDKSVPL--IKYRGN 228
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+Y + + ED E+Y NGP V F V+ DF YKSGVY+H+ G+ +GG AV+++GWG
Sbjct: 229 HSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGK 287
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
+G YW +AN W+ WG +GYF I RG+NEC IE AG P + L
Sbjct: 288 M-NGTPYWKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAGTPDTSQLT 336
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/183 (44%), Positives = 104/183 (56%), Gaps = 19/183 (10%)
Query: 20 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWAFGA EA+SDR CI +++S +D+L+CCG CG+GC+GGYPI AW+Y+V G
Sbjct: 1 SCWAFGAAEAMSDRICIASQGKTQVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTG 60
Query: 78 VVT-------EECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKKNQL-WRNSK 121
+ T C PY C H P Y TP C KC+ + + + K
Sbjct: 61 ICTGGSYESQSGCKPY-PIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDK 119
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
HY SAY + I EI NGPVE ++TVYEDF Y GVY H G +GGHAV+++
Sbjct: 120 HYGTSAYNVAKTVAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRIL 179
Query: 182 GWG 184
GWG
Sbjct: 180 GWG 182
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 123/217 (56%), Gaps = 20/217 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSC++F + E +SDRFCI + +N+ LS DL+ C + GC+GG P + Y
Sbjct: 22 QGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTCSWY--SFGCNGGIPGLVFDY 79
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V++ C PY G +H C P + C K + +++ KH++ Y +
Sbjct: 80 IHKDGLVSDACFPYLSYDGNTHVKC-PDF----CYNN---KTKSFKSDKHFADKVYHVGE 131
Query: 133 DPED-------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
ED I EI +GPV F VY DF YKSGVY+H TG G HAVK+IGWGT
Sbjct: 132 FLEDKAKRVLEIQKEILTHGPVNADFMVYSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGT 191
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
++G DYW++AN W ++G G+FKI RG +EE
Sbjct: 192 -ENGVDYWLIANSWGTTFGLQGFFKIVRGGKFIHLEE 227
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 103/165 (62%), Gaps = 1/165 (0%)
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 125
+S Y + G + E P + C+ TP CV+KC + ++ + H+
Sbjct: 18 VSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGK 77
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
SAY I +D + I EIY NGPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG
Sbjct: 78 SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGV 137
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
+ YW++AN WN WG+DG+FKI RGS+ECGIE + AGLP+
Sbjct: 138 QNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 182
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/220 (39%), Positives = 120/220 (54%), Gaps = 26/220 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACC--GFLCGDGCDGGYPISAW 70
Q CGSCWAF + LSDRFCIH +N LS DL++C F GC GG +
Sbjct: 145 QQLCGSCWAFASSAFLSDRFCIHSEGQINEDLSPQDLVSCSYENF----GCSGGQLTESV 200
Query: 71 RYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
+ ++ G+V+E+C PY + T C P K C +K+ L
Sbjct: 201 DFLIYEGIVSEKCKPYMNQDTYCKFKCQNDKQPYTKYF--CEQKSML------------- 245
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
I SD E+I E+ NGP+ V +VYED +YK GVY++ TG+ +GGHA+K+IGWG ++ G
Sbjct: 246 ILSDIEEIQLELMTNGPMMVGLSVYEDLMNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKG 305
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
E +W NQW + WG GY IK G E G++ V+ +P
Sbjct: 306 ELFWKCQNQWGKDWGMGGYINIKAG--ELGMDTMVLGCMP 343
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/228 (39%), Positives = 124/228 (54%), Gaps = 15/228 (6%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
L+ + H WA + ++SDR CI M + LS +L++C G C G+ +
Sbjct: 20 LLPREHYTELWAVASAASISDRTCIQTNGTMKVQLSAIELISCSKNKLG--CQIGFSEFS 77
Query: 70 WRYFVHHGVVTEE---CDPYF-----DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRN 119
W Y++ +G+VT + C PY + S+P C Y P C + C + ++
Sbjct: 78 WDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKA 137
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
KHY Y + + DI EI NGPVE V+ DF +YKSGVY+HITG ++ H+V+
Sbjct: 138 DKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGVYRHITGQLVTIHSVR 197
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+IGWG +D YW+ AN WN WG +GYFKI RGSNEC IE V AG
Sbjct: 198 IIGWGIEND-IPYWLCANSWNEDWGLNGYFKILRGSNECEIESFVNAG 244
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 120/231 (51%), Gaps = 15/231 (6%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CG+ W SDRF I N+ LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSK 121
G+ +AWRY GVV E C PY H + +R C K + R+S
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPYT-----QHRDTCKIRHNSRSLRANGCQKPVNVDRDSL 310
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAV 178
+ AY +N + DIMAEI+ +GPV+ + V DF Y GVY+ + G H+V
Sbjct: 311 YTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSV 369
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KL+GWG +GE YWI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 370 KLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWP 420
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 120/223 (53%), Gaps = 13/223 (5%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSC+A ++DR+CIH G L+CC CDGGY + Y
Sbjct: 69 QGSCGSCYAVSTAAVITDRYCIHSGGERQFYFGSTGYLSCCTDCYK--CDGGYVHKTFDY 126
Query: 73 FVHHGVVTEECDPYFDSTGCS-HP---GCEPAYPTPKCVRKCVKKNQLW--RNSKHYSIS 126
+V +G+ + PY GC +P + KC R+C L ++ KH + S
Sbjct: 127 WVKYGLTSG--GPYHSGQGCKPYPFGGATQDVNIVLKCDRQCQAGYPLTYSQDLKHGASS 184
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
D + AEIY+NGP+ SF VY DF Y+SGVY+H+TG G HAV++IGWG
Sbjct: 185 YILPWGDENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKGSHAVRVIGWGV- 243
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++G YW+ AN WN WG +G+FKI RG N G+E+ AGLP
Sbjct: 244 ENGVKYWLCANSWNERWGENGFFKIVRGENHVGVEDISYAGLP 286
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 92/238 (38%), Positives = 121/238 (50%), Gaps = 30/238 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG----CDGGYPIS 68
QG+C S WA +DR CI + LS +L++C GDG CDGG
Sbjct: 87 QGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNLMSC-----GDGEKMGCDGGSAFK 141
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
AW ++ G+VT E C PY + C H G C T C +KCV K
Sbjct: 142 AWELTMNKGIVTGGNFDSNEGCQPY-KNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNK 200
Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
N + + H + Y + ++ + I EI +GPV VYE+F YK G+YK TG
Sbjct: 201 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTHGPVTAFMYVYENFMGYKEGIYKSTTG 260
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+++G H VKLIGWG DG +YW+ N WN +WG DG FKI RG N C IE V+AG+
Sbjct: 261 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMAGI 318
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLAC-CGFLCGDGCDGGYPI 67
+V QG CGSCWAF +V DR CI G++ + S +++C G + C+GG+
Sbjct: 92 VVDQGGCGSCWAFSSVATFGDRRCIA-GLDKKPVKYSPQYVVSCDHGNM---ACNGGWLP 147
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
+AW++ G T+EC PY + C PT KC + + S
Sbjct: 148 NAWKFLTKTGTTTDECVPYQSGSTTLRGTC----PT-----KCADGSSKVHLTTATSYKD 198
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + D +M + GP++V+F VY DF +Y+SGVY+H G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 257 DGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 117/221 (52%), Gaps = 18/221 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
E + QG CGSCWA A E + R I +S DL++C GC+GGY
Sbjct: 69 AEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGVMSPQDLVSCESN--NMGCNGGYADR 126
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
W + G+ TE+C PY +G P C KC + + R+ + S
Sbjct: 127 VWNWIQKKGITTEQCIPYVSGSG----------RVPTCPSKCKNGSNIVRS---FVSSWG 173
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
NS + +M E+ NGPV F V+EDF +Y+SGVY+H TG G H V L+GWGT ++
Sbjct: 174 SFNS--KTVMDEVANNGPVYACFEVFEDFYNYRSGVYQHKTGRSQGWHHVMLMGWGT-EN 230
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G YW+L N W WG G+F+I+RG+N+C I+E +GLP
Sbjct: 231 GVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYSGLP 271
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 88/267 (32%), Positives = 131/267 (49%), Gaps = 56/267 (20%)
Query: 18 CGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCG--FLCGDG------------- 60
C S WAF A E++SDR CI+ G +N LS +LL+CC F CG+G
Sbjct: 106 CKSSWAFSAAESMSDRLCINSGGMINTVLSAQELLSCCTGVFSCGEGDSEHWQFRNSKFR 165
Query: 61 -----------------------CDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 90
C GG AW+Y+ HG+ T C PY S
Sbjct: 166 KPRCQKFNKEILEARRNLETREKCAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISP 225
Query: 91 ------GCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 142
+ PGC TP C +KC + +HY +S ++ + +I +++
Sbjct: 226 CDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVM 285
Query: 143 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 202
NGP+ + VY+DF Y +G+Y H+TG+ G +V+++GWG +G YW+LAN W +
Sbjct: 286 LNGPISATMEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWGKQ 344
Query: 203 WGADGYFKIKRGSNECGIEEDVVAGLP 229
WG +G F++ RG NECG+E + V+G+P
Sbjct: 345 WGENGTFRVLRGVNECGLEANCVSGMP 371
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/197 (43%), Positives = 108/197 (54%), Gaps = 18/197 (9%)
Query: 20 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA A E +SDR C+ LS D+LACCG CG GC+GGY AW Y + G
Sbjct: 1 SCWAVSAAETMSDRLCVQTNGRKKTLLSDTDILACCGDFCGYGCNGGYSARAWLYARNSG 60
Query: 78 VVT----EE---CDPY------FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSKH 122
V + +E C PY + + C + Y TP C + C + + K
Sbjct: 61 VCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKI 120
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y+ AYR++SD I AEI+ GPV+ SF YEDFAHYKSG+Y H G GGHAVK+IG
Sbjct: 121 YAXDAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIG 180
Query: 183 WGTSDDGEDYWILANQW 199
WG ++G WI+AN W
Sbjct: 181 WGV-ENGTKXWIVANSW 196
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 92/238 (38%), Positives = 120/238 (50%), Gaps = 30/238 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG----CDGGYPIS 68
QG+C S WA +DR CI + LS +L++C GDG CDGG
Sbjct: 87 QGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNLMSC-----GDGEKMGCDGGSAFK 141
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
AW ++ G+VT E C PY + C H G C T C +KCV K
Sbjct: 142 AWELTMNKGIVTGGNFDSNEGCQPY-KNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNK 200
Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
N + + H + Y + ++ + I EI GPV VYE+F YK G+YK TG
Sbjct: 201 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTG 260
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+++G H VKLIGWG DG +YW+ N WN +WG DG FKI RG N C IE V+AG+
Sbjct: 261 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMAGI 318
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/223 (36%), Positives = 123/223 (55%), Gaps = 26/223 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAF E ++DRFCI +N +S +++C +GC+GG +A+++
Sbjct: 145 QEQCGSCWAFSISEMVADRFCIGTRGKINTIMSPQWMVSCD--TADNGCNGGEFPTAFQF 202
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----WRNSKHYSISA 127
G+V++ C PY G P C C + +NS+++ ++
Sbjct: 203 VETTGLVSDGCVPYQSGNGF----------VPPCPNSCANGEDINVRYRTKNSRNFDVN- 251
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
D + + A I NGPV F VY DF +Y+SG YKH+ G ++GGHA+K++GWG +
Sbjct: 252 -----DMKSVQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLVGGHAIKVVGWGVTQ 305
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
YWI+AN W+ WG +GYF I RG+NEC IEE++ +P+
Sbjct: 306 SNVPYWIVANSWSDEWGMNGYFWILRGTNECSIEENMWETIPA 348
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 121/244 (49%), Gaps = 31/244 (12%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CG+ W SDRF I + LS ++L+C GCDG
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKETVQLSAQNILSCTRRQ--QGCDG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
G+ +AWRY GVV E C PY +S GCE TP V
Sbjct: 256 GHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNSRSLRANGCE----TPVNVD----- 306
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV- 172
R++ + AY +N + DIMAEI+ +GPV+ + V DF Y GVY+ +
Sbjct: 307 ----RDTFYTVGPAYSLNREA-DIMAEIFNSGPVQATMRVNRDFFSYSRGVYRQTAANRE 361
Query: 173 --MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
G H+VKL+GWG +GE YWI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 362 APTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEKGYFRILRGSNECGIEEYVLASWPY 421
Query: 231 SKNL 234
N
Sbjct: 422 VYNF 425
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V DR C+ G++ + S +++C GD C+GG+
Sbjct: 92 VVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDMACNGGWLP 147
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
+ W++ G T+EC PY + C PT KC + + S
Sbjct: 148 NVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLATATSYKD 198
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + D +M + +GP++V+F VY DF +Y+SGVY+H G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 257 DGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 92/239 (38%), Positives = 121/239 (50%), Gaps = 24/239 (10%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CGS W SDRF I + LS ++L+C GC+G
Sbjct: 200 SRYISEVPDQGWCGSSWVLSTTSVASDRFAIQSQGKEVVQLSPQNILSCTRRQ--QGCEG 257
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGCEPAY---PTPKCVRKCVKKNQLW 117
G+ +AWRY GV+ E C PY S G H G A+ P P V ++ L+
Sbjct: 258 GHLDAAWRYLHKKGVLDESCYPYTQSRGTCKVRHSGSLKAHGCRPAPG-----VDRDSLY 312
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMG 174
YS+S DI AEI+ +GPV+ + VY DF Y G+Y+ G G
Sbjct: 313 TVGPAYSLSR------EADIKAEIFHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPTG 366
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
H+VKL+GWG +G+ YWI AN W WG GYF+I RGSNECGIE+ V+A P N
Sbjct: 367 FHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYVYN 425
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 93/255 (36%), Positives = 125/255 (49%), Gaps = 47/255 (18%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG EA +DR CI + LS ++ AC GCDGG P AW +
Sbjct: 165 QSDCGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAPSF---GCDGGIPSLAWSW 221
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC- 110
+ G+ T + C PY D C+H P C + +Y TP C +C
Sbjct: 222 VHNKGIATGGDYLAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETPNCAEQCH 280
Query: 111 -VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV---------------EVSFTVY 154
K R+ +H+ + + D I +GPV SF VY
Sbjct: 281 NPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVY 340
Query: 155 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
EDF Y+SGVYKH +G +GGHAVK+IGWG + G+ YW++ N WN WG +G FKI G
Sbjct: 341 EDFLAYRSGVYKHTSGKELGGHAVKIIGWG-EETGQAYWLVVNSWNEDWGDNGLFKIALG 399
Query: 215 SNECGIEEDVVAGLP 229
+ C I++D++ G P
Sbjct: 400 N--CEIDDDLLGGTP 412
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 124/240 (51%), Gaps = 27/240 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WA A++DR CI N++ S L++CC CG+GC GGY +AWRY
Sbjct: 118 QGNCAADWAISVTSAMNDRICIASQGNITALYSPQKLVSCCED-CGNGCSGGYTAAAWRY 176
Query: 73 FVHHGVVT-------EECDPYF-----DSTGCSHP----------GCEPAYPTPKCVRKC 110
+ G+VT E C P+ ST + P G +PA TPKC C
Sbjct: 177 ILKKGIVTGGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPA-TTPKCDLSC 235
Query: 111 VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
+ + D + K+GP V+ VYEDF YKSGVY H+TG
Sbjct: 236 YNARHEGKYLDDIIKAKKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTG 295
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
D +G +V++IGWG + G+ +W+LAN W SWG G+FKI+R NEC IE AG+P+
Sbjct: 296 DYLGLLSVRMIGWGL-EGGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPN 354
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 112/206 (54%), Gaps = 15/206 (7%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGM-NLSLSVNDLLACCGFLCGDGC 61
+ N + + Q CGSCWA A A+SDR+C G+ +L +S DLL+CC CG GC
Sbjct: 7 WPNCPTITEIRDQSGCGSCWAVAARSAMSDRYCTRGGVRDLRISAGDLLSCCN-ACGLGC 65
Query: 62 DGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKN 114
+GG P AW Y+V G+V+E C PY C+H C Y TP C C
Sbjct: 66 NGGDPDWAWLYYVETGIVSEFCQPY-PFPPCAHHVNSTHYTPCSVEYDTPFCNITCTNTI 124
Query: 115 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ S S S ED E++ GP EV+FTVYEDF Y GVYKH +G+ +G
Sbjct: 125 PPIKYKGRISYSL----SGEEDYKRELFLYGPFEVAFTVYEDFVAYSDGVYKHFSGNALG 180
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWN 200
GHAV+L+GWG +G YW +AN WN
Sbjct: 181 GHAVRLVGWGNL-NGTPYWKIANSWN 205
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 116/221 (52%), Gaps = 18/221 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPIS 68
E + Q CGSCWA A E + R I +S DL++C GC+GGY
Sbjct: 71 AEPVRNQASCGSCWAHAASETMGFRMGIRGCYKGVMSPQDLVSCESN--NMGCEGGYADR 128
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
W + G+ TE+C PY +G P C KC + + R+ + S
Sbjct: 129 VWNWIQKKGITTEQCLPYVSGSG----------RVPTCPSKCKNGSNIVRS---FVSSWG 175
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
NS + +M E+ NGPV F V+EDF +YKSG+Y+H TG G H V L+GWGT ++
Sbjct: 176 SFNS--KTVMDEVANNGPVYACFEVFEDFLNYKSGIYQHKTGKSKGWHHVMLMGWGT-EN 232
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G YW+L N W WG G+F+I+RG+N+C I+E +GLP
Sbjct: 233 GVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYSGLP 273
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 119/239 (49%), Gaps = 31/239 (12%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CG+ W SDRF I + LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKK 113
G+ +AWRY GVV E C PY +S GC+ Y
Sbjct: 256 GHLDAAWRYLHKKGVVDESCYPYTQQRDTCKIRHNSRSLRANGCQTPYNVD--------- 306
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
R++ + AY +N + DIMAEI+ +GPV+ + V DF Y GVY+ + M
Sbjct: 307 ----RDTFYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRM 361
Query: 174 ---GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G H+VKL+GWG +GE YWI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 362 APTGFHSVKLVGWGEEHNGEKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 420
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 116/230 (50%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS W SDRF I + LS ++L+C GCDGG+ +AWR+
Sbjct: 206 QGWCGSSWVLSTTSVASDRFAIQSKGKEAVRLSAQNILSCTRRQ--QGCDGGHLDAAWRF 263
Query: 73 FVHHGVVTEECDPY----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
GVV + C PY +S GC P+ + R+S +
Sbjct: 264 LHKKGVVDDSCYPYTQQRDTCKIRHNSRSLKANGCRPS-------------PNVDRDSFY 310
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVK 179
AY +N + DIMAEIY +GPV+ + VY DF Y G+Y+ G G H+VK
Sbjct: 311 TVGPAYTLNRE-GDIMAEIYHSGPVQATMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVK 369
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
L+GWG +G+ YWI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 370 LVGWGEEHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 419
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 89/235 (37%), Positives = 124/235 (52%), Gaps = 24/235 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WA A++DR CI N++ S +L+CC CGDGC+GGY +AW+Y
Sbjct: 109 QGNCAADWAISVTSAINDRICIKSKKNITAFYSPQKMLSCCDD-CGDGCNGGYSGAAWQY 167
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP----------TPKCVRKCVKKNQ 115
++ G+VT E C P+ C+H + P TP+C C N
Sbjct: 168 WMKRGLVTGGDYGSNEGCQPWLIPP-CNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNY 226
Query: 116 LWRNSKHYSISAYRINSDPEDIMA-EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
K S RI+ ++ E+ K+GP VYEDF YKSG+Y+H+TG ++G
Sbjct: 227 SKPFLKDIS-KGIRIDWHCSGMIRNELKKHGPATAIMRVYEDFLTYKSGIYQHVTGKLLG 285
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
VK+IGWG G YW+ AN W SWG G+FKI+RG NEC E+ ++G P
Sbjct: 286 QITVKVIGWGVY-RGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRP 339
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 126/234 (53%), Gaps = 26/234 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCD---------G 63
Q CGSCWAFGAVEA++DR CI G + LS DL++CC G G
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCEDCGGGCKGGFPGQAWDMG 171
Query: 64 GYPISAWRYFV--HHGVVTEECDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQ 115
S WR+ H G C PY F T +P C Y TP+C + C K +
Sbjct: 172 KTRDSHWRFRKKNHTG-----CQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYK 226
Query: 116 L-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
+ K + + + ++ + +I GPVE +F VYEDF + KSG+ +H+TG ++G
Sbjct: 227 TPFEQDKPFGEGSSNVQNNEKVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVG 286
Query: 175 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GH +++IGWG + G YW++AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 287 GHPIRIIGWGV-EKGNPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 124/235 (52%), Gaps = 23/235 (9%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDGCDGG 64
E + + Q +CGSCWA A ++DR CI + ++D +LAC G
Sbjct: 294 EWIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQILAC-----------G 342
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP------AYPTPKCVRKCVKKNQL-W 117
S + Y+ G+ T PY D + C P TP C C +
Sbjct: 343 MIPSPFNYWKKMGIATG--GPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYDIPI 400
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K Y+ Y ++S+ +IM EIY +GPV F VYEDF +Y SG+Y+ T MGGHA
Sbjct: 401 SDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHA 460
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+++IGWG ++G YW++AN WN ++G G+F+I+RG+NEC IE +V G+P +
Sbjct: 461 IRIIGWG-EENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKLR 514
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 11/131 (8%)
Query: 60 GCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 113
GC G +A+ Y+ G+VT + C + + C+ C P PKC R C
Sbjct: 69 GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTM--CRPYMLAPKCQRTCQAS 126
Query: 114 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 172
L + K+Y S Y +N D DIM EIY+ GPV F VY DF +Y SG + I G+
Sbjct: 127 YNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQF--ICGNK 184
Query: 173 MGGHAVKLIGW 183
L W
Sbjct: 185 RCEEEENLTSW 195
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 84/197 (42%), Positives = 109/197 (55%), Gaps = 21/197 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWA V A+SDR CIH M LS DL++CC + CG+GC GG P +AW Y
Sbjct: 85 QSSCGSCWAVAGVGAMSDRVCIHSNGMMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDY 143
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCEPA--------YPTPKCVRKC-VKKNQL 116
+ +G+VT C PY C HPG YPTP C C ++
Sbjct: 144 WWRNGIVTGGTLENPTGCLPY-PFPQCRHPGSRSQLNPCPGYIYPTPSCYPYCQAGYDKT 202
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
+ K Y ++Y ++ IM EI KNGPVE F VY DFA YKSG+Y H++G G H
Sbjct: 203 YEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKH 262
Query: 177 AVKLIGWGTSDDGEDYW 193
A+++IGWG ++G +YW
Sbjct: 263 AIRIIGWGV-ENGVNYW 278
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 106/206 (51%), Gaps = 20/206 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWA ALSDR CI +++S D+L CC + CG GC GG+PI AW Y
Sbjct: 116 QANCGSCWAVSTAAALSDRICISTNGTKQVNISATDILTCC-YKCGYGCQGGWPIEAWEY 174
Query: 73 FVHHGVVT------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVK--KNQLW 117
G VT + C C H G E Y TPKC C KN +
Sbjct: 175 VAREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRARTPKCRTSCTPGYKNS-Y 233
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ K AY + + + I EI KNGPV +FTVY DF++YK G+YKH G G HA
Sbjct: 234 SDDKIRGKDAYELPNSVKAIQREIMKNGPVVAAFTVYADFSYYKKGIYKHTAGRARGSHA 293
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSW 203
VK+IGWG D YWI+ N W+ W
Sbjct: 294 VKVIGWGEEGD-VPYWIVKNSWHNDW 318
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 122/221 (55%), Gaps = 19/221 (8%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGD-GCDGGYPI 67
+V QG CGSCWAF +V DR C+ G++ + S +++C GD C+GG+
Sbjct: 92 VVDQGGCGSCWAFSSVATFGDRRCVA-GLDKKPVKYSPQYVVSCDH---GDMACNGGWLP 147
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
+ W++ G T+EC PY + C PT KC + + S
Sbjct: 148 NVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLATATSYKD 198
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV+++G+GT D
Sbjct: 199 YGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDD 256
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 257 DGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 84/220 (38%), Positives = 118/220 (53%), Gaps = 26/220 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACC--GFLCGDGCDGGYPISAW 70
Q +CG+CWAF L+DR CI + +N LS D++ C F GC+GGY ++A
Sbjct: 140 QANCGACWAFTGSGMLADRICILTNGTINEELSPQDMVDCSHDNF----GCEGGYLMNAL 195
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY-SISAYR 129
Y ++ GV E C PY D T KC C K + + KHY R
Sbjct: 196 DYLMNEGVTKESCTPYKDKTN-------------KCQYTCQNKTEEFH--KHYCKPGTLR 240
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 189
+ ++ E I ++ +NGP+ V TVYEDF +Y +G YK + G+++GGHAVKL+GW T+ G
Sbjct: 241 VLTNEEQIKRDLMQNGPLMVGLTVYEDFINYATGDYKFVAGEIVGGHAVKLMGWRTTQKG 300
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+ W++ NQWN WG G+ I NE GI+ V P
Sbjct: 301 QTSWLIQNQWNDDWGEQGFGYIL--ENEVGIDSIGVGCTP 338
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 123/227 (54%), Gaps = 17/227 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ +V QG CGS WA SDRF I N+ LS LL+C GC GG+
Sbjct: 204 ISPIVDQGWCGSDWAVSLAGVASDRFAIQSNGAENMVLSPQTLLSC-NVRAQQGCHGGHI 262
Query: 67 ISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRK-CVKKNQLWRNSKHYS 124
AW + HG+V E+C PY S T C P P ++ C+ + R + Y
Sbjct: 263 DVAWNFARGHGLVDEKCFPYKASVTRC------PFRPRGNLIQDGCMP--LVKRRTSRYK 314
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMGGHAVKLI 181
+ S +DIM +I ++GPV+ TVY+DF HY+ GVY+ H ++ G H+V++I
Sbjct: 315 LGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDFFHYRDGVYRRSYHGNNELKGFHSVRII 374
Query: 182 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GWG D G+ YW++AN W R WG +GYF+I RGSNE IE VV GL
Sbjct: 375 GWG-EDRGDRYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGL 420
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/238 (37%), Positives = 120/238 (50%), Gaps = 15/238 (6%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CG+ W SDRF I + LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSK 121
G+ +AWRY GVV E C PY H + +R C + R++
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPY-----TQHRDTCKIRHNSRSLRANGCQTPVNVDRDTL 310
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAV 178
+ AY +N + DIMAEI+ +GPV+ + V DF Y GVY+ + G H+V
Sbjct: 311 YTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSV 369
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 236
KL+GWG +GE YWI AN W WG GYF+I RGSNECGIEE V+A P N K
Sbjct: 370 KLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLASWPYVYNYYK 427
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/232 (38%), Positives = 123/232 (53%), Gaps = 29/232 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CG+ WAF +DR IH ++ LSV +L++C GC+GG SAWRY
Sbjct: 242 QRNCGASWAFSTASVAADRIAIHSEGQITDNLSVQNLISC-DTRNQHGCNGGNIDSAWRY 300
Query: 73 FVHHGVVTEECDPYF-----DSTGCSHPGCEPAY-------PTPKCVRKCVKKNQLWRNS 120
HGVV+ C P F + +G +H Y P P + K N+L+R +
Sbjct: 301 LKTHGVVSYACYPSFWKKHLEPSGENHCYVSSEYGKNYTNGPCPNALEK---SNRLYRCA 357
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAV 178
HY R++S +IM EI GPV+ VYEDF YK G+Y+H G H+V
Sbjct: 358 SHY-----RVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYRHSQKAGSKWKTHSV 412
Query: 179 KLIGWGTSDDG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
KL+GWG D + +WI AN W +SWG +GYF+I RG NEC IE+ ++A
Sbjct: 413 KLLGWGALADKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILA 464
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/249 (38%), Positives = 122/249 (48%), Gaps = 36/249 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CG CWAF EA SDR CI G + + LS D+ C DGCDGG I+ W Y
Sbjct: 47 QSNCGCCWAFAGAEAASDRQCIATGGAVAVPLSAQDV---CFNANVDGCDGGQIITPWTY 103
Query: 73 FVHHGVVTEE------------CDPYFDSTGCSHPGCE-------------PAYPTPKCV 107
G VT C +F + C H G P+ +P+
Sbjct: 104 VAKAGAVTGGQYNGTGPFGAGLCADWF-APHCHHHGPRGDDPYPAEGDAGCPSEKSPEGP 162
Query: 108 RKC----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 163
+ C + + KH + S IMA I + GPVE +FTVYEDF +Y G
Sbjct: 163 KACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVETAFTVYEDFENYAGG 222
Query: 164 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
+Y H+TG+ GGHAVK +GWG ++G YW +AN WN WG GYF+I RGSNE GIE+
Sbjct: 223 IYHHVTGEEAGGHAVKFVGWGV-ENGTKYWKVANSWNPYWGEAGYFRILRGSNEGGIEDQ 281
Query: 224 VVAGLPSSK 232
V +K
Sbjct: 282 VTGSHADAK 290
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 93/137 (67%), Gaps = 13/137 (9%)
Query: 97 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 156
CE Y T ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ D
Sbjct: 2 CEAGYSTS------------YKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 49
Query: 157 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 216
F YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N
Sbjct: 50 FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGEN 108
Query: 217 ECGIEEDVVAGLPSSKN 233
CGIE ++VAG+P ++
Sbjct: 109 HCGIESEIVAGIPRTQQ 125
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/264 (35%), Positives = 127/264 (48%), Gaps = 51/264 (19%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--CGD-GCDGGYPISA 69
Q CGSCWA VEA + R CI G N LS ++LACC + C GC GG +A
Sbjct: 82 QSACGSCWAIAPVEAFNARLCIKSGGKFNQLLSAGEMLACCNSVHSCNSHGCQGGIARAA 141
Query: 70 WRYFVHHGVVT-------------EECDPY------FDSTGCSHPGC------------- 97
W + HG+VT + C PY D + C
Sbjct: 142 WSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQEDSKYEPCPEVRVPPLGERHQ 201
Query: 98 --------EPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKNGP 146
+ Y TP C+ +C K +H++ A + ++I EI NGP
Sbjct: 202 RGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEGTDNIKKEIMTNGP 261
Query: 147 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 206
SF+ YEDF+ YKSGVYKH +G +G H+V++IGWGT + G DYW++ N WN WG
Sbjct: 262 TSASFSTYEDFSSYKSGVYKHTSGGYLGDHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDH 320
Query: 207 GYFKIKRGSNECGIEEDVVAGLPS 230
G FKI +G +CGI++ V LP+
Sbjct: 321 GTFKIAQG--DCGIDDAVQGSLPA 342
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/276 (34%), Positives = 128/276 (46%), Gaps = 68/276 (24%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG EA +DR CI + LS ++ AC GC+GG+P SAW +
Sbjct: 560 QSACGSCWAFGVTEAFNDRLCIKSNGTFTELLSAGEMNACAP---SHGCNGGFPNSAWSW 616
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSH-------PGC--------------- 97
G+ T + C PY D C+H P C
Sbjct: 617 VHDKGIATGGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYPECPKVSCSGESPPATAE 675
Query: 98 -------EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV- 147
+ +Y TP C +C K R+ +H+ + + D I +GPV
Sbjct: 676 TATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDAKNAIRTDGPVG 735
Query: 148 --------------EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
SF+VYEDF YKSGVYKH +G+ +GGHAVK+IGWG + G+ YW
Sbjct: 736 PIYFCDPNVNFDQVSASFSVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWG-EESGQAYW 794
Query: 194 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
I+ N WN WG G FKI G+ CGI+++++ G P
Sbjct: 795 IVVNSWNEDWGDHGLFKIALGN--CGIDDNLLGGTP 828
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)
Query: 69 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 114
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 115 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GGHAVK+IGWG + G YW++AN W+ WG G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/193 (44%), Positives = 111/193 (57%), Gaps = 22/193 (11%)
Query: 20 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWAFGAVEA+SDR CI ++LS DLL+CC CG GC+GG P+SAW+++V G
Sbjct: 1 SCWAFGAVEAISDRICIASKGKTQVTLSAADLLSCC-RSCGFGCNGGDPLSAWKFWVKEG 59
Query: 78 VVTEE-------CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKK--NQLWRNS 120
+VT C PY C H P +PTPKC + C + ++
Sbjct: 60 IVTGSNHSTNAGCKPY-PFPACEHHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKED 118
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 180
K++ SAY + + E I EI GPVEV+F VYEDF +Y G+Y H G + GGHAVK+
Sbjct: 119 KYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKM 178
Query: 181 IGWGTSDDGEDYW 193
IGWG D+G YW
Sbjct: 179 IGWGI-DNGVPYW 190
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 119/231 (51%), Gaps = 15/231 (6%)
Query: 6 SEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDG 63
S ++ + QG CG+ W SDRF I + LS ++L+C GC+G
Sbjct: 198 SSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKEAVQLSAQNILSCTRRQ--QGCEG 255
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSK 121
G+ +AWRY GVV E C PY H + +R C + R++
Sbjct: 256 GHLDAAWRYLHKKGVVDENCYPYT-----QHRDTCKIRHNSRSLRANGCQTPVNVDRDTL 310
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAV 178
+ AY +N + DIMAEI+ +GPV+ + V DF Y GVY+ + + G H+V
Sbjct: 311 YTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSV 369
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
KL+GWG +GE YWI AN W WG GYF+I RGSNECGIE+ V+A P
Sbjct: 370 KLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLASWP 420
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/241 (39%), Positives = 117/241 (48%), Gaps = 30/241 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
Q CGS AVE SDR CI N LS D L+CC L CGDG CDG +P
Sbjct: 113 QSDCGSAAHLVAVEMASDRTCISSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 172
Query: 68 SAWRYFVHHGVVT---------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
+++ HG+ T CD + + S P P Y TP C C
Sbjct: 173 DILKWWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTTSVPC--PGYHTPPCEDHCTS 230
Query: 113 KNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
N W + KH+ + Y + DI EI NGPV SF +YEDF YKSG+Y H
Sbjct: 231 -NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIYVHT 289
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GD GG K+IGWG D+G YW+ +QW +G +G+ +I RG NE IE V+A L
Sbjct: 290 AGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIEHQVLAAL 348
Query: 229 P 229
P
Sbjct: 349 P 349
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 115/209 (55%), Gaps = 21/209 (10%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDG 60
+ N + + Q C SCWA + A++DR CIH LS D+++CC + CG G
Sbjct: 73 WPNCPSISEIRDQSSCSSCWAVSSASAITDRICIHSNGQKKPRLSAIDIVSCCAY-CGYG 131
Query: 61 CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH----PGCEPA----YPTPK 105
C+GG P +W Y+ GVVT C PY CSH PG P YPTPK
Sbjct: 132 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGVVTPGLPPCPRDIYPTPK 190
Query: 106 CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C +KC N+ + K S+Y + DIM EI KNGPV+ F ++EDF YKSG+
Sbjct: 191 CEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGI 250
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYW 193
Y + TG ++GGHA+++IGWG ++G +YW
Sbjct: 251 YHYTTGRLVGGHAIRVIGWGV-ENGVNYW 278
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/230 (41%), Positives = 120/230 (52%), Gaps = 26/230 (11%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
++ QG+C S WAF V SDR I ++LS LL+C GC GG+ A
Sbjct: 196 ILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLSPQHLLSC-NTRGQRGCSGGHIDRA 254
Query: 70 WRYFVHHGVVTEECDPYF----DSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
W + GVV+ +C PY D G C PG P+ C + N+L H+S
Sbjct: 255 WWFMRKRGVVSNDCYPYTSGDQDKKGVCMMPGKLPS----DCPTGRERNNEL-----HHS 305
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHA---- 177
YRI ++ +I EI +NGPV+ SF V EDF Y SGVY+H + D HA
Sbjct: 306 TPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWH 365
Query: 178 -VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VKL+GWG ++G YW+ AN W WG DGYFKI RG NEC IE VVA
Sbjct: 366 SVKLLGWGV-ENGIKYWLGANSWGTKWGEDGYFKILRGENECNIESYVVA 414
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 225 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 283
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 284 LRKRGLVSHACYPLFKDQHATNSGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 340
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YRI+S+ +IM EI +NGPV+ V+EDF HYKSG+Y+H+ +
Sbjct: 341 --PPYRISSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHVASTHGESENYRKLRT 398
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL+GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 399 HAVKLLGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 454
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 124/224 (55%), Gaps = 20/224 (8%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
V ++ Q CGSCWAF + EALSDR CI +N++LS L+A C + GC+GG P
Sbjct: 109 VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSPQALVA-CDDIGNQGCNGGVP 167
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 125
AW Y G+ T EC PY G C R+C + + + +K +S+
Sbjct: 168 QLAWEYMEWKGLPTFECYPYTAGNGTDG----------TCQRQCADGSAMTYYRAKPFSM 217
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWG 184
+ + I EI GPV + VY+DF Y SGVY + T +++GGHA++++GWG
Sbjct: 218 TTC---NSVACIQNEIITYGPVVGTMMVYQDFMSYSSGVYVYDGTAELLGGHAIEIVGWG 274
Query: 185 TSDDGE-DYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVA 226
T + DYWI+ N W+ +WG DGYF I+RG+N CGI+ D A
Sbjct: 275 TDATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTNMCGIDHDASA 318
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S +IM EI +NGPV+ V+EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFQKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/215 (38%), Positives = 113/215 (52%), Gaps = 18/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWA EA+ D I ++SV DL++C C+GG A Y V
Sbjct: 83 QASCGSCWAHSVAEAMGDAQNIAGCPRGAMSVQDLVSCDK--TDSACNGGDMKKAQEYLV 140
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
G+ TE C Y +G P C KC +Q+ R Y + +++ + +P
Sbjct: 141 KTGITTEACVKYVSGSG----------RVPACPSKCDNGSQIIR----YKLQSWK-SVEP 185
Query: 135 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 194
+IM + + GP+ F VY DF +Y+SGVY+H +G GGHAV L GWG ++G YW+
Sbjct: 186 SEIMQALMEYGPLSCGFMVYSDFMNYRSGVYQHKSGYFEGGHAVLLCGWGV-ENGLPYWL 244
Query: 195 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+ N W +WG G+FKI RGSN C IE V G+P
Sbjct: 245 VQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVP 279
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/242 (38%), Positives = 118/242 (48%), Gaps = 38/242 (15%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD----GCDGGYPIS 68
QG+C S WA +DR CI + LS +L++C GD GCDGG
Sbjct: 87 QGNCASSWAVAVASTFTDRLCIASNGKFTDNLSAQNLMSC-----GDDEKLGCDGGSAYK 141
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
AW + + G+VT E C PY + C H G C T C KCV K
Sbjct: 142 AWEFTMGKGIVTGGPYDSNEGCQPY-KNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNK 200
Query: 114 N-------QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 166
N L++ S Y S ++ + I EI GPV VYE+F YK GVYK
Sbjct: 201 NYKVKYEDDLYKTSVVYMTSW----TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVYK 256
Query: 167 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G+++G H VKLIGWG + G +YW+ N WN +WG DG FKI RG N C IE V+A
Sbjct: 257 STAGELIGYHHVKLIGWGVDEAGIEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELLVMA 316
Query: 227 GL 228
GL
Sbjct: 317 GL 318
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 119/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 129 QKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 187
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 188 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNFEKSNRIYQCS--- 244
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+T +
Sbjct: 245 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRT 302
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 303 HAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 358
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNSGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S +IM EI +NGPV+ V+EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIYRHVTSTNKESEKFLKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 84/200 (42%), Positives = 111/200 (55%), Gaps = 22/200 (11%)
Query: 20 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA A+SDR CI + +S D+++CC + CG GC+GG+PI AW+Y V G
Sbjct: 1 SCWAVSTAAAMSDRICIASKGATQVLISAQDIVSCCTW-CGAGCEGGWPIEAWKYGVTEG 59
Query: 78 VVT------EECDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKH 122
VVT +EC ++ C + G EP Y TP C ++C KN + K
Sbjct: 60 VVTGGNFGRKECCRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMD-KR 118
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y SAY + + I +I +NGPV F VYEDF +YKSG+Y+H G GGHAVK+IG
Sbjct: 119 YGTSAYELPNSVXAIQRDIMENGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIG 178
Query: 183 WG---TSDDGEDYWILANQW 199
WG T + YWI+AN W
Sbjct: 179 WGEEXTENGTIPYWIIANSW 198
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 93/243 (38%), Positives = 120/243 (49%), Gaps = 28/243 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
Q CGS AVE SDR CI + N LS D L+CC L CGDG CDG +P
Sbjct: 114 QSDCGSAAHLVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 173
Query: 68 SAWRYFVHHGVVTEE-------CDPYFD-------STGCSHPGCEPAYPTPKCVRKCVKK 113
+++ HG+ T C PY + G + C P Y TP C C
Sbjct: 174 DILKWWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVPC-PGYHTPTCEEHCTS- 231
Query: 114 NQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 169
N W + KH+ + Y + DI EI NGPV SF +Y+DF YK+G+Y H
Sbjct: 232 NITWPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTA 291
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GD GG K+IGWG D+G YW+ +QW +G +G+ + RG NE IE V+A LP
Sbjct: 292 GDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350
Query: 230 SSK 232
S+
Sbjct: 351 DSE 353
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 120/237 (50%), Gaps = 30/237 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 234 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCP-KNRHGCNSGSIDRAWWF 292
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F + ++ GC A + T C K N++++ S
Sbjct: 293 LRKRGLVSHACYPLFKNQNATNHGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 349
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---------MG 174
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+HIT +
Sbjct: 350 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITKKANEESGKYRKLQ 407
Query: 175 GHAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 408 THAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
Length = 345
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 123/216 (56%), Gaps = 24/216 (11%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISA 69
++ Q +CGSCWA AV L +RFCI G +N+ S D+++C L C+GGY S+
Sbjct: 137 ILDQANCGSCWAHAAVTMLQNRFCIKSGGSINMQFSRQDMVSCD--LGNAACNGGYLSSS 194
Query: 70 WRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY--SISA 127
+Y GVV+E+C Y + G S P+C +C K+ + K Y ++
Sbjct: 195 VQYLQTEGVVSEQCLAYASADGNS---------VPRCNYRCDDKSLEY---KKYGCKYNS 242
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--GGHAVKLIGWGT 185
+I + EDI EIY NGPV V F VY+DF+ Y +G+Y+ +T D + GGHAV L GWG
Sbjct: 243 MKILTTYEDIKEEIYTNGPVMVGFVVYDDFSSYSTGIYE-VTPDSVEEGGHAVTLNGWGY 301
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
D+G YWI NQW +WG G+F+I G E GI+
Sbjct: 302 -DNGRLYWIGQNQWQNTWGESGFFRIYAG--EAGID 334
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/194 (42%), Positives = 109/194 (56%), Gaps = 22/194 (11%)
Query: 20 SCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA + A+SDR CI + +S D+++CC + CG GC GG+ I AW YF G
Sbjct: 1 SCWAVSSASAMSDRVCIATQGAKQVLISDQDIVSCCTW-CGYGCQGGWSIRAWYYFAEQG 59
Query: 78 VVTE-------ECDPYFDSTGCSHPGCEPAY-------PTPKCVRKC-VKKNQLWRNSKH 122
VVT C PY + C + EP Y TP+C R+C + + + + KH
Sbjct: 60 VVTGGNYNTKGSCRPY-EIHPCGYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKH 118
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y +AY++ E I EI +NGPV FTVYEDFAHYK G+YKH +G GGHAVK+IG
Sbjct: 119 YGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIG 178
Query: 183 WGTSDDGED---YW 193
WG+ G + YW
Sbjct: 179 WGSEQKGSEKIPYW 192
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 124/239 (51%), Gaps = 12/239 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q C + WA +SDR+C G+ L +S LL+CC CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCK-QCGGGCKG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
G+P AWRY+V +G+ + C PY + G P + + TPKC C K+
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIP 220
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
K+ + Y + ED E+Y NGP F VY D YKSGVY+H+ GD +GG
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGT 278
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
AVK++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDHNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 89/251 (35%), Positives = 120/251 (47%), Gaps = 29/251 (11%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCI---HFGMNLSLSVNDLLACCGFLC----GD----- 59
++ QG CGSCWAF L+ R CI G L+ L++C +C GD
Sbjct: 111 ILQQGSCGSCWAFATTGVLAQRMCIKSEQIGQGYELAPQALVSCTDQICYTKAGDRCSSP 170
Query: 60 --------GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV 111
GCDGGYP A+R+ G+ E C Y G C V +C
Sbjct: 171 SSTCYCSLGCDGGYPDGAFRFMQDEGITPELCVKYVSKDGTDPLECSDVQTM---VSECT 227
Query: 112 KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK--HIT 169
+ N Y +SD E I +I ++GPV S+ V+EDF Y SGVY
Sbjct: 228 ATSNATVNGDR---CYYHSSSDIETIQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDG 284
Query: 170 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
D +G HAV ++GWG +D YW++ N W +G DGYFKI RG+NEC IE +V L
Sbjct: 285 SDSIGWHAVIIVGWGV-EDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSLV 343
Query: 230 SSKNLVKEITS 240
+++ +V TS
Sbjct: 344 NTEGVVFASTS 354
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 118/231 (51%), Gaps = 23/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CG+ WAF +DR IH ++ LS +L++C GC+GG AWRY
Sbjct: 242 QRNCGASWAFSTASVAADRIAIHSKGQITDNLSAQNLISC-DTRNQHGCNGGSIDGAWRY 300
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHY 123
HGVV+ C P F + Y + + C K N+L+R + HY
Sbjct: 301 LKTHGVVSYACYPSFWNKHLGPSAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCASHY 360
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLI 181
R++S DIM EI GPV+ VYEDF YK G+Y+H G H+VKL+
Sbjct: 361 -----RVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLL 415
Query: 182 GWGTSDDG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GWG D + +WI AN W +SWG +GYF+I RG NEC IE+ ++A L
Sbjct: 416 GWGALPDKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 118/230 (51%), Gaps = 20/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR I M SLS +LL+C GC GG AW Y
Sbjct: 241 QGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWY 299
Query: 73 FVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISA 127
GVV+E C P+ ++ G S P + + R+ NQ + +++ Y S A
Sbjct: 300 LRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPA 359
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVK 179
YR+ S +DIM E+Y+NGPV+ V+EDF YKSG+Y+ G H+VK
Sbjct: 360 YRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVK 419
Query: 180 LIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+ GWG DG+ YW+ AN W R WG DGYF+I RG NEC IE +V
Sbjct: 420 ITGWGEERGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 469
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/217 (38%), Positives = 103/217 (47%), Gaps = 18/217 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAF A L+ R CI N+ LS + C C GGY AW +
Sbjct: 152 QQTCGACWAFSATYVLAHRLCIATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYAWSF 209
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G + C PY G PA KC Q + Y R S
Sbjct: 210 LERTGTTVDSCIPYASGRATFSSGTCPA--------KCKVSTQ---SMTMYKAKNSRYIS 258
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+I A I G V+ FT+Y DF Y+SGVYKH++ +GGHAV LIGWG + G +Y
Sbjct: 259 GVNNIKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVALIGWGV-ESGTNY 317
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
W+ N W +WG GYFKI +G ECGIE V AG P
Sbjct: 318 WLAVNSWGSNWGMSGYFKIAQG--ECGIENQVYAGEP 352
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/220 (40%), Positives = 113/220 (51%), Gaps = 22/220 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---NDLLACCGFLCGDGCDGGYPISAWR 71
QGHCGSCWAF A A DR C+ G++ S V C +L GC GG S W
Sbjct: 98 QGHCGSCWAFSATSAFGDRRCMQ-GLD-SAGVPYSQQYTISCDYL-DLGCAGGLSFSVWT 154
Query: 72 YFVHHGVVTEECDPYFDSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
+ HG T EC PY D+ S P C C +++ R K Y
Sbjct: 155 FLTEHGTTTLECVPYTDANKDISSP----------CPDACADGSEI-RLVKADGCLDYSG 203
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
N IM + +GPV+ S VY DF +Y+SGVY+H+ G + HAV++IG+G +DD +
Sbjct: 204 NVTA--IMQALANDGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHAVEIIGYGAADDED 261
Query: 191 D--YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
YWI+ N WG +GYF I RGSNEC IE V +GL
Sbjct: 262 STPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 116/230 (50%), Gaps = 20/230 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M SLS +LL+C GC GG AW Y
Sbjct: 222 QGNCAGSWAFSTAAVASDRISIHSMGHMTPSLSPQNLLSC-DTRNQRGCSGGRLDGAWWY 280
Query: 73 FVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISA 127
GVVT+EC P+ DS + P + T + R+ + Q N + S A
Sbjct: 281 LRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQATARCPNPQTHANDIYQSTPA 340
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVK 179
YR+ ++IM E+ +NGPV+ V+EDF YKSG+Y+H G H+VK
Sbjct: 341 YRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSGIYRHTAVAEGKGPKHQQHGTHSVK 400
Query: 180 LIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+ GWG DG+ YW AN W R+WG DG+F+I RG NEC +E VV
Sbjct: 401 ITGWGEEQLPDGQVQKYWTAANSWGRAWGEDGHFRIARGVNECEVESFVV 450
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 116/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCS-KNRPGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F + GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATSNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSANKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/239 (38%), Positives = 125/239 (52%), Gaps = 12/239 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN-LSLSVNDLLACCGFLCGDGCDG 63
N + + Q C + WA +SDR+C G+ L +S LL+CC CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCK-QCGGGCKG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
G+P AWRY+V +G+ + C PY + G P + + TPKC C K+
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIP 220
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
K+ + Y + ED E+Y NGP F VY D YKSGVY+++ GD++GG
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQ 278
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
AV+++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVRIVGWGKL-NGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 59/113 (52%), Positives = 89/113 (78%), Gaps = 1/113 (0%)
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGH
Sbjct: 6 YKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGH 65
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
A++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 66 AIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 117/241 (48%), Gaps = 30/241 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
Q CGS A E SDR CI N LS D L+CC L CGDG CDG +P
Sbjct: 116 QSDCGSAAHLVAAEIASDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 175
Query: 68 SAWRYFVHHGVVT---------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
+++ HG+ T CD + + S P P Y TP C +C
Sbjct: 176 DILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTTSVPC--PGYHTPVCEERCTS 233
Query: 113 KNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
N W + KH+ + Y + DI EI +NGPV SF +Y+DF YKSG+Y H
Sbjct: 234 -NITWPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIYVHT 292
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GD GG K+IGWG D+G YW+ +QW +G +G+ +I RG NE IE V+A
Sbjct: 293 AGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGENGFVRILRGVNEVNIEHQVLAAQ 351
Query: 229 P 229
P
Sbjct: 352 P 352
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 89/139 (64%), Gaps = 3/139 (2%)
Query: 94 HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSF 151
+P C+ Y P C ++C K + L + KHY+ AYRI S E I EI KNGPV SF
Sbjct: 121 NPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASF 180
Query: 152 TVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 210
TVY DF HY SGVYK ++GGHAV++IGWG + YW+++N WN WG G FK
Sbjct: 181 TVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFK 240
Query: 211 IKRGSNECGIEEDVVAGLP 229
I RG NECGIEE++ AGLP
Sbjct: 241 IWRGKNECGIEEEITAGLP 259
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/241 (38%), Positives = 115/241 (47%), Gaps = 30/241 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL---CGDG--CDGGYPI 67
Q CGS AVE SDR CI N LS D L+CC L CGDG CDG +P
Sbjct: 116 QSDCGSAAHLVAVELASDRTCIFSNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPK 175
Query: 68 SAWRYFVHHGVVT---------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVK 112
+++ HG+ T CD + + S P P Y TP C C
Sbjct: 176 DILKWWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTTSVPC--PGYHTPTCEEHCTS 233
Query: 113 KNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 168
N W + KH+ + Y + DI EI NGPV SF +Y+DF YKSG+Y H
Sbjct: 234 -NITWPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIYVHT 292
Query: 169 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GD GG K+IGWG D G YW+ +QW +G +G+ + RG NE IE V+A L
Sbjct: 293 AGDQEGGMDTKIIGWGV-DSGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAAL 351
Query: 229 P 229
P
Sbjct: 352 P 352
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 120/231 (51%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISA-Y 128
G+V+ C P F ++ GC A + ++ K N + ++++ Y S Y
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRDATKPCPNNVEKSNRIYQCSPPY 355
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKL 180
R++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T + HAVKL
Sbjct: 356 RVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 415
Query: 181 IGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GWGT E +WI AN W +SWG +GYF+I RG NE IE+ V+A
Sbjct: 416 TGWGTLRGAQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 466
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 119/234 (50%), Gaps = 29/234 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q +C + WAF +DR + NLS +L++CC GC+ G AW +
Sbjct: 237 QKNCAASWAFSTASVAADRIXGRYTANLS--PQNLISCCA-KNRHGCNSGSIDRAWWFLR 293
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHYSI 125
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 294 KRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS----- 348
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHA 177
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+T + HA
Sbjct: 349 PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKYRKLQTHA 408
Query: 178 VKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+KL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 409 IKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 462
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 115/226 (50%), Gaps = 17/226 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR I M SLS +LL+C GC GG AW Y
Sbjct: 256 QGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWY 314
Query: 73 FVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISA 127
GVV+E C P+ ++ G S P + + R+ NQ + +++ Y S A
Sbjct: 315 LRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPA 374
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVK 179
YR+ S +DIM E+Y+NGPV+ V+EDF YKSG+Y+H G H+VK
Sbjct: 375 YRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVK 434
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+ G G YW+ AN W R WG DGYF+I RG NEC IE +V
Sbjct: 435 ITG-GRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 479
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 87/251 (34%), Positives = 124/251 (49%), Gaps = 36/251 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RHGCNSGSVDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+HIT
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467
Query: 232 KNLVKEITSAD 242
++TSAD
Sbjct: 468 ----GQLTSAD 474
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/236 (34%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNDGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+T
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEEASKYRKFQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+HIT +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKLIIAA 466
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 120/235 (51%), Gaps = 28/235 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I ++LS +L++CC GC GG AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSNGRYTVNLSPQNLISCC-LKHRYGCSGGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFD----STGCSHP----GCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
G+V+ C P F + GC+ G + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNSTNGCAMASRSDGRGKRHATTPCPNNIEKSNRIYQCS---- 351
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGH 176
YR++S+ IM EI KNGPV+ V+EDF +YK+G+Y+H+T + + H
Sbjct: 352 -PPYRVSSNETQIMKEIMKNGPVQAIMQVHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTH 410
Query: 177 AVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
AVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 AVKLTGWGTLRGAKGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/231 (37%), Positives = 116/231 (50%), Gaps = 23/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CG+ WAF +DR IH ++ LSV +L++C GC GG AWRY
Sbjct: 242 QRNCGASWAFSTASVAADRIAIHSDGQITDNLSVQNLISC-DTKNQHGCGGGNIEGAWRY 300
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHY 123
HGVV+ C P F P Y + + C N+L+R + HY
Sbjct: 301 LKTHGVVSYACYPSFWKHSLDSPSENHCYVSSEYGKNHTNGPCPNALEDSNRLYRCASHY 360
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLI 181
RI+S DIM EI GPV+ VYEDF YK G+Y+H G H+VKL+
Sbjct: 361 -----RISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLL 415
Query: 182 GWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GWG+ + + +WI AN W + WG +GYF+I RG NEC IE+ ++ L
Sbjct: 416 GWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 116/231 (50%), Gaps = 23/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CG+ WAF +DR IH ++ LSV +L++C GC+GG AWRY
Sbjct: 242 QRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISC-DTGNQRGCNGGSIDGAWRY 300
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPK---------CVRKCVKKNQLWRNSKHY 123
HGVV+ C P F P Y + + C N+L+R HY
Sbjct: 301 LTTHGVVSYACYPSFWKHHLDSPSENQCYVSSEYGKNHTNGPCPNALEDSNRLYRCGSHY 360
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLI 181
R++S DIM EI GPV+ VYEDF YK G+Y+H G H+VKL+
Sbjct: 361 -----RVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLL 415
Query: 182 GWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
GWG+ + + +WI AN W + WG +GYF+I RG NEC IE+ ++ L
Sbjct: 416 GWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 82/236 (34%), Positives = 116/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC G AW Y
Sbjct: 186 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCSSGSIDRAWWY 244
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P+ ++ C A + T C K N++++ S
Sbjct: 245 LRKRGLVSHACYPFLKDQNTTNNACAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 301
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ +IM EI NGPV+ V+EDF HYKSG+Y+H+T +
Sbjct: 302 --PPYRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKSEKYQKLQT 359
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI+AN W SWG +GYF+I RG NE IE+ ++A
Sbjct: 360 HAVKLTGWGTLRGAQGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIAA 415
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 87/251 (34%), Positives = 124/251 (49%), Gaps = 36/251 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RHGCNSGSVDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+HIT
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAHGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467
Query: 232 KNLVKEITSAD 242
++TSAD
Sbjct: 468 ----GQLTSAD 474
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 101/171 (59%), Gaps = 15/171 (8%)
Query: 72 YFVHHGVVT-------EECDPY-FDS----TGCSHPGC-EPAYPTPKCVRKCVKKNQL-W 117
Y V G+VT C PY F T +P C Y TP+C +KC K + +
Sbjct: 9 YLVKRGIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPY 68
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
K+Y Y + S+ + I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGHA
Sbjct: 69 EQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHA 128
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
+++IGWG + YW++AN WN WG G F+I RG +EC IE +VVAGL
Sbjct: 129 IRIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 178
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 113/210 (53%), Gaps = 21/210 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGS WA A +DR C+ + N LS ++ CC CG+GC+GGYPI AW+
Sbjct: 50 QGNCGSDWALSTSSAFADRLCVATNGDFNQLLSAEEITFCC-HKCGNGCNGGYPIRAWKR 108
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
F +HG+VT E C+PY +D G + +P P KC +KC + N
Sbjct: 109 FKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQPMEPNHKCSKKCYGDEDIDFN 168
Query: 120 SKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHA 177
H Y+ Y + I ++ GP+E SF VY+DF +YKSG+Y K +GGH+
Sbjct: 169 KDHRYTRDDYYLTY--RGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASYLGGHS 226
Query: 178 VKLIGWGTSDDGEDYWILANQWNRSWGADG 207
VKLIGWG + G YW++ N WN WG G
Sbjct: 227 VKLIGWG-EEYGVLYWLMVNSWNADWGDKG 255
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 114/222 (51%), Gaps = 28/222 (12%)
Query: 3 FTNSEHVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVND--LLACCGFLCGDG 60
+ N + + Q + GSCWA A E +SDR C+ + ++D +LACCG CG G
Sbjct: 105 WKNCSSITYIRDQSNSGSCWAVSAAETMSDRICVQSKGRVQKMISDVDILACCGRECGRG 164
Query: 61 CDGGYPISAWRYFVHHGVVT----EE---CDPYFDSTGCSHPGCEP-----------AYP 102
C+GG AW Y GVVT +E C PY HP CE ++
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYH-----LHP-CEITGKFWSCPRDHSFR 218
Query: 103 TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 161
TP C + C + + K Y S Y ++ D + I E+ KNGPV+ +FT YEDF+ Y+
Sbjct: 219 TPACKKYCQYGYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYEDFSFYR 278
Query: 162 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 203
G+Y H G G HAVK++GWG ++G YW +AN W+ W
Sbjct: 279 KGIYVHSYGRQRGAHAVKVVGWGV-ENGTKYWNVANSWSTDW 319
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I + LS +L++CC GC GG AW Y
Sbjct: 229 QKNCAASWAFSTASVAADRIAIQSKGRYTDNLSPQNLISCC-VKNRHGCKGGSIDRAWWY 287
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC+ A + T C K N++++ S
Sbjct: 288 LRKRGLVSHACYPLFKDQIFNNNGCDMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 344
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYKSG+Y+HI +
Sbjct: 345 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINNLKDESEKYRNLRT 402
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWG E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 403 HAVKLTGWGVLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 458
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 139 bits (351), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 108/199 (54%), Gaps = 19/199 (9%)
Query: 48 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSH 94
+L CC CG GC GGYPI AW+ F +HG+VT E C+PY +D G +
Sbjct: 1 ELTFCC-HTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNNT 59
Query: 95 PGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTV 153
+P +C R C +L + H Y+ Y + I ++ GP+E SF V
Sbjct: 60 CAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDV 117
Query: 154 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 212
Y DF YKSG+Y+ +GGHAVKLIGWG G YW++ N WN WG +G FKI+
Sbjct: 118 YSDFPSYKSGIYERTENATYLGGHAVKLIGWG-EQYGIPYWLMVNSWNEDWGDNGLFKIR 176
Query: 213 RGSNECGIEEDVVAGLPSS 231
RG+NECG++ AG+P +
Sbjct: 177 RGTNECGVDNSTTAGVPVT 195
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 241 QKNCAASWAFSTASVAADRIAIQSNGRFTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 299
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 300 LRKRGLVSHACYPLFKDQNATNNDCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 356
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V++DF HYK G+Y+H+T +
Sbjct: 357 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIYRHVTSTHEEPEKYRKLRT 414
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HA+KL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 415 HAIKLAGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 470
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 86/243 (35%), Positives = 118/243 (48%), Gaps = 43/243 (17%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WAF SDR I M LS +L++C G GC GG AW Y
Sbjct: 247 QGNCAASWAFSTAAVASDRISIQSMGHMTPRLSPQNLISCDTRNQG-GCAGGRIDGAWWY 305
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN-----------------Q 115
GVVTE+C PY +P + TP V +C+ ++ Q
Sbjct: 306 LRRRGVVTEDCYPY-----------QPPHQTPAEVGRCMMQSRSVGRGKRQATQRCPNTQ 354
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
+ N + S YR++S+ ++IM EI NGPV+ V+EDF YK+G+YKH
Sbjct: 355 NYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKP 414
Query: 174 ------GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G H+V++ GWG + YWI AN W ++WG +GYF+I RG NEC IE
Sbjct: 415 PQYRKHGTHSVRITGWGEDRNVDGTSRKYWIAANSWGKNWGENGYFRIVRGENECEIETF 474
Query: 224 VVA 226
V+
Sbjct: 475 VIG 477
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 96/249 (38%), Positives = 122/249 (48%), Gaps = 36/249 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFL--C-GDGCDGGYPISA 69
Q C SCWA V+A S R CI G N LS +LLACC C GC GG A
Sbjct: 106 QSACASCWAIAPVQAFSARLCIKSGGKFNQLLSAGELLACCNLAHSCEARGCKGGVARDA 165
Query: 70 WRYFVHHGVVT-------------EECDPYFDSTGCSH--------PGCEPAYPTPKCVR 108
W + HG+ T + C PY + C+H P + +Y TP C+
Sbjct: 166 WVFLNKHGIATGGDFVPKSSMEAVDGCWPY-NFPRCAHYQKKSKYGPCPKKSYETPSCLD 224
Query: 109 KCV--KKNQLWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
+C K +H++ A Y N I EI K+GP SF YEDF YKSGV
Sbjct: 225 RCPNEKYGTPLDKDRHFTARAVPYWFNGI-RSIKKEIMKHGPTSASFFTYEDFFSYKSGV 283
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
YK+ +G + H V+LIGWGT + G DYW+ N WN W G FKI +G +CGI D+
Sbjct: 284 YKYTSGAYVEFHTVELIGWGT-EKGVDYWLAKNDWNEEWADLGTFKIAQG--DCGI-NDL 339
Query: 225 VAGLPSSKN 233
V G P++ N
Sbjct: 340 VLGAPAALN 348
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 88/233 (37%), Positives = 120/233 (51%), Gaps = 24/233 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M +LS +LL+C GC+GG AW +
Sbjct: 274 QGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSC-NTRHQQGCNGGRIDGAWWF 332
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA-----YPTPKCVRKCVKKNQLWR---NSKHYS 124
GVVT+EC P F + +H PA T + R+ + + R N + S
Sbjct: 333 LRRRGVVTDECYP-FSNQETNHSPNAPACMMHSRSTGRGKRQAIARCPNPRSHANEIYQS 391
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGH 176
AYR++S+ ++IM E+ +NGPV+ V+EDF Y++G+Y+H G H
Sbjct: 392 TPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFMYRTGIYRHTAVAAGKPEQYRRHGTH 451
Query: 177 AVKLIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
+VK+ GWG DG + YWI AN W + WG GYF+I RG NEC IE VV
Sbjct: 452 SVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIETFVV 504
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/236 (34%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLSKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +W+ AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 466
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/226 (36%), Positives = 114/226 (50%), Gaps = 17/226 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH-FGMN-LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C + WAF SDR I G++ + LS DL++C C GG+P WR+
Sbjct: 220 QGDCANSWAFSTAAVASDRLSIQSRGVDKVELSPQDLMSCLNGGRRVVCQGGHPDRGWRF 279
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCE-PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+++G V+EEC PY ++ C P P +C KH+S YR+
Sbjct: 280 LLNYGGVSEECYPYEGVHSSANATCRIPRRRDPIEDARCPTGRT---EQKHFSTPPYRVP 336
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAVKLIGW 183
++ EDIM EIY NGPV+ V EDF Y+SGVY+H G H+V+++GW
Sbjct: 337 ANEEDIMQEIYANGPVQALILVKEDFFLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGW 396
Query: 184 GTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G YW+ AN W WG +GYF+I RG +E IE V+A
Sbjct: 397 GVDRSQYRPIKYWLCANSWGHGWGENGYFRIVRGEDESQIESFVLA 442
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/119 (54%), Positives = 85/119 (71%), Gaps = 1/119 (0%)
Query: 114 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 173
N + N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++
Sbjct: 3 NVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALL 62
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GGHAV+L+GWG ++ YW++AN WN WG +GYFKI RG NECGIE DV AG+P K
Sbjct: 63 GGHAVRLLGWGEENN-VPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 120
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 115/216 (53%), Gaps = 19/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CGSCWAF A+ DR C G++ +S S L++C L GCDGG W
Sbjct: 99 QGSCGSCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G T EC Y D G A P P QL++ + +S
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++++G+GT+DDG
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/251 (34%), Positives = 123/251 (49%), Gaps = 36/251 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKK-RRGCNSESVDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIEKSNRIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+HIT
Sbjct: 353 --PPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRT 410
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAW--- 467
Query: 232 KNLVKEITSAD 242
++TSAD
Sbjct: 468 ----GQLTSAD 474
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 110/198 (55%), Gaps = 19/198 (9%)
Query: 20 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY----- 72
SCWA + EA+SD C+ + + +S +D+L+CCG CG GC GG+ I A+++
Sbjct: 1 SCWAVSSAEAMSDEICVQSNSTIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRER 60
Query: 73 --FVHHGVVTEECDPYFDSTGCSHPGCEPAY--------PTPKCVRKCVKKN-QLWRNSK 121
+ C P S + +P Y PTPKC + C +K + ++ K
Sbjct: 61 CCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDK 120
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
H++ AY + ++ I EIYKNGPV +F VY+DF++YK G+Y H G G HAVK++
Sbjct: 121 HFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVV 180
Query: 182 GWGTSDDGEDYWILANQW 199
GWG ++ DYW++AN W
Sbjct: 181 GWG-RENATDYWLIANSW 197
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 113/232 (48%), Gaps = 24/232 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR IH + LS L++C GC GG AW Y
Sbjct: 244 QRNCAASWAFSTASVAADRIAIHSKGRFTDNLSPQHLISC-DTRNQYGCKGGSITGAWSY 302
Query: 73 FVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA- 127
+G+V+ C P F T C A + ++ C + W S H
Sbjct: 303 LKKYGLVSHACYPLFWNNLHQTSCEMSSVFDAEGKRQAIQPCPNR---WEPSNHIYQCGL 359
Query: 128 -YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAV 178
YRI+S DIM EI +NGPV+ VY+DF YKSG+YKHI H++
Sbjct: 360 PYRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEGKTQNRHQKKPHSI 419
Query: 179 KLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
K++GWGT D E +WI AN W SWG +GYF+I RG NEC IE+ V+A
Sbjct: 420 KIVGWGTLRDAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIEKTVIA 471
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 225 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 283
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 284 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 339
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 340 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 397
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 398 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 453
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC GG AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSSGRYTANLSPQNLISCCARK-RHGCGGGSVDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNATN-GCAMASRSDGRGKRHATTPCPNHIEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ IM EI +NGPV+ V+EDF YK+G+Y+H+T +
Sbjct: 352 --PPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIYRHVTSTSEDSEKYQKLRT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYFKI RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGENGYFKILRGVNESDIEKLIIAA 465
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 81/236 (34%), Positives = 116/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCT-KNRHGCNSGSVDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N +++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNANNNGCAMASRSDGRGKRHATKPCPNNIEKSNVIYQCS--- 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+ +
Sbjct: 353 --PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVIRTSEESEKYQKLRT 410
Query: 176 HAVKLIGWGTSDDG----EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWG E +W+ AN W +SWG DGYF+I RG NE IE+ ++A
Sbjct: 411 HAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGEDGYFRILRGVNESDIEKLIIAA 466
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 86/242 (35%), Positives = 116/242 (47%), Gaps = 43/242 (17%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WAF SDR I M LS +L++C G GC GG AW Y
Sbjct: 28 QGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRLDGAWWY 86
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---------------- 116
GVVTE+C PY P TP + +C+ +++
Sbjct: 87 LRRRGVVTEDCYPY-----------RPPQQTPAELSRCMMQSRSVGRGKRQATQRCPNTN 135
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
++N + S YR+++ ++IM EI NGPV+ V+EDF Y SG+YKH
Sbjct: 136 NYQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKP 195
Query: 174 ------GGHAVKLIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G H+VK+ GWG DG YWI AN W ++WG +GYF+I RG NEC IE
Sbjct: 196 PHYRKHGTHSVKITGWGEERNFDGTTRKYWIAANSWGKNWGENGYFRIARGENECEIEAF 255
Query: 224 VV 225
V+
Sbjct: 256 VI 257
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 117/224 (52%), Gaps = 21/224 (9%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGY 65
V ++ QG CG CWAF A+ DR C+ G++ + S L++C GCDGG
Sbjct: 93 VTPVMDQGSCGGCWAFSAIGVFGDRRCVA-GIDKEGVPYSQQYLISCS--TENHGCDGGD 149
Query: 66 PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI 125
W + G T EC Y D P C C +Q+ + Y
Sbjct: 150 FWPTWSFLTLTGATTAECVKYIDY---------PNIVASPCPAVCDDGSQI----QLYKA 196
Query: 126 SAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGW 183
Y +++ + + IM + GPV+ VY D ++Y+SGVYKH G + +G HA++++G+
Sbjct: 197 HGYGQVSKNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHALEMVGY 256
Query: 184 GTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
GT+DDG DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 257 GTTDDGTDYWIIRNSWGADWGENGYFRIVRGVNECRIEDEIYAA 300
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 111/224 (49%), Gaps = 21/224 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAF SDR CI N+ +S L+ C C GGY +W++
Sbjct: 108 QGQCGSCWAFATTGVFSDRLCITTNNVSNVVISPEFLIEC--DKTSFACQGGYGYYSWKF 165
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPKCVRKCVKKNQLWRNSKHYSISAYR 129
F++ G+ E C PY + Y +C C + L + + SAY
Sbjct: 166 FMNTGIPLESCVPYTKDS--------LVYGNTTNAQCRSTCTDGSPL---KLYKAASAYY 214
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDD 188
I S + EI NGPVE F VY DF YKSG+Y+ G +GGHAVK++GW + +
Sbjct: 215 IYSPITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASDSN 274
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSN--ECGIEEDVVAGLPS 230
G YWI NQW SWG GYF I RG++ C + ++AG S
Sbjct: 275 GTPYWIAQNQWGTSWGMGGYFYIYRGNSTLNCKFDNYMIAGTVS 318
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 352 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 86/227 (37%), Positives = 115/227 (50%), Gaps = 14/227 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS WA SDRF I ++LS LL+C C+GGY AW Y
Sbjct: 99 QGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSY 157
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V E+C PY ++ C C + R SK+ AYR+ +
Sbjct: 158 IRKIGLVDEQCFPY----SATNEKCRIPRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGN 213
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVKLIGWGT--SD 187
+ DIM EI +GPV+ + VY DF YK G+Y+H T D G H+V+++GWG S
Sbjct: 214 ET-DIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSP 272
Query: 188 DG-EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
+G + YW +AN W WG +GYF+I RGSNEC IE V+ +N
Sbjct: 273 EGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGTWAEVEN 319
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 80/234 (34%), Positives = 115/234 (49%), Gaps = 29/234 (12%)
Query: 17 HCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
+C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 NCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWYLR 295
Query: 75 HHGVVTEECDPYFDSTGCSHPGCE---------PAYPTPKCVRKCVKKNQLWRNSKHYSI 125
G+V+ C P F S+ C + T C K N++++ S
Sbjct: 296 KRGLVSHACYPLFKDQNISNNTCAMTSKADGRGKRHATRPCPNNIEKSNRIYQCS----- 350
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHA 177
YR++S+ +IM EI +NGPV+ V+EDF HYK+G+Y+H+ + HA
Sbjct: 351 PPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHA 410
Query: 178 VKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
VKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 411 VKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 352 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 117/236 (49%), Gaps = 30/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW Y
Sbjct: 237 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWY 295
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ GC A + T C K N++++ S
Sbjct: 296 LRKRGLVSHACYPLFKDQNANN-GCAMASRSDGRGKRHATKPCPNNIEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S +IM EI +NGPV+ V EDF HYK+G+Y+H+T +
Sbjct: 352 --PPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 114/216 (52%), Gaps = 19/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CG CWAF A+ DR C G++ +S S L++C L GCDGG W
Sbjct: 99 QGSCGECWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G T EC Y D G A P P QL++ + +S
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++++G+GT+DDG
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 114/216 (52%), Gaps = 19/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CG CWAF A+ DR C G++ +S S L++C L GCDGG W
Sbjct: 99 QGSCGGCWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 155
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G T EC Y D G A P P QL++ + +S
Sbjct: 156 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 204
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++++G+GT+DDG
Sbjct: 205 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 263
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 264 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 136 bits (343), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 114/216 (52%), Gaps = 19/216 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CG CWAF A+ DR C G++ +S S L++C L GCDGG W
Sbjct: 65 QGSCGECWAFSAIGVFGDRRC-AMGIDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWS 121
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G T EC Y D G A P P QL++ + +S
Sbjct: 122 FLTFTGATTAECVKYVDY------GHTVASPCPAVCDDG-SPIQLYKAHGYGQVS----K 170
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGE 190
S P IM + GP++ VY D ++Y+SGVYKH G + +G HA++++G+GT+DDG
Sbjct: 171 SVPA-IMGMLVAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGT 229
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 230 DYWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 265
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 102/172 (59%), Gaps = 17/172 (9%)
Query: 19 GSCWAFGAVEALSDRFCIHFGMNLSLSVN--DLLACCGFLCGDGCDGGYPISAWRYFVHH 76
GSCWAFGA EA+SDR CIH +S+ ++ DLLACC CG GC+GGYP +AW ++
Sbjct: 1 GSCWAFGAAEAISDRLCIHSNGKVSVEISSEDLLACCDS-CGMGCNGGYPSAAWDFWTDV 59
Query: 77 GVVTEE-------CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKH 122
G+V+ C PY G P TP+C+ +C ++ KH
Sbjct: 60 GLVSGGLYDSHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKH 119
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 174
Y S+Y + SD E I +EIYKNGPVE +FTVYEDF YK+GVY+H+TG +G
Sbjct: 120 YGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 80/178 (44%), Positives = 103/178 (57%), Gaps = 16/178 (8%)
Query: 25 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
GAVEA+SDR CIH N SLS DLL+CC CG GCDGG+P AW ++ HG+VT
Sbjct: 1 GAVEAMSDRLCIHSSGAFNKSLSAVDLLSCCK-DCGYGCDGGFPPMAWDFWKTHGIVTGG 59
Query: 81 --EE---CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
EE C PY S G P YPTPKCV+ C ++ K + ++Y
Sbjct: 60 SKEEPAGCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYN 119
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
++ IM EI NGPVE +F V+EDF YKSG+Y H G +GGHA++++GWG +
Sbjct: 120 VHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEEN 177
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/243 (34%), Positives = 117/243 (48%), Gaps = 43/243 (17%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WAF SDR I M LS +L++C G GC GG AW +
Sbjct: 225 QGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCTGGRIDGAWWF 283
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---------------- 116
GVVTE+C PY P TP + +C+ +++
Sbjct: 284 LRRRGVVTEDCYPY-----------RPPQQTPAELGRCMMQSRSVGRGKRQATQRCPNTN 332
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
++N + S YR++++ ++IM EI NGPV+ V+EDF YKSG+YKH
Sbjct: 333 NYQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKP 392
Query: 174 ------GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G H+VK+ GWG + YWI AN W ++WG +GYF+I RG NEC IE
Sbjct: 393 PQYRKHGTHSVKITGWGEERNVDGAKRKYWIAANSWGKNWGEEGYFRIARGENECEIEAF 452
Query: 224 VVA 226
V+
Sbjct: 453 VIG 455
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/258 (36%), Positives = 122/258 (47%), Gaps = 21/258 (8%)
Query: 3 FTNSEHVEILVI----QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFL 56
F SEH LV QG CGS WAF SDRF I + L+ +LAC
Sbjct: 191 FDASEHWTGLVAEARDQGWCGSSWAFSTATMASDRFAILSKGREMVQLAPQQMLACVRR- 249
Query: 57 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
GC GG+ +AW+Y GVV EEC PY + + T C VK N
Sbjct: 250 -QQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCKISNDDTLITANCELP-VKVN-- 305
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-----D 171
R + A+ +N++ DIMAEI G V+ VY DF Y+SG+Y+H +
Sbjct: 306 -RTLMYKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSAAATPAEE 363
Query: 172 VMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
H+V+LIGWG G D YWI N W + WG +G F+I RGSNEC IE V+A
Sbjct: 364 RSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGENGRFRILRGSNECDIESYVLASN 423
Query: 229 PSSKNLVKEITSADMFED 246
P V+ I ++
Sbjct: 424 PYVHEHVQAIRKVGELQE 441
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/227 (37%), Positives = 112/227 (49%), Gaps = 14/227 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS WA SDRF I ++LS LL+C C+GGY AW Y
Sbjct: 225 QGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSY 283
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V E+C PY ++ C C + R SK+ AYR+ +
Sbjct: 284 IRKIGLVDEQCFPY----SATNEKCRIPRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGN 339
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY DF YK G+Y+H T D G H+V+++GWG
Sbjct: 340 E-TDIMYEILHSGPVQATMKVYHDFFTYKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSP 398
Query: 190 E---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
E YW +AN W WG +GYF+I RGSNEC IE V+ +N
Sbjct: 399 EGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGTWAEVEN 445
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 121/240 (50%), Gaps = 14/240 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDG 63
N + + Q C + WA A+SDR+C + G L +S LL+CC CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQ 115
G+P AWRY+V +G+ + C PY C H G + + TP+C C K
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPY-PFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTI 219
Query: 116 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 175
K+ AY + E+ E+Y NGP VY D YKSGVY+++ G MG
Sbjct: 220 PL--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGV 277
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
AVK++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 278 TAVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 115/234 (49%), Gaps = 17/234 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS WA SDRF I + L+ +++C GC GG+ +AW Y
Sbjct: 205 QGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQIVSCVRR--SQGCSGGHLDTAWSY 262
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G V EEC PY + H C+ C ++ R + + A+ +N+
Sbjct: 263 LRKVGTVNEECYPYISA----HNVCKIRPSDTLITANCELPMKVDRTNMYKMGPAFSLNN 318
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-----MGGHAVKLIGWGTSD 187
+ DIM EI K+GPV+ V+ DF YKSG+Y+H G H+V+LIGWG
Sbjct: 319 E-TDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSADQRAGYHSVRLIGWGEER 377
Query: 188 DGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 238
G + YWI N W WG +G F+I RGSNEC IE V+A LP VK++
Sbjct: 378 HGYEVTKYWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLPYVHQQVKDL 431
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 116/231 (50%), Gaps = 19/231 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WAF +DR I + LS+ +LLAC GC+GG+ AW Y
Sbjct: 205 QGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLLAC-NNRGQQGCNGGHLDRAWNY 263
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA----YPTPKCV------RKCVKKNQLWRNSKH 122
GVV EEC PY C+ T KC RK + ++ R
Sbjct: 264 MRRFGVVNEECYPYISGRTGQVEKCKVPRRGNLATMKCQLVNAAERKSDRSDKPPRKGLF 323
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHAVK 179
S AYRI +DIM EI ++GPV+ + V+ DF Y+ GVY++ + G H+V+
Sbjct: 324 RSPPAYRIAPFEDDIMNEILQHGPVQATMRVHPDFFLYRGGVYRYSGTNSQQRSGYHSVR 383
Query: 180 LIGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
++GWG + YW++AN W R WG DGYF+I RG NE IE+ V+A
Sbjct: 384 IVGWGVDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 120/239 (50%), Gaps = 12/239 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDG 63
N + + Q C + WA A+SDR+C + G L +S LL+CC CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
G+P AWRY+V +G+ + C PY + G P + TP+C C K
Sbjct: 161 GFPGFAWRYYVEYGIASSYCQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIP 220
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
K+ AY + E+ E+Y NGP VY D YKSGVY+++ G MG
Sbjct: 221 L--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
AVK++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 118/242 (48%), Gaps = 17/242 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS WA SDRF I + L+ +++C GC GG+ +AW Y
Sbjct: 206 QGWCGSSWALSTASVASDRFAILSKGREIVQLAPQQIISCVRR--SQGCSGGHLDTAWNY 263
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G V +EC PY + C+ C ++ R + + A+ +N+
Sbjct: 264 VRKVGTVNDECYPYISAQN----ACKIRPSDTLITANCDLPTKVDRTNMYKMGPAFSLNN 319
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT----GDVMGG-HAVKLIGWGTSD 187
+ DIM EI K+GPV+ V+ DF YKSG+Y+H GD G H+V+LIGWG
Sbjct: 320 E-TDIMIEIKKHGPVQAILRVHRDFFSYKSGIYRHSAASSAGDERAGYHSVRLIGWGEER 378
Query: 188 DGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMF 244
+G + YW+ N W R WG +G F+I RG NEC IE V+A LP VK +
Sbjct: 379 NGYETTKYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLASLPYVHQQVKPMRQVGEL 438
Query: 245 ED 246
++
Sbjct: 439 QE 440
>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
Length = 301
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 112/213 (52%), Gaps = 20/213 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN---DLLACCGFLCGDG-CDGGYPISAW 70
Q CGSCWAF AV +DR C +G++ S V+ + C F GDG C+GG+ + W
Sbjct: 97 QASCGSCWAFSAVATFADRRCA-YGLD-SKQVHYSEQYVVSCDF--GDGACNGGWLSNVW 152
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
++ GV +C YF C+ C + + + I+
Sbjct: 153 KFLTKTGVPKLDCLKYFSGMTGDRE---------SCITHCTDGSPVELYQASHVIN---Y 200
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
D + +M + +GP++V+F VY DF +Y SGVY+H+ G + GGHAV+++G+G + G
Sbjct: 201 GMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQHVNGMMEGGHAVEMVGYGIDESGL 260
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
YWI+ N W WG GYF+I R NECGIEE
Sbjct: 261 KYWIIRNSWGPDWGEGGYFRIIRRVNECGIEEQ 293
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 294
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 295 LRKRGLVSHACYPLFKEQSTNNNSCAMASRSDGRGKRHATRPCPNSFEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YRI+S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+ +
Sbjct: 352 --PPYRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
Length = 464
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 86/236 (36%), Positives = 121/236 (51%), Gaps = 31/236 (13%)
Query: 8 HVEILVIQGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGY 65
+V ++ QG CGSC+AF ++ L R + + ++LS D+++C + GC+GG+
Sbjct: 244 YVPVVKNQGSCGSCYAFSSMGMLESRLRVATKNQVQVNLSPQDIVSCSAY--SQGCEGGF 301
Query: 66 P-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
P + A +Y HGVV EEC PY TG C A KC R V +K+
Sbjct: 302 PYLIAGKYAQDHGVVAEECYPY---TG-RDSACSAA---KKCQRSYV--------AKYRY 346
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----------MG 174
+ Y + E + + ++GP+ VSF VY DF HY GVY G +
Sbjct: 347 VGGYYGACNEELMKMSLVESGPLSVSFEVYSDFMHYAGGVYHRTDGLFNKINEFNPFELT 406
Query: 175 GHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV L+G+GT S E YWI+ N W WG DG+F+I+RG +ECGIE V P
Sbjct: 407 NHAVLLVGYGTDSQTKEKYWIVKNSWGTKWGEDGFFRIRRGVDECGIESIAVEVTP 462
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 118/221 (53%), Gaps = 22/221 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSCWAF + E ++DR CI S +LL CC C C GGY AW Y
Sbjct: 99 QGTCGSCWAFASTEVMTDRLCIGTKGETKFVFSPENLLTCCED-CRLECVGGYTAKAWDY 157
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP---TPKCVRKC--VKKNQLWRNSKHYSISA 127
+++ G+V+ Y S GC P + ++ KCV+ C K + + + KHY S
Sbjct: 158 YINEGIVSG--GDYNSSEGC-QPYSKASFQYAVASKCVKACQNDKYDVKYDDDKHYGDSF 214
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + ++ I EI NGPV +F V+ED +YKSG+ V ++ WGT +
Sbjct: 215 YTLETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGIQL---------SNVSILRWGT-E 264
Query: 188 DGEDYWILANQWNRSWG-ADGYFKIKRGSNECGIEEDVVAG 227
+G YW++AN W WG G+ KIKRG+NEC IE+++ AG
Sbjct: 265 EGVPYWLIANSWGTWWGDLGGFIKIKRGTNECAIEQEMAAG 305
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 60/128 (46%), Positives = 83/128 (64%), Gaps = 2/128 (1%)
Query: 103 TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 161
TPKC++ C + + K Y +Y + I EI NGPVE +FTVYED YK
Sbjct: 41 TPKCIKHCQASYTVAYEQDKSYGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYK 100
Query: 162 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
GVY+H+TG ++GGHA++++GWG +D YW++AN WN WG +G+FKI RGS+ CGIE
Sbjct: 101 DGVYQHVTGKMLGGHAIRILGWGVEND-VPYWLIANSWNTDWGNNGFFKILRGSDHCGIE 159
Query: 222 EDVVAGLP 229
+ AG+P
Sbjct: 160 SQISAGIP 167
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSRGRYTANLSPQNLISCCAKK-RHGCNSGSIDRAWWF 294
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 295 LRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+ +
Sbjct: 352 --PPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEESEKYRKLRS 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 83/243 (34%), Positives = 117/243 (48%), Gaps = 43/243 (17%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WAF SDR I M LS +L++C G GC GG AW Y
Sbjct: 222 QGNCAASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRIDGAWWY 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---------------- 116
GVVTE C PY +P P V +C+ +++
Sbjct: 281 LRRRGVVTENCYPY-----------QPPQQAPAEVGRCMMQSRAVGRGKRQATQRCPNTY 329
Query: 117 -WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-- 173
+ N + S Y+++S+ ++IM EI +NGPV+ V+EDF YK+G+YKH
Sbjct: 330 NYHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNGIYKHTDVSSTKP 389
Query: 174 ------GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G H+V++ GWG D YWI AN W ++WG +G+F+I RG+NEC IE
Sbjct: 390 PQYRKHGTHSVRITGWGEDKDYDGTPRKYWIAANSWGKNWGENGFFRIARGANECEIEAF 449
Query: 224 VVA 226
V+
Sbjct: 450 VIG 452
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 294
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 295 LRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+ +
Sbjct: 352 --PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 117/236 (49%), Gaps = 29/236 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C + WAF +DR I +LS +L++CC GC+ G AW +
Sbjct: 236 QKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWF 294
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPA---------YPTPKCVRKCVKKNQLWRNSKHY 123
G+V+ C P F ++ C A + T C K N++++ S
Sbjct: 295 LRKRGLVSHACYPLFKDQNTTNNICAMASRSDGRGKRHATKPCPNSFEKSNRIYQCS--- 351
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG--------DVMGG 175
YR++S+ +IM EI +NGPV+ V+EDF +YK+G+Y+H+ +
Sbjct: 352 --PPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYKKLRT 409
Query: 176 HAVKLIGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
HAVKL GWGT E +WI AN W +SWG +GYF+I RG NE IE+ ++A
Sbjct: 410 HAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 112/236 (47%), Gaps = 26/236 (11%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYP 66
++ ++ QG CGS WA SDR I +N LS LL+C GC GGY
Sbjct: 211 IDEVIDQGKCGSSWAISTASVASDRLAIQSMGEINPRLSEQHLLSC-NIRGQRGCSGGYL 269
Query: 67 ISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
AW + G V+ C PY + T C AY + +C + V + +
Sbjct: 270 DRAWYHLRRAGAVSRACYPYHSGLDEDTIMQKLRCRVAYGSSQCPERGVTSD------LY 323
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---------GDVM 173
S YRI + DIM EIY+NGPV+ +F V DF Y GVY+++ D
Sbjct: 324 LSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTASQSDSDQA 383
Query: 174 GGHAVKLIGWGTSD----DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
G H+VK++GWG + YW+ N W R+WG G F+I RG NEC IE V+
Sbjct: 384 GWHSVKIVGWGIDRSDWYNPIKYWLCTNSWGRNWGEQGMFRIVRGVNECEIESFVL 439
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 81/201 (40%), Positives = 108/201 (53%), Gaps = 24/201 (11%)
Query: 20 SCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA A E +SDR CI LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G
Sbjct: 1 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKG 60
Query: 78 VVTEECDPYFDSTGCS---HPGCE-----------PA--YPTPKCVRKCVKKN--QLWRN 119
VT Y D TGC +P CE P+ YPT + K + +
Sbjct: 61 YVTG--GSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHK 118
Query: 120 SKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
H+ +I + + I I +G + TV+EDF HY GVY H G +GGHAV
Sbjct: 119 DLHFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAV 178
Query: 179 KLIGWGTSDDGEDYWILANQW 199
K++GWG D+G YW++AN W
Sbjct: 179 KMLGWGV-DNGTPYWLIANSW 198
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 133 bits (334), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 86/241 (35%), Positives = 116/241 (48%), Gaps = 38/241 (15%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M+ +LS +LL+C GC GG AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRISIHSMGHMSPALSPQNLLSC-NTHNQHGCRGGRLDGAWWF 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP---------------TPKCVRKCVKKNQLW 117
G+V+ C P+ + H G PA P T C N ++
Sbjct: 281 LRRRGLVSNNCYPFSEG---DHNGAAPAAPCMMHSRHMGRGKRQATAHCPNSRTHANHIY 337
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----- 172
+ + YR++S +DIM E+ +NGPV+ V+EDF YKSG+YKH +
Sbjct: 338 Q-----ATPPYRLSSHEKDIMKELMENGPVQALLEVHEDFFLYKSGIYKHTPASLGKPER 392
Query: 173 ---MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
G H+VK+ GWG DG+ YW AN W +WG +GYF+I RG+NEC IE VV
Sbjct: 393 YRQHGTHSVKITGWGEEIQPDGQKVKYWTAANSWGPTWGENGYFRIVRGANECDIESFVV 452
Query: 226 A 226
Sbjct: 453 G 453
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 133 bits (334), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 121/239 (50%), Gaps = 12/239 (5%)
Query: 5 NSEHVEILVIQGHCGSCWAFGAVEALSDRFC-IHFGMNLSLSVNDLLACCGFLCGDGCDG 63
N + + Q C + WA A+SDR+C + G L +S LL+CC CG GC G
Sbjct: 102 NCPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCK-QCGGGCKG 160
Query: 64 GYPISAWRYFVHHGVVTEECDPY-------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 116
G+P AW Y+V +G+ + C PY + G P + + TPKC C K+
Sbjct: 161 GFPGFAWLYYVEYGIASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIP 220
Query: 117 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 176
K+ + Y + ED E+Y NGP F VY D YKSGVY+++ GD +GG
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 235
AV+++GWG +G YW +AN W+ WG +GY I G+NEC IE G P L
Sbjct: 279 AVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFPDPSQLT 336
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 133 bits (334), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 109/221 (49%), Gaps = 16/221 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVN-DLLACCGFLCGDGCDGGYPI 67
+ ++ Q CGSCWAF A E LSDR CI + ++ L C GC+GG P
Sbjct: 45 IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALVSCDIFGNQGCNGGIPQ 104
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
AW Y HG+ T C PY G CV+ N+ + + ++
Sbjct: 105 LAWEYMELHGIPTYGCFPYTSGNGTDG----------SCVKNSCVDNEQYTLYRAKPLT- 153
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-DVMGGHAVKLIGWGTS 186
+ + E I +I K GP++ + VY DF Y SGVY G ++GGHA+K++GWG
Sbjct: 154 LKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSLLGGHAIKIVGWGFD 213
Query: 187 D-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
++YWI+AN W SWG DG+F I ++CGI D A
Sbjct: 214 QASNQNYWIVANSWGPSWGIDGFFWIAF--DQCGINSDACA 252
>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
Length = 568
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 79/232 (34%), Positives = 124/232 (53%), Gaps = 28/232 (12%)
Query: 8 HVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 65
+V + Q +CGSC+AF ++ L R I + LS ++++C + GC+GG+
Sbjct: 351 YVSPVRNQANCGSCYAFASLGMLESRIRIKTNNSQVPVLSPQEIVSCSEY--SQGCEGGF 408
Query: 66 P-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
P + +Y G+V EEC PY AY +P +KC + + S+++
Sbjct: 409 PYLIGGKYAQDFGLVEEECFPY------------QAYDSPCTPKKCSR----YYTSEYHY 452
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAV 178
+ + + + E+ +NGP+ V+F VY+DF HY++G+Y H + HAV
Sbjct: 453 VGGFYGGCNEALMKHELIQNGPLTVAFEVYDDFIHYRTGIYHHTGLRDNFNPFELTNHAV 512
Query: 179 KLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
L+G+GT + GEDYWI+ N W SWG +GYF+I RG++EC IE VA P
Sbjct: 513 LLVGYGTDEKTGEDYWIVKNSWGTSWGENGYFRILRGTDECAIESIAVAATP 564
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 113/234 (48%), Gaps = 30/234 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFL-------CGDGCDGGY 65
Q CGSCWA L+DR CI N+ LS L+ C G C +GC GG+
Sbjct: 66 QQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDCDGSCVSDGVSGCNNGCKGGF 125
Query: 66 PISAWRYFVHHGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
A ++ G+V++EC Y S S P C+ P N+ Y
Sbjct: 126 VGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPI--------------SNTTIYK 171
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
++ R +D EI NGPV +F +Y DF +K VY + + HAV+++GWG
Sbjct: 172 ATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWG 231
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV------AGLPSSK 232
T+ DG DYWI AN W WG GYFKI+RGS+E EE + A +P+S+
Sbjct: 232 TTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASVPTSQ 285
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 121/216 (56%), Gaps = 23/216 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CGSCWAF + E ++DR CI + S +LL CC GGY +AW Y
Sbjct: 99 QGNCGSCWAFASTEVMTDRLCISSKGKIKFVFSPENLLTCCKDCGCGC-KGGYIKNAWDY 157
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+++ G+ + Y S GC P E ++ + +CVK Y + +
Sbjct: 158 YINEGIAS--GGDYNSSEGC-QPYSESSFQYAE-ASECVK--------------FYTLET 199
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+ I EI NGPV + V+EDFA +KSGVY + +G +G H+VK+IGWGT ++G Y
Sbjct: 200 NVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWGT-EEGIPY 258
Query: 193 WILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 227
W++AN W WG G+FK++RG+NEC IE+++ AG
Sbjct: 259 WLIANSWGSEWGELGGFFKMRRGTNECWIEQEMTAG 294
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 111/224 (49%), Gaps = 30/224 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH--------FGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
QG+CG+CWAF A A DR C+ + ++S +DL GC GG
Sbjct: 124 QGYCGACWAFSATGAFGDRRCMQWLDPVGVPYSQQYTVSCDDLDL--------GCAGGTS 175
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
+ W + HG T EC Y D+ C PA + VK + S + +
Sbjct: 176 FNVWTFLTEHGTTTLECVRYTDADKDLSSPC-PALCDDGSEIQLVKADGCLDYSGNVTA- 233
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 186
IM + +GPV+ +VY DF +Y+ GVYKH+ G + HAV++IG+GT+
Sbjct: 234 ----------IMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVEIIGYGTT 283
Query: 187 DDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
DD E YWI+ N +WG +GYF I RGSNEC IE V +GL
Sbjct: 284 DDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSGL 327
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 115/222 (51%), Gaps = 15/222 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C S WA +DR + N++LS L+C GC+GGY AW Y
Sbjct: 100 QGDCASSWAQSTAATSADRLALITEGRQNVALSAQQFLSCNQHR-QKGCEGGYLDRAWWY 158
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRIN 131
GVV+EEC PY T C R+C + NS+ Y + +YR++
Sbjct: 159 IRKFGVVSEECYPYISGTTRKPEICYMQKSKHANGRQCPSGHP---NSRVYRTTPSYRVS 215
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG---DVMGGHAVKLIGWG---T 185
S +DIM+EI NGPV+ +F V+ DF + +GVYKH+ ++ G H+V+L+GWG +
Sbjct: 216 SREQDIMSEILTNGPVQATFRVHGDF--FIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYS 273
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ YWI AN W +WG +G F+I RG N C IE V+
Sbjct: 274 TGIPVKYWIAANSWGTNWGENGTFRILRGENHCEIESFVIGA 315
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/211 (38%), Positives = 106/211 (50%), Gaps = 27/211 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFG EA +DR CI + LS ++ AC F GC GG P SAW +
Sbjct: 2 QSACGSCWAFGVTEAFNDRLCIKSDGAFTELLSAGEMNACTLFF---GCGGGDPYSAWSW 58
Query: 73 FVHHGVVT-------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 119
G+ T + C PY D C+H + YP KC + +
Sbjct: 59 VHDKGIATGGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYP--KCPKVSCSGDD---- 111
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
+H+ + + + D I +GPV SFTVYEDF Y+SGVYKH +G +GGHAVK
Sbjct: 112 -RHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVK 170
Query: 180 LIGWGTSDDGEDYWILANQWNRSWGADGYFK 210
+IGWG G+ YW+ N WN WG G F+
Sbjct: 171 IIGWGEK-SGQAYWLAVNSWNEDWGDHGLFR 200
>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 305
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/214 (36%), Positives = 117/214 (54%), Gaps = 19/214 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
Q C C+AF + ALS R CI +SLSV +++C G+ GC GG S+W
Sbjct: 101 QKECSCCYAFATIGALSTRRCIAKLDSQAVSLSVQHMVSCDN---GEAGCLGGEFESSWA 157
Query: 72 YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
+ GVV +C PY TG S +C C + L ++ HY ++
Sbjct: 158 FLETEGVVKSDCLPYTSGETGNSG----------ECPMMC-QDGTLVEDAFHYKAASASP 206
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
++ +IM + +GPV+ F V+EDF +Y G+Y + G +GGHAV ++G+G+ +D
Sbjct: 207 LNNYNEIMVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGSSLGGHAVLIVGYGSMND-H 265
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
DYWI+ N W WG +GYF+I RG+NECGIE++
Sbjct: 266 DYWIVRNSWGPDWGENGYFRILRGTNECGIEKNA 299
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 109/226 (48%), Gaps = 31/226 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG----DGCDGGYPIS 68
QG C +CWA AV +DR CI G ++ LS+ L +CC G +GC G
Sbjct: 168 QGECNNCWASAAVGMFNDRVCIKSGGRITDILSLGYLTSCCNRANGCPKSNGCMFGSVPE 227
Query: 69 AWRYFVHHGVVT-------EE------CDPYFDSTGCSH-PGCEPAYPT-------PKCV 107
+ +HG+VT EE C PY C+H PG E YP P C
Sbjct: 228 GLNFMKNHGLVTGGEYKPPEELGNDDGCWPY-PFPKCNHVPGLESKYPRCAQVRDLPACA 286
Query: 108 RKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
C K + H + S R+ PE I EI+ NGPV T+YEDF YKSGVY
Sbjct: 287 TTCPNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEIFDNGPVAAMMTLYEDFRFYKSGVY 346
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 211
H TG ++ H +KLIGWG + G++YW+ N WN WG G K+
Sbjct: 347 VHKTGQMLAAHTLKLIGWGV-ESGQEYWLAVNAWNEEWGDHGMIKL 391
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 112/239 (46%), Gaps = 40/239 (16%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG C S W+ +DR I +N+ LS LL+C GC+GGY AW Y
Sbjct: 204 QGDCASSWSHSTTATSADRLSIITDGRVNIPLSAQQLLSCNQHR-QRGCEGGYLDRAWWY 262
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH---------- 122
GVV+E C PY +S PG +C +R H
Sbjct: 263 IRKLGVVSELCYPY-ESGATQQPG------------ECRIPKSAYRTGAHIDCPSGAADP 309
Query: 123 --YSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI--------TGD 171
Y ++ YR++S +DIM EI NGPV+ +F VYEDF Y GVY+H+
Sbjct: 310 SVYRMTPPYRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGGVYQHLDLHEHKEEERK 369
Query: 172 VMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
V G H+V++IGWG ++ YW+ AN W WG DG F+I RG N C IE V+
Sbjct: 370 VQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGA 428
>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
Length = 569
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 117/233 (50%), Gaps = 32/233 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSC+AF AV A+ R I N+ L+V D+++C + C GG P + R+
Sbjct: 343 QMACGSCYAFAAVTAIESRIRIQSRNNVREPLAVQDIVSCSPY--AQKCHGGIPYAVGRH 400
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+V E C PY S + C KC + + +K+ +S Y S
Sbjct: 401 LRDFNLVPESCFPYKGSENVA------------CSSKCKNPEYIVKVTKYRYVSDYYGGS 448
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH-----------ITGDVMG----GHA 177
+ ++M EIY++GP+ S+ +Y DF +Y G+YKH I ++ G H+
Sbjct: 449 NYANMMKEIYEHGPISASYLIYPDFKYYSKGIYKHSGKGYPMKTDRINREMNGWEPTTHS 508
Query: 178 VKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V + GWG GE YW + N W+ SWG +G F+IKRG++EC IE + VA P
Sbjct: 509 VVITGWGEDPKTGEKYWNVLNSWSESWGENGRFRIKRGNDECAIEAEGVAFYP 561
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/179 (41%), Positives = 101/179 (56%), Gaps = 7/179 (3%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QGHCGSCWA + E L DRFCIH + LS D+ +C GC+GG+ +A+ Y
Sbjct: 92 QGHCGSCWAMCSFEVLQDRFCIHSNGSEKPWLSGQDITSCDSR--SHGCNGGWTETAFEY 149
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRIN 131
GV TEEC PY C HPGC ++ TP C ++C + +S ++Y+ +Y I
Sbjct: 150 AKKAGVPTEECVPYLMGK-CHHPGCS-SWQTPTCKKECSSLSNYNYSSNRYYASKSYSIQ 207
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
+ E I E+ +NGPV FT Y+D A Y GVY H+ G G HA+K++GWG + E
Sbjct: 208 RNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYNHVMGSEQGLHAIKIVGWGVWRESE 266
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 28/46 (60%)
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
++G YWI+ N W +G DG IKRG NECGIE DV G+P
Sbjct: 319 NKEEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGIPK 364
>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
Length = 804
Score = 130 bits (328), Expect = 3e-28, Method: Composition-based stats.
Identities = 86/234 (36%), Positives = 127/234 (54%), Gaps = 19/234 (8%)
Query: 3 FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
FT H I +I QG CG C+A AVE ++ R C+ + +S+ DL+ C +L
Sbjct: 64 FTYRGHRCIQIIDQGSCGCCYAAAAVEMVTARRCLQLNDSRLVSLEDLVTCDHTKYLNIQ 123
Query: 58 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
+GC GG P+++ ++ G+V + C+ Y++ T +P YPT C C K
Sbjct: 124 NNGCRGGNPLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKHITGDVMGG- 175
R K+ + YR+ S + +M +IY+NGP+ VS + DF + K G+Y +GG
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGG 232
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV ++GWG ++G YW AN + +WG GYFKIKRGSNE IE + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 113/224 (50%), Gaps = 11/224 (4%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHF-GMNLSLSVNDLLACCGFLCGDGCDGGYPI 67
+ ++ QG CGS WA SDRF I G + +L C GC GG+
Sbjct: 200 ISPVLDQGWCGSDWAVTIATVASDRFAIQSNGAERMVLSPQVLLSCNIRRQQGCRGGHID 259
Query: 68 SAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
AW + HG+V EEC PY +T P P + + R S+ Y +
Sbjct: 260 VAWNFARGHGLVDEECFPYKAATTSC-----PFRPKANLIEDGCRPPVRQRTSR-YKVGP 313
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GD--VMGGHAVKLIGWG 184
+ DIM +I ++GPV TV++DF HY G+Y+ GD + G H+V+++GWG
Sbjct: 314 PGKLATENDIMYDIMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWG 373
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
D G+ YW++AN W WG +GYF+I RGSNE GIE VV L
Sbjct: 374 -EDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVL 416
>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
Length = 804
Score = 130 bits (328), Expect = 4e-28, Method: Composition-based stats.
Identities = 86/234 (36%), Positives = 128/234 (54%), Gaps = 19/234 (8%)
Query: 3 FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
FT H I +I QG CG C+A AVE ++ R C+ F + +S+ DL+ C +L
Sbjct: 64 FTYRGHRCIQIINQGSCGCCYAAAAVEMVTARRCLQFNDSKLVSLEDLVTCDHTKYLNIQ 123
Query: 58 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
+GC GG +++ ++ G+V + C+ Y++ T +P YPT C C K+
Sbjct: 124 NNGCRGGNSLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKHPKD 175
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKH-ITGDVMGG 175
R K+ + YR+ S + +M +IY+NGP+ VS + DF + K G+Y + GG
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGG 232
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV ++GWG ++G YW AN + +WG GYFKIKRGSNE IE + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 98/177 (55%), Gaps = 19/177 (10%)
Query: 20 SCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA + A+SDR CI + +S D+++CC + CG GCDGG+PI AW++F G
Sbjct: 1 SCWAVSSAAAMSDRICIASKGVKQVLISAQDMVSCCSY-CGYGCDGGWPIKAWQFFAREG 59
Query: 78 VVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKK-NQLWRNSKH 122
VVT C PY + T C H G EP Y TP+C RKC ++ K
Sbjct: 60 VVTGGNYGRQGCCRPY-EITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKR 118
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 179
Y AY++ + + I EI +GPV +TVYEDF++Y G+YKH G GGHAVK
Sbjct: 119 YGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 116/232 (50%), Gaps = 22/232 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M +LS +LL+C GC GG AW +
Sbjct: 221 QGNCAGSWAFSTAAVASDRISIHSMGHMTPALSPQNLLSC-DTHNQKGCRGGRLDGAWWF 279
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCV---KKNQLWRNSKHYSI 125
G+V+ C P+ D+T + P + + R+ ++ N + +
Sbjct: 280 LRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKRQATAHCPNSRAHANHIYQAT 339
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR++SD +DIM E+ +NGPV+ V+EDF YKSG+YKH + G H+
Sbjct: 340 PPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGIYKHTPASLGKPARYRQHGTHS 399
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
VK+ GWG DG+ YW AN W +WG G+F+I RG+NEC IE VV
Sbjct: 400 VKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIESFVV 451
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/177 (45%), Positives = 97/177 (54%), Gaps = 20/177 (11%)
Query: 25 GAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE 82
GAVEA+SDR CIH N SLS DLL+CC CG GC GGYP AW Y+ HG+VT
Sbjct: 1 GAVEAMSDRLCIHSNGAFNKSLSAVDLLSCCEN-CGFGCRGGYPAVAWDYWKTHGIVTGG 59
Query: 83 CDPYFDSTGCSH---PGCE------------PAYPTPKCVRKCVKKNQLWRNSKHYSISA 127
D +GC P CE YPTP+CV++C + + K + +
Sbjct: 60 SKE--DPSGCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMS 117
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
Y I + IM EI GPVE FT+YEDF Y SGVY H G M GHAV+++GWG
Sbjct: 118 YNIYASEISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWG 174
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 111/233 (47%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M SLS +LL+C GC+GG AW +
Sbjct: 77 QGNCAGSWAFSTAAVASDRISIHSKGHMTPSLSPQNLLSC-NTRHQQGCNGGRLDRAWSF 135
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV-------KKNQLWRNSKHYSI 125
G+V+++C P + P + P + R+ + + N + S
Sbjct: 136 LRRRGLVSDKCYPLASQNSIAEPCRMYSRPMGRGKRQATGPCPNNFHHSNDYSNDIYQST 195
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHA 177
YR++S+ +DIM EI +NGPV+ V+EDF YK G+Y+H G H+
Sbjct: 196 PPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFLYKDGIYRHTPASNGKPPQFRRQGTHS 255
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG +G +W AN W +WG G F+I RG NEC IE VV
Sbjct: 256 VKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDIESFVVG 308
>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 309
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/223 (35%), Positives = 114/223 (51%), Gaps = 22/223 (9%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDG--GYP 66
++ G C S WA VEA R C++ G++ S +L+C +GC G
Sbjct: 92 VIDMGTCSSSWAHSPVEAFGHRRCMN-GVDQEATRYSAQYILSCA---TTNGCLAFPGQG 147
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
+ +W + G+ E C Y D + E +YP P C + L Y
Sbjct: 148 VVSWDFIATTGIPLESCVKYTD-----YDKTESSYPCPSL---CNDNSSL----VLYKSD 195
Query: 127 AYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
Y + +PE + I GP++ FTVYEDFA+Y G+Y H+ G G +V+++G+GT
Sbjct: 196 GYEGVGFNPEKLRRAIALRGPMQAMFTVYEDFAYYLEGIYSHVYGGTAGYLSVEIVGYGT 255
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
SD+G+DYWI+ N W +WG DGYF+I RG NEC IEE V +
Sbjct: 256 SDEGQDYWIVKNYWGSNWGEDGYFRIVRGQNECQIEEAVYGAI 298
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 88/237 (37%), Positives = 112/237 (47%), Gaps = 30/237 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQNLLSCDTHH-QQGCQGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP-----------TPKCVRKCVKKNQLWRNSK 121
GVV++ C P+ +G PA P + R+C + N
Sbjct: 282 LRRRGVVSDHCYPF---SGHEQAEAGPATPCMMHSRAMGRGKRQATRRCPNSHDD-ANEI 337
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------M 173
+ AYR+ SD ++IM E+ +NGPV+ VYEDF YKSG+Y H +
Sbjct: 338 YQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRH 397
Query: 174 GGHAVKLIGWGTS--DDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG DG YW AN W SWG GYF+I RGSNEC IE V+
Sbjct: 398 GTHSVKITGWGEEMLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLG 454
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 82/221 (37%), Positives = 115/221 (52%), Gaps = 15/221 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA + SDRF I ++ LS LL+C GC GGY AW +
Sbjct: 223 QGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQHLLSC-NNRGQQGCRGGYLDRAWLF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V +EC P+ TG + C + V C K R + AYR+ +
Sbjct: 282 MRKFGLVDKECYPW---TG-RNDQCRLRKRSNLNVAGCRKPPNPLRQELYKVGPAYRLGN 337
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY+DF YK+GVY+H + G H++++IGWG
Sbjct: 338 E-TDIMQEILTSGPVQATMRVYQDFFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSY 396
Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YW++AN W R WG +G F+I+RG+NEC IE V+A
Sbjct: 397 RGPPLKYWLVANSWGRHWGENGLFRIQRGTNECEIESYVLA 437
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 114/237 (48%), Gaps = 30/237 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C L GC GG+ AW +
Sbjct: 224 QGNCAGSWAFSTAAVASDRVSIHSMGHMTPLLSPQNLLSC-DTLHQQGCRGGHLDGAWWF 282
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYP-----------TPKCVRKCVKKNQLWRNSK 121
GVV++ C P+ +G PA P + R+C + N
Sbjct: 283 LRRRGVVSDHCYPF---SGREQAEAGPAPPCMMHSRAMGRGKRQATRRCPNSHTD-ANDI 338
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-------- 173
+ AYR+ SD ++IM E+ +NGPV+ V+EDF YK G+Y H +
Sbjct: 339 YQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPLSMARPEQYRRH 398
Query: 174 GGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W SWG G+F+I RGSNEC IE V+
Sbjct: 399 GTHSVKITGWGEETLPDGRTLKYWTAANSWGPSWGERGHFRILRGSNECDIESFVLG 455
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/176 (42%), Positives = 103/176 (58%), Gaps = 18/176 (10%)
Query: 25 GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
GAVEA++DR CIH + +S DLL+CC CG GC GG+P AW +++ +G+VT
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISATDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGG 59
Query: 81 -----EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
C Y CSH P C + + TP CV C K + + K ++ S+Y
Sbjct: 60 SKENPSGCRSY-PFPRCSHHGKGKYPPCPKTIFDTPNCVDHCDKPDIDYAADKTHAKSSY 118
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
+ S+ IM EI +NGPVE +F VYEDF YKSG+Y H G ++GGHA++++GWG
Sbjct: 119 NVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWG 174
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 117/239 (48%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG+ SAW +
Sbjct: 102 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWF 160
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P F G + G P P+C+ R+ + +Q+ N
Sbjct: 161 LRRRGVVSDHCYP-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHAN 214
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ S ++IM E+ +NGPV+ V+EDF Y++G+Y H +
Sbjct: 215 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYR 274
Query: 173 -MGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 275 RHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 333
>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
Length = 183
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/185 (41%), Positives = 106/185 (57%), Gaps = 24/185 (12%)
Query: 26 AVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
AV ++SDR CIH N + LS DLL+CC CG GC GG+ AW Y+ +G+VT
Sbjct: 1 AVTSMSDRVCIHSNQNKTNVQLSARDLLSCC-TSCGFGCVGGWIGDAWDYWRDNGIVTGG 59
Query: 81 -----EECDPY-------FDSTGCS---HPGCEPAYPTPKCVRKCVKKNQ-LWRNSKHYS 124
C PY S G +P + YPTP CV KC + + K ++
Sbjct: 60 DYQDKSTCLPYPFPPSHHLVSKGTPFEIYP--QTLYPTPPCVSKCQEGYPGEYEKDKIFA 117
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 184
+S+Y+I+ + +I EI NGPVE VY DF +YK+GVY+H TG+++GGHA++L+GWG
Sbjct: 118 LSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKTGVYQHTTGEILGGHAIRLLGWG 177
Query: 185 TSDDG 189
+ DG
Sbjct: 178 KTKDG 182
>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
Length = 218
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/235 (32%), Positives = 115/235 (48%), Gaps = 33/235 (14%)
Query: 7 EHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGG 64
+ + ++ + CG CWAF E +SDRFC+ +N LS L++C GC G
Sbjct: 2 KQLSLIRDEQQCG-CWAFVVAEVVSDRFCVSSKTKVNEVLSPQYLISCDS--NNGGCSYG 58
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
Y +A+++ + G+VTE C P+ G P C +KC+ N
Sbjct: 59 YFDTAFQFVENQGIVTENCFPFVSGEGNY---------IPPCPKKCLAYNPF-------- 101
Query: 125 ISAYRINSD----PEDIMA---EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 177
+ +++N+ P+DI I G + S +Y DF Y+ GVY+H+ G+ M H+
Sbjct: 102 -TLFKVNNSRAFLPQDIQGMQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNYMFTHS 160
Query: 178 VKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
V+++GWG + + YWI N W WG G+F I RGSNEC IE DV P
Sbjct: 161 VRIVGWGITSPQQGSIPYWICGNNWTEEWGMQGWFWILRGSNECNIELDVWETTP 215
>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
Length = 150
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 93/181 (51%), Gaps = 38/181 (20%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +CGSCWAFGA E +SDR CI +S D+L CCG CG GCDG
Sbjct: 5 QTNCGSCWAFGAAEVISDRICIVTKGARQPIISPTDMLDCCGEYCGYGCDGC-------- 56
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 131
P TPKC C K N + K++ SAY +
Sbjct: 57 --------------------------PKAVTPKCALSCQSKYNTEYAKDKNFGSSAYYVG 90
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
+ I EI NGPVE SFTVYEDF YK GVY++ G+V+GGHA+K+IGWGT ++G D
Sbjct: 91 RNFSVIQTEIMTNGPVEASFTVYEDFYIYKKGVYQYTAGEVLGGHAIKIIGWGT-ENGTD 149
Query: 192 Y 192
Y
Sbjct: 150 Y 150
>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
Length = 466
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/236 (34%), Positives = 120/236 (50%), Gaps = 26/236 (11%)
Query: 4 TNSEHVEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGC 61
+N +V + QG CGSC+AF ++ R + + +S D+++C + GC
Sbjct: 245 SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQDVVSCSEY--AQGC 302
Query: 62 DGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 120
GG+P + A +Y G+V E C PY G P E KC R +
Sbjct: 303 AGGFPYLIAGKYGEDFGLVEESCFPY---NGKDEPCKETK---SKCRRHST--------T 348
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDV-----MG 174
+Y + + + +M E+ KNGP+ +SF VY DF HYK G+Y+H GD +
Sbjct: 349 NYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKGGIYQHTGLGDSYNPWQIT 408
Query: 175 GHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV L+G+GT G+DYWI+ N W WG +G+F+I RG +EC IE + VA P
Sbjct: 409 NHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFFRILRGVDECSIENEAVAVTP 464
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 14/143 (9%)
Query: 97 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI-MAEIYKNGPVEVSFTVYE 155
CEP Y ++ KHY S+Y ++ KNGPVE +FTVY
Sbjct: 5 CEPGYSPS------------YKEDKHYGCSSYSVSRGARRRSWQRSSKNGPVEAAFTVYS 52
Query: 156 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 215
DF YKSGVY+H+ GD+MGGHAV+++GWG ++G YW++ N WN WG +G+FKI RG
Sbjct: 53 DFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQ 111
Query: 216 NECGIEEDVVAGLPSSKNLVKEI 238
+ CGIE ++VAG+P + K I
Sbjct: 112 DHCGIESEIVAGIPCTDQYWKRI 134
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 105/218 (48%), Gaps = 21/218 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CG CWAF A+ R C G++ + S L++C GC GG W
Sbjct: 99 QGSCGGCWAFSAIGMFGSRRC-AVGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWS 155
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RI 130
+ G T EC Y D C PT C +Q+ + Y Y ++
Sbjct: 156 FLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQV 202
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDG 189
+ IM + GPV+ VY D +Y GVY+H G + G HA++++G+GT+DDG
Sbjct: 203 SKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDG 262
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
DYW + N W WG DGYF+I RG NEC IE+++ A
Sbjct: 263 TDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
Length = 463
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/231 (35%), Positives = 118/231 (51%), Gaps = 33/231 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + LS ++++C + GCDGG+P + A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCDGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C+ K +R S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CMLKEDCFRYYTSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIG 182
+ + E+ NGP+ V+F VY DF HY+ G+Y H TG + HAV L+G
Sbjct: 353 GGCNEALMKLELVHNGPMAVAFEVYNDFLHYQEGIYHH-TGLTDPFNPFELTNHAVLLVG 411
Query: 183 WGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+GT G DYWI+ N W +WG DGYF+I+RG++EC IE VA P K
Sbjct: 412 YGTDPATGMDYWIVKNSWGTAWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/223 (39%), Positives = 119/223 (53%), Gaps = 22/223 (9%)
Query: 23 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
A + ++DR CI + LS +L +CC CG GC+GG+P+ A++Y+ GV T
Sbjct: 124 AMSSASVMTDRTCIAYKGEQQPFLSDEELTSCCT-SCGYGCNGGFPLLAFKYWNEIGVPT 182
Query: 81 EECDPYFDSTGCSHPGCEP------AYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINS 132
PY +GC P A TP C KC+ K +L ++ ++Y S Y I S
Sbjct: 183 G--GPYGSKSGCKPFSIAPPTSSSTAAQTPLCQLKCISDYKRKLDKD-RYYGESYYLITS 239
Query: 133 DPE---DIMAEIYKNGPVEVSFTVYEDFAHYKSGVY---KHITGDVMGGHAVKLIGWGTS 186
+ I EI +GPV + ++E F +YKSGVY K +G HAVKLIGWG
Sbjct: 240 SNQPVKTIQREIMDHGPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWG-E 298
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE-DVVAGL 228
YW++ N WN ++G G FKI+RG+NECGIE V AGL
Sbjct: 299 QKRIPYWLVVNSWNTTFGEQGLFKIRRGTNECGIENLHVTAGL 341
>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain 1; AltName: Full=Dipeptidyl peptidase I
heavy chain 1; Contains: RecName: Full=Dipeptidyl
peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
peptidase I heavy chain 2; Contains: RecName:
Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
Full=Dipeptidyl peptidase I heavy chain 3; Contains:
RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
AltName: Full=Dipeptidyl peptidase I heavy chain 4;
Contains: RecName: Full=Dipeptidyl peptidase 1 light
chain; AltName: Full=Dipeptidyl peptidase I light chain;
Flags: Precursor
gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
Length = 435
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 28/228 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF + L R I + LS ++++C + GC+GG+P + A +
Sbjct: 225 QASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 282
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY G P C+P C R + +S++Y + +
Sbjct: 283 YAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR--------YYSSEYYYVGGFYGA 326
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ ++GP+ V+F VY+DF HY+ G+Y H + HAV L+G+GT
Sbjct: 327 CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 386
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 387 DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 434
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 115/232 (49%), Gaps = 16/232 (6%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ +V QG CGS WA SDR I +N SLS LL+C GC+GGY
Sbjct: 212 INPVVDQGDCGSSWAVSTTGISSDRLAIISEGRINASLSSQQLLSCNQHR-QKGCEGGYL 270
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
AW Y GVV + C PY S PG R+ ++ ++S + ++
Sbjct: 271 DRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTDRRGLRCPSGSQDSTAFKMT 329
Query: 127 A-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHA 177
Y+++S EDI E+ NGPV+ +F V+EDF Y GVY+H + G H+
Sbjct: 330 PPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHS 389
Query: 178 VKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
V+++GWG ++ YW+ AN W WG DGYFKI RG N C IE V+
Sbjct: 390 VRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIG 441
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 105/218 (48%), Gaps = 21/218 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QG CG CWAF A+ R C G++ + S L++C GC GG W
Sbjct: 99 QGSCGGCWAFSAIGMFGSRRC-AVGIDKAAVLYSQQHLISCS--TENFGCSGGDFFPTWS 155
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY-RI 130
+ G T EC Y D C PT C +Q+ + Y Y ++
Sbjct: 156 FLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI----QFYKAHGYGQL 202
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDG 189
+ IM + GPV+ VY D +Y GVY+H G + G HA++++G+GT+DDG
Sbjct: 203 SKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHALEMVGYGTTDDG 262
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
DYW + N W WG DGYF+I RG NEC IE+++ A
Sbjct: 263 TDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
Length = 807
Score = 128 bits (321), Expect = 3e-27, Method: Composition-based stats.
Identities = 87/247 (35%), Positives = 132/247 (53%), Gaps = 23/247 (9%)
Query: 3 FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
FT +H + +I QG CG C+A VE ++ R C+ F + +S+ DL+ C +L
Sbjct: 64 FTYRDHKCVQIINQGSCGCCYAAATVEMVTARRCLQFNDSKLVSLEDLVTCDHTKYLNVQ 123
Query: 58 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
+GC GG +++ ++ G+V + C+ Y++ T +P YPT C C K
Sbjct: 124 NNGCRGGNALASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA-HYKSGVYKHITG---DVM 173
R K+ + YR+ S + +M +IY+NGP+ VS + DF K +Y ++G +
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPPKDKKSIY--VSGPNTKLS 230
Query: 174 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 233
GGHAV ++GWG ++G YW AN + +WG GYF+IKRGSNE IE A LP + N
Sbjct: 231 GGHAVMIVGWG-EENGVPYWDCANTYGTNWGDHGYFRIKRGSNELKIETWPGAALPIASN 289
Query: 234 LVKEITS 240
E S
Sbjct: 290 SQPETPS 296
>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 305
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 115/214 (53%), Gaps = 19/214 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
Q C C+AF + ALS R CI +SLSV +++C G+ GC GG S+W
Sbjct: 101 QKECSCCYAFATLGALSTRRCIAKLDPQAVSLSVQHMVSCDS---GEAGCQGGEFESSWA 157
Query: 72 YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
+ G V +C PY TG S +C C + + + SA R+
Sbjct: 158 FLETEGAVKSDCLPYTSGETGKSG----------ECPTTCQDGTPVESAFHYKAASASRL 207
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
S+ +IM + +GPV+ F V+EDF +Y G+Y + G +GGHAV ++G+G+ ++
Sbjct: 208 -SNYNEIMVSLLADGPVQTGFYVHEDFLYYVGGIYHKVYGTSLGGHAVLIVGYGSMNN-H 265
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
DYWI+ N W WG +GYF+I RG+NECGIE++
Sbjct: 266 DYWIVRNSWGSDWGENGYFRILRGTNECGIEKNA 299
>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
Length = 459
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 28/228 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF + L R I + LS ++++C + GC+GG+P + A +
Sbjct: 249 QASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 306
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY G P C+P C R + +S++Y + +
Sbjct: 307 YAQDFGLVEEACFPY---AGSDSP-CKPN----DCFR--------YYSSEYYYVGGFYGA 350
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ ++GP+ V+F VY+DF HY+ G+Y H + HAV L+G+GT
Sbjct: 351 CNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 410
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 411 DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAATPIPK 458
>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 315
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 81/219 (36%), Positives = 112/219 (51%), Gaps = 28/219 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
QGHCGSCWAF + A D C+ G++ + S L++C L GC GG
Sbjct: 101 QGHCGSCWAFASSRAFGDTRCMQ-GLDPVPVLYSPQYLVSCS--LQNMGCTGGTMEDVGD 157
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G+ T+ C PY D E A+ P C CV + + R + + R +
Sbjct: 158 FLRDTGIATDTCVPYVD---------EDAHWEP-CPVSCVDGSPI-RTVQ--LMDFVRYD 204
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE- 190
+ E +M I NGP+ S +YEDF +Y+SG+Y I G G HA++L+G+GT G+
Sbjct: 205 GNLEAMMEAIAMNGPIHASMMIYEDFMYYQSGIYHFIYGSGCGMHAIELVGYGTDISGDS 264
Query: 191 --------DYWILANQWNRSWGADGYFKIKRGSNECGIE 221
DYWI N W WG +GYF+I RG+NECGIE
Sbjct: 265 EAGEEVRVDYWIARNSWGEDWGENGYFRIVRGNNECGIE 303
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/226 (36%), Positives = 110/226 (48%), Gaps = 19/226 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG----DGCDGGYPIS 68
Q C +CW + L+DR CI G LSV +CC G GC GG +
Sbjct: 60 QSACHNCWTVSSTGMLNDRVCIKSGGTFRDILSVGYFTSCCNPANGCPKAKGCQGGNLLE 119
Query: 69 AWRYFVHHGVVT-EECDP---YFDSTGC---SHPGCEPA-YPTPKCVRKCVKK--NQLWR 118
+ +HG+VT +E P + GC P C+ A Y +P C KC K +
Sbjct: 120 GLNFLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKHAGYSSPACQTKCTNKAYKTSLQ 179
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 178
H + S R+ + P++I EI+ NGPV ++YED YK+GVY H TG G H +
Sbjct: 180 QDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGMLSIYEDIRVYKAGVYVHQTGSFQGIHTL 239
Query: 179 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
K+IGWG + G+DYW+ N WN WG G K+ G GIE V
Sbjct: 240 KIIGWGV-ESGQDYWLAVNSWNEEWGDHGMIKLAVG--RTGIENSV 282
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 115/233 (49%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWF 250
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSI 125
GVV++ C P+ D G + + P + R+ + NQ+ N +
Sbjct: 251 LRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVT 310
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHA 177
AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H + G H+
Sbjct: 311 PAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHS 370
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 371 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 423
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/237 (35%), Positives = 123/237 (51%), Gaps = 16/237 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA SDR+ I + LS LL+C GC GG+ AW +
Sbjct: 210 QGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLLSCNKGQ--RGCQGGHLSRAWTF 267
Query: 73 FVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
G+V + C P+ + T C P P + + + L R+ + AY+I
Sbjct: 268 IRKFGLVDDYCYPWTGTPTKCKIPK-RPNFDALSSICPPSLGSNL-RSELYRVGPAYKIQ 325
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----MGGHAVKLIGWGTSD 187
D +DIM EI ++GPV+ + VY+DF YKSGVY + G H+VK++GWG
Sbjct: 326 -DEKDIMEEIMQSGPVQATMKVYQDFFSYKSGVYTKSNTERESSNFGYHSVKILGWGEET 384
Query: 188 D--GE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITS 240
+ G+ YW+ AN W + WG +G+FKI+RG+NEC IEE V+A + + +EI +
Sbjct: 385 NIYGQPIKYWLAANSWGQQWGENGFFKIRRGTNECEIEEFVLAAWAETNDPSREIIT 441
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 116/215 (53%), Gaps = 22/215 (10%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPIS 68
+V QG+CGSCWAF +V+ +D C G++ +S SV +L C GC+GG P++
Sbjct: 157 VVDQGNCGSCWAFSSVQTFADHRC-RSGLDATGVSYSVQYVLDC--DRKDHGCNGGEPVN 213
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
A+ + + G V C Y C KC +N + + S
Sbjct: 214 AFNFLHNTGTVLASCVGYTAGDDAVVKFCPQ-----KCDDGSAVENVV-------ATSGS 261
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+ S + ++A +GPV +F V +DF +YKSGVY+H G +GGHAV++IG+G +D
Sbjct: 262 KSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVEIIGYGVTDS 317
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G DYW + N W WG DGYF+I RG +ECGIE +
Sbjct: 318 GLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEHE 352
>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
Length = 463
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSSQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
Length = 158
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 94/169 (55%), Gaps = 13/169 (7%)
Query: 63 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 122
GG+ ++ WR+ G +E+C PY S G + P C ++ C + S
Sbjct: 1 GGFLVATWRFLAAVGTASEQCVPYV-SFGGAVPACN--------IKSCAVSGE---KSPF 48
Query: 123 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 182
Y + + R D+MA++ NGP++ + VY+DF YKSGVY H++G ++G HA+K++G
Sbjct: 49 YKVKSARKLKGMVDMMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVG 108
Query: 183 WGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
WG S YWI AN W WG DGYF I RG ECG+ + V +G P+
Sbjct: 109 WGVDSASKLPYWICANSWGEDWGLDGYFWIARGRGECGLGKTVWSGKPA 157
>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
Length = 462
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 115/228 (50%), Gaps = 28/228 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF ++ L R I + + LS +++C + GCDGG+P + A +
Sbjct: 252 QASCGSCYAFASMAMLEARIRILTNNSKTPVLSTQQIVSCSEY--SQGCDGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY G P C P C R V S ++ + +
Sbjct: 310 YVQDFGVVEENCFPYL---GHDSP-CSPK----NCTRYYV--------SDYHYVGGFYGA 353
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ +NGP+ V+F VY DF HY+ GVY H + HAV L+G+GT
Sbjct: 354 CNEALMKLELVENGPMAVAFEVYNDFIHYQKGVYHHTGLRDSFNPFEITNHAVLLVGYGT 413
Query: 186 SDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+ GE YWI+ N W WG DGYF+I RG++ECGIE V+ P K
Sbjct: 414 DEKTGEHYWIVKNSWGSYWGEDGYFRILRGTDECGIESIAVSATPIPK 461
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 116/215 (53%), Gaps = 22/215 (10%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPIS 68
+V QG CGSCWAF +++ +D C G++ +S SV +L C GC+GG P++
Sbjct: 160 VVDQGSCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDCD--RKDHGCNGGEPVN 216
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
A+ + + G V C Y C KC +N + + S
Sbjct: 217 AFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQ-----KCDDGSAVENIV-------ATSGA 264
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+ S + ++A +GPV +F V +DF +YKSGVY+H G +GGHAV+++G+G +D
Sbjct: 265 KSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEIVGYGVTDS 320
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G DYW + N W WG DGYF+I RG +ECGIE++
Sbjct: 321 GLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEQE 355
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 115/233 (49%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSI 125
GVV++ C P+ D G + + P + R+ + NQ+ N +
Sbjct: 282 LRRRGVVSDHCYPFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVT 341
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHA 177
AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H + G H+
Sbjct: 342 PAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 117/239 (48%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG+ SAW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWF 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P F G + G P P+C+ R+ + +Q+ N
Sbjct: 281 LRRRGVVSDHCYP-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHAN 334
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ S ++IM E+ +NGPV+ V+EDF Y++G+Y H +
Sbjct: 335 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYR 394
Query: 173 -MGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 395 RHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 453
>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
Length = 804
Score = 127 bits (318), Expect = 5e-27, Method: Composition-based stats.
Identities = 85/234 (36%), Positives = 126/234 (53%), Gaps = 19/234 (8%)
Query: 3 FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFLC-- 57
FT H I +I QG CG C+A AVE ++ R C+ + +S+ DL+ C +L
Sbjct: 64 FTYRGHRCIQIINQGSCGCCYAAAAVEMVTARRCLQLNDSRLVSLEDLVTCDHTKYLNIQ 123
Query: 58 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
+GC GG +++ ++ G+V + C+ Y++ T +P YPT C C K
Sbjct: 124 NNGCRGGNSLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKHITGDVMGG- 175
R K+ + YR+ S + +M +IY+NGP+ VS + DF + K G+Y +GG
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLGGG 232
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV ++GWG ++G YW AN + +WG GYFKIKRGSNE IE + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285
>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 296
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 111/215 (51%), Gaps = 22/215 (10%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMN---LSLSVNDLLACCGFLCGDGCDGGYPIS 68
+V QG CGSCWAF +++ +D C G++ +S SV +L C GC+GG P
Sbjct: 93 VVDQGSCGSCWAFSSIQTFADHRC-RSGLDATGVSYSVQYVLDC--DRKDHGCNGGEPTK 149
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
A+ + G V C Y C PK ++ S S SA
Sbjct: 150 AFDFLHSTGTVLTSCVDYTAGADNVVKFC------PKTCDDGSAVENVFAASGSKSGSAI 203
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+ + +GPV +F V +DF +YKSGVY+H G +GGHAV+++G+G +D
Sbjct: 204 DV----------LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVEVVGYGVTDS 253
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G DYW + N W WG DGYF+I RGS+ECGIE++
Sbjct: 254 GLDYWTVRNSWGPDWGEDGYFRIVRGSDECGIEQE 288
>gi|403355691|gb|EJY77431.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 104/213 (48%), Gaps = 29/213 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW F V AL F + +G +LS L+ C G GC+GG P A+ Y
Sbjct: 153 QGKCGSCWTFSTVGALESHFLLKYGQFRNLSEQQLVDCAGNYDNHGCNGGLPSHAFEYLK 212
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS--AYRINS 132
+G + EE +YP C K + S+ + A ++
Sbjct: 213 DNGGIAEET----------------SYPYVAVTNTCALK----KGSQSVGVKGGAVNVSL 252
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-----KHITGDVMGGHAVKLIGWGTSD 187
+D+ IY +GPV ++F V DF Y++GVY K+ DV HAV +G+GT +
Sbjct: 253 SEDDLKQAIYSHGPVSIAFQVASDFRDYRAGVYTSKVCKNGPQDV--NHAVLAVGFGTDE 310
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
+ DYWI+ N W WG GYFK++RG N CG+
Sbjct: 311 NKVDYWIIKNSWGAVWGDQGYFKMERGVNMCGV 343
>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
P15]
Length = 627
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 86/234 (36%), Positives = 127/234 (54%), Gaps = 19/234 (8%)
Query: 3 FTNSEHVEILVI-QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLAC--CGFL--C 57
FT H I +I QG CG C+A AVE ++ R C+ F + +S+ DL+ C +L
Sbjct: 64 FTYRGHRCIQIINQGSCGCCYAAAAVEMVTARRCLQFNDSKLVSLEDLVTCDHTKYLNIQ 123
Query: 58 GDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 117
+GC GG +++ ++ G+V + C+ Y++ T +P YPT C C K
Sbjct: 124 NNGCRGGNSLASLKFGETTGMVYDTCEDYWNRT---YP-----YPTETCKTVCKDKRPKD 175
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF-AHYKSGVYKH-ITGDVMGG 175
R K+ + YR+ S + +M +IY+NGP+ VS + DF + K G+Y + GG
Sbjct: 176 RTIKNKA--PYRL-SGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGG 232
Query: 176 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
HAV ++GWG ++G YW AN + +WG GYFKIKRGSNE IE + LP
Sbjct: 233 HAVMIVGWG-EENGVPYWDCANTYGTNWGDQGYFKIKRGSNELKIETWPGSALP 285
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 110/233 (47%), Gaps = 31/233 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACC----GFLCGDGCDGGYPIS 68
QG CG+CWA E L+DR CI + LS + +CC G L GC+GG +
Sbjct: 56 QGRCGNCWAVCPTEVLNDRLCIKSSGKIQEILSAGYVTSCCNPAHGCLHAKGCNGGRLVE 115
Query: 69 AWRYFVHHGVVT-------------EECDPY-------FDSTGCSHPGCEPA--YPTPKC 106
A + HGVVT + C PY + G +P C+ P P C
Sbjct: 116 AMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDVVQQPVPPC 175
Query: 107 VRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
C K + H + S ++ +D + I EI+ NGPV +F +Y+DF +YKSGV
Sbjct: 176 RTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVFSAFEMYKDFRYYKSGV 235
Query: 165 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 217
Y T +V H +K+IGWG +D +YW+ N WN WG G K+ G N
Sbjct: 236 YVPTTKEVDCLHVIKIIGWG-ADSVREYWLAMNAWNEEWGDHGLIKMAFGKNR 287
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 114/233 (48%), Gaps = 16/233 (6%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ + QG CGS WA SDR I +N SLS LL+C GC+GGY
Sbjct: 272 IHPIADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLLSCNQHR-QKGCEGGYL 330
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
AW Y GVV + C PY S PG R+ ++ ++S + ++
Sbjct: 331 DRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTAFKMT 389
Query: 127 A-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHA 177
Y+++S EDI E+ NGPV+ +F V+EDF Y GVY+H + G H+
Sbjct: 390 PPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHS 449
Query: 178 VKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
V+++GWG ++ YW+ AN W WG DGYFKI RG N C IE V+
Sbjct: 450 VRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 502
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 102/179 (56%), Gaps = 18/179 (10%)
Query: 25 GAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-- 80
GAVEA++DR CIH + +S DLL+CC CG GC GG+P AW +++ +G+VT
Sbjct: 1 GAVEAMTDRLCIHSNATIKKHISSTDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGG 59
Query: 81 -----EECDPYFDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
C Y C+H P E +PTP C + C + K + S+Y
Sbjct: 60 SKENPSGCRSY-PFPKCNHHGKGPDAPCPEKIFPTPACNKTCDTPEVNYILDKTKAKSSY 118
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
+ + + IM EI +NGPVE +F VYEDF HY+SGVY H G ++GGHA++++GWG +
Sbjct: 119 NVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEEN 177
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 116/228 (50%), Gaps = 17/228 (7%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYP 66
+ + QG CG+ WA + + SDRF I + LS LL+C GC GG+
Sbjct: 214 ISKITDQGWCGASWAISSAQVASDRFAIMSKGTDAVELSAQHLLSC-NNRGQQGCSGGHL 272
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 126
AW + G+V E C P+ ST C T C R +
Sbjct: 273 DRAWMFMRRFGLVDENCYPWKASTE----TCRLRKRTDLRSAGCAPPPNPLRTELYKVGP 328
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH-ITGDVMGG--HAVKLIGW 183
AYR+ ++ DIM EI +GPV+ + VY+DF Y+SGVYKH +T ++ H+V++IGW
Sbjct: 329 AYRLANE-TDIMQEILTSGPVQATMRVYQDFFSYESGVYKHSVTAELYESDYHSVRIIGW 387
Query: 184 G------TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
G + + YW++AN W + WG +G F+I++G+NEC IE V+
Sbjct: 388 GEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQKGTNECEIESFVL 435
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 79/224 (35%), Positives = 108/224 (48%), Gaps = 18/224 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA A SDRF + ++ LS LL+C C GGY AW Y
Sbjct: 222 QGWCGASWAISATRVASDRFALMSKGADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLY 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V E+C P+ + C+ T C R + AYR+ +
Sbjct: 281 MRKFGLVDEDCYPWEGTNA----QCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGN 336
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY+DF Y+SG+YKH G H+V++IGWG
Sbjct: 337 E-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSA 395
Query: 190 E-------DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YW++ N W + WG G F+I+RG+NEC IE VVA
Sbjct: 396 HRHHNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 115/239 (48%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH ++S LS +LL+C GC GG AW +
Sbjct: 184 QGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWF 242
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------N 119
GVV++ C P+ S G + A P P C+ + R N
Sbjct: 243 LRRRGVVSDHCYPF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHAN 296
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H +
Sbjct: 297 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 356
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 357 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 415
>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|403355865|gb|EJY77523.1| Cathepsin B [Oxytricha trifallax]
Length = 299
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 76/202 (37%), Positives = 102/202 (50%), Gaps = 21/202 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCW+F A L DR C+H +N+ LS D+++C GC GG+ Y
Sbjct: 96 QQSCGSCWSFAATSMLQDRLCLHSNGAVNVQLSQQDMVSC--DFDNAGCSGGWLSHTINY 153
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY--SISAYRI 130
V HGVVT +C Y G +C +C N + K Y ++ ++
Sbjct: 154 LVVHGVVTSQCLAYASVDGAGR----------ECSFRCDDANTEY---KKYGCKFNSLKM 200
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDG 189
+ E++M EIY NGPV V F VY DF Y G Y+ + + GGHAV + GWG + G
Sbjct: 201 TTSKEEMMEEIYLNGPVMVGFIVYSDFMSYGGGYYEVSPSASISGGHAVIVHGWGY-NGG 259
Query: 190 EDYWILANQWNRSWGADGYFKI 211
YWI NQW +WG+ GYF I
Sbjct: 260 RLYWIAQNQWGTTWGSSGYFNI 281
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 113/239 (47%), Gaps = 35/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C + WAF SDR I M LS +L++C DGC GG AW +
Sbjct: 220 QGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISC-DTRHQDGCAGGRIDGAWWF 278
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKC-------------VKKNQLWRN 119
GVVT++C P+ P + A +C+ + + + N
Sbjct: 279 MRRRGVVTQDCYPF-------SPPEQSAVEVARCMMQSRAVGRGKRQATAHCPNSHSYHN 331
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM------ 173
+ S YR++++ +IM EI NGPV+ V+EDF YKSG+++H +
Sbjct: 332 DIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYR 391
Query: 174 --GGHAVKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+V++ GWG D YWI AN W ++WG DGYF+I RG NEC IE V+
Sbjct: 392 KHATHSVRITGWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIG 450
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 119/230 (51%), Gaps = 13/230 (5%)
Query: 8 HVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 65
++ + QG CG+ WA V+ +DRF I +S LS LL+C L GC GG+
Sbjct: 208 YISSPIDQGWCGASWAITTVQVTTDRFGIMSKRAISDVLSPQHLLSC-NNLNQQGCQGGH 266
Query: 66 PISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
AW + G++TEEC P+ + C+ P + +C + N ++ +
Sbjct: 267 LTRAWNWIRKFGLITEECYPWQGRMSTCAVPK-KKKETMAQCPSRVRSNNDRTTKTRLHR 325
Query: 125 IS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMGGHAVKL 180
+ YR+ ++ E IM EI +GPV+ V DF YKSGVYK +G G H+V++
Sbjct: 326 VGPVYRVATE-EGIMHEILTSGPVQAVMKVSRDFFMYKSGVYKCSNLASGSRTGYHSVRI 384
Query: 181 IGWGTSDDG---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+GWG G YWI +N W WG +GYF+I +G +EC IE+ V+A
Sbjct: 385 VGWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAA 434
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/217 (36%), Positives = 106/217 (48%), Gaps = 30/217 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDG----CDGGYPIS 68
QG+C S WA +DR CI + LS +L++C GDG CDGG
Sbjct: 49 QGNCASSWAVAVASTFTDRLCIASNGQFTDNLSAQNLMSC-----GDGEKMGCDGGSAFK 103
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
AW ++ G+VT E C PY + C H G C T C +KCV K
Sbjct: 104 AWELTMNKGIVTGGNFDSNEGCQPYKNRP-CDHYGDSRLTNCSSLRRTQMTVCRKKCVNK 162
Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
N + + H + Y + ++ + I EI GPV VYE+F YK G+YK TG
Sbjct: 163 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTTG 222
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 207
+++G H VKLIGWG DG +YW+ N WN +WG DG
Sbjct: 223 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGNDG 259
>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
gigas]
Length = 464
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 106/219 (48%), Gaps = 17/219 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q +C S WAF V+ +DR I L+ LS L++C GC GG AW +
Sbjct: 213 QKNCASSWAFSTVDVAADRLAIESEGLLTNQLSPQHLVSCNTGRGQRGCRGGSTEKAWWF 272
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G++TEEC PY S G C T C N Y YR+
Sbjct: 273 VKRRGIITEECYPYTASDG----ECLDGETT------CPNANSSTAKIVLYVTPPYRVRQ 322
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-AVKLIGWG----TSD 187
D EDI AEIY+NGPV+ +F V DF Y+SGVY+H D+ +V++IGWG
Sbjct: 323 DEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIGWGEKTNKKG 382
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YWI N W WG G F+I RG N GIEE+V+A
Sbjct: 383 KKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEENVLA 421
>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
Length = 463
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
Length = 463
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 121/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 102/215 (47%), Gaps = 18/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAF A L+ R CI N+ LS + C C GGY +W +
Sbjct: 123 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 180
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+ G + C PY G G P +C + ++ Y R +
Sbjct: 181 LENTGTPLDTCIPYASGRGTFSSGTCPT----QCKIASMSMSK-------YKAKNTRYIT 229
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+I I G V+ FTVY D YKSGVYKH+ V+GGHAV LIG+G + G +Y
Sbjct: 230 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNY 288
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
W+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 289 WLAANSWGANWGMSGYFKIAQG--EGGIENQVYAG 321
>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
Length = 463
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 120/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLSVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 118/239 (49%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH ++S LS +LL+C GC GG AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCHGGRLDGAWWF 176
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ S G + A P P C+ R+ + + + N
Sbjct: 177 LRRRGVVSDHCYPF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHAN 230
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H +
Sbjct: 231 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 290
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 291 RHGTHSVKITGWGEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 349
>gi|403331769|gb|EJY64852.1| hypothetical protein OXYTRI_15000 [Oxytricha trifallax]
Length = 259
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 114/222 (51%), Gaps = 24/222 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSC+AF A +SDR CI +NL LS +L++C GC GG+ + Y
Sbjct: 53 QGSCGSCYAFAASGMMSDRLCIKSNGQINLVLSPQELVSC--DYQNYGCSGGWMTNTLYY 110
Query: 73 FVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
+ +G+ +E C PY F+S T C +C N + K ++ +I
Sbjct: 111 LMSYGIPSETCLPYDMFNSE------------TKACSGRCDSPNYEYTRHKCKKGTS-KI 157
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
SDPE IM +I +NGP V+F +EDF ++ G+YK+ +G + GHA KL GWG G
Sbjct: 158 MSDPETIMRDIMENGPSIVAFQAFEDFLNFGGGIYKYTSGKFLVGHATKLTGWGLDSAGR 217
Query: 191 DYWILANQWNRSWGAD---GYFKIKRGSNECGIEEDVVAGLP 229
YWI NQ+ WG G++KI G E G V + +P
Sbjct: 218 LYWIGQNQFGLGWGGRGDYGFYKIYDG--EVGFGSAVWSCIP 257
>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
Length = 455
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 111/228 (48%), Gaps = 28/228 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSC++F + L R I S +++C + GCDGG+P +Y
Sbjct: 245 QAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVSCSQY--SQGCDGGFPYLIGKY 302
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V E+C PY TG P PA KC K + S ++ + +
Sbjct: 303 IQDFGIVEEDCFPY---TGSDSPCNLPA--------KCTK----YYASDYHYVGGFYGGC 347
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWGT 185
+M E+ KNGP+ V+ VY DF +YK G+Y H TG + HAV L+G+G
Sbjct: 348 SESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHH-TGLRDANNPFELTNHAVLLVGYGQ 406
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GE YWI+ N W WG +G+F+I+RG++EC IE VA P K
Sbjct: 407 CHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATPIPK 454
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/221 (36%), Positives = 112/221 (50%), Gaps = 15/221 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA + SDRF I + LS LL+C GC GGY AW +
Sbjct: 223 QGWCGASWAVSTADVASDRFAIMSKGAETVELSAQHLLSC-NNRGQQGCKGGYLDRAWLF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V EEC P+ TG + C + C R + AYR+ +
Sbjct: 282 MRKFGLVDEECYPW---TG-RNDQCRLRKRSNLKTAGCQNPPNSLRTELYKVGPAYRLGN 337
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY+DF Y+SGVY+H + G H+V++IGWG
Sbjct: 338 E-TDIMQEILTSGPVQATMRVYQDFFVYQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSY 396
Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YW++AN W +WG +G F+I++G+NEC IE V+A
Sbjct: 397 RGPPLKYWLVANSWGHNWGENGLFRIQKGTNECEIESYVLA 437
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 111/221 (50%), Gaps = 14/221 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF-GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYF 73
QG CG+ WA + SDRF + G + L L C GCDGGY AW +
Sbjct: 217 QGWCGASWAISTAQVASDRFAVMSKGTDSVLLSAQHLLSCNKKGQRGCDGGYLDRAWLFM 276
Query: 74 VHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSD 133
G+V E+C P+ + C+ T C R + AYR+ ++
Sbjct: 277 RKFGLVDEQCYPWKGV----YEQCKLQKRTNLEAAGCRAPANPLRKELYKVGPAYRLGNE 332
Query: 134 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWG---TSD 187
DIM EI +GPV+ + VY+DF Y+SG+Y H + G H+V++IGWG ++D
Sbjct: 333 -TDIMREILTSGPVQATMKVYQDFFSYESGIYMHTPIAELYESGYHSVRIIGWGEDISTD 391
Query: 188 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G YW++ N W + WG +G F+I+RG NEC IE VVA
Sbjct: 392 SGLPIKYWLVVNSWGQEWGENGLFRIRRGINECDIESFVVA 432
>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
Length = 463
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 118/228 (51%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + LS ++++C + GCDGG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCDGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY TG P C+P CVR + +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP-CKPK---EDCVR--------YYSSEYHYVGGFYGG 354
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ +GP+ V+F VY DF HY+ G+Y H + HAV L+G+GT
Sbjct: 355 CNEALMKLELVHHGPMAVAFEVYNDFLHYRKGIYYHTGLRDPFNPFELTNHAVLLVGYGT 414
Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 415 DPVSGMDYWIVKNSWGIGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 104/215 (48%), Gaps = 18/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAF A L+ R CI N+ LS + C C GGY +W +
Sbjct: 160 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 217
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+ G + C PY G G C +C K SK+ + + I S
Sbjct: 218 LENTGTPLDSCIPYASGRGTFSSGT--------CPTQC--KIASMSMSKYKAKNTVYI-S 266
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+I I G V+ FTVY D YKSGVYKHI V+GGHAV LIG+G + G +Y
Sbjct: 267 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVALIGFGV-EGGSNY 325
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
W+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 326 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 358
>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
Length = 446
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 111/225 (49%), Gaps = 31/225 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---NDLLACCGFLCGDGCDGGYP-ISAW 70
QG CGSC+AF ++ R + N + V D++ CC + GCDGG+P +
Sbjct: 241 QGGCGSCYAFSSMAMNEARIRV-MSNNTQMPVFSPQDIVDCCQY--SQGCDGGFPYLVGG 297
Query: 71 RYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
+Y G+V E CDPY RKC + R + Y
Sbjct: 298 KYAEDFGLVDESCDPYVGED-----------------RKCKSTSCSRRYATRYRYVGGYY 340
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDV----MGGHAVKLIGWG 184
+ E M + GP+ VSF VY+DF HYKSGVY+H +T + HAV L+G+G
Sbjct: 341 GACNEQEMKLALQRGPLSVSFMVYDDFMHYKSGVYRHSGLTDKYNPFEITNHAVLLVGYG 400
Query: 185 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
+D+G YWI+ N W + WG +GYF+I RG++EC IE V P
Sbjct: 401 -ADEGTKYWIVKNSWGKGWGEEGYFRILRGADECAIESIAVETFP 444
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 110/221 (49%), Gaps = 15/221 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA + +DRF I + LS LL+C GC GGY AW +
Sbjct: 281 QGWCGASWAISTADVATDRFSIMSKGAEDAELSAQHLLSC-NNRGQQGCRGGYLDRAWLF 339
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V ++C P+ G C+ C K R + AYR+ +
Sbjct: 340 MRKFGLVDKDCYPWTGKNG----QCKLRKRNNLQAAGCRKPPNPLRTELYKVGPAYRLGN 395
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY+DF YK+G+Y+H + G H+V++IGWG
Sbjct: 396 E-TDIMQEILTSGPVQATMRVYQDFFVYKNGIYRHSQSAELHDSGYHSVRIIGWGEERSY 454
Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YW++ N W +WG +G FKI+RG+NEC IE V+A
Sbjct: 455 RGPPLKYWLVVNSWGYNWGENGLFKIQRGTNECEIESYVLA 495
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/238 (35%), Positives = 118/238 (49%), Gaps = 34/238 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH ++S LS +LL+C GC GG AW +
Sbjct: 290 QGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWF 348
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ S G + A P P C+ R+ + + + N
Sbjct: 349 LRRRGVVSDHCYPF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHAN 402
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H +
Sbjct: 403 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 462
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 463 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVL 520
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 105/217 (48%), Gaps = 18/217 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAF A L+ R CI N+ LS + C C GGY +W +
Sbjct: 108 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 165
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+ G + C PY G G C +C K SK+ + + I S
Sbjct: 166 LENTGTPLDTCIPYASGGGTFSSGT--------CPTQC--KIASMSMSKYKAKNTVYI-S 214
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+I I G V+ FTVY D YKSGVYKH+ V+GGHAV LIG+G + G +Y
Sbjct: 215 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVSTVLGGHAVALIGFGV-EGGSNY 273
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
W+ AN W +WG GYFKI +G E GIE V AG P
Sbjct: 274 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGEP 308
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 81/242 (33%), Positives = 113/242 (46%), Gaps = 17/242 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS WA SDRF I + L+ LLAC C GG+ +AW+Y
Sbjct: 316 QGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLLACVRR--QQACSGGHLDTAWQY 373
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
GVV +EC PY + C+ C + R + + AY +N+
Sbjct: 374 LRRVGVVNDECYPYIAAKN----QCKINDGDTLVSANCELPANVNRTAMYRMGPAYSLNN 429
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-----DVMGGHAVKLIGWGTSD 187
+ DIM EI + G V+ VY DF Y++G+Y+H + H+V+LIGWG
Sbjct: 430 E-TDIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSAAATPAEERSAYHSVRLIGWGEER 488
Query: 188 DGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMF 244
G D YWI N W WG +G F+I RG+NEC IE V+A P V+ + +
Sbjct: 489 VGYDMVKYWIAVNSWGTWWGENGRFRILRGTNECEIESYVLASNPYVHQHVQTVRNVGDL 548
Query: 245 ED 246
++
Sbjct: 549 QE 550
>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
leucogenys]
Length = 463
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 119/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY+ G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYEKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 115/235 (48%), Gaps = 22/235 (9%)
Query: 9 VEILVIQGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP 66
+ + QG CGS WA SDR I +N SLS LL+C GC+GGY
Sbjct: 216 IHPVADQGDCGSSWAVSTTGISSDRLSIISEGRINASLSSQQLLSCNQHR-QKGCEGGYL 274
Query: 67 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGC---EPAYPTPKCVRKCVKKNQLWRNSKHY 123
AW Y GVV + C PY C + Y + +R C +Q +S +
Sbjct: 275 DRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPKRDYTNRQGLR-CPSGDQ---DSTAF 330
Query: 124 SISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMG 174
++ Y+++S EDI E+ NGPV+ +F V+EDF Y GVY+H + G
Sbjct: 331 KMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEG 390
Query: 175 GHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
H+V+++GWG ++ YW+ AN W WG DGYFKI RG N C IE V+
Sbjct: 391 YHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIG 445
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/223 (35%), Positives = 111/223 (49%), Gaps = 16/223 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA V+ SDRF I + LS L++C GC GGY AW +
Sbjct: 254 QGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHLISC-NNRGQRGCKGGYLDRAWLF 312
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI-SAYRIN 131
GVV E+C P+ C C ++N ++ Y + AYR+
Sbjct: 313 MRKFGVVDEDCYPWLSG---RSDKCRIPRRGKLSDAGCQRRNSYNLRNEMYKVGPAYRLG 369
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH---ITGDVMGGHAVKLIGWGTSDD 188
++ DIM EI +GPV+ + V+ DF HY+SG+Y H G H+V+++GWG
Sbjct: 370 NE-TDIMQEILTSGPVQATMRVHRDFFHYESGIYVHSRPFDTRQSGYHSVRIVGWGEEPS 428
Query: 189 GED-----YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
+ +W +AN W R WG DGYF+I RG+NEC IE V+
Sbjct: 429 PYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNNECEIESFVLG 471
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 76/104 (73%), Gaps = 1/104 (0%)
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
S+Y + DIM EI KNGPV+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG
Sbjct: 10 SSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV 69
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
++G YW++AN WN WG GYF+++RG+NECGIE + AGLP
Sbjct: 70 -ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/191 (38%), Positives = 103/191 (53%), Gaps = 24/191 (12%)
Query: 54 GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCE 98
G +C DGC G P +AW + +G+ TE C PY + C H P E
Sbjct: 152 GHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPY-NFPKCGHHQQDSKYQPCPE 210
Query: 99 PAYPTPKCVRKCVKKN--QLWRNSKHYS--ISAYRINSDPEDIMAEIYKNGPVEVSFTVY 154
Y TP C+ +C KN +H++ S Y++ ++I EI NGP +F++Y
Sbjct: 211 KNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGT-DNIKKEIMTNGPTSAAFSMY 269
Query: 155 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 214
+DF Y+SGVYKH +G +MG H V++IGWGT G DYW++ N WN WG G FKI +G
Sbjct: 270 DDFLSYESGVYKHTSGTLMGEHGVEIIGWGTK-QGVDYWLVMNSWNEGWGVHGTFKIAQG 328
Query: 215 SNECGIEEDVV 225
+CGI + +
Sbjct: 329 --DCGINDMAI 337
>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
Length = 463
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 120/230 (52%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P ++A +
Sbjct: 252 QASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLTAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGNDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W SWG DGYF+I RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTSWGEDGYFRIHRGTDECAIESIAVAATPIPK 462
>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 515
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 89/167 (53%), Gaps = 18/167 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA+SDR CIH G LS DLL+CC + CG GCDGG+P AW Y
Sbjct: 103 QSSCGSCWAFGAVEAMSDRLCIHSGAKYQKGLSAVDLLSCC-WKCGYGCDGGFPAQAWNY 161
Query: 73 FVHHGVVT-------EECDPY------FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQLWR 118
+ G+VT C Y D G HP C Y TP+C +KC +
Sbjct: 162 WSTDGIVTGGSKENPSGCRSYPFPSCSHDERG-RHPLCPSEIYHTPRCTKKCDTDKLHYS 220
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
+ S+Y + +IM EI NGPVE F VYEDF Y+ G+Y
Sbjct: 221 AELTKANSSYNVLDSDREIMMEIMNNGPVEAVFDVYEDFLQYEKGIY 267
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 93/166 (56%), Gaps = 17/166 (10%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSCWAFGAVEA++DR CI G S LS DL++CC CGDGC GG+P AW Y
Sbjct: 112 QSRCGSCWAFGAVEAMTDRICIQSGGQQSAELSALDLISCCED-CGDGCQGGFPGVAWDY 170
Query: 73 FVHHGVVT-------EECDPY-----FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WR 118
+V G+VT C PY T +P C Y TP+C + C K + +
Sbjct: 171 WVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYE 230
Query: 119 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 164
KHY +Y + S+ + I EI GPVE +F VYEDF +YKSG+
Sbjct: 231 QDKHYGDESYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGI 276
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 118/229 (51%), Gaps = 32/229 (13%)
Query: 8 HVEILVIQGHCGSCWAFGAVEALSDRFCIH-FGM-NLSLSVNDLLACCGFLCGD-GCDGG 64
++ + Q CGS WA + DRF I FG N+ +S LL+C L G GC+GG
Sbjct: 198 YISPIADQDWCGSDWAVSIASIVGDRFSIQSFGTENVRMSSQTLLSC--HLKGQRGCNGG 255
Query: 65 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 124
A+ + HG+V+E+C PY V + ++ + + Y
Sbjct: 256 NLDIAFDFVKTHGLVSEQCFPY---------------------EGAVTQCRIGNDCRRYR 294
Query: 125 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GDVM--GGHAVKLI 181
+ S EDIM +I +GP TVY+DF HY+ G+Y+H GD + G H+V+++
Sbjct: 295 VGVPFSISKEEDIMYDIMTSGPALGIMTVYQDFFHYREGIYRHTRHGDQLMRGLHSVRIV 354
Query: 182 GWGTSDDGED-YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
GWG +D ED YWI+AN W SWG GYF+I RG + GIE V+ LP
Sbjct: 355 GWG--EDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLP 401
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 102/215 (47%), Gaps = 18/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAF A L+ R CI N+ LS + C C GGY +W +
Sbjct: 23 QQTCGACWAFSANYVLAHRLCIATNGQTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 80
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+ G + C PY G + + C +C + + Y R +
Sbjct: 81 LENTGTPLDTCIPYASGRG--------TFSSGTCPTQCKIASM---SMSKYKAKNTRYIT 129
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+I I G V+ FTVY D YKSGVYKH+ V+GGHAV LIG+G + G +Y
Sbjct: 130 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNY 188
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
W+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 189 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 221
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 57/93 (61%), Positives = 71/93 (76%), Gaps = 1/93 (1%)
Query: 138 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 197
MAEI K GPVE +FTVY DF YKSGVY+H TG+ +GGHA+K++GWG ++DG DYW++AN
Sbjct: 1 MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWG-NEDGHDYWLVAN 59
Query: 198 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 230
WN WG G+FKI RG +ECGIE + AG P
Sbjct: 60 SWNEDWGDQGFFKILRGVDECGIESQITAGSPK 92
>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
Length = 528
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/231 (31%), Positives = 113/231 (48%), Gaps = 32/231 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSV---NDLLACCGFLCGDGCDGGYPISAWR 71
QG CGSC++F + R + F N V ++++C + GCDGG+ +
Sbjct: 315 QGQCGSCYSFSTTAMMEARKRV-FTQNKEQPVYSPENIISCSFY--SQGCDGGFAYLISK 371
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKC-VRKCVKKNQLWRNSKHYSISAYRI 130
+ G++ E+CDPY TG H KC + + Q W N ++ Y
Sbjct: 372 WGEDFGIIAEQCDPY---TGTPH----------KCNLNQACSTRQYWTNYRY--TGGYYG 416
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG----------GHAVKL 180
E++ ++ K GP+ VS VY D +Y SG+Y+H++ + H V +
Sbjct: 417 AVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSGIYRHVSSSKLTSPVPNPFELTNHVVLI 476
Query: 181 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 231
+GWG ++ GE YWI+ N W S+G DGYF I RG +EC IE + + +P+
Sbjct: 477 VGWGENEKGEKYWIVKNSWGTSFGMDGYFLIARGVDECAIESENASAIPTQ 527
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 113/229 (49%), Gaps = 20/229 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+CG+ +AF +DR IH G L LS L++C GC+GG+ AW
Sbjct: 233 QGNCGASYAFSTSTVAADRLSIHSGGELKDMLSAQYLISCTTDHHQKGCEGGHVDRAWWQ 292
Query: 73 FVHHGVVTEECDPYFDSTGCSHPG--CEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YR 129
G V+++C PY S + PG Y PK +C + SK Y S YR
Sbjct: 293 LRRVGTVSKDCYPY-TSGDTNDPGKCLMSKYKLPKKNIECPVGQGI--TSKLYQASPPYR 349
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG---------HAVKL 180
I + +IM EI NGPV+ V +DF Y+ GVYKH H+V++
Sbjct: 350 IAAKEREIMNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRI 409
Query: 181 IGWGTSDDGED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
IGWGT G+D YW+ AN W R WG G+F+I RGS+E IE VV
Sbjct: 410 IGWGTDYTGDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 84/233 (36%), Positives = 114/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH-FG-MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH G M LS +LLAC GC GG AW +
Sbjct: 78 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLACDTH-HQQGCRGGRLDGAWWF 136
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ + N N+ Y ++
Sbjct: 137 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVT 196
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 197 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 256
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 257 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 309
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 18/224 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA SDRF + ++ LS LL+C C GGY AW Y
Sbjct: 222 QGWCGASWAISTTRVASDRFALMSKGADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLY 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V E+C P+ + + C+ T C R + AYR+ +
Sbjct: 281 MRKFGLVDEDCYPWEGT----NVQCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGN 336
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY+DF Y+SG+YKH G H+V++IGWG
Sbjct: 337 E-TDIMYEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSA 395
Query: 190 E-------DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YW++ N W + WG G F+I+RG+NEC IE VVA
Sbjct: 396 HRYRNLPIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/215 (37%), Positives = 105/215 (48%), Gaps = 18/215 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CG+CWAF A L+ R CI N+ LS + C C GGY +W +
Sbjct: 108 QQTCGACWAFSANYVLAHRLCIATNGKTNVVLSPEYQVQCDTM--NKACQGGYLKYSWTF 165
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
+ G + C PY G + + C +C K SK+ + + I S
Sbjct: 166 LENTGTPLDTCIPYASGRG--------TFSSGTCPTQC--KIASMSMSKYKAKNTVYI-S 214
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
+I I G V+ FTVY D YKSGVYKH+ V+GGHAV LIG+G + G +Y
Sbjct: 215 GINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVALIGFGV-EGGSNY 273
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
W+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 274 WLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 306
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 100/183 (54%), Gaps = 19/183 (10%)
Query: 23 AFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 80
A AV A+SDR CI G ++ LS DL++CC CG GCDGG+P AW Y+V HG+VT
Sbjct: 42 AVSAVGAMSDRICIQSGGKQSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVT 100
Query: 81 -------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSI 125
C PY C H P C + Y TP+C RKC K + + KHY
Sbjct: 101 GGSKENHTGCQPY-PFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGG 159
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
+ + + I EI GPVE ++EDF +YKSG+Y++ TG +G H V++IGWG
Sbjct: 160 ISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI 219
Query: 186 SDD 188
++
Sbjct: 220 ENE 222
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/152 (44%), Positives = 91/152 (59%), Gaps = 11/152 (7%)
Query: 82 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDI 137
EC + D+ G C+ P+P C C +N ++ S +H++ + ++I
Sbjct: 229 ECSHHVDTKGME--PCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEI 284
Query: 138 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 197
EI NGPV +FTVYEDF +YKSGVYKH+ G +GGHAVK+IGWG D E YW++ N
Sbjct: 285 KREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGI-DQNEQYWLVMN 343
Query: 198 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
WN +WG G FKI G ECGI+ +V AG+P
Sbjct: 344 SWNVNWGDQGIFKIAIG--ECGIDSEVTAGIP 373
>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
Length = 469
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 118/236 (50%), Gaps = 29/236 (12%)
Query: 8 HVEILVIQGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGY 65
+V + Q CGSC++F ++ L R I + + LS +++C + GCDGG+
Sbjct: 251 YVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSTQQIVSCSEY--SQGCDGGF 308
Query: 66 P-ISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHY 123
P + A +Y GVV E+C PY T C P +C R + S +
Sbjct: 309 PYLIAGKYTQDFGVVEEDCFPYTARDTQC--------VPKKECPR--------YYASDYQ 352
Query: 124 SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHA 177
+ + + + E+ ++GP+ V+F VY DF HY+ GVY H + HA
Sbjct: 353 YVGGFYGGCNEALMKLELVRHGPMAVAFEVYNDFLHYREGVYHHTGLRDPFNPFELTNHA 412
Query: 178 VKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
V L+G+GT G DYWI+ N W +WG DGYF+I+RGS+EC IE VA P +
Sbjct: 413 VLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFRIRRGSDECAIESIAVAATPIPR 468
>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
Length = 463
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
Length = 455
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 109/229 (47%), Gaps = 30/229 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGSC+ F + L R I + S LS +++C + GCDGG+P +Y
Sbjct: 245 QGSCGSCYCFATMGMLEARLRILTNNSQSPVLSPQQVVSCSEY--SQGCDGGFPYLTGKY 302
Query: 73 FVHHGVVTEECDPYF--DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
G+V E C PY DS C Y +++ + +
Sbjct: 303 VQDFGIVDESCFPYMGKDSPCGISQSCRRGYA-----------------AEYKYVGGFYG 345
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--ITGDV----MGGHAVKLIGWG 184
+M E+ KNGP+ V+ VY DF YK G+Y H +T V + HAV L+G+G
Sbjct: 346 GCSEAAMMVELVKNGPMAVALEVYSDFMSYKGGIYHHTGLTDHVNPFELTNHAVLLVGYG 405
Query: 185 TSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G+ YWI+ N W SWG DGYF+I+RGS+EC IE VA P K
Sbjct: 406 RCHMTGQKYWIVKNSWGSSWGEDGYFRIRRGSDECAIESIAVAASPIPK 454
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 111/221 (50%), Gaps = 15/221 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA + SDR+ I LS LL+C GC GGY AW +
Sbjct: 223 QGWCGASWAVSTADVASDRYSIMSKGAEAPELSAQQLLSC-NNRGQQGCRGGYLDRAWLF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 132
G+V +EC P+ C+ + C K + R + AYR+ +
Sbjct: 282 MRKFGLVDKECYPWSGKND----QCKLRKRSTLKAAGCRKPSHPLRTELYKVGPAYRLGN 337
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDG 189
+ DIM EI +GPV+ + VY+DF YKSG+Y+H + G H+V++IGWG
Sbjct: 338 E-TDIMQEILTSGPVQATMRVYQDFFIYKSGIYRHSRSAELHDSGYHSVRIIGWGEERSY 396
Query: 190 E----DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
YW++AN W +WG +G FKI++G+NEC IE V+A
Sbjct: 397 RGPPLKYWLVANSWGYNWGDNGLFKIQKGTNECEIESYVLA 437
>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
Length = 463
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
Length = 463
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/215 (37%), Positives = 107/215 (49%), Gaps = 23/215 (10%)
Query: 15 QGH-CGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLAC--CGFLCGDGCDGGYPISA 69
QG C SCWA A L+DR C+ G + LS +L+ C G L GC GG +
Sbjct: 53 QGQKCSSCWAMTATGVLADRLCVASGGKVKKVLSPQELIDCDRNGNL---GCGGGRLDTP 109
Query: 70 WRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
YF +GVVTE+C+ Y A C C +K++S YR
Sbjct: 110 LAYFRDNGVVTEKCESY------------KATQASSCSNTCDDGTSFSNTTKYHSKDCYR 157
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDD 188
++S E A+IY NGP+ F +Y D +YKSGVY K + HA ++IGWG +D
Sbjct: 158 LSS-IEQAKADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAGRVIGWGV-ED 215
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 223
G YW+ AN W WG G FKI+ G+NE G E +
Sbjct: 216 GVQYWLAANSWGTGWGQQGLFKIRSGTNEVGFEAN 250
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 85/243 (34%), Positives = 118/243 (48%), Gaps = 17/243 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA + +DRF I M +LS LL+C L GC GG+ SAW +
Sbjct: 242 QGWCGASWAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDL-QRGCQGGHLTSAWNW 300
Query: 73 FVHHGVVTEECDPY-FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G+VTEEC P+ +T C+ + K + L R Y ++
Sbjct: 301 VMTFGLVTEECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSPLRRVGLMYRVAT---- 356
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDD 188
E IM EI G V+ V ++F Y+SGVYK D+ G H V+++GWG
Sbjct: 357 --EEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQ 414
Query: 189 G---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFE 245
YWI++N W WG GYF+I +G+NEC IE+ VVA +P N I+ E
Sbjct: 415 NGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMPDIDNFCN-ISDQSFRE 473
Query: 246 DAS 248
+AS
Sbjct: 474 NAS 476
>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
Length = 458
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 115/228 (50%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF ++ L R I ++ LS +++C + GC+GG+P + A +
Sbjct: 247 QASCGSCYAFSSMGMLESRIQIRSQLSQKPILSPQQVVSCSNY--SQGCEGGFPYLIAGK 304
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y +G+V E PY TG P C K Q + ++++ + +
Sbjct: 305 YVSDYGIVEESDLPY---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGG 349
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ GP+ V+F VY+DF HY+SGVY H + HAV L+G+GT
Sbjct: 350 CNEAYMKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGT 409
Query: 186 SDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GE YWI+ N W SWG GYF+I+RG++EC IE V+ P K
Sbjct: 410 DQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSAEPIIK 457
>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
Length = 455
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 119/228 (52%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC++F ++ + R I + LS ++++C + GC+GG+P + A +
Sbjct: 244 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 301
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E+C PY TG P C K C + + +S+++ + +
Sbjct: 302 YAQDFGLVEEDCFPY---TGTDSP-C-------KLKEGCFR----YYSSEYHYVGGFYGG 346
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ GP+ V+F VY DF HY+ GVY H + HAV L+G+GT
Sbjct: 347 CNEALMKLELVHRGPMAVAFEVYNDFLHYRQGVYHHTGLRDPFNPFELTNHAVLLVGYGT 406
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+ G DYWI+ N W SWG DGYF+I+RG++EC IE +A P K
Sbjct: 407 DAASGLDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIALAATPIPK 454
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 57/102 (55%), Positives = 71/102 (69%), Gaps = 1/102 (0%)
Query: 126 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 185
SAY + I EI NGPV FT+YED YKSGVY+H G ++GGHA+K+IGWGT
Sbjct: 113 SAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 172
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+G YW++AN W WG +G+FKI+RG NECGIE +VVAG
Sbjct: 173 -QNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAG 213
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 20 SCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDG 63
SCWAFGA E +SDR CI +S D++ CCG CG GCDG
Sbjct: 66 SCWAFGAAEVISDRICIATKGARQPIISPMDMVDCCGKYCGYGCDG 111
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 111/239 (46%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCQGGRLDGAWWF 176
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------N 119
GVV++ C P+ H E A P P+C+ + R N
Sbjct: 177 LRRRGVVSDHCYPF-----SGHERNE-AGPAPRCMMHSRAMGRGKRQATARCPNSYVHAN 230
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 171
+ AYR+ S+ +DIM E+ +NGPV+ V+EDF Y+SG+Y H
Sbjct: 231 DIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYR 290
Query: 172 VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 291 RHGTHSVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLG 349
>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
Length = 458
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 115/230 (50%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC+AF ++ L R I ++ LS +++C + GCDGG+P + A +
Sbjct: 247 QGSCGSCYAFASMGMLESRIQIQSQLSQKPILSPQQVVSCSNY--SQGCDGGFPYLIAGK 304
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSISAYR 129
Y G+V E PY G P C K+ Q + ++++ + +
Sbjct: 305 YLNDFGIVEESDFPYI---GSDSP--------------CTLKDSYQRYYTAEYHYVGGFY 347
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ GP+ V+F VY+DF HY+SGVY H + HAV L+G+
Sbjct: 348 GGCNEAYMKLELVLGGPLSVAFEVYDDFIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGY 407
Query: 184 GTSDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT GE YWI+ N W SWG G+F+I+RGS+EC IE V+ P K
Sbjct: 408 GTDQQTGEKYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVSANPIIK 457
>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
Length = 463
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 120/228 (52%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARLRILTNNSQTPVLSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY T P C K + C + + +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TATDSP-C-------KVKKDCFR----YYSSEYHYVGGFYGG 354
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ +GPV VSF VY+DF HY G+Y H + HAV L+G+GT
Sbjct: 355 CNEALMKLELVNHGPVVVSFEVYDDFIHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGT 414
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
S G DYWI+ N W+ +WG DGYF+I+RG++ECGIE + P K
Sbjct: 415 DSASGLDYWIVKNSWSATWGEDGYFRIRRGTDECGIESIALTATPIPK 462
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 80/231 (34%), Positives = 116/231 (50%), Gaps = 23/231 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q +CGSCWA + +SDR C+ + +S++ + A + GDGC+GG A+ F+
Sbjct: 95 QSNCGSCWAVSSAGVMSDRICVATNGKVKVSISGI-ATASCVGGDGCNGGLEEVAFEKFI 153
Query: 75 HHGVVT-------EECDPYFDSTGCSH-------PGCE--PAYPTPKCVRKCVKK-NQLW 117
+G T + C PY C+H P C+ P Y C +C K ++ +
Sbjct: 154 ENGFPTGSEVDKHQGCQPY-PFKHCAHHVNSTEYPPCDSVPEYKADTCSHECQKDYDRKY 212
Query: 118 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-VMGGH 176
+Y Y SD I EI NGPV VSFTVYE F +Y G+Y+ G+ + G H
Sbjct: 213 EEDLYYGKEQYGF-SDEAPIQREIMTNGPVAVSFTVYESFLYYSGGIYRSTPGERIKGYH 271
Query: 177 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK-IKRGSNECGIEEDVVA 226
AV+++GWG ++G YW +AN WN WG + G +E IE+ VA
Sbjct: 272 AVRVVGWGV-ENGTKYWKIANSWNEQWGRERLLPHTPAGVDESDIEDGGVA 321
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 112/226 (49%), Gaps = 16/226 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CGS W+ SDR I +N +LS LL+C GC+GGY AW Y
Sbjct: 204 QGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWY 262
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRIN 131
GVV + C PY S PG R+ ++ ++S + ++ Y+++
Sbjct: 263 IRKLGVVGDHCYPYV-SGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVS 321
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH--------ITGDVMGGHAVKLIGW 183
S EDI E+ NGPV+ +F V+EDF Y GVY+H + G H+V+++GW
Sbjct: 322 SREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGW 381
Query: 184 G---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G ++ YW+ AN W WG DGYFK+ RG N C IE V+
Sbjct: 382 GVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIG 427
>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
Length = 314
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 106/223 (47%), Gaps = 24/223 (10%)
Query: 12 LVIQGHCGSCWAFGAVEALSDRFCIHFGMNL---SLSVNDLLACCGFLCGDGCDGGYPIS 68
++ Q CGSCWAF + E LSDR CI +LS L+AC DGC GG P
Sbjct: 105 ILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVAC-DVYGNDGCSGGIPQL 163
Query: 69 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSIS 126
AW Y G+ T+ C PY G + C R C L+R +K +++
Sbjct: 164 AWEYMELKGLPTDSCVPYTAGNGTVY----------SCQRSCSDSEDYSLYR-AKPFTL- 211
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG-DVMGGHAVKLIGWGT 185
+ S + I I GP+ + VYEDF Y SGVY G ++GGHA+K++GWG
Sbjct: 212 --KTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGHAIKIVGWGF 269
Query: 186 SDDGE-DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 227
+ +YWI+AN W WG G+F I C I D A
Sbjct: 270 DQTSQLNYWIVANSWGADWGQQGFFFISM--ETCSISSDASAA 310
>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
Length = 464
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 79/232 (34%), Positives = 119/232 (51%), Gaps = 31/232 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K L
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKLL 464
>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
Length = 464
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 79/232 (34%), Positives = 119/232 (51%), Gaps = 31/232 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 234
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K L
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKLL 464
>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
Length = 461
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 116/228 (50%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF ++ L R I + LS ++++C + GC+GG+P + A +
Sbjct: 250 QASCGSCYAFASMGMLEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 307
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY + P P C R + +S ++ + +
Sbjct: 308 YAQDFGLVEEACFPYMGAD-------FPCKPKKDCFR--------YYSSDYHYVGGFYGG 352
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ +GP+ V+F VY+DF HY++G+Y H + HAV L+G+GT
Sbjct: 353 CNEALMKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTGLRDPFNPFELTNHAVLLVGYGT 412
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
+ G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 DTASGMDYWIVKNSWGAGWGENGYFRIRRGTDECAIESIAVAATPVPK 460
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 84/233 (36%), Positives = 114/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG+ AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCNTHH-QQGCRGGHLDGAWWF 281
Query: 73 FVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G P + T + R+ N N+ Y ++
Sbjct: 282 LRRRGVVSDHCYPFLGRERDKAGPVPPCMMHSRATGRGKRQATAHCPNGHVNNNNIYQVT 341
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
AYR+ S+ +IM E+ +NGPV+ V+EDF YK G+Y H ++ G H+
Sbjct: 342 PAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 402 VKITGWGEETWPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 112/239 (46%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 171 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWF 229
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ A PTP+C+ R+ + Q+ N
Sbjct: 230 LRRRGVVSDNCYPFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSN 283
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H
Sbjct: 284 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 343
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 344 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 402
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 125/264 (47%), Gaps = 37/264 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCG----DGCDGGYPIS 68
Q C +CWA +V +DR CI G ++ LS+ L +CC G DGC G
Sbjct: 62 QAECHNCWASASVGMFNDRVCIQSGGRITDILSLAYLTSCCNHANGCPKSDGCRRGSVAE 121
Query: 69 AWRYFVHHGVVT-------------EECDPYFDSTGCSH-PGCEPAYPTPKCVRK----- 109
+ +HG+VT + C PY C+H PG + YP +C K
Sbjct: 122 GLIFMKNHGIVTGGEYKPPKKLGNDDGCWPY-PFPKCNHVPGMKVKYP--RCGSKVGRLA 178
Query: 110 ----CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
C + H + S R+ PE I EI+ NGPV T++EDF YKSGVY
Sbjct: 179 APSHCDGLHCRRAGDVHRAKSWGRLPISPEKIKQEIFDNGPVAAIMTIHEDFRLYKSGVY 238
Query: 166 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
++ TG ++G H +KLIGWG + G++YW+ N WN WG G K+ G N ++E+
Sbjct: 239 EYKTGAMVGAHTLKLIGWGV-EAGQEYWLAVNSWNEEWGDQGKIKLAVGKN--ALDEESR 295
Query: 226 AGLPSSKNLVKEITSADMFEDASA 249
+P + V E+ M ++ A
Sbjct: 296 QQVP--RRAVNELDEDAMMAESGA 317
>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 305
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 75/214 (35%), Positives = 106/214 (49%), Gaps = 19/214 (8%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDG---CDGGYPISAWR 71
Q C C+AF + ALS R CI L SV L A C G C GG ++W
Sbjct: 101 QSDCSCCYAFATLGALSTRRCI---AKLDASVVPLSAQHMVSCDHGEAGCQGGGFNTSWA 157
Query: 72 YFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI 130
+ G + +C PY TG S +C C + L ++ HY +
Sbjct: 158 FLETEGAIMRDCLPYVSGETGLSG----------ECPTTC-QDGTLLNDTIHYKAVSASH 206
Query: 131 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 190
+ +IM + GPV+ F V+EDF +Y G+Y G +GGHAV ++G+G+ ++
Sbjct: 207 LKNYNEIMTSLLNEGPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAVLIVGYGSMNN-H 265
Query: 191 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 224
DYWI+ N W WG +GYF+I RG+NECGIE +
Sbjct: 266 DYWIVRNSWGSDWGENGYFRILRGTNECGIENNA 299
>gi|403354695|gb|EJY76909.1| Cathepsin B [Oxytricha trifallax]
Length = 311
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 99/203 (48%), Gaps = 16/203 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNL--SLSVNDLLACCGFLCGD-GCDGGYPISAWR 71
Q CGSCWAF L R+C+ LS +L++C F GCDGGY +
Sbjct: 107 QAQCGSCWAFATTNVLEYRYCMATKGKKYPELSPQNLISC--FNSASWGCDGGYIDQTFL 164
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GV TE+C PY G C KC L+ N + + +
Sbjct: 165 YLEMMGVNTEQCMPYKSGDGN----------MTACPSKCANGENLYMNKYYCRPGSTQYM 214
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 191
+ ++ GP+ F V+EDF +Y G+Y ++GD +G HAVKL+G+G ++ +
Sbjct: 215 RGEQQFKNYLFNKGPMVAVFDVFEDFINYGGGIYNKVSGDKLGKHAVKLLGYGV-ENSTN 273
Query: 192 YWILANQWNRSWGADGYFKIKRG 214
Y+I NQW + WG DGYF+IK G
Sbjct: 274 YYIGVNQWGKDWGEDGYFRIKAG 296
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 109/222 (49%), Gaps = 18/222 (8%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLC---GDGCDGGYPISAW 70
I +CGSCW+F +V ++SDR + V+DL C +GC GG+P++A+
Sbjct: 66 IPQYCGSCWSFASVSSVSDR--LKLMTKGKWPVHDLSPQVILNCDHNSNGCQGGHPLTAF 123
Query: 71 RYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLW--RNSKHYSISA 127
+Y HGV E C Y + C+ R C + + +N Y +
Sbjct: 124 KYMHDHGVPEEGCMRYMAKNMECTDI---------NICRDCDSEKGCFAVKNYTKYYVDE 174
Query: 128 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 187
Y + +++M EIY GP+ S V +D YK G+Y+ TG HA+ ++GWG +
Sbjct: 175 YGSVAGEKNMMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHAISVVGWG-EE 233
Query: 188 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
DG+ YWI N W WG G+F+I RG N GIE D +P
Sbjct: 234 DGQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVP 275
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 92/206 (44%), Gaps = 16/206 (7%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
I +CGSCWA ALSDR + + LS +++ C CDGG +
Sbjct: 350 IPQYCGSCWAQAPTSALSDRINLMRKGKWPTVELSAQEVINCSN---AGTCDGGSDADVF 406
Query: 71 RYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
Y + G+ + C Y D C P C ++ K Y +S Y
Sbjct: 407 EYAFNEGIPDQTCQVYEAIDKECNDMARCMDCPPGEDCYPV--------KDYKRYKVSEY 458
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+I AEI+ GPV S V E+F Y+ G++ G ++G HAV++ GWG ++D
Sbjct: 459 GEVKGEMEIKAEIFARGPVSCSMIVTEEFLAYQGGIFVDDRGHIVGYHAVEVAGWGETED 518
Query: 189 GEDYWILANQWNRSWGADGYFKIKRG 214
G YWI N W WG G+F++ G
Sbjct: 519 GTKYWIARNSWGPYWGEHGWFRMIVG 544
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ + N N+ Y ++
Sbjct: 177 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVT 236
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 237 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 296
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 297 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
Length = 411
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 79/211 (37%), Positives = 106/211 (50%), Gaps = 28/211 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
Q CGSCWAFGAV + I +SLS L+ C + +GCDGGY A +Y
Sbjct: 215 QQRCGSCWAFGAVGVVESMNAIAKNPLVSLSEQQLVDCD--MNDNGCDGGYRPYALQYIR 272
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
H+G+V EE PY G C+ + K VK + RN
Sbjct: 273 HNGIVPEELYPY---AGKELDSCKLNTTVQRVYVKTVK--YIRRN--------------- 312
Query: 135 EDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----MGGHAVKLIGWGTSDDG 189
E MA+ ++ GP+ V V +D HY+SGV+ D G HA+ ++G+G S +G
Sbjct: 313 ESAMADFVFYKGPLSVGINVTKDLFHYQSGVFTPSKEDCEQNPQGTHALAVVGYG-SQNG 371
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGI 220
EDYWI+ N W + WG DG+F KRG+N CGI
Sbjct: 372 EDYWIIKNSWGKRWGMDGFFLYKRGANSCGI 402
>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
Length = 316
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 105 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 162
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 163 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 205
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 206 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 265
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 266 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 315
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 112/239 (46%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWF 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ A PTP+C+ R+ + Q+ N
Sbjct: 281 LRRRGVVSDNCYPFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSN 334
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H
Sbjct: 335 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 394
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 395 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453
>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
Length = 463
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSKY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y GVV E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGVVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY+ G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G YWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 103/202 (50%), Gaps = 22/202 (10%)
Query: 20 SCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHG 77
SCWA A ++DR C+ + +S D+L+CCG CG GC GG I AW++ + +G
Sbjct: 1 SCWAVSAASVMTDRLCVQSKGRIKRFISDTDILSCCGRFCGYGCRGGANIRAWKHVMRNG 60
Query: 78 VVT-------EECDPY-FDSTGCSHPGC------EPAYPTPKCVRKCVKK--NQLWRNSK 121
V T C PY F G +Y TP+C + C + + +
Sbjct: 61 VCTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDR 120
Query: 122 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 181
+Y+ SAY + +D + IM EI + GPV ++ Y DF YK GVY+H G+ GGH++K++
Sbjct: 121 YYAASAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIM 180
Query: 182 GWGTSDDGED----YWILANQW 199
GWG YW++AN W
Sbjct: 181 GWGNYKHPNGTVIPYWLVANSW 202
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/217 (36%), Positives = 103/217 (47%), Gaps = 30/217 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGD----GCDGGYPIS 68
QG+C S WA SDR CI + LS +LL+C GD GCDGG
Sbjct: 49 QGNCASSWAVAVASTFSDRLCIASNGQFTDNLSAQNLLSC-----GDEEKMGCDGGSAFK 103
Query: 69 AWRYFVHHGVVT-------EECDPYFDSTGCSHPG------CEPAYPTPK--CVRKCVKK 113
AW + G+VT E C PY C+H G C T C KCV K
Sbjct: 104 AWELTMSKGIVTGGNFDSNEGCQPY-KIRPCNHYGNGNLKNCSSLRRTQMTVCREKCVNK 162
Query: 114 NQL--WRNSKHYSISAYRIN-SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG 170
N + + H + Y + ++ + I EI GPV VYE+F YK G+YK G
Sbjct: 163 NYKVKYEDDLHKTSIVYMTSWTNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGIYKSTAG 222
Query: 171 DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 207
+++G H VKLIGWG DG +YW+ N WN +WG +G
Sbjct: 223 ELIGYHHVKLIGWGVDGDGTEYWLAMNSWNSNWGTNG 259
>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
Length = 463
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC++F ++ + R I + LS ++++C + GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E+C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ GP+ V+F VY+DF HY+ GVY H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT + G DYWI+ N W SWG +GYF+I+RG++EC IE +A P K
Sbjct: 413 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 82/239 (34%), Positives = 112/239 (46%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 210 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQRGCHGGRLDGAWWF 268
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------N 119
GVV++ C P+ + A P P+C+ + R N
Sbjct: 269 LRRRGVVSDHCYPFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHAHAN 322
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H +
Sbjct: 323 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYR 382
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 383 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 441
>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
C
Length = 441
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 228 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 285
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 286 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 328
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 329 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 388
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 389 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 438
>gi|1582221|prf||2118248A prepro-cathepsin C
Length = 463
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
Length = 463
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
Length = 463
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 235 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 292
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 293 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 335
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 336 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 395
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 396 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 445
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 104/221 (47%), Gaps = 15/221 (6%)
Query: 22 WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGV 78
WA+ L+DR CI + N LS +L+ C G G G + W Y HG+
Sbjct: 115 WAYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGL 172
Query: 79 VTEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINS 132
V+ Y + GC P P C +C N + H +S Y
Sbjct: 173 VS--GGKYNTNDGCQPSKIPPIGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIK 230
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGED 191
EDI E+ GPV V F VY+DF YKSGVY + + H KLIGWG ++G D
Sbjct: 231 SNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVD 289
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
YW+L N W WG +G FKIKRG+NE +E+ V AG P K
Sbjct: 290 YWLLVNSWGNEWGQNGLFKIKRGTNEVHVEDYVYAGEPEIK 330
>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
Length = 458
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC++F ++ + R I + LS ++++C + GC+GG+P + A +
Sbjct: 247 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 304
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E+C PY TG P C K +R +S+++ + +
Sbjct: 305 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 347
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ GP+ V+F VY+DF HY+ GVY H + HAV L+G+
Sbjct: 348 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 407
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT + G DYWI+ N W SWG +GYF+I+RG++EC IE +A P K
Sbjct: 408 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 457
>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
Length = 463
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
Length = 463
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
Length = 463
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 119/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC++F ++ + R I + LS ++++C + GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E+C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CRLKEGCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ GP+ V+F VY+DF HY+ GVY H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT + G DYWI+ N W SWG +GYF+I+RG++EC IE +A P K
Sbjct: 413 GTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAATPIPK 462
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/239 (35%), Positives = 114/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNS 120
GVV++ C P+ S + A PTP C+ R+ N N+
Sbjct: 177 LRRRGVVSDHCYPF------SGRERDEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNN 230
Query: 121 KHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H +
Sbjct: 231 DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYR 290
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 291 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
Length = 462
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 121/228 (53%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 251 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 308
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY TG P C K + C++ + +S+++ + +
Sbjct: 309 YAQDFGLVEESCFPY---TGTDAP-C-------KMKKDCIR----YYSSEYHYVGGFYGG 353
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ +GP+ V+F VY+DF HY+ G+Y+H + HAV L+G+GT
Sbjct: 354 CNEALMKLELVHHGPMAVAFEVYDDFLHYQKGIYQHTGLRDPFNPFELTNHAVLLVGYGT 413
Query: 186 S-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W SWG DG+F+I+RG +EC IE +A P K
Sbjct: 414 DLASGMDYWIVKNSWGTSWGEDGFFRIRRGIDECSIESIAMAATPIPK 461
>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
Length = 463
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 119/229 (51%), Gaps = 29/229 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC++F +V L R I + LS ++++C + GC+GG+P + A +
Sbjct: 252 QGSCGSCYSFASVGMLEARIRILTNNTQTPILSPQEIVSCSQY--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY G P C K + CV+ + S+++ + +
Sbjct: 310 YAQDFGLVEESCFPY---KGIDVP-C-------KVKKDCVR----YYTSEYHYVGGFYGG 354
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWG 184
+ + E+ ++GP+ V+F VY+DF HY G+Y H TG + HAV L+G+G
Sbjct: 355 CNEALMKLELVQHGPMAVAFEVYDDFLHYHKGIY-HRTGLRDPFNPFELTNHAVLLVGYG 413
Query: 185 TSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
T G DYWI+ N W WG DGYF+I RG++EC IE +A P K
Sbjct: 414 TDPVSGRDYWIVKNSWGTGWGEDGYFRILRGTDECAIESIAMAATPIPK 462
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 114/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDKHN-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCV-----KKNQLWRNSKH----- 122
GVV++ C P+ A P P+C+ K+ + R H
Sbjct: 282 LRRRGVVSDHCYPFSGQER------NEAGPEPRCMMHSRAMGRGKRQAIARCPNHHVHAN 335
Query: 123 --YSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
Y ++ AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+ G+Y H +
Sbjct: 336 DIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYR 395
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLG 454
>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
Length = 356
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 119/228 (52%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 145 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 202
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY TG P C K + C + + +S +Y + +
Sbjct: 203 YAQDFGVVEEGCFPY---TGTDSP-C-------KLKKDCFR----YYSSDYYYVGGFYGG 247
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ I E+ +GP+ V+F VY DF HY G+Y H + HAV L+G+GT
Sbjct: 248 CNEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGT 307
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
S G+DYWI+ N W SWG DGYF+I+RG++EC IE +A P K
Sbjct: 308 DSASGQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 355
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 177 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 236
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 237 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 296
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 297 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWF 250
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 251 LRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVT 310
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
AYR+ S+ +IM E+ +NGPV+ V+EDF YK G+Y H ++ G H+
Sbjct: 311 PAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHS 370
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 371 VKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVT 341
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
AYR+ S+ +IM E+ +NGPV+ V+EDF YK G+Y H ++ G H+
Sbjct: 342 PAYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 402 VKITGWGEETRPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/239 (33%), Positives = 114/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ + A P P+C+ R+ + + + N
Sbjct: 282 LRRRGVVSDHCYPFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHVHAN 335
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ AYR+ ++ ++IM E+ +NGPV+ V+EDF Y+ G+Y H +
Sbjct: 336 DIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYR 395
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454
>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
Length = 363
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 109/214 (50%), Gaps = 32/214 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCW F V L F I + + +LS L+ C G GC+GG P A++Y
Sbjct: 153 QGSCGSCWTFSTVGTLEAHFLIKYQQSRNLSEQQLVDCAGAYDNYGCNGGLPSHAFQYIS 212
Query: 75 HH-GVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN-S 132
+ G+ TE PYF R C + ++ K + +N +
Sbjct: 213 DNGGIATEAAYPYFAKD-----------------RPCT----IQQSQKSVGVVGGSVNLT 251
Query: 133 DPEDIMA-EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-----HAVKLIGWGTS 186
ED +A I+++GPV +++ V +DF Y SGVY T D G HAV +G+GT
Sbjct: 252 KSEDELAIAIFQHGPVSIAYEVIDDFMDYHSGVY--TTKDCKNGPDDVNHAVVAVGFGT- 308
Query: 187 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 220
++G DYW++ N W+ WG +GYFKI+RG N CGI
Sbjct: 309 ENGVDYWLVKNSWSTKWGDNGYFKIQRGVNMCGI 342
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 114/238 (47%), Gaps = 32/238 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPYF----DSTGCSHP--------GCEPAYPTPKCVRKCVKKNQLWRNS 120
GVV++ C P+ D G + P G T +C V N +++ +
Sbjct: 282 LRRRGVVSDHCYPFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVHANDIYQVT 341
Query: 121 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------- 172
AYR+ S+ ++IM E+ +NGPV+ V+EDF Y+ G+Y H +
Sbjct: 342 -----PAYRLGSNEKEIMKELLENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRR 396
Query: 173 MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 397 HGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLG 454
>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
Length = 460
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 117/230 (50%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 249 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 306
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y GVV E C PY TG P C K +R +S+++ + +
Sbjct: 307 YAQDFGVVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 349
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY G+Y H + HAV L+G+
Sbjct: 350 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGY 409
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G YWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 410 GTDSASGIHYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 459
>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
Length = 466
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 116/228 (50%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I S LS ++++C + GC+GG+P + A +
Sbjct: 255 QASCGSCYSFASMGMLEARIRILTNNTQSPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 312
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y G+V E C PY TG P C K C++ + S+++ + +
Sbjct: 313 YAQDFGLVEEACFPY---TGTDSP-C-------KMKEDCIR----YYTSEYHYVGGFYGG 357
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ +GP+ V+F VY+DF HY G+Y H + HAV L+G+GT
Sbjct: 358 CNEALMKLELVHHGPMAVAFEVYDDFLHYNQGIYHHTGLKDPFNPFELTNHAVLLVGYGT 417
Query: 186 S-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W SWG GYF+I+RG++EC IE +A P K
Sbjct: 418 DPKTGLDYWIVKNSWGTSWGEQGYFRIRRGTDECAIESIAMAATPIPK 465
>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
Length = 463
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 118/230 (51%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F +V L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 252 QESCGSCYSFASVGMLEARIRILTNNSQTPILSPQEIVSCSQY--AQGCNGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E+C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEDCFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT G DYWI+ N W SWG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDPATGVDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
Length = 478
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 119/228 (52%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GC+GG+P + A +
Sbjct: 267 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQY--AQGCEGGFPYLIAGK 324
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY TG P C K + C + + +S +Y + +
Sbjct: 325 YAQDFGVVEEGCFPY---TGTDSP-C-------KLKKDCFR----YYSSDYYYVGGFYGG 369
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ I E+ +GP+ V+F VY DF HY G+Y H + HAV L+G+GT
Sbjct: 370 CNEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGT 429
Query: 186 -SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
S G+DYWI+ N W SWG DGYF+I+RG++EC IE +A P K
Sbjct: 430 DSASGQDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAMAATPIPK 477
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 104/221 (47%), Gaps = 15/221 (6%)
Query: 22 WAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGV 78
WA+ L+DR CI + N LS +L+ C G G G + W Y HG+
Sbjct: 115 WAYATAGVLADRMCIATNGSYNQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGL 172
Query: 79 VTEECDPYFDSTGCSHPGCEPAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINS 132
V+ Y + GC P P C +C N + H +S Y
Sbjct: 173 VS--GGKYNTNDGCQPSKIPPIGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIK 230
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGED 191
EDI E+ GPV V F VY+DF YKSGVY + + H KLIGWG ++G D
Sbjct: 231 SNEDIQKEVQTYGPVSVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVD 289
Query: 192 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
YW+L N W WG +G FKIKRG+NE +E+ V AG P K
Sbjct: 290 YWLLVNFWGNEWGQNGLFKIKRGTNEVHVEDYVYAGEPEIK 330
>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
Length = 470
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 112/227 (49%), Gaps = 31/227 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCI--HFGMNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
QG CGSC+AF ++ L R + + LS ++++C + GC+GG+P + A +
Sbjct: 258 QGQCGSCYAFASMGMLEARLRVLTNNTQQFVLSPQEIVSCGKY--SQGCEGGFPYLIAGK 315
Query: 72 YFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 129
Y GVV EEC PY DS+ C Y T + + +
Sbjct: 316 YAEDFGVVLEECYPYEGKDSSCKDTSRCGRGYAT-----------------NYRYVGGFY 358
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ E + E+ KNGP+ V+F VY DF HYK GVY+H + HAV L+G+
Sbjct: 359 GGCNEELMQLELVKNGPMAVAFEVYSDFMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGY 418
Query: 184 GTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G + G +W + N W WG +G+F+I+RG++EC IE VA P
Sbjct: 419 GRDPETGAKFWTVKNSWGEKWGEEGFFRIRRGTDECAIESIAVAADP 465
>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
Length = 463
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 116/230 (50%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ + R I + LS ++++C + GC GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQY--AQGCAGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CTVKEGCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HY+ G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GTS-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT G DYWI+ N W SWG DGYF+I+RG++EC IE VA P K
Sbjct: 413 GTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
Length = 470
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 112/221 (50%), Gaps = 27/221 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC+AF ++ L R I ++ LS +++C + GC+GG+P + A +
Sbjct: 247 QASCGSCYAFSSMGMLESRIQIRSQLSQKPILSPQQVVSCSNY--SQGCEGGFPYLIAGK 304
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y +G+V E PY TG P C K Q + ++++ + +
Sbjct: 305 YVSDYGIVEESDLPY---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGG 349
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ GP+ V+F VY+DF HY+SGVY H + HAV L+G+GT
Sbjct: 350 CNEAYMKLELVLGGPLSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGT 409
Query: 186 SDD-GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 225
GE YWI+ N W SWG GYF+I+RG++EC IE V
Sbjct: 410 DQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAV 450
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 110/229 (48%), Gaps = 18/229 (7%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIH---FGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWR 71
Q + G+ WAF LSDR I F + + LS L++C F +G G W
Sbjct: 308 QENEGTSWAFSTTSVLSDRLAIQSKNFTV-VELSPQHLVSC--FSSHEG-RGERLDRTWW 363
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV+ C P S G C N + N + + YR++
Sbjct: 364 YLRKKGVVSTVCYPESRSKSTQGIGSCGLVAHSSGAHICPNGNVISSNEIYKTSPVYRVS 423
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGW 183
S+ E+IM EI++NGPV+ V DF YKSGVY D + H+VK+IGW
Sbjct: 424 SNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGW 483
Query: 184 G---TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G + + YWI+ N W +WG GYF+I++G NECGIEE ++A P
Sbjct: 484 GEKKSKTNSGKYWIVQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWP 532
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 113/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ + N N+ Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVT 341
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Equus caballus]
Length = 436
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 82/239 (34%), Positives = 113/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG+ AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGHLDGAWWF 250
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCV---KKNQLWRN 119
GVV++ C P+ + A P P+C+ R+ +++ N
Sbjct: 251 LRRRGVVSDHCYPFSGRER------DEAGPAPRCMMHSRAMGRGKRQATAHCPNSRVHTN 304
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 171
+ AYR+ S ++IM E+ +NGPV+ V+EDF Y+ GVY H
Sbjct: 305 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYR 364
Query: 172 VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 365 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 423
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 82/239 (34%), Positives = 113/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG+ AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGHLDGAWWF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCV---KKNQLWRN 119
GVV++ C P+ + A P P+C+ R+ +++ N
Sbjct: 282 LRRRGVVSDHCYPFSGRER------DEAGPAPRCMMHSRAMGRGKRQATAHCPNSRVHTN 335
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD-------- 171
+ AYR+ S ++IM E+ +NGPV+ V+EDF Y+ GVY H
Sbjct: 336 DIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQGGVYSHTPVSHGRPERYR 395
Query: 172 VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 88/157 (56%), Gaps = 14/157 (8%)
Query: 83 CDPYFDSTGCSH-------PGCEPA-YPTPKCVRKC--VKKNQLWRNSKHYSISAYRINS 132
C PY D C+H P C YPTP CV +C K R+ +H+ + + +
Sbjct: 3 CWPY-DFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61
Query: 133 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 192
D I +GPV SFTVYEDF Y+SGVYKH +G +GGHAVK+IGWG G+ Y
Sbjct: 62 SVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK-SGQAY 120
Query: 193 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
W+ N WN WG G FKI G+ CGI++D++ G P
Sbjct: 121 WLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTP 155
>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
Length = 463
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 117/230 (50%), Gaps = 31/230 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C GC+GG+P + A +
Sbjct: 252 QASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQH--AQGCEGGFPYLIAGK 309
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR--NSKHYSISAYR 129
Y G+V E C PY TG P C K +R +S+++ + +
Sbjct: 310 YAQDFGLVEEACFPY---TGTDSP--------------CKMKEDCFRYYSSEYHYVGGFY 352
Query: 130 INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGW 183
+ + E+ +GP+ V+F VY+DF HYK G+Y H + HAV L+G+
Sbjct: 353 GGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 412
Query: 184 GT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 413 GTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPK 462
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 81/233 (34%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 118 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 176
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 177 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSHVNNNDIYQVT 236
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ +++M E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 237 PVYRLGSNDKEVMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 296
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 297 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 30/229 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSC++F + L R I + +LS +++C + GCDGG+P +Y
Sbjct: 245 QESCGSCYSFATMGMLEARIRILTNNSDAPTLSPQQVVSCSEY--SQGCDGGFPYLIGKY 302
Query: 73 FVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
G+V E C PY +T C P +K Q +++ + +
Sbjct: 303 TQDFGIVDESCFPYVGQNTPCGVP----------------QKCQRIYAAEYNYVGGFYGG 346
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWG 184
+M E+ KNGP+ V+F VY DF +YK G+Y H TG + HAV L+G+G
Sbjct: 347 CSEAAMMLELVKNGPMAVAFEVYPDFMNYKEGIYHH-TGLADPFNPFELTNHAVLLVGYG 405
Query: 185 T-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G++YWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 406 RCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 454
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 113/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 250
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNS 120
GVV++ C P+ + A PTP C+ R+ N N+
Sbjct: 251 LRRRGVVSDHCYPFSGRER------DEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNN 304
Query: 121 KHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H +
Sbjct: 305 DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYR 364
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 365 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 218 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 276
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 277 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSQAMGRGKRQATAHCPNSYVNNNDIYQVT 336
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 337 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 396
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 397 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 449
>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
Length = 379
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/219 (33%), Positives = 108/219 (49%), Gaps = 28/219 (12%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFV 74
QG CGSCWAF V A+ + I G+ +SLS +++ C G +GC GGY A R+
Sbjct: 182 QGQCGSCWAFATVAAIEAQHAIKKGILVSLSEQEMVDCDGR--NNGCSGGYRPYAMRFVK 239
Query: 75 HHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDP 134
+G+ TE+ PY + H C L +N I YR+ S
Sbjct: 240 ENGLETEKSYPY---SALKHDQC-----------------MLHQNDTKVYIDDYRMLSTS 279
Query: 135 EDIMAE-IYKNGPVEVSFTVYEDFAHYKSGVYKHITGD----VMGGHAVKLIGWGTSDDG 189
E+ +A+ + GPV V + Y+SG++ D MG HA+ ++G+G +
Sbjct: 280 EENIADWVGTKGPVTFGMNVVKAMYSYRSGIFNPSAEDCAEKSMGAHALTIVGYG-GEGT 338
Query: 190 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 228
YWI+ N W SWG+DGYF++ RG N CG+ VVA +
Sbjct: 339 SAYWIVKNSWGTSWGSDGYFRLARGVNSCGLANTVVAPI 377
>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
niloticus]
Length = 461
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 30/229 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMN--LSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
Q CGSC++F + L R I + +LS +++C + GCDGG+P +Y
Sbjct: 251 QESCGSCYSFATMGMLEARIRILTNNSDAPTLSPQQVVSCSEY--SQGCDGGFPYLIGKY 308
Query: 73 FVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
G+V E C PY +T C P +K Q +++ + +
Sbjct: 309 TQDFGIVDESCFPYVGQNTPCGVP----------------QKCQRIYAAEYNYVGGFYGG 352
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-------MGGHAVKLIGWG 184
+M E+ KNGP+ V+F VY DF +YK G+Y H TG + HAV L+G+G
Sbjct: 353 CSEAAMMLELVKNGPMAVAFEVYPDFMNYKEGIYHH-TGLADPFNPFELTNHAVLLVGYG 411
Query: 185 T-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G++YWI+ N W WG +GYF+I+RG++EC IE VA P K
Sbjct: 412 RCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPK 460
>gi|302848309|ref|XP_002955687.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
nagariensis]
gi|300259096|gb|EFJ43327.1| hypothetical protein VOLCADRAFT_106905 [Volvox carteri f.
nagariensis]
Length = 846
Score = 120 bits (302), Expect = 4e-25, Method: Composition-based stats.
Identities = 71/216 (32%), Positives = 105/216 (48%), Gaps = 13/216 (6%)
Query: 17 HCGSCWAFGAVEALSDRFCIHF---GMNLSLSVNDLLACCGFL-CGDGCDGGYPISAWRY 72
+CG CW G++ + DR I ++ LS LL C F G GCDGG + + Y
Sbjct: 566 YCGGCWVHGSLSMIQDRLKIKKRAKSPDVMLSRQTLLNCAAFEGYGHGCDGGDTVDVFSY 625
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL---W---RNSKHYSIS 126
G+ E C Y + PG +C+ C+ N + W R K+Y +
Sbjct: 626 MAEFGLPDEGCMTYNATDHTKFPGVSHCPVEGQCL-NCMPINGVDTCWPIERPVKYYLNA 684
Query: 127 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA-HYKSGVYKHITGDVMGGHAVKLIGWGT 185
++ E +M+EIY GP+ +DF HYK G+YK +GD H V+++GWG
Sbjct: 685 WGNLDKSVEAMMSEIYHRGPITCGIACPDDFTWHYKGGIYKDTSGDTELDHDVEVVGWGV 744
Query: 186 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
+DG YW++ N W WG G+F+++RG N IE
Sbjct: 745 -EDGVKYWVVRNSWGTYWGEMGFFRVERGVNALQIE 779
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 113/239 (47%), Gaps = 34/239 (14%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNS 120
GVV++ C P+ + A PTP C+ R+ N N+
Sbjct: 282 LRRRGVVSDHCYPFSGRER------DEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNN 335
Query: 121 KHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
Y ++ YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H +
Sbjct: 336 DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYR 395
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 250
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 251 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 310
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 311 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 370
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 371 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 192 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 250
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 251 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 310
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 311 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 370
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 371 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423
>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
Peptide, 462 aa]
Length = 462
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GCDGG+P + A +
Sbjct: 251 QESCGSCYSFASIGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 308
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY + P P C+R + +S++Y + +
Sbjct: 309 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 353
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ K+GP+ V+F V++DF HY SG+Y H + HAV L+G+G
Sbjct: 354 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 413
Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W WG GYF+I+RG++EC IE +A +P K
Sbjct: 414 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 341
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 228 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 286
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 287 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 346
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 347 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 406
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 407 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 459
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 112/233 (48%), Gaps = 22/233 (9%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 223 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWF 281
Query: 73 FVHHGVVTEECDPY----FDSTGCSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSIS 126
GVV++ C P+ D G + P + + R+ N N+ Y ++
Sbjct: 282 LRRRGVVSDHCYPFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVT 341
Query: 127 -AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHA 177
YR+ S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+
Sbjct: 342 PVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHS 401
Query: 178 VKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
VK+ GWG T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 402 VKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 85/247 (34%), Positives = 120/247 (48%), Gaps = 47/247 (19%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C S +A A A SDR CI N +S +++CC +LCG GCDGG +W Y
Sbjct: 83 QGNCRSSYAVAAASAASDRICIQSNGTKNPIMSAQQIISCC-YLCGHGCDGGSLFESWDY 141
Query: 73 FVHHGVVT-------EECDPYFDSTGCSHPGCE------PAYP--------TPKCVRKCV 111
+ HG V+ + C PY + P C+ P + TP C +KC
Sbjct: 142 YRRHGFVSGGDYNSNQGCQPY------TIPPCKLMNEKPPGHSCTTYHREETPICEKKCY 195
Query: 112 KKNQLWR------NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 165
N K+Y +S Y M +I+ NGP+ F +Y D YKSGVY
Sbjct: 196 NPNYYTSFRTDIYKGKYYKLSPYMA-------MKDIFDNGPITTQFYMYRDLVDYKSGVY 248
Query: 166 KHITG---DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 222
++ D H+VK+ GWG ++G YW++AN + WG +G FKI RG++ C +E
Sbjct: 249 QYDEQSDFDFFTVHSVKIFGWG-EENGVPYWLVANSFGTDWGYNGTFKISRGNDGCFFQE 307
Query: 223 DVVAGLP 229
+ AGLP
Sbjct: 308 KMYAGLP 314
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 83/239 (34%), Positives = 112/239 (46%), Gaps = 33/239 (13%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHF--GMNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG+C WAF SDR IH M LS +LL+C GC GG AW +
Sbjct: 222 QGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QKGCRGGRLDGAWWF 280
Query: 73 FVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRN 119
GVV++ C P+ + A PTP+C+ R+ + +Q+ N
Sbjct: 281 LRRRGVVSDNCYPF-----SGREQNDEASPTPRCMMHSRAMGRGKRQATSRCPNSQVDSN 335
Query: 120 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV------- 172
+ YR+ SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H
Sbjct: 336 DIYQVTPVYRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYR 395
Query: 173 -MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 226
G H+VK+ GWG T DG YW AN W WG G+F+I RG NEC IE V+
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLG 454
>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 487
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 83/243 (34%), Positives = 116/243 (47%), Gaps = 17/243 (6%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFG--MNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY 72
QG CG+ WA + +DRF I M +LS LL+C L GC GG+ SAW +
Sbjct: 242 QGWCGASWAISTAQVTTDRFVIMTKGLMRDALSPKHLLSCNNDL-QRGCQGGHLTSAWNW 300
Query: 73 FVHHGVVTEECDPY-FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
+ G+VTEEC P+ +T C+ + K + L R Y ++
Sbjct: 301 VMTFGLVTEECYPWDGRATDCAVSNQRSNNNLIVTCPRSAKTSPLRRVGLMYRVAT---- 356
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK---HITGDVMGGHAVKLIGWGTSDD 188
E IM EI G V+ V ++F Y+SGVY+ G G H V+++GWG
Sbjct: 357 --EEGIMYEIMNWGSVQAMMKVSKEFFMYESGVYRCSNLALGSKTGYHTVRIVGWGEEQQ 414
Query: 189 G---EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFE 245
YWI++N W WG GYF+I +G+NEC IE+ VVA + N I+ E
Sbjct: 415 NGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVAAMADIGNFC-SISDKSFRE 473
Query: 246 DAS 248
+AS
Sbjct: 474 NAS 476
>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
Cysteine Protease Of The Papain Family
Length = 438
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GCDGG+P + A +
Sbjct: 227 QESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 284
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY + P P C+R + +S++Y + +
Sbjct: 285 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 329
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ K+GP+ V+F V++DF HY SG+Y H + HAV L+G+G
Sbjct: 330 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 389
Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W WG GYF+I+RG++EC IE +A +P K
Sbjct: 390 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 437
>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
Length = 462
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 117/228 (51%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GCDGG+P + A +
Sbjct: 251 QESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVSCSPY--AQGCDGGFPYLIAGK 308
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY + P P C+R + +S++Y + +
Sbjct: 309 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYSSEYYYVGGFYGG 353
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ K+GP+ V+F V++DF HY SG+Y H + HAV L+G+G
Sbjct: 354 CNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGK 413
Query: 186 SD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
G DYWI+ N W WG GYF+I+RG++EC IE +A +P K
Sbjct: 414 DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPK 461
>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
Length = 483
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 114/228 (50%), Gaps = 27/228 (11%)
Query: 15 QGHCGSCWAFGAVEALSDRFCIHFGMNLS--LSVNDLLACCGFLCGDGCDGGYP-ISAWR 71
Q CGSC++F ++ L R I + + LS ++++C + GCDGG+P + A +
Sbjct: 272 QESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSMY--AQGCDGGFPYLIAGK 329
Query: 72 YFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRIN 131
Y GVV E C PY + P P C+R + S +Y + +
Sbjct: 330 YAQDFGVVEENCFPYTATDA-------PCKPKENCLR--------YYTSGYYYVGGFYGG 374
Query: 132 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGT 185
+ + E+ ++GP+ V+F V +DF HY SG+Y H + HAV L+G+G
Sbjct: 375 CNEALMKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTGLRDPFNPFELTNHAVLLVGYGR 434
Query: 186 S-DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 232
D G DYW + N W WG GYF+I+RG++EC IE VA +P K
Sbjct: 435 DPDTGTDYWTVKNSWGTEWGESGYFRIRRGTDECAIESIAVAAIPIPK 482
>gi|300121755|emb|CBK22330.2| unnamed protein product [Blastocystis hominis]
Length = 562
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 107/221 (48%), Gaps = 16/221 (7%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFGMNLSLSVNDLLACCGFLC---GDGCDGGYPISAW 70
I +CGSCW+F +V ++SDR + V+DL C +GC GG+P++A+
Sbjct: 66 IPQYCGSCWSFASVSSVSDR--LKLMTKGKWPVHDLSPQVILNCDHNSNGCQGGHPLTAF 123
Query: 71 RYFVHHGVVTEECDPYF-DSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
+Y HGV E C Y + C+ C P C +N Y + Y
Sbjct: 124 KYMHDHGVPEEGCMRYMAKNMECTDINICRDCDPDKGCFAV--------KNYTKYYVDEY 175
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
+ +++M EIY GP+ + E+ YK G+Y+ TG H++ ++GWG +D
Sbjct: 176 GSVAGEKNMMKEIYARGPITCTIADPEELMEYKGGIYRDTTGAKSLDHSISVVGWG-EED 234
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 229
G+ YWI N W WG G+F+I RG N GIE D +P
Sbjct: 235 GQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVP 275
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 100/213 (46%), Gaps = 16/213 (7%)
Query: 14 IQGHCGSCWAFGAVEALSDRFCIHFG---MNLSLSVNDLLACCGFLCGDGCDGGYPISAW 70
I +CGSCWA ALSDR + + LSV +++ C G C+GG+ +
Sbjct: 350 IPQYCGSCWAQAPTSALSDRINLMRKGKWPTVELSVQEIINCSG---KGSCEGGWQSGVY 406
Query: 71 RYFVHHGVVTEECDPY--FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 128
+Y H G+ + C Y D C P +C ++ K Y +S Y
Sbjct: 407 QYAYHQGIPDQTCQVYEAIDKECNDMARCMDCPPGKECGPV--------KDYKRYKVSEY 458
Query: 129 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 188
S +I AEI+ GPV V ++F Y+ G++K + +G H+V++ GWG ++D
Sbjct: 459 GYASGEAEIKAEIFARGPVSCDIWVTQEFLDYQGGIFKENGSEYLGRHSVEVAGWGETED 518
Query: 189 GEDYWILANQWNRSWGADGYFKIKRGSNECGIE 221
G YWI N W WG G+F+I G G++
Sbjct: 519 GTKYWIGRNSWGTYWGEHGWFRIIIGEKGLGLD 551
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.138 0.462
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,607,587,380
Number of Sequences: 23463169
Number of extensions: 209917486
Number of successful extensions: 411675
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5533
Number of HSP's successfully gapped in prelim test: 1609
Number of HSP's that attempted gapping in prelim test: 393082
Number of HSP's gapped (non-prelim): 8625
length of query: 249
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 110
effective length of database: 9,097,814,876
effective search space: 1000759636360
effective search space used: 1000759636360
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)