BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027054
(229 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 388 bits (997), Expect = e-106, Method: Compositional matrix adjust.
Identities = 175/212 (82%), Positives = 195/212 (91%), Gaps = 2/212 (0%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCGDGCDGGYP+ AWRYFVHHGVVTEECDPYFD+ GCSHPGCEP
Sbjct: 165 MNISLSVNDLLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEP 224
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
+PTPKCVRKC+ KNQLWR SKHYS++AYRI+SDP D+MAE+YKNGPVEVSFTVYEDFAH
Sbjct: 225 GFPTPKCVRKCIDKNQLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAH 284
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG+VMGGHAVKLIGWGTSD+GEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 285 YKSGVYKHITGEVMGGHAVKLIGWGTSDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECG 344
Query: 200 IEEDVVAGLPSSKN--LVKEITSADMFEDASA 229
IE+D VAGLPS++N LV+E+ S D EDA A
Sbjct: 345 IEDDAVAGLPSARNLDLVREVASMDALEDAFA 376
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 174/210 (82%), Positives = 189/210 (90%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GC+GGYPISAWRYFVHHGVVTEECDPYFD GCSHPGCEP
Sbjct: 148 MNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEP 207
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
YPTPKC RKCV KNQLW+ SKHY + YRI+SDPE IMAEIYKNGPVEV+FTVYEDFAH
Sbjct: 208 GYPTPKCARKCVNKNQLWKKSKHYGVKPYRIDSDPESIMAEIYKNGPVEVAFTVYEDFAH 267
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +MGGHAVKLIGWGTS+DGE YW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 268 YKSGVYKHITGGMMGGHAVKLIGWGTSEDGEAYWLLANQWNRGWGDDGYFKIRRGTNECG 327
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
IE DVVAGLPS++NLV+E+ S D EDASA
Sbjct: 328 IEGDVVAGLPSTRNLVREVVSVDAREDASA 357
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 382 bits (980), Expect = e-104, Method: Compositional matrix adjust.
Identities = 168/197 (85%), Positives = 186/197 (94%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LSVNDLLACCGF+CGDGCDGGYPISAWRYFV HGVVTE+CDPYFD+TGCSHPGCEPA
Sbjct: 150 NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPA 209
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP+CVR CV KNQ+WR +KHY +SAYR+ DP DIMAE+YKNGPVEVSFTVYEDFAHY
Sbjct: 210 YPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHY 269
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITGDVMGGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 270 KSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGI 329
Query: 201 EEDVVAGLPSSKNLVKE 217
EEDVVAGLPS+KN+ +E
Sbjct: 330 EEDVVAGLPSTKNIARE 346
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 168/197 (85%), Positives = 186/197 (94%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LSVNDLLACCGF+CGDGCDGGYPISAWRYFV HGVVTE+CDPYFD+TGCSHPGCEPA
Sbjct: 149 NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP+CVR CV KNQ+WR +KHY +SAYR+ DP DIMAE+YKNGPVEVSFTVYEDFAHY
Sbjct: 209 YPTPRCVRHCVDKNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITGDVMGGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 269 KSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGI 328
Query: 201 EEDVVAGLPSSKNLVKE 217
EEDVVAGLPS+KN+ +E
Sbjct: 329 EEDVVAGLPSTKNIARE 345
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/208 (81%), Positives = 188/208 (90%), Gaps = 2/208 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 149 NISLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCV+KCV NQ+W+ SKHYS+SAYR+NSDP DIMAE+YKNGPVEV+FTVYEDFA+Y
Sbjct: 209 YRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 269 KSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGI 328
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
EEDV AGLPS+KNLV+E+T DM DA+
Sbjct: 329 EEDVTAGLPSTKNLVREVT--DMDADAA 354
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 168/203 (82%), Positives = 186/203 (91%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 153 MNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEP 212
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC RKCV NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 213 AYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAH 272
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECG
Sbjct: 273 YKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECG 332
Query: 200 IEEDVVAGLPSSKNLVKEITSAD 222
IE VVAGLPS +N+VK IT++D
Sbjct: 333 IEHGVVAGLPSDRNVVKGITTSD 355
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 372 bits (956), Expect = e-101, Method: Compositional matrix adjust.
Identities = 167/203 (82%), Positives = 185/203 (91%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 151 MNISLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEP 210
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC RKCV NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 211 AYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAH 270
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECG
Sbjct: 271 YKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECG 330
Query: 200 IEEDVVAGLPSSKNLVKEITSAD 222
IE VVAGLPS +N+ K IT++D
Sbjct: 331 IEHGVVAGLPSDRNVFKGITTSD 353
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 372 bits (956), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/210 (80%), Positives = 186/210 (88%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
NLSLSVNDLLACCG++CGDGCDGGYPI AWRYFV GVVTEECDPYFD GCSHPGCEP
Sbjct: 116 MNLSLSVNDLLACCGWMCGDGCDGGYPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEP 175
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
+PTPKC RKC KN+LW SKH+S++AYRI+SDP IMAE+ NGPVEV+FTVYEDFAH
Sbjct: 176 GFPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYEDFAH 235
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 236 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECG 295
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
IEEDVVAGLPS++NLV+E+ D E ASA
Sbjct: 296 IEEDVVAGLPSTRNLVREVAKIDAHEHASA 325
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 372 bits (955), Expect = e-101, Method: Compositional matrix adjust.
Identities = 164/203 (80%), Positives = 184/203 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGG PI AWRY HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 149 NISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE+YKNGPVEV+FTV+EDFAHY
Sbjct: 209 YQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGI
Sbjct: 269 KSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGI 328
Query: 201 EEDVVAGLPSSKNLVKEITSADM 223
E+DV AGLPS+KN+V+E+T D+
Sbjct: 329 EDDVTAGLPSTKNIVREVTDMDV 351
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 372 bits (954), Expect = e-101, Method: Compositional matrix adjust.
Identities = 164/203 (80%), Positives = 184/203 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGG PI AWRY HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 151 NISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 210
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIMAE+YKNGPVEV+FTV+EDFAHY
Sbjct: 211 YQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHY 270
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGI
Sbjct: 271 KSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGI 330
Query: 201 EEDVVAGLPSSKNLVKEITSADM 223
E+DV AGLPS+KN+V+E+T D+
Sbjct: 331 EDDVTAGLPSTKNIVREVTDMDV 353
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 371 bits (953), Expect = e-101, Method: Compositional matrix adjust.
Identities = 168/203 (82%), Positives = 186/203 (91%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GC+GGYPI+AWRYF HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 84 MNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEP 143
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC RKCV NQLWR SKHY +SAY++ S P+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 144 AYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAH 203
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECG
Sbjct: 204 YKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECG 263
Query: 200 IEEDVVAGLPSSKNLVKEITSAD 222
IE VVAGLPS +N+VK IT++D
Sbjct: 264 IEHGVVAGLPSDRNVVKGITTSD 286
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 163/203 (80%), Positives = 183/203 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGG PI AWRY HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 151 NISLSVNDLLACCGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 210
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCVRKCVK NQ+W+ SKHYS+ AYR+ SDP+DIM E+YKNGPVEV+FTV+EDFAHY
Sbjct: 211 YQTPKCVRKCVKGNQIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHY 270
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGI
Sbjct: 271 KSGVYKHITGSALGGHAVKLIGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGI 330
Query: 201 EEDVVAGLPSSKNLVKEITSADM 223
E+DV AGLPS+KN+V+E+T D+
Sbjct: 331 EDDVTAGLPSTKNIVREVTDMDV 353
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 369 bits (946), Expect = e-100, Method: Compositional matrix adjust.
Identities = 165/208 (79%), Positives = 187/208 (89%), Gaps = 2/208 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGGYP+ AW+Y HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 148 NISLSVNDLLACCGFLCGSGCDGGYPLYAWQYLAHHGVVTEECDPYFDQIGCSHPGCEPA 207
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCV+KCV NQ+W+ SKHYS++AYR++SDP DIM E+YKNGPVEV+FTVYEDFAHY
Sbjct: 208 YRTPKCVKKCVSGNQVWKKSKHYSVNAYRVSSDPHDIMTEVYKNGPVEVAFTVYEDFAHY 267
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGT++DGEDYW+LANQWNR WG DGYFKI+RG+NECGI
Sbjct: 268 KSGVYKHITGYELGGHAVKLIGWGTTEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGI 327
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
EEDV AGLPS+KNLV+E+T DM DA+
Sbjct: 328 EEDVTAGLPSTKNLVREVT--DMDADAA 353
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 363 bits (933), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 164/210 (78%), Positives = 185/210 (88%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD TGCSHPGCEP
Sbjct: 153 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEP 212
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC+RKCV NQLW SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 213 AYPTPKCMRKCVSGNQLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAH 272
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGT+D+GEDYW+LANQWNRSWG DGYF I+RG+NECG
Sbjct: 273 YKSGVYKHITGSNIGGHAVKLIGWGTTDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECG 332
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
IE++ VAGLPSS+N+ K IT +D AS
Sbjct: 333 IEDEPVAGLPSSRNVFKVITGSDDLSVASV 362
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 363 bits (932), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 163/196 (83%), Positives = 179/196 (91%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 148 MNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEP 207
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
YPTPKCVRKC +NQLWR +K Y SAYRI+SDP IMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 208 GYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAH 267
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
Y+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDYWILANQWNR+WG DGYF I+RG NECG
Sbjct: 268 YESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECG 327
Query: 200 IEEDVVAGLPSSKNLV 215
IEE VVAGLPSSKNL+
Sbjct: 328 IEEGVVAGLPSSKNLM 343
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 362 bits (930), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 163/196 (83%), Positives = 179/196 (91%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GCDGGYP+ AWRYF+HHGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 182 MNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEP 241
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
YPTPKCVRKC +NQLWR +K Y SAYRI+SDP IMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 242 GYPTPKCVRKCTDENQLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAH 301
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
Y+SGVY++ TGDVMGGHAVKLIGWGT+DDGEDYWILANQWNR+WG DGYF I+RG NECG
Sbjct: 302 YESGVYRYTTGDVMGGHAVKLIGWGTTDDGEDYWILANQWNRNWGDDGYFMIRRGVNECG 361
Query: 200 IEEDVVAGLPSSKNLV 215
IEE VVAGLPSSKNL+
Sbjct: 362 IEEGVVAGLPSSKNLM 377
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 362 bits (928), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 161/208 (77%), Positives = 185/208 (88%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS NDLLACCGFLCGDGCDGGYP+ AW+YFV GVVT+ECDPYFD+ GCSHPGCEPA
Sbjct: 148 NISLSANDLLACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPA 207
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKCVK+N LW SKH+ ++AY I+SDP IM E+YKNGPVEVSFTVYEDFAHY
Sbjct: 208 YPTPKCHRKCVKQNLLWSKSKHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHY 267
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKH+TGDVMGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG++EC I
Sbjct: 268 KSGVYKHVTGDVMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEI 327
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
E++VVAGLPS++NL E+ +D F DA+
Sbjct: 328 EDEVVAGLPSARNLNMELDVSDAFLDAA 355
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 360 bits (925), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 160/208 (76%), Positives = 184/208 (88%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS NDL ACCGFLCGDGCDGGYP+ AW+YFV GVVT+ECDPYFD+ GCSHPGCEPA
Sbjct: 148 NISLSANDLYACCGFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPA 207
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKCVK+N LW SKH+ ++AY I+SDP IM E+YKNGPVEVSFTVYEDFAHY
Sbjct: 208 YPTPKCHRKCVKQNLLWSRSKHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHY 267
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKH+TGD+MGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKI+RG+NEC I
Sbjct: 268 KSGVYKHVTGDIMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEI 327
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
E++VVAGLPS++NL E+ +D F DA+
Sbjct: 328 EDEVVAGLPSARNLNVELDVSDAFLDAA 355
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 360 bits (925), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 160/208 (76%), Positives = 182/208 (87%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 149 NVSLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCVRKCVK NQ+W+ SK++S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YQTPKCVRKCVKGNQIWKKSKYFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGI
Sbjct: 269 KSGVYKHITGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGI 328
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
EEDV AGLPS+KN+ + + D D S
Sbjct: 329 EEDVTAGLPSTKNMGRWVMDMDADADVS 356
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 357 bits (917), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 163/209 (77%), Positives = 181/209 (86%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
NLSLSVNDLLACCG++CG GCDGG PI AWRYFV GVVTEECDPYFD GCSHPGCEP
Sbjct: 131 NLSLSVNDLLACCGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEPG 190
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
+PTPKC RKC KN+LW SKH+S++AYRI+SDP IMAE+ NGPVEV+FTVYEDFAHY
Sbjct: 191 FPTPKCERKCADKNKLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVAFTVYEDFAHY 250
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITGD MGGHAVKLIGWGTS+DGEDYW+LANQWNR WG DGYFKIKRG+NECGI
Sbjct: 251 KSGVYKHITGDAMGGHAVKLIGWGTSEDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI 310
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDASA 229
E VVAGLPS++NLV+E+ D E A+A
Sbjct: 311 EGAVVAGLPSTRNLVREVAGIDGHEHATA 339
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 357 bits (915), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 158/202 (78%), Positives = 181/202 (89%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP
Sbjct: 149 NVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPT 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKCV +NQLW SKHY + AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYK+ITG +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGI
Sbjct: 269 KSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 328
Query: 201 EEDVVAGLPSSKNLVKEITSAD 222
E+ VVAGLPS KN+ K IT++D
Sbjct: 329 EQSVVAGLPSEKNVFKGITTSD 350
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 356 bits (914), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 158/202 (78%), Positives = 181/202 (89%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP
Sbjct: 171 NVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPT 230
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKCV +NQLW SKHY + AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 231 YPTPKCERKCVSRNQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHY 290
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYK+ITG +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGI
Sbjct: 291 KSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 350
Query: 201 EEDVVAGLPSSKNLVKEITSAD 222
E+ VVAGLPS KN+ K IT++D
Sbjct: 351 EQSVVAGLPSEKNVFKGITTSD 372
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 355 bits (912), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 157/202 (77%), Positives = 180/202 (89%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEP
Sbjct: 194 MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEP 253
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTP C +KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAH
Sbjct: 254 AYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAH 313
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECG
Sbjct: 314 YKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECG 373
Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
IEEDVVAG+PS+KN+V+ SA
Sbjct: 374 IEEDVVAGMPSTKNMVRNYDSA 395
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 355 bits (911), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 157/202 (77%), Positives = 180/202 (89%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEP
Sbjct: 149 MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEP 208
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTP C +KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAH
Sbjct: 209 AYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAH 268
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECG
Sbjct: 269 YKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECG 328
Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
IEEDVVAG+PS+KN+V+ SA
Sbjct: 329 IEEDVVAGMPSTKNMVRNYDSA 350
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 354 bits (908), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 159/209 (76%), Positives = 184/209 (88%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 150 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEP 209
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTP+C+RKCV N+LW SKHYS+S Y +NS P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 210 AYPTPRCLRKCVSDNKLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYEDFAH 269
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGTS++GEDYW++ANQWNR WG DGYF I+RG+NECG
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSNEGEDYWLMANQWNRGWGDDGYFMIRRGTNECG 329
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
IE++ VAGLPSS+N+ K T ++ AS
Sbjct: 330 IEDEPVAGLPSSRNVFKVDTGSNDLPVAS 358
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 353 bits (907), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 159/201 (79%), Positives = 177/201 (88%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEPA
Sbjct: 141 SILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPA 200
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC +KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 201 YPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 260
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 261 KSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 320
Query: 201 EEDVVAGLPSSKNLVKEITSA 221
EE VVAG+PS+KN+V A
Sbjct: 321 EEGVVAGMPSTKNMVPNFGGA 341
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 353 bits (907), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 159/201 (79%), Positives = 177/201 (88%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEPA
Sbjct: 141 SILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEPA 200
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC +KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 201 YPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 260
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 261 KSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 320
Query: 201 EEDVVAGLPSSKNLVKEITSA 221
EE VVAG+PS+KN+V A
Sbjct: 321 EEGVVAGMPSTKNMVPNFGGA 341
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 353 bits (906), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 158/209 (75%), Positives = 177/209 (84%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVND+LACCG LCG GC GG P SAW Y HHGVVTEECDPYFD GCSHPGCEP
Sbjct: 142 MNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGCSHPGCEP 201
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TPKCV+KCV NQLW SKHYS+ AY +NSDP+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 202 TYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAH 261
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKL+GWGTS +GEDYW+LANQWN +WG DGYFKIKRG+NECG
Sbjct: 262 YKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECG 321
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
IE V AGLPS+KN+V+E+T D+ D S
Sbjct: 322 IENAVTAGLPSTKNIVREVTDMDVDADVS 350
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 353 bits (906), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 158/209 (75%), Positives = 177/209 (84%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVND+LACCG LCG GC GG P SAW Y HHGVVTEECDPYFD GCSHPGCEP
Sbjct: 147 MNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGCSHPGCEP 206
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TPKCV+KCV NQLW SKHYS+ AY +NSDP+DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 207 TYRTPKCVKKCVNGNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAH 266
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKL+GWGTS +GEDYW+LANQWN +WG DGYFKIKRG+NECG
Sbjct: 267 YKSGVYKHITGFALGGHAVKLVGWGTSHEGEDYWLLANQWNTNWGDDGYFKIKRGTNECG 326
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
IE V AGLPS+KN+V+E+T D+ D S
Sbjct: 327 IENAVTAGLPSTKNIVREVTDMDVDADVS 355
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 352 bits (903), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 159/202 (78%), Positives = 177/202 (87%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
++ LSVNDLLACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEP
Sbjct: 1 MSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYFVQNGVVTDECDPYFDPVGCKHPGCEP 60
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC +KC ++NQ+W+ KH+SI AYRINSDP DIMAE+YKNGPVEV+FTVYEDFAH
Sbjct: 61 AYPTPKCEKKCKEQNQVWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAH 120
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +MGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 121 YKSGVYKHITGGIMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 180
Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
IEE VVAG+PS+KN+V A
Sbjct: 181 IEEGVVAGMPSTKNMVPNFGGA 202
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 352 bits (903), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 157/202 (77%), Positives = 180/202 (89%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDL+ACCGF+CGDGCDGGYPI AWRYFV +GVVT+ECDPYFD GC HPGCEP
Sbjct: 25 MNISLSVNDLVACCGFMCGDGCDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEP 84
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTP C +KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAH
Sbjct: 85 AYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAH 144
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECG
Sbjct: 145 YKSGVYKHITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECG 204
Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
IEEDVVAG+PS+KN+V+ SA
Sbjct: 205 IEEDVVAGMPSTKNMVRNYDSA 226
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 352 bits (902), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 155/196 (79%), Positives = 178/196 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDL+ACCGF+CGDGCDGGYPISAW+Y V +GVVT+ECDPYFD GC HPGCEPA
Sbjct: 146 NISLSVNDLVACCGFMCGDGCDGGYPISAWQYLVENGVVTDECDPYFDQVGCKHPGCEPA 205
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP C +KC +NQ+W+ KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 206 YPTPACEKKCKVQNQVWQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 265
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVY+HITG++MGGHAVKLIGWGTS DG+DYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 266 KSGVYEHITGEMMGGHAVKLIGWGTSADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGI 325
Query: 201 EEDVVAGLPSSKNLVK 216
EEDVVAG+PS+KN V+
Sbjct: 326 EEDVVAGMPSTKNTVR 341
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 351 bits (901), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 159/210 (75%), Positives = 182/210 (86%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 150 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEP 209
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC RKCV N+LW SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 210 AYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAH 269
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGTS +GEDYW++ANQWNR WG DGYF I+RG+NECG
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECG 329
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDASA 229
IE++ VAGLPSSKN+ + T ++ AS
Sbjct: 330 IEDEPVAGLPSSKNVFRVDTGSNDLPVASV 359
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 351 bits (901), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 159/209 (76%), Positives = 182/209 (87%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGF CGDGCDGGYPI+AW+YF + GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 150 MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEP 209
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AYPTPKC RKCV N+LW SKHYS+S Y + S+P+DIMAE+YKNGPVEVSFTVYEDFAH
Sbjct: 210 AYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAH 269
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG +GGHAVKLIGWGTS +GEDYW++ANQWNR WG DGYF I+RG+NECG
Sbjct: 270 YKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECG 329
Query: 200 IEEDVVAGLPSSKNLVKEITSADMFEDAS 228
IE++ VAGLPSSKN+ + T ++ AS
Sbjct: 330 IEDEPVAGLPSSKNVFRVDTGSNDLPVAS 358
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 350 bits (899), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 157/194 (80%), Positives = 176/194 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++SLSVNDLLACCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPA
Sbjct: 149 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKC +NQ+W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YPTPKCHRKCKVENQVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 269 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 328
Query: 201 EEDVVAGLPSSKNL 214
EEDV AG+PS+KN+
Sbjct: 329 EEDVTAGMPSTKNM 342
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 350 bits (897), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 158/192 (82%), Positives = 176/192 (91%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LSVNDLLACCGF CGDGCDGGYPISAW+YF + GVVTEECDPYFD TGCSHPGCEPA
Sbjct: 152 NITLSVNDLLACCGFRCGDGCDGGYPISAWQYFSYSGVVTEECDPYFDQTGCSHPGCEPA 211
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TP+C+RKCV +NQLW SKHYSI+ Y + S+P+DIMAEIYKNGPVEVSFTVYEDFAHY
Sbjct: 212 YNTPQCLRKCVGRNQLWSESKHYSINTYVVESNPQDIMAEIYKNGPVEVSFTVYEDFAHY 271
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGT+DDGEDYW+LANQWNRSWG DGYF I+RG+NECGI
Sbjct: 272 KSGVYKHITGSNIGGHAVKLIGWGTTDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGI 331
Query: 201 EEDVVAGLPSSK 212
E++ VAGLPSSK
Sbjct: 332 EDEPVAGLPSSK 343
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 350 bits (897), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 157/201 (78%), Positives = 175/201 (87%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS NDL+ACCGF+CGDGCDGGYPI AW+YFV GVVTEECDPYFD GC HPGCEPA
Sbjct: 142 NISLSANDLVACCGFMCGDGCDGGYPIKAWQYFVQSGVVTEECDPYFDQVGCKHPGCEPA 201
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKC +KC +NQ+W KH+SI+AYR+NSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 202 YDTPKCEKKCKVQNQVWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 261
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKH+TG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 262 KSGVYKHVTGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 321
Query: 201 EEDVVAGLPSSKNLVKEITSA 221
EE+VVAG+PS+KN+ SA
Sbjct: 322 EEEVVAGMPSTKNMAGNHGSA 342
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 349 bits (896), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 157/194 (80%), Positives = 175/194 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++SLSVNDLLACCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPA
Sbjct: 149 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKC +NQ+W+ +KH S++AYR++S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 209 YPTPKCHRKCKVENQVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 269 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGGDGYFKIIRGKNECGI 328
Query: 201 EEDVVAGLPSSKNL 214
EEDV AG+PS+KN+
Sbjct: 329 EEDVTAGMPSTKNM 342
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 349 bits (895), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 159/208 (76%), Positives = 178/208 (85%), Gaps = 2/208 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++SLSVNDLLACC FLCG GCDGGYPI+AWRYF GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 148 SISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCEPL 207
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKC RKCVK N LWR SKHY ++AYR++ DP+ IMAE+YKNGPVEVSFTVYEDFAHY
Sbjct: 208 YPTPKCHRKCVKGNVLWRKSKHYGVNAYRVSHDPQSIMAEVYKNGPVEVSFTVYEDFAHY 267
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKH+TG MGGHAVKLIGWGTS+ GEDYW++ N WNR WG DGYFKI+RG+NECGI
Sbjct: 268 KSGVYKHVTGGNMGGHAVKLIGWGTSEQGEDYWLIVNSWNRGWGEDGYFKIRRGTNECGI 327
Query: 201 EEDVVAGLPSSKNLVKEITSADMFEDAS 228
E VVAGLPS++NL E+ D DAS
Sbjct: 328 EHSVVAGLPSARNLNVEL--GDAVLDAS 353
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 349 bits (895), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 156/195 (80%), Positives = 173/195 (88%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS NDL+ACCGF+CGDGCDGGYPISAW+YFV +GVVTEECDPYFD GC HPGCEPA
Sbjct: 144 NISLSANDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTEECDPYFDQVGCKHPGCEPA 203
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP C +KC +NQ+W+ KH+SI AY++NSDP DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 204 YPTPVCEKKCKVQNQVWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHY 263
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 264 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 323
Query: 201 EEDVVAGLPSSKNLV 215
EEDV AG+PS KN+
Sbjct: 324 EEDVTAGMPSMKNIA 338
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 345 bits (886), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 153/195 (78%), Positives = 173/195 (88%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LS NDL+ACCGF+CGDGCDGGYPISAW+YFV +GVVT+ECDPYFD GC HPGCEPA
Sbjct: 105 NITLSANDLVACCGFMCGDGCDGGYPISAWQYFVQNGVVTDECDPYFDQVGCKHPGCEPA 164
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP C +KC +NQ+W KH+SI+AY++NSDP DIMAE+Y NGPVEV+FTVYEDFAHY
Sbjct: 165 YPTPVCEKKCKVQNQVWEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHY 224
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECGI
Sbjct: 225 KSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECGI 284
Query: 201 EEDVVAGLPSSKNLV 215
EEDV AG+PS+KN+
Sbjct: 285 EEDVTAGMPSTKNIA 299
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 345 bits (885), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 155/194 (79%), Positives = 172/194 (88%), Gaps = 1/194 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF GVVT ECDPYFD TGCSHPGCEPA
Sbjct: 144 NVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPGCEPA 203
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP C +KCVKKN LW SKH+S++AYR+NSD IM E+Y NGP EVSFTVYEDFAHY
Sbjct: 204 YPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYEDFAHY 263
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKH+TG MGGHAVKLIGWGTS+DGEDYW+LANQWNRSWG DGYFKI RG+NECGI
Sbjct: 264 KSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGDDGYFKIIRGTNECGI 323
Query: 201 EEDVVAGLPSSKNL 214
EDV AG+PS+KNL
Sbjct: 324 -EDVTAGMPSTKNL 336
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 345 bits (884), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 155/194 (79%), Positives = 171/194 (88%), Gaps = 1/194 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LSVNDLLACCGFLCG+GCDGGYPI+AW+YF GVVT ECDPYFD TGCSHPGCEPA
Sbjct: 144 NVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFKRTGVVTSECDPYFDQTGCSHPGCEPA 203
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP C +KCVKKN LW SKH+S++AYR+NSD IM E+Y NGP EVSFTVYEDFAHY
Sbjct: 204 YPTPACEKKCVKKNLLWSESKHFSVNAYRVNSDQHSIMTEVYTNGPAEVSFTVYEDFAHY 263
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKH+TG MGGHAVKLIGWGTS+DGEDYW+LANQWNRSWG DGYFKI RG+NECGI
Sbjct: 264 KSGVYKHVTGSEMGGHAVKLIGWGTSEDGEDYWLLANQWNRSWGGDGYFKIIRGTNECGI 323
Query: 201 EEDVVAGLPSSKNL 214
EDV AG PS+KNL
Sbjct: 324 -EDVTAGTPSTKNL 336
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 344 bits (882), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 154/202 (76%), Positives = 174/202 (86%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N+SLSVNDLLACCGFLCG GC+GGYPISAWRYF GVVT+ECDPYFD GC HPGCEP
Sbjct: 144 MNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRKGVVTDECDPYFDQVGCKHPGCEP 203
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
AY TPKC +KC +N++W+ KH+S+ AYR++S+P DIMAE+Y NGPVEV+FTVYEDFAH
Sbjct: 204 AYRTPKCEKKCKVQNEVWKEQKHFSVDAYRVHSNPHDIMAEVYTNGPVEVAFTVYEDFAH 263
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NECG
Sbjct: 264 YKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGKNECG 323
Query: 200 IEEDVVAGLPSSKNLVKEITSA 221
IEEDVVAG+PS+KN+ + A
Sbjct: 324 IEEDVVAGMPSTKNMARNYDDA 345
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 338 bits (866), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 149/179 (83%), Positives = 164/179 (91%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GCDGGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPA
Sbjct: 149 NISLSVNDLLACCGFLCGSGCDGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPA 208
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCV+KCV NQ+W+ SKHYS+SAYR+NSDP DIMAE+YKNGPVEV+FTVYEDFA+Y
Sbjct: 209 YRTPKCVKKCVSGNQVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYY 268
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
KSGVYKHITG +GGHAVKLIGWGT+DDGEDYW+LANQWNR WG DGYFKI+RG+NECG
Sbjct: 269 KSGVYKHITGYELGGHAVKLIGWGTTDDGEDYWLLANQWNREWGDDGYFKIRRGTNECG 327
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 335 bits (860), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 153/198 (77%), Positives = 172/198 (86%), Gaps = 2/198 (1%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++SLSVNDLLACCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPA
Sbjct: 145 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 204
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFA 138
YPTPKC RKC +NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT + DFA
Sbjct: 205 YPTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFA 264
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
HYKSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG NEC
Sbjct: 265 HYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGENEC 324
Query: 199 GIEEDVVAGLPSSKNLVK 216
GIE DV AG+PS+KN +
Sbjct: 325 GIEGDVTAGMPSTKNTAR 342
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 334 bits (857), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 149/195 (76%), Positives = 170/195 (87%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDL+ACCGFLCGDGCDGGYPI AW+YFV +GVVT+ECDP+FD GC HPGCEPA
Sbjct: 145 NVSLSVNDLVACCGFLCGDGCDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQHPGCEPA 204
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP C +KC +NQ+W KH+SI AY++NSDP DIMAE+YKNGPVEVSF +YEDFAHY
Sbjct: 205 YPTPVCEKKCKVQNQVWEEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHY 264
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYK ITG ++GGHA KLIGWGTSD GEDYW+LANQWNR WG DGYFKI RG+NECGI
Sbjct: 265 KSGVYKQITGRMVGGHAAKLIGWGTSDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGI 324
Query: 201 EEDVVAGLPSSKNLV 215
E DV AG+PS+KN+
Sbjct: 325 EGDVNAGMPSTKNIA 339
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 330 bits (847), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 160/202 (79%), Positives = 183/202 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLS ND++ACCG LCG GC+GG+P+ AW YF +HGVVTEECDPYFD+TGCSHPGCEP
Sbjct: 151 NVSLSANDVVACCGLLCGLGCNGGFPMGAWLYFKYHGVVTEECDPYFDNTGCSHPGCEPG 210
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTPKCVRKCV +NQLW SKHY +SAYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 211 YPTPKCVRKCVSENQLWGESKHYGVSAYRINHDPQDIMAEVYKNGPVEVAFTVYEDFAHY 270
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYKHITG +GGHAVKLIGWGTSDDGEDYW+LANQWNRSWG DGYFKI+RG+NECGI
Sbjct: 271 KSGVYKHITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 330
Query: 201 EEDVVAGLPSSKNLVKEITSAD 222
E VVAGLPS +N+ K++T++D
Sbjct: 331 EHGVVAGLPSDRNVFKDVTTSD 352
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 324 bits (830), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 145/197 (73%), Positives = 169/197 (85%), Gaps = 1/197 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+ GVVT ECDPYFD GC HPGCEP
Sbjct: 144 NVTLSENDLVACCGFMCGDGCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPL 203
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP+CV++C +NQ W NSK +S +AYRI+S P DIMAE+Y NGPVEVSF+VYEDFAHY
Sbjct: 204 YPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHY 263
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYK+ GD MGGHAVKL+GWGT +DG DYW++AN WN +WG DGYFKI RGSNECGI
Sbjct: 264 KSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWLVANSWNTAWGEDGYFKIARGSNECGI 322
Query: 201 EEDVVAGLPSSKNLVKE 217
E DVVAG+PS+KNLV +
Sbjct: 323 EGDVVAGMPSTKNLVMD 339
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 324 bits (830), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 145/197 (73%), Positives = 169/197 (85%), Gaps = 1/197 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LS NDL+ACCGF+CGDGCDGGYPISAW+YF+ GVVT ECDPYFD GC HPGCEP
Sbjct: 144 NVTLSENDLVACCGFMCGDGCDGGYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPL 203
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP+CV++C +NQ W NSK +S +AYRI+S P DIMAE+Y NGPVEVSF+VYEDFAHY
Sbjct: 204 YPTPQCVKQCKDENQKWGNSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHY 263
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYK+ GD MGGHAVKL+GWGT +DG DYW++AN WN +WG DGYFKI RGSNECGI
Sbjct: 264 KSGVYKYTKGDYMGGHAVKLVGWGT-EDGTDYWLVANSWNTAWGEDGYFKIARGSNECGI 322
Query: 201 EEDVVAGLPSSKNLVKE 217
E DVVAG+PS+KNLV +
Sbjct: 323 EGDVVAGMPSTKNLVMD 339
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 321 bits (823), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 164/200 (82%)
Query: 29 LLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 88
L F G GGYP+ AWRY HHGVVTEECDPYFD GCSHPGCEPAY TPKCVR
Sbjct: 9 FLHAVAFSVGLAVMGGYPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAYQTPKCVR 68
Query: 89 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 148
KCVK NQ+W+ SKH+S++AY + SDP DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYKHI
Sbjct: 69 KCVKGNQIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHI 128
Query: 149 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
TG +GGHAVKLIGWGT+D+GEDYW++ANQWNRSWG DGYF I+RG+NECGIEEDV AGL
Sbjct: 129 TGSQLGGHAVKLIGWGTTDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVTAGL 188
Query: 209 PSSKNLVKEITSADMFEDAS 228
PS+KN+ + + D D S
Sbjct: 189 PSTKNMGRWVMDMDADADVS 208
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 319 bits (817), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 143/197 (72%), Positives = 166/197 (84%), Gaps = 1/197 (0%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LS NDL+ACCGF CGDGCDGGYP+SAW+YF+ GVVT ECDPYFD GC HPGCEP
Sbjct: 144 NVTLSENDLVACCGFRCGDGCDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQHPGCEPL 203
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
YPTP+CV++C +NQ W NSK +S +AYRI S P DIMAE+Y GPVEV F VYEDFAHY
Sbjct: 204 YPTPQCVKQCKDENQNWGNSKRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHY 263
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVYK+ITGD +GGHAVKLIGWGT ++G DYW++AN WN +WG DGYFKI RGSNEC I
Sbjct: 264 KSGVYKYITGDFLGGHAVKLIGWGT-ENGTDYWLVANSWNTAWGEDGYFKIARGSNECSI 322
Query: 201 EEDVVAGLPSSKNLVKE 217
EEDVVAG+PS+KNLV +
Sbjct: 323 EEDVVAGMPSTKNLVMD 339
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 317 bits (812), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 143/212 (67%), Positives = 175/212 (82%), Gaps = 2/212 (0%)
Query: 9 DALSSSPYVSLQ-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF 67
+ALS + Q N +LS NDL+ACCGF CG GC+GG+P+SAWRYF GVVT+ECDPYF
Sbjct: 130 EALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGFPLSAWRYFSRRGVVTDECDPYF 189
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
D+ GC+HPGCEP+YPTP+CV+ C K NQ W +SKHYS +AYRI SDP +IMAE++ NGPV
Sbjct: 190 DNDGCNHPGCEPSYPTPRCVKNC-KDNQRWSHSKHYSANAYRIKSDPYNIMAEVFNNGPV 248
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
EVSF+VYEDFAHY++GVYKH+ G +GGHAVKLIGWGT+DDG DYW++AN WN +WG G
Sbjct: 249 EVSFSVYEDFAHYETGVYKHVQGRYLGGHAVKLIGWGTTDDGIDYWLIANSWNTAWGEGG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 219
YFKI RG NECGIE D VAG+PS+KNL+++ T
Sbjct: 309 YFKIARGVNECGIERDPVAGMPSAKNLIQDPT 340
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 311 bits (797), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 140/176 (79%), Positives = 158/176 (89%)
Query: 47 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS 106
+ AW YF +HGVVT+ECDPYFD+TGCSHPGCEP YPTPKC RKCV +NQLW SKHY +
Sbjct: 1 MGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTYPTPKCERKCVSRNQLWGESKHYGVG 60
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
AYRIN DP+DIMAE+YKNGPVEV+FTVYEDFAHYKSGVYK+ITG +GGHAVKLIGWGTS
Sbjct: 61 AYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTS 120
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
DDGEDYW+LANQWNRSWG DGYFKI+RG+NECGIE+ VVAGLPS KN+ K IT++D
Sbjct: 121 DDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKNVFKGITTSD 176
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 303 bits (775), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 134/166 (80%), Positives = 150/166 (90%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
CDGGYPISAW+YF HHGVVTEECDPYFD GCSHPGCEP Y TPKCVRKCVK NQ+W+ S
Sbjct: 1 CDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGCEPGYQTPKCVRKCVKGNQVWKKS 60
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
KHYS+ Y++NSDP++IM E+YKNGPVEV+F+VYEDFAHYKSGVYKHITG +GGHAVKL
Sbjct: 61 KHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKSGVYKHITGSALGGHAVKL 120
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
GWGTSD+GEDYW+LANQWN +WG DGYFKIKRG+NECGIEEDV A
Sbjct: 121 NGWGTSDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVTA 166
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 135/199 (67%), Positives = 160/199 (80%), Gaps = 2/199 (1%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+N+SLS NDL+ACC CG GCDGGYP +AW YF GVVT +CDPYFD GC HPGCEP
Sbjct: 146 ENVSLSENDLVACCS-SCGFGCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEP 204
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TP CV++CV N+ WR+SKH+++ Y +NSD DI AEIYKNGPVEVS+TVYEDFAH
Sbjct: 205 EYDTPVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAH 263
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKH+ G+V+GGHAVK IGWGT+DDG+DYWI+AN WNRSWG DG+F+I RGSNECG
Sbjct: 264 YKSGVYKHVFGEVLGGHAVKFIGWGTTDDGKDYWIVANSWNRSWGEDGFFQISRGSNECG 323
Query: 200 IEEDVVAGLPSSKNLVKEI 218
IE + VAG+P K +I
Sbjct: 324 IESEPVAGIPLKKTGFSDI 342
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 134/199 (67%), Positives = 159/199 (79%), Gaps = 2/199 (1%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+N+SLS NDL+ACC CG GC+GGYP +AW YF GVVT +CDPYFD GC HPGCEP
Sbjct: 135 ENVSLSENDLVACCS-SCGFGCEGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEP 193
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TP CV++CV N+ WR+SKH+++ Y +NSD DI AEIYKNGPVEVS+TVYEDFAH
Sbjct: 194 EYDTPVCVKQCVD-NEQWRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAH 252
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YKSGVYKH+ G V+GGHAVK IGWGT+DDG+DYWI+AN WNRSWG DG+F+I RGSNECG
Sbjct: 253 YKSGVYKHVFGQVLGGHAVKFIGWGTTDDGKDYWIVANSWNRSWGEDGFFQISRGSNECG 312
Query: 200 IEEDVVAGLPSSKNLVKEI 218
IE + VAG+P K +I
Sbjct: 313 IESEPVAGIPLKKTGFSDI 331
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 290 bits (741), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 130/195 (66%), Positives = 153/195 (78%), Gaps = 1/195 (0%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+++SLS NDLLACCGF CGDGCDGGYPI AWRYF GVVT +CDPYFD GC HPGC P
Sbjct: 142 ESVSLSENDLLACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGCGHPGCYP 201
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TPKCV+ CV ++LW SKH S++AY ++ +PED+MAE+Y NGP+EVSF V+EDFAH
Sbjct: 202 TYRTPKCVKHCVD-DELWVKSKHLSVNAYEVSKEPEDLMAELYTNGPIEVSFEVFEDFAH 260
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YK+GVYKH+ G +GGHAVKLIGWGT+DDG DYW + N WN +WG G F+I RG NECG
Sbjct: 261 YKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEHGLFRIARGGNECG 320
Query: 200 IEEDVVAGLPSSKNL 214
IE VAGLP K L
Sbjct: 321 IESYAVAGLPFDKGL 335
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 289 bits (740), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 131/166 (78%), Positives = 147/166 (88%), Gaps = 2/166 (1%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++SLSVNDLLACCGFLCG GC+GGYPISAWRYF GVVTEECDPYFD TGC HPGCEPA
Sbjct: 145 SVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQHPGCEPA 204
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE--DFA 138
YPTPKC RKC +NQ W+ +KH+S++AYR++S+P DIMAE+YKNGPVEV+FT + DFA
Sbjct: 205 YPTPKCQRKCKVENQAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTYCQILDFA 264
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
HYKSGVYKHITG VMGGHAVKLIGWGTSD GEDYW+LANQWNR WG
Sbjct: 265 HYKSGVYKHITGGVMGGHAVKLIGWGTSDAGEDYWLLANQWNRGWG 310
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 289 bits (739), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 128/195 (65%), Positives = 156/195 (80%), Gaps = 1/195 (0%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+++SLS NDLLACCGF CG GC+GGYPI AW+YF H GVVT +CDPYFD GC+HPGC P
Sbjct: 150 ESVSLSENDLLACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGCAHPGCYP 209
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TPKC ++CV ++ W SKH ++AY ++ +PED+MAE+Y NGPVEV+F VYEDFAH
Sbjct: 210 TYETPKCEKQCVD-DEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAH 268
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YK+GVYKH+ G MGGHAVKLIGWGT+DDG DYW + N WN +WG DG F+I RG++ECG
Sbjct: 269 YKTGVYKHLFGGFMGGHAVKLIGWGTTDDGVDYWTIVNSWNTNWGEDGLFRIVRGNDECG 328
Query: 200 IEEDVVAGLPSSKNL 214
IE + VAGLPS K L
Sbjct: 329 IESNAVAGLPSRKGL 343
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 284 bits (726), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 124/195 (63%), Positives = 157/195 (80%), Gaps = 1/195 (0%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+++SLS NDLLACCGF CGDGC+GGYPI AW+YF GVVT +CDPYFD GC HPGC P
Sbjct: 148 ESVSLSENDLLACCGFECGDGCEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGCGHPGCYP 207
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
Y TPKC ++CV ++LW +SKH +SAY ++ +PE++MAE++ NGP+EV+F V+EDFAH
Sbjct: 208 TYDTPKCFKRCVD-DELWVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAFDVFEDFAH 266
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
YK+GVYKH+ G +GGHAVKL+GWGT+DDG DYW + N WN +WG DG F+I RG +ECG
Sbjct: 267 YKTGVYKHLYGGYIGGHAVKLVGWGTTDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECG 326
Query: 200 IEEDVVAGLPSSKNL 214
IE + VAGLPS+K L
Sbjct: 327 IESNAVAGLPSNKGL 341
>gi|149941230|emb|CAO02547.1| putative cathepsin B-like cysteine protease [Vigna unguiculata]
Length = 201
Score = 243 bits (621), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 108/132 (81%), Positives = 121/132 (91%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GC+GGYP+SAWRY +HGVVTEECDPYFD TGCSHPGCEPA
Sbjct: 66 NISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPA 125
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCV+KCV NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 126 YRTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 185
Query: 141 KSGVYKHITGDV 152
KSGVYKH+TG V
Sbjct: 186 KSGVYKHVTGYV 197
>gi|149941232|emb|CAO02548.1| putative cathepsin B-like cysteine protease,putative [Vigna
unguiculata]
Length = 195
Score = 242 bits (617), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 107/130 (82%), Positives = 120/130 (92%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+SLSVNDLLACCGFLCG GC+GGYP+SAWRY +HGVVTEECDPYFD TGCSHPGCEPA
Sbjct: 66 NISLSVNDLLACCGFLCGSGCNGGYPLSAWRYLSNHGVVTEECDPYFDQTGCSHPGCEPA 125
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
Y TPKCV+KCV NQLW+ SKHYS+SAY++ S+P DIMAE+YKNGPVEV+FTVYEDFAHY
Sbjct: 126 YRTPKCVKKCVSGNQLWKKSKHYSVSAYKVKSNPHDIMAEVYKNGPVEVAFTVYEDFAHY 185
Query: 141 KSGVYKHITG 150
KSGVYKH+TG
Sbjct: 186 KSGVYKHVTG 195
>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
Length = 142
Score = 229 bits (584), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 102/134 (76%), Positives = 120/134 (89%)
Query: 88 RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH 147
+KC +NQ+W KH+S++AYR+NSDP DIMAE+Y+NGPVEV+FTVYEDFAHYKSGVYKH
Sbjct: 1 KKCKVQNQVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVYKH 60
Query: 148 ITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
ITG +MGGHAVKLIGWGT+D GEDYW+LANQWNR WG DGYFKI RG+NECGIEEDVVAG
Sbjct: 61 ITGGMMGGHAVKLIGWGTTDAGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAG 120
Query: 208 LPSSKNLVKEITSA 221
+PS+KN+V+ SA
Sbjct: 121 MPSTKNMVRNYDSA 134
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 108/228 (47%), Positives = 143/228 (62%), Gaps = 23/228 (10%)
Query: 4 TRTNRDALSSSPYVSLQNLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTE 61
T ++R ++S+ V N+ +S DLL+CC G+ CGDGC+GGYPI AWRY+VH+G+VT
Sbjct: 111 TISDRTCIASNGEV---NVLISAEDLLSCCTGGYNCGDGCEGGYPIQAWRYWVHNGLVTG 167
Query: 62 E-------CDPYFDS------TGCSHPGCEP-AYPTPKCVRKCVKKNQL---WRNSKHYS 104
C PY + G + P C TP+CV++C K+ + KHY
Sbjct: 168 GSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYG 227
Query: 105 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
SAY I + I EI +NGPVEV F VY DF YKSG+YKH+ G +GGHAVK++GWG
Sbjct: 228 SSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIYKHVAGRELGGHAVKILGWG 287
Query: 165 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
++G YW+ AN WN +WG GYF+I+RG+NECGIE VVAG+P K
Sbjct: 288 V-ENGTPYWLAANSWNVNWGEKGYFRIRRGTNECGIESSVVAGIPDLK 334
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 204 bits (518), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 104/203 (51%), Positives = 130/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S DLL+CCGF CGDGC+GG+P SAW+Y+ G+VT C PY C
Sbjct: 162 QVEISAEDLLSCCGFECGDGCNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPY-QIKPCE 220
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP CV KC + N KHY +S+Y + SDP I EI +GP
Sbjct: 221 HHVPGDRPKCSEGGGTPSCVSKCKGNTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGP 280
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVYKH+TG V+GGHA++++GWG S++G YW++AN WN WG
Sbjct: 281 VEGAFTVYADFPTYKSGVYKHVTGGVLGGHAIRILGWG-SENGVAYWLVANSWNTDWGDK 339
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI RGS+ECGIE VVAG+P
Sbjct: 340 GYFKILRGSDECGIESSVVAGIP 362
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 200 bits (509), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 134/218 (61%), Gaps = 20/218 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
T+R ++S Q +S DLL CC F CGDGC+GGYP +AW Y+ + G+VT
Sbjct: 128 TDRTCIASK---GAQTPHISAEDLLTCCTFTCGDGCNGGYPAAAWEYWKNQGIVTGGQYD 184
Query: 61 --EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
+ C PY +TG P C PTP C R C + N + N KH+ S+Y +
Sbjct: 185 SNQGCQPYSLAKCEHHTTGPYKP-CGDIVPTPACKRSCRQGYNVTYPNDKHFGASSYGVR 243
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ I EI NGPVE +FTVY DF YKSGVY+H +G +GGHA+K+IGWG DG D
Sbjct: 244 G-VDQIATEIMTNGPVEAAFTVYSDFLSYKSGVYQHTSGQPLGGHAIKIIGWGVQ-DGTD 301
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YWI+AN WN SWG DG+F IK+G++ECGIE VVAGLP
Sbjct: 302 YWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAGLP 339
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 101/228 (44%), Positives = 149/228 (65%), Gaps = 20/228 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ Y++++ +S DLL+CCG CG+GC+GG+P AW+Y++ G+V+
Sbjct: 120 SDRICVHTNGYITIE---VSAEDLLSCCGLQCGEGCNGGFPAGAWKYWIKKGLVSGGLYD 176
Query: 63 ----CDPYFDSTGCSH--PGCEPAYP-----TPKCVRKC-VKKNQLWRNSKHYSISAYRI 110
C PY C H G PA TPKC +KC + +++ KHY +AY +
Sbjct: 177 SHVGCRPY-SIPPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNV 235
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
S ++IMAEIYKNGPVE +F VY DF YKSGVY+H+TGD++GGHA++++GWG +DG
Sbjct: 236 PSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGVYQHVTGDMLGGHAIRVLGWGV-EDGV 294
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW+ AN WN WG +G+FKI RG + CGIE ++VAG+P ++ K+I
Sbjct: 295 PYWLAANSWNTDWGDNGFFKILRGKDHCGIESEMVAGIPRTEQYWKKI 342
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 199 bits (507), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 135/204 (66%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ +S DLL CC CG GC+GGYP SAW +F G+VT + C PY C
Sbjct: 128 NQVRISTEDLLTCCD-SCGFGCNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPY-AIPAC 185
Query: 73 SH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H C + PTPKC + C K N ++N KHY +++Y IN+D +IM EI NG
Sbjct: 186 DHHVPHSKNPCNGSLPTPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQNEIMREIMTNG 245
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTV+ DF +YKSGVY+H++G+ +GGHA+K++GWG ++ YW++AN WN SWG
Sbjct: 246 PVEAAFTVFADFPNYKSGVYQHVSGEELGGHAIKILGWGVENN-TPYWLVANSWNPSWGD 304
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RGS+ECGIE++VVAGLP
Sbjct: 305 NGFFKILRGSDECGIEDEVVAGLP 328
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 199 bits (505), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 100/209 (47%), Positives = 133/209 (63%), Gaps = 20/209 (9%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
LS DLL+CCG++CG+GC+GG+P +AW Y+V +G+V+ + TGC EP
Sbjct: 146 FDLSSEDLLSCCGYVCGNGCNGGFPQAAWEYWVQNGLVS---GGLYHGTGCQPYAIEPCE 202
Query: 82 ---------------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
TPKC KCV + KHY AYRI ++ + IM EIYKNG
Sbjct: 203 HHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNG 262
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF YKSGVY H TG +GGHA++++GWG ++GE YW+ N WN WG
Sbjct: 263 PVEGAFIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWG-EENGEKYWLCGNSWNTDWGN 321
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
+G+FKIKRG NECGIE ++V G+P+S++L
Sbjct: 322 NGFFKIKRGVNECGIESEMVGGIPASESL 350
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW + G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFLTRKGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ ++S++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 130 SDRICIHTNAHISVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYD 186
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 187 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 245
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+HITG++MGGHA++++GWG ++G
Sbjct: 246 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHITGEMMGGHAIRILGWGV-ENGTP 304
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 305 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 351
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 91/120 (75%), Positives = 105/120 (87%)
Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
R +SDP IM E+YKNGPVEV+FTVYEDFAHYKSGVYKH+TGD +GGHAVKLIGWGTS+D
Sbjct: 2 RGSSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHAVKLIGWGTSED 61
Query: 169 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 228
GEDYW+LANQWNR WG DGYFKI+RG+NEC IE++VVAG+PS KNL E+ +D F DAS
Sbjct: 62 GEDYWLLANQWNRGWGDDGYFKIRRGTNECDIEDEVVAGMPSPKNLNMELDVSDAFLDAS 121
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 122/200 (61%), Gaps = 16/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
+S DL+ CC F CG GC GGYP +AW +F G+VT + C PY C H
Sbjct: 142 ISAQDLMTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPY-SLPNCDHHV 200
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C PTP C + C N + N KH+ +AY + + + I EI NGPVE
Sbjct: 201 SGQYPACSGEGPTPACKKSCEAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEG 260
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+FTVYED YKSGVY+H TG V+GGHA+K+IGWG + G DYW +AN WN WG +G+F
Sbjct: 261 AFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGV-ESGVDYWWVANSWNNDWGDNGFF 319
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KIK+G +ECGIE +VAG+P
Sbjct: 320 KIKKGVDECGIESQIVAGMP 339
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 102/208 (49%), Positives = 126/208 (60%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
N LS DLL+CC F CG+GC+GGYPI AW+++V HG+VT C PY +
Sbjct: 132 NTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPC 191
Query: 70 ----TGCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
G P C E PTPKCV C KN + KH+ +AY + E I EI
Sbjct: 192 GETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEI 251
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
NGP+EV+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW++AN WN
Sbjct: 252 LTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNV 310
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
+WG GYF+I RG NECGIE VAG+P
Sbjct: 311 AWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 197 bits (500), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 40 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 96
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 97 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 155
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 156 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 214
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 215 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 261
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 146/227 (64%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 24 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 80
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 81 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 139
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 140 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 198
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 199 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 245
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 105/216 (48%), Positives = 137/216 (63%), Gaps = 30/216 (13%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
+++ LS +L++CC CGDGC+GGYP +A +YFV G+VT + C Y C
Sbjct: 145 EDVRLSTENLVSCCSS-CGDGCNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAY-SFPPC 202
Query: 73 SH-------PGCEPAYPTPKCVRKC-----VKK---NQLWRNSKHYSISAYRINSDPEDI 117
+H P C+ PTP+C +KC VK+ L++ K YS+S SDP+ I
Sbjct: 203 AHHVASTKYPPCKGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSVS-----SDPKAI 257
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
M EI NGPVEV+FTVYEDF YKSGVY+H+TG+ +GGHAVK+IGWG +D YW++ N
Sbjct: 258 MTEIMNNGPVEVAFTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVEND-TPYWLIVN 316
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
WN +WG G FKI RGSNECGIE++VV LP K
Sbjct: 317 SWNETWGDQGTFKILRGSNECGIEDEVVTALPQKKQ 352
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 16/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY S+Y ++ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H+TG++MGGHAV+++GWG +DG YW++ N WN WG +
Sbjct: 249 VEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+FKI RG + CGIE ++VAG+P + K+I
Sbjct: 308 GFFKILRGRDHCGIESEIVAGIPCTDQYWKKI 339
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 196 bits (499), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 98/226 (43%), Positives = 146/226 (64%), Gaps = 17/226 (7%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 45 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 101
Query: 63 ----CDPYFDSTGCSH-----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINS 112
C PY +H P C TPKC + C + ++ KHY ++Y +++
Sbjct: 102 SHVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSN 161
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
+DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G Y
Sbjct: 162 SEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPY 220
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
W++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 221 WLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 266
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 196 bits (499), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 134/212 (63%), Gaps = 15/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
++ +S DLL+CCGF CG GC+GGYP AWRY+ G+V+ C PY
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEH 189
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
G P TP+C R C + ++ KHY I++Y + ++IMAEIYKNGP
Sbjct: 190 HVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGP 249
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F VYEDF YKSGVY+H+TG+ +GGHA++L+GWG D+G YW+ AN WN WG +
Sbjct: 250 VEGAFIVYEDFLMYKSGVYQHVTGEQVGGHAIRLLGWGV-DNGTPYWLAANSWNTDWGDN 308
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+FKI RG + CGIE ++VAG+PS++ K +
Sbjct: 309 GFFKILRGEDHCGIESEIVAGIPSTERYWKRV 340
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 196 bits (499), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 143/220 (65%), Gaps = 19/220 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 102 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 158
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 159 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 217
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 218 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 276
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 277 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 316
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 196 bits (498), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 131/206 (63%), Gaps = 14/206 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
+FKI RG N CGIE ++VAG+P +++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQD 334
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 196 bits (498), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 101/203 (49%), Positives = 125/203 (61%), Gaps = 17/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
N LS DL +CC CG GC+GGYP +AW YF G+VT + C PY
Sbjct: 266 NNFYLSAEDLTSCCDS-CGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACD 324
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
TG P C PTP C C + N W + KH+ S+Y + +D + IM EIY NGP
Sbjct: 325 HHVTGKYQP-CGDIQPTPACANSC-QNNATWSSDKHFGASSYSVGTDQQSIMTEIYTNGP 382
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE S+ VY DF YKSGVY+H+TGD +GGHAVK+IGWG D YWI+AN WN WG +
Sbjct: 383 VEASYDVYADFVSYKSGVYQHVTGDYLGGHAVKIIGWGV-DGSTPYWIVANSWNNDWGNN 441
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+F I RGS+ECGIE+ +VAG+P
Sbjct: 442 GFFNILRGSDECGIEDGIVAGIP 464
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 143/227 (62%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 109 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYD 165
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY +Y ++
Sbjct: 166 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVS 224
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
++ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 225 NNERDIMAEIYKNGPVEGAFSVYADFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 283
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++ N WN WG +G+FKI RG + CGIE +VVAG+P + + I
Sbjct: 284 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWRNI 330
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 143/220 (65%), Gaps = 19/220 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 41 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 97
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 98 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 156
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 157 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 215
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 216 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 255
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 103/208 (49%), Positives = 129/208 (62%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
N LS DLL+CC G L CG+GC+GGYPI AW+++V HG+VT C PY +
Sbjct: 133 NTLLSSQDLLSCCTGLLSCGNGCEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPC 192
Query: 70 ----TGCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
G + P C + PTPKCV C N + KH+ +AY + E I EI
Sbjct: 193 GQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEI 252
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
KNGPVEV+FTVYEDF Y +GVY H +G +GGHAVK++GWG D+G YW++AN WN
Sbjct: 253 LKNGPVEVAFTVYEDFYQYTTGVYVHTSGASLGGHAVKILGWGV-DNGTPYWLVANSWNV 311
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
+WG GYF+I RG NECGIE VAG+P
Sbjct: 312 NWGEKGYFRIIRGLNECGIEHSAVAGIP 339
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 196 bits (497), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 143/220 (65%), Gaps = 19/220 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 39 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 95
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 96 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 154
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 155 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 213
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 214 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 253
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 196 bits (497), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGP E +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPAEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 196 bits (497), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 99/227 (43%), Positives = 145/227 (63%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGSRCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 196 bits (497), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 133/212 (62%), Gaps = 15/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
N+ +S DLL+CCGF CG GC+GGYP AW+Y+ G+V+ C PY
Sbjct: 120 NVEISAEDLLSCCGFECGMGCNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEH 179
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
+ G P TP+CV+KC ++ KHY +++Y I ++IMAEIYKNGP
Sbjct: 180 HTNGTRPPCSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGP 239
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F VY DF YKSGVY+H++G+ +GGHA++++GWG D+G YW+ AN WN WG D
Sbjct: 240 VEGAFVVYSDFLMYKSGVYQHVSGEEVGGHAIRILGWGV-DNGTPYWLAANSWNTDWGED 298
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+F+I RG + CGIE ++VAG+P + K +
Sbjct: 299 GFFRILRGQDHCGIESEIVAGIPKTSEYWKML 330
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 195 bits (495), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 136/211 (64%), Gaps = 16/211 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
+ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY C H
Sbjct: 1 VEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEH 59
Query: 75 ------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPV
Sbjct: 60 HVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV 119
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 120 EGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNG 178
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 179 FFKILRGQDHCGIESEVVAGIPRTDQYWEKI 209
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 131/207 (63%), Gaps = 16/207 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 113 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CE 171
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGP
Sbjct: 172 HHVNGARPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 231
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +
Sbjct: 232 VEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDN 290
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKN 213
G+FKI RG N CGIE ++VAG+P ++
Sbjct: 291 GFFKILRGENHCGIESEIVAGIPRTQQ 317
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 131/206 (63%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S DLL CC CG GC+GGYP +AW Y+ G+VT + C PY C
Sbjct: 135 QVDISAEDLLDCCDS-CGAGCNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPY-SLAPCE 192
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C PTPKCV C K + +++ KH+ Y I+SD + I EI+KNGP
Sbjct: 193 HHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGP 252
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE FTVY DF YKSGVY+H +GDV+GGHA++++GWGT ++G YW++AN WN WG
Sbjct: 253 VEADFTVYADFLSYKSGVYQHQSGDVLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGDH 311
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
GYFKI RG +ECGIE+D+ AG+P ++
Sbjct: 312 GYFKILRGKDECGIEDDINAGIPKNE 337
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 140/220 (63%), Gaps = 19/220 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 130 SDRICIHTNAHVSVE---VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYD 186
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C ++ KHY ++Y ++
Sbjct: 187 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVS 245
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 246 NSERDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 304
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
YW++ N WN WG +G+FKI RG + CGIE +VVAG+P +
Sbjct: 305 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 344
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 131/206 (63%), Gaps = 16/206 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 51 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CE 109
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGP
Sbjct: 110 HHVNGARPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 169
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +
Sbjct: 170 VEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDN 228
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
G+FKI RG N CGIE ++VAG+P ++
Sbjct: 229 GFFKILRGENHCGIESEIVAGIPRTQ 254
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/206 (47%), Positives = 131/206 (63%), Gaps = 16/206 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 57 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPP-CE 115
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGP
Sbjct: 116 HHVNGARPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGP 175
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +
Sbjct: 176 VEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNADWGDN 234
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
G+FKI RG N CGIE ++VAG+P ++
Sbjct: 235 GFFKILRGENHCGIESEIVAGIPRTQ 260
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
+FKI RG N CGIE ++VAG+P ++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQQ 334
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 95/205 (46%), Positives = 134/205 (65%), Gaps = 16/205 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 2 SVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCE 60
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGP
Sbjct: 61 HHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGP 120
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +
Sbjct: 121 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 179
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSS 211
G+FKI RG + CGIE +VVAG+P +
Sbjct: 180 GFFKILRGQDHCGIESEVVAGIPRT 204
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
+FKI RG N CGIE ++VAG+P ++
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRTQQ 334
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 194 bits (492), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 130/206 (63%), Gaps = 14/206 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 62 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH 121
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TPKC + C + ++ KHY ++Y ++ ++IMAEIYKNGPV
Sbjct: 122 HVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPV 181
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G
Sbjct: 182 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNG 240
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
+FKI RG N CGIE ++VAG+P ++
Sbjct: 241 FFKILRGENHCGIESEIVAGIPRTQQ 266
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 194 bits (492), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 128/207 (61%), Gaps = 15/207 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
N+ +S DLL CCGF CG+GC+GG+P AW ++ G+V+ C PY
Sbjct: 153 NVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH 212
Query: 68 DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
G P TPKC R C ++ KH+ S+Y + S +IMAEIYKNGP
Sbjct: 213 HVNGSRPPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGP 272
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H+TG++MGGHAV+++GWG +DG YW++ N WN WG
Sbjct: 273 VEAAFSVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDS 331
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKN 213
G+FKI RG + CGIE ++VAGLP ++
Sbjct: 332 GFFKILRGQDHCGIESEIVAGLPCTEQ 358
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 193 bits (490), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 127/206 (61%), Gaps = 20/206 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT Y + TGC +P C
Sbjct: 147 QLSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQEKTGCKPYPYPPC 204
Query: 78 E-----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
E P+ YPT KC R C L + H+ SAY ++ +I EI
Sbjct: 205 EHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMT 264
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVEV+F+VYEDF HY GVY H G +GGHAVK++GWG D+G YW+ AN WN W
Sbjct: 265 HGPVEVAFSVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
G +GYF+I RG NECGIE VV G+P
Sbjct: 324 GENGYFRIIRGVNECGIESGVVGGIP 349
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 193 bits (490), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 132/208 (63%), Gaps = 16/208 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
G+FKI RG + CGIE ++VAG+P + +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPCTPHF 335
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 192 bits (489), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 99/206 (48%), Positives = 126/206 (61%), Gaps = 20/206 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
+S+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT Y + +GC +P C
Sbjct: 147 QISISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQEKSGCKPYPYPPC 204
Query: 78 E-----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
E P+ YPT KC C L + H+ SAY ++ P +I EI
Sbjct: 205 EHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMT 264
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVEV+FTVYEDF HY GVY H G +GGHAVK++GWG D+G YW+ AN WN W
Sbjct: 265 HGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
G +GYF+I RG NECGIE VV G P
Sbjct: 324 GENGYFRIIRGVNECGIESGVVGGTP 349
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 192 bits (489), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 94/208 (45%), Positives = 132/208 (63%), Gaps = 16/208 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C ++ KH+ S+Y I+ + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H+TGD+MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFTVYSDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
G+FKI RG + CGIE ++VAG+P + +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPCTPHF 335
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 192 bits (489), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 99/205 (48%), Positives = 122/205 (59%), Gaps = 17/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N+ LS D+L+CCG CG GC GGYPI AWRYF+ HGV T + C PY C
Sbjct: 145 NVGLSATDILSCCGTTCGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHP-CG 203
Query: 74 HPGCEPAY--------PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H E Y PTP+C + C + + K Y SAY + ++ + I EI N
Sbjct: 204 HHRNEIYYGECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTN 263
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV+ +F VYEDF+ Y+SG+Y H G GGHAVKLIGWG DDG YW+ AN WN WG
Sbjct: 264 GPVQAAFMVYEDFSRYRSGIYVHTAGRREGGHAVKLIGWGVDDDGNKYWLAANSWNSDWG 323
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+GYF+I RG + CGIE VVAG+P
Sbjct: 324 ENGYFRIVRGVDHCGIESAVVAGMP 348
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 192 bits (489), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 131/206 (63%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
+++S DLL CC CG GCDGGYP +AW Y+ G+V++ C PY C
Sbjct: 135 QVNISAEDLLDCCDS-CGAGCDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPY-SLAPCE 192
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C PTPKCV C K + +++ KH+ Y I+S+ + I EI+KNGP
Sbjct: 193 HHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGP 252
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE FTVY DF YKSGVY+H +GDV+GGHA++++GWGT ++G YW++AN WN WG
Sbjct: 253 VEADFTVYADFLSYKSGVYQHHSGDVLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGDH 311
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
GYFKI RG +ECGIE+D+ AG+P +
Sbjct: 312 GYFKILRGKDECGIEDDINAGIPKDE 337
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 192 bits (489), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 105/223 (47%), Positives = 136/223 (60%), Gaps = 23/223 (10%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE- 62
++R ++S+ V N LS D+L CC + CGDGC+GGYPI AW+Y+V +G+VT
Sbjct: 119 SDRTCIASNGVV---NTLLSAEDILTCCIGEYYCGDGCEGGYPIQAWKYWVKNGLVTGGS 175
Query: 63 ------CDPYFDS------TGCSHPGCEPA-YPTPKCVRKCVKKNQL---WRNSKHYSIS 106
C PY + G + P C + TPKCV C + + KHY +
Sbjct: 176 YESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYGAT 235
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
AY ++ + I +EI KNGPVEV FTVY DF YKSGVY H+ G +GGHAVKL+GWG
Sbjct: 236 AYAVSRKVDQIQSEILKNGPVEVGFTVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGV- 294
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
D+G YW+ AN WN +WG +GYF+I RG NECGIE VVAG+P
Sbjct: 295 DNGTPYWLAANSWNTNWGENGYFRILRGVNECGIESQVVAGMP 337
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 192 bits (488), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 100/208 (48%), Positives = 126/208 (60%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
N LS DLL+CC F CG+GC+GGYPI AW+++ HG+VT C PY +
Sbjct: 132 NTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPC 191
Query: 70 ----TGCSHPGC-EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEI 121
G + P C E PTPKCV C + + KH+ +AY + E I EI
Sbjct: 192 GQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEI 251
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
KNGP+EV+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW++AN WN
Sbjct: 252 LKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNI 310
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
+WG GYF+I RG NECGIE VAG+P
Sbjct: 311 NWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/208 (48%), Positives = 126/208 (60%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS-- 69
N LS DLL+CC F CG+GC+GGYPI AW+++ HG+VT C PY +
Sbjct: 132 NTLLSSEDLLSCCTGIFSCGNGCEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPC 191
Query: 70 ----TGCSHPGC-EPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEI 121
G + P C E PTPKCV C + + KH+ +AY + E I EI
Sbjct: 192 GQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEI 251
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
KNGP+EV+FTVYEDF Y +GVY H G +GGHAVK++GWG D+G YW++AN WN
Sbjct: 252 LKNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWNI 310
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
+WG GYF+I RG NECGIE VAG+P
Sbjct: 311 NWGEKGYFRIIRGLNECGIEHSAVAGIP 338
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 94/212 (44%), Positives = 132/212 (62%), Gaps = 16/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C ++ KHY ++Y +++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+V+ DF YKSGVY+H+TG++MGGHAV+++GWG +D YW++ N WN WG
Sbjct: 249 VEAAFSVFSDFLQYKSGVYQHVTGEMMGGHAVRILGWGVEND-TPYWLVGNSWNTDWGDH 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+FKI RG + CGIE +VVAG+P ++ K I
Sbjct: 308 GFFKILRGRDHCGIESEVVAGIPCTEQYWKRI 339
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 191 bits (484), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 126/204 (61%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N +S DLLACC CG+GC GG+P AWRY+ G+VT + C PY C
Sbjct: 145 NAEISAEDLLACCSS-CGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYM-IPACD 202
Query: 74 H-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P + TPKC +KC N +++ KHY ++Y ++S E IM EI NG
Sbjct: 203 HHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVDSV-EKIMTEIMTNG 261
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYEDF YKSGVY+H TG +GGHAVK++GWG D+G YWI+AN WN WG
Sbjct: 262 PVEAAFTVYEDFLSYKSGVYQHRTGQELGGHAVKILGWG-EDNGTPYWIVANSWNPDWGN 320
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F I RG +ECGIE +VAGLP
Sbjct: 321 QGFFNILRGKDECGIESQIVAGLP 344
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 135/215 (62%), Gaps = 17/215 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
LQN+ +S DLL CCGF CG+GC+GG+P AW ++ G+V+ C PY
Sbjct: 128 LQNVEVSAEDLLTCCGFQCGEGCNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPY-SIPP 186
Query: 72 CSH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C H P C TPKC + C + ++ KH+ Y + SD ++IM EIYK
Sbjct: 187 CEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPSDEKEIMVEIYK 246
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPVE +F+VY DF YKSGVY+H+TG+++GGHAV+++GWG ++G YW++ N WN W
Sbjct: 247 NGPVEAAFSVYSDFLLYKSGVYQHVTGEMVGGHAVRILGWGV-ENGTPYWLVGNSWNTDW 305
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G +G+FKI RG + CGIE ++VAG+P + + + I
Sbjct: 306 GDNGFFKILRGRDHCGIESEIVAGIPCTGHYSERI 340
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 101/203 (49%), Positives = 127/203 (62%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
+N +S DLL CCGF CG GC+GG AW +F + G VT E C PY
Sbjct: 127 KNPHISAEDLLTCCGFWCGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCE 186
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
++G P CE + PTPKC R C + N + + KH S Y I +D E I EIY NG
Sbjct: 187 HHTSGSKKP-CEGSEPTPKCKRSCREGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLNG 245
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY DF +YKSGVYK+ TG+ +GGHA+K++GWG ++ YW++AN WN WG
Sbjct: 246 PVEAAFTVYSDFPNYKSGVYKYTTGNALGGHAIKILGWGVENN-VPYWLVANSWNPDWGD 304
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G+FKI RGSNECGIE VVAG+
Sbjct: 305 KGFFKILRGSNECGIEASVVAGM 327
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 124/206 (60%), Gaps = 20/206 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
+S+S +D+ ACCG CG+GC+GGYPI AWR++V +G VT Y + TGC +P C
Sbjct: 147 QVSISADDINACCGMACGNGCNGGYPIEAWRHYVKNGYVTG--GSYQEKTGCKPYPYPPC 204
Query: 78 E-------------PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
E YPT KC R C L ++ H+ SAY ++ +I EI
Sbjct: 205 EHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMT 264
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPVEV+FTVY DF Y GVY H G +GGHAVK++GWG D+G YW+ AN WN W
Sbjct: 265 NGPVEVAFTVYADFEVYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDW 323
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
G +GYF+I RG NECGIE VV G+P
Sbjct: 324 GENGYFRIIRGVNECGIEHGVVGGIP 349
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 190 bits (482), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 135/218 (61%), Gaps = 20/218 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S +SL+ +S DLL+CC CG GC GGYP SAW ++ G+VT
Sbjct: 115 SDRLCIHSGSKISLE---ISAEDLLSCCD-ECGMGCSGGYPSSAWEFWTKKGLVTGGLCG 170
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRIN 111
C PY + C H P C+ TPKC +KC+ + KH+ +Y +
Sbjct: 171 SEVGCRPYSIAP-CEHHVNGTRPPCQGTQETPKCEKKCIDGYLTSYLKDKHFGKRSYSLP 229
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S E IM E+YKNGPVE +FTVY DF YK+GVY+H+TG+V+GGHA+K++GWG + G
Sbjct: 230 SQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVYQHVTGEVLGGHAIKILGWG-EESGTP 288
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG G+FKIKRG++ECGIE ++VAG P
Sbjct: 289 YWLAANSWNGDWGDKGFFKIKRGNDECGIESEMVAGTP 326
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 190 bits (482), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 135/206 (65%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 131 NVEVSAEDLLSCCGSECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 189
Query: 74 H--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G PA TP C +KC + + +++ K+Y ++Y + S ++IMAEIYKNG
Sbjct: 190 HHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYKNG 249
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F+VYEDF HYKSGVY+H+ G+++GGHA++++GWG ++G YW+ AN WN WG
Sbjct: 250 PVEGAFSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGV-ENGIRYWLAANSWNIDWGD 308
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
+G+FK RG N CGIE +++AG+P +
Sbjct: 309 NGFFKFLRGKNHCGIESEIIAGIPRT 334
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 190 bits (482), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 130/204 (63%), Gaps = 14/204 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW +++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGSQCGDGCNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TPKC + C + ++ KHY ++Y ++++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPQCTGEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++ YW++AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDIMGGHAIRILGWGV-ENSVPYWLVANSWNVDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
FKI RG + CGIE ++VAG+P +
Sbjct: 309 LFKILRGEDHCGIESEIVAGIPRT 332
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 138/220 (62%), Gaps = 20/220 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S +SL+ +S DLL CC CG GC GG+P +AW ++ + G+VT
Sbjct: 117 SDRICIQSGGKISLE---ISAEDLLTCCD-ECGMGCFGGFPSAAWEFWTNKGLVTGGLFD 172
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRIN 111
C PY + C H P C+ TPKCV +C L + KH+ +Y I
Sbjct: 173 SKVGCRPYTLAP-CEHHVNGSRPPCQGEVETPKCVTQCNNGYSLSYPKDKHFGQRSYSIP 231
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S E IM E+YKNGPVE +F+VY DF YK+GVY+H+TGD++GGHAVK++GWG ++G
Sbjct: 232 SQQEQIMTELYKNGPVEAAFSVYADFLLYKNGVYQHVTGDMLGGHAVKILGWG-EENGTP 290
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
YW++AN WN WG G+FKIKRG++ECGIE ++VAG P S
Sbjct: 291 YWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVAGAPLS 330
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 131/210 (62%), Gaps = 17/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ +++S D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C PTP C RKC +++R K Y AY + + I +EI KN
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKN 253
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GYF+I RGSN+CGIE + AG+ +++L
Sbjct: 313 EKGYFRIVRGSNDCGIEGTIAAGIVDTESL 342
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 189 bits (480), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 129/204 (63%), Gaps = 14/204 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+ AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
+FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 189 bits (480), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 93/207 (44%), Positives = 131/207 (63%), Gaps = 15/207 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
++ +S DLL+CCGF CG GC+GGYP AWRY+ G+V+ C PY
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEH 189
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
G P TP+C R C + ++ KHY I++Y + ++IMAEIYKNGP
Sbjct: 190 HVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGP 249
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+ AN WN WG +
Sbjct: 250 VEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDN 308
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKN 213
G+FKI RG + CGIE ++VAG+P ++
Sbjct: 309 GFFKILRGEDHCGIESEIVAGVPRTEQ 335
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 189 bits (480), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 100/205 (48%), Positives = 129/205 (62%), Gaps = 18/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N LS D+L+CC CG GCDGGYPI+AW+Y V G T C PY +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189
Query: 69 STG-CSHPGC-EPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ G + P C + Y TP CV KC K N +++ KH+ +AY + I AEI +
Sbjct: 190 TVGNVTWPDCPDDGYNTPACVNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAH 249
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF YKSGVY H TG +GGHA++++GWGT D+G YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+GYF+I RG+NECGIE VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 189 bits (479), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 101/211 (47%), Positives = 133/211 (63%), Gaps = 18/211 (8%)
Query: 14 SPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY 66
SP + + LS +DLL+CC CG+GC+GG+P SAW ++V G+VT + C PY
Sbjct: 137 SPSGGPKRVHLSADDLLSCC-RTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPY 195
Query: 67 FDSTGCSH-------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIM 118
C H P + PTP+CV C K + + + KHY S+Y + S+ + I
Sbjct: 196 -PIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKSSYSVPSEEKQIQ 254
Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 178
AEI NGPVE FTVY DF HYKSGVY+ T + +GGHA++L+GWG ++G YW+ AN
Sbjct: 255 AEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDEALGGHAIRLLGWGV-ENGVPYWLAANS 313
Query: 179 WNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WN WG G+FKI RGS+ECGIE+DVVAGLP
Sbjct: 314 WNTEWGDKGFFKILRGSDECGIEDDVVAGLP 344
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 189 bits (479), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 100/201 (49%), Positives = 127/201 (63%), Gaps = 18/201 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
L+ +D+L+CC CG GC+GG+P SAW Y+VH G+VT E C PY C H
Sbjct: 174 LAADDVLSCC-TECGAGCNGGFPGSAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDHHV 231
Query: 75 -----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
P + PTP+CVR C K + + + KHY AY + + + I AEI NGPVE
Sbjct: 232 NGTLGPCDKTIPPTPRCVRMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVE 291
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
FTVYEDF HYKSGVY+ T +GGHA++L+GWG ++G YW+ AN WN WG G+
Sbjct: 292 ADFTVYEDFLHYKSGVYQRHTDSALGGHAIRLLGWGV-ENGVPYWLAANSWNTEWGDKGF 350
Query: 189 FKIKRGSNECGIEEDVVAGLP 209
FKI RGS+ECGIE D+VAGLP
Sbjct: 351 FKILRGSDECGIESDIVAGLP 371
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 126/204 (61%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
Q+ +S DLL+CC CG GC GG+P AW Y+ G+VT C PY C
Sbjct: 124 QSPEISAEDLLSCCD-QCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPY-SIAPC 181
Query: 73 SH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TPKC C+ K + ++ KH+ Y + SD + IM E+Y NG
Sbjct: 182 EHHVNGTRPPCSGEQDTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNG 241
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYEDF YKSGVY+H+TG +GGHAVK++GWG ++G +W++AN WN WG
Sbjct: 242 PVEAAFTVYEDFPLYKSGVYQHLTGSALGGHAVKILGWG-EENGTPFWLVANSWNSDWGD 300
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+GYFKI RG +ECGIE ++VAGLP
Sbjct: 301 NGYFKILRGHDECGIESEMVAGLP 324
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/202 (50%), Positives = 130/202 (64%), Gaps = 17/202 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
+ L+ +D+L+CC + CG GC+GG+P +AW Y+V G+VT E C PY C H
Sbjct: 140 VHLAADDVLSCC-WGCGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPY-PVPSCDH 197
Query: 75 P------GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C PTPKCVR C K N +++ KHY S+Y + S+ I EI KNGPV
Sbjct: 198 HVNGTLGPCGQDPPTPKCVRLCRKGYNVDFKDDKHYGKSSYSVPSNETQIQMEIMKNGPV 257
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTVY DF YKSGVYK + D +GGHA++++GWG +D YW++AN WN WG G
Sbjct: 258 EGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVEND-VPYWLVANSWNTEWGDKG 316
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFKI RGSNECGIEED+VAG+P
Sbjct: 317 YFKILRGSNECGIEEDIVAGIP 338
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 99/202 (49%), Positives = 132/202 (65%), Gaps = 17/202 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
+ L+ +D+L+CC + CG GC+GG+P +AW Y+V G+VT E C PY C H
Sbjct: 139 VHLAADDVLSCC-WGCGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPY-PVPSCDH 196
Query: 75 ------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C PTPKCVR C K + +++ KHY S+Y ++S+ I EI KNGPV
Sbjct: 197 HVNGTLGPCGQDPPTPKCVRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPV 256
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTVY DF YKSGVYK + D +GGHA++++GWG ++G +W++AN WN WG G
Sbjct: 257 EGAFTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGV-ENGVPFWLVANSWNTEWGDKG 315
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFKI RGSNECGIEED+VAG+P
Sbjct: 316 YFKILRGSNECGIEEDIVAGIP 337
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/203 (49%), Positives = 127/203 (62%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CC CG GC+GG+P SAW Y+V G+VT C PY ++ C
Sbjct: 133 NVEISAEDLLTCCD-SCGMGCNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIAS-CE 190
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP+CV C K N +R K++ +Y I+ + I EI NGP
Sbjct: 191 HHTKGKLPPCGDIVDTPQCVHMCEKGYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H+TG+ MGGHAV+++GWGT + G YW++AN WN WG
Sbjct: 251 VEAAFTVYADFVTYKSGVYRHVTGEEMGGHAVRILGWGT-ESGTPYWLVANSWNTDWGDK 309
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI RGS+ECGIE +VAGLP
Sbjct: 310 GYFKILRGSDECGIESSIVAGLP 332
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 131/210 (62%), Gaps = 17/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ +++S D++ CC CGDGC+GG+PI AW+YF++ GVV+ + C PY C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPY-PIHPC 193
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C PTP C RKC +++R K Y AY + + I +EI +N
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRN 253
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GYF+I RG+N+CGIE + AG+ +++L
Sbjct: 313 EKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 128/203 (63%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+++S DLL CC CG GC+GGYP +AW ++ G+VT + C PY+ C
Sbjct: 134 QVNISAEDLLTCCD-SCGAGCNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPP-CE 191
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C PTP+CVR C K + + KHY+ Y +++D I EI+KNGP
Sbjct: 192 HHTVGPLPNCTGIKPTPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGP 251
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE FTVY DF YKSGVY+ + D +GGHA++++GWGT ++G YW++AN WN WG
Sbjct: 252 VEADFTVYADFVSYKSGVYQRHSDDALGGHAIRILGWGT-ENGVPYWLVANSWNEDWGDK 310
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI RG++ECGIE+D+ AG+P
Sbjct: 311 GYFKILRGNDECGIEDDINAGIP 333
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 102/207 (49%), Positives = 126/207 (60%), Gaps = 21/207 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q +++S +DLL+CC CG GCDGG P +AW Y+V +G+VT Y +GC +P
Sbjct: 144 QKVTISADDLLSCCD-ECGFGCDGGDPYAAWSYWVSNGIVTGS--NYTSKSGCKPYPYPP 200
Query: 77 CE-------------PAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIY 122
CE YPT C KC + NS KHY S Y + D I EI
Sbjct: 201 CEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQDVASIQKEIM 260
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
NGPVEV+F VYEDF HY SG+YKH TGD +GGHAVK++GWGT ++G DYWI AN WN
Sbjct: 261 TNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGT-ENGTDYWICANSWNSD 319
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
WG +G+F+I RG +EC IE VVAG P
Sbjct: 320 WGENGFFRILRGVDECQIESSVVAGEP 346
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + +++ KH+ S+Y ++S+ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG +D YW++ N WN WG
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDK 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 132/205 (64%), Gaps = 16/205 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGSECGDGCNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + +++ KH+ S+Y ++S+ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG +D YW++ N WN WG
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEMMGGHAIRILGWGVEND-TPYWLVGNSWNTDWGDK 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSS 211
G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMPCT 332
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 133/206 (64%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCG LCG+GC+GGYP AW+Y+ G+V+ C PY C
Sbjct: 130 NVEVSAEDLLSCCGPLCGEGCNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPY-SIPPCE 188
Query: 74 H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TPKC + C + ++ K+Y S+Y + S ++IMAEIYKNG
Sbjct: 189 HHVNGTRPKCTGEGGDTPKCSKTCEPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F+V+ DF YKSGVYKH+ G+V+GGHA++++GWG ++G YW++ N WN WG
Sbjct: 249 PVEAAFSVFSDFLTYKSGVYKHVAGEVLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
+G+FKI RG + CGIE +VVAG+P +
Sbjct: 308 NGFFKILRGEDHCGIESEVVAGIPRT 333
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 187 bits (475), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 98/200 (49%), Positives = 127/200 (63%), Gaps = 17/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
+S DLL+CC CG GC+GGYP SAW ++ G+VT + C PY C H
Sbjct: 57 ISAEDLLSCC-ETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPY-KIAACDHHV 114
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C+ PTPKC RKC N + + KH+ SAY + SDP +I EI NGPVE
Sbjct: 115 VGKLKPCKGDSPTPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEG 174
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+FTVY DF YKSGVY+H +G +GGHA+K++GWG ++G YW++AN WN WG +G+F
Sbjct: 175 AFTVYADFPTYKSGVYQHTSGSALGGHAIKILGWG-EENGTPYWLVANSWNSDWGDEGFF 233
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KIKRG++ECGIE +V GLP
Sbjct: 234 KIKRGNDECGIESGIVGGLP 253
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 187 bits (475), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 18/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189
Query: 69 STG-CSHPGCEP-AYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ G + P C Y TP CV KC N +++ KH+ +AY + I AEI +
Sbjct: 190 TVGNTTWPACPTDGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAH 249
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF YKSGVY H TG+ +GGHA++++GWGT D+G YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKSGVYVHTTGEELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+GYF+I RG+NECGIE VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 187 bits (474), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 94/205 (45%), Positives = 130/205 (63%), Gaps = 19/205 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGC 77
++ +S DLL+CCGF CG GC+GGYP AWRY+ G+V+ Y GC + P C
Sbjct: 130 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGL--YDSHVGCRAYTIPPC 187
Query: 78 E------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
E TP+C R C + ++ KHY I++Y + ++IMAEIYKN
Sbjct: 188 EHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKN 247
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +F VYEDF YKSGVY+H++G+ +GGHA++++GWG ++G YW+ AN WN WG
Sbjct: 248 GPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWG 306
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 307 ITGFFKILRGEDHCGIESEIVAGVP 331
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 187 bits (474), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 99/214 (46%), Positives = 130/214 (60%), Gaps = 20/214 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
++LS +DLL+CC CG GC+GG P+ AW+Y+V HG+VT + C PY C
Sbjct: 171 QVTLSADDLLSCCR-TCGFGCEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPY-PFPPCE 228
Query: 74 H--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
H P YPTPKC +KCV K + + + + Y +AY + +D I EI
Sbjct: 229 HHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILT 288
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVEV+F VYEDF HY G+Y H G + GGHAVKLIGWG D G YW++AN WN W
Sbjct: 289 HGPVEVAFEVYEDFLHYAGGIYVHTGGKLGGGHAVKLIGWGI-DQGTPYWLIANSWNTDW 347
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
G +G+F+I RG +ECGIE VV G+P S N+ +
Sbjct: 348 GEEGFFRILRGVDECGIESGVVGGIPKSTNIQRR 381
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 186 bits (473), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 96/207 (46%), Positives = 125/207 (60%), Gaps = 24/207 (11%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG-------- 71
Q++ LS ++L CC CG GC+GGYP SA Y+V G+VT + +++TG
Sbjct: 140 QDIRLSAQNMLTCCA-TCGQGCNGGYPASAMSYYVKTGLVTGD---LYNTTGWCQAYSFA 195
Query: 72 -CSH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 122
C+H P C PTPKC + C Q + + H AY + E IM EI
Sbjct: 196 PCAHHVDTPLYPACTGELPTPKCAKTCDSGSGQTY--TVHKGSKAYSVGKTQEAIMTEIQ 253
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
NGPVE +FTVYEDF +YKSGVYKH+TG +GGHA+K++GWG ++ YWI+ N WN++
Sbjct: 254 TNGPVEAAFTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENN-TPYWIVVNSWNQT 312
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
WG +G FKI RG NECGIE VV LP
Sbjct: 313 WGDNGTFKILRGKNECGIEAQVVTALP 339
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 186 bits (473), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 133/206 (64%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCGF CG GC+GGYP AW+++ G+V+ C PY C
Sbjct: 130 NVEVSAEDLLSCCGFECGMGCNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPY-SIPPCE 188
Query: 74 H--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G PA TPKCV++C ++ + KH+ ++Y + S ++IMAEIYKNG
Sbjct: 189 HHVNGSRPACKGEEGDTPKCVKQCEDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VY DF YKSGVY+H TG+ +GGHA+K++GWG ++G YW+ AN WN WG
Sbjct: 249 PVEGAFLVYADFPMYKSGVYQHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
+G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 NGFFKILRGKDHCGIESEIVAGIPKN 333
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 186 bits (473), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 125/207 (60%), Gaps = 21/207 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS---- 69
N +S DLL+CC CGDGCDGGYP+ AWRY+V G+V+ C PY +
Sbjct: 128 NTFVSAEDLLSCCT-SCGDGCDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQ 186
Query: 70 --TGCSHPGCEPAY--PTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 122
G + P C PA TP+C C K+ + KHY +SAY + I EI
Sbjct: 187 TVNGVTWPKC-PAQEEATPECASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEIL 245
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
++GPVE F VY DF YKSG+Y H++G +GGHAVK++GWG ++G YW++AN WN +
Sbjct: 246 QHGPVEAGFLVYSDFYRYKSGIYTHVSGQELGGHAVKILGWGV-ENGTKYWLVANSWNIN 304
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
WG GYF+I RG NECGIE VVAG+P
Sbjct: 305 WGEKGYFRILRGRNECGIESAVVAGIP 331
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 186 bits (473), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 128/205 (62%), Gaps = 18/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGE 189
Query: 69 STG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ G + P C + Y TP CV KC N +++ KH+ +AY + I AEI +
Sbjct: 190 TVGNTTWPDCPQDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAH 249
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF YKSGVY H TG +GGHA++++GWGT D+G YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKSGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+GYF+I RG+NECGIE VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 186 bits (473), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 125/208 (60%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
Q++ LS +LL CC CGDGCDGG+P +A Y+V+ G+VT + C Y + C
Sbjct: 141 QDIRLSTQNLLTCCA-ACGDGCDGGWPEAAMDYYVNTGLVTGDLYGNNSWCQAYTFAP-C 198
Query: 73 SH-------PGCEPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 122
+H P C PTP C+ C + + H AY I D + IMAEIY
Sbjct: 199 AHHVTSDIYPPCTGELPTPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAKDEKAIMAEIY 258
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
KNGP+EV+ TVYEDF YK+GVY+H+TGD +GGHAVK++GWG ++G YW + N WN S
Sbjct: 259 KNGPIEVALTVYEDFLTYKTGVYQHVTGDELGGHAVKMVGWGV-ENGTPYWTIVNSWNES 317
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPS 210
WG G FKI RG NECGIE V LP+
Sbjct: 318 WGDKGTFKILRGKNECGIESSCVTALPA 345
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 186 bits (473), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 133/206 (64%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCG CGDGC+GGYP +AW+Y+ G+V+ C PY C
Sbjct: 130 NVEVSAEDLLSCCGLECGDGCNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TPKC + C + ++ KH+ +Y ++S+ ++IMAEIYKNG
Sbjct: 189 HHVNGTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTV+ DF YK+GVYKH+ G+++GGHA++++GWG ++G YW++ N WN WG
Sbjct: 249 PVEGAFTVFSDFLMYKTGVYKHLAGEMLGGHAIRILGWG-KENGVPYWLVGNSWNVDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 SGFFKIVRGEDHCGIESEIVAGIPRT 333
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 186 bits (472), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 127/205 (61%), Gaps = 18/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N LS D+L+CC CG GC+GGYPI+AW+Y V G T C PY +
Sbjct: 131 NTLLSAEDVLSCCSN-CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGE 189
Query: 69 STG-CSHPGC-EPAYPTPKCVRKCVKKNQ--LWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ G + P C + Y TP CV KC KN + KH+ +AY + I AEI +
Sbjct: 190 TVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAH 249
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF YK+GVY H TG +GGHA++++GWGT D+G YW++AN WN +WG
Sbjct: 250 GPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNVNWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+GYF+I RG+NECGIE VV G+P
Sbjct: 309 ENGYFRIIRGTNECGIEHAVVGGVP 333
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 186 bits (472), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 134/212 (63%), Gaps = 16/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C ++ KHY S+Y ++S ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKFCEPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H+TG++MGGHAV+++GWG ++G YW++ N WN WG +
Sbjct: 249 VEAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+FKI RG + CGIE ++VAG+P + K+I
Sbjct: 308 GFFKILRGRDHCGIESEIVAGIPCTDQYWKKI 339
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 186 bits (471), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 14/204 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GD+MGGHA++++ WG ++G YW+ AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWLAANSWNLDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
+FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 186 bits (471), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 95/213 (44%), Positives = 133/213 (62%), Gaps = 17/213 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CC CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDLLTCCHMECGDGCNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C+ TPKC + C + ++ KHY S+Y + S ++IMAEIYKNG
Sbjct: 189 HHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F+VY DF YKSGVY+H+TG+ +GGHA++++GWG ++G YW+ AN WN WG
Sbjct: 249 PVEGAFSVYTDFLVYKSGVYQHVTGEEVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
+G+FKI RG + CGIE ++VAG+P + K+I
Sbjct: 308 NGFFKILRGQDHCGIESEIVAGIPRTDQYWKKI 340
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 186 bits (471), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 131/203 (64%), Gaps = 17/203 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHP- 75
+S DLL+CC CG GC+GG+P +AW YF G+V+ + C PY + C H
Sbjct: 175 ISSEDLLSCCSS-CGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAP-CEHHV 232
Query: 76 -----GCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C PTPKC R C K ++ + + K++ +AY +++D + IM EI NGPVE
Sbjct: 233 NGTRLPCSGEGPTPKCERTCEKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEG 292
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+FTVY DF YKSGVY+H++G +GGHA++++GWG +DG YW++AN WN WG +G+F
Sbjct: 293 AFTVYADFPTYKSGVYQHVSGGELGGHAIRVLGWGV-EDGTPYWLVANSWNSDWGDNGFF 351
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
KI RG NECGIE ++VAGLP +
Sbjct: 352 KILRGQNECGIEGEIVAGLPKKQ 374
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 186 bits (471), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 127/204 (62%), Gaps = 17/204 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY + C H
Sbjct: 147 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHV 204
Query: 75 ----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ TP C C N + K Y YRI+S+PE IM E+ +NGPVEV
Sbjct: 205 IGPLPSCDGDVETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEV 264
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW++AN WN WG GYF
Sbjct: 265 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNSDWGDKGYF 323
Query: 190 KIKRGSNECGIEEDVVAGLPSSKN 213
KI RG NECGIE DV AG+P KN
Sbjct: 324 KIVRGKNECGIESDVNAGIPKIKN 347
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 186 bits (471), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 127/204 (62%), Gaps = 14/204 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKN PV
Sbjct: 190 HVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GD+MGGHA++++GWG +G YW+ AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVG-NGVPYWLAANSWNLDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
+FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 186 bits (471), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 128/204 (62%), Gaps = 14/204 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-----D 68
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S P C T +C + C + ++ KH+ ++Y +++ ++IMAEIYKNGPV
Sbjct: 190 HVNGSRPPCTGEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPV 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+ AN WN WG +G
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
+FKI RG N CGIE ++VAG+P +
Sbjct: 309 FFKILRGENHCGIESEIVAGIPRT 332
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 96/206 (46%), Positives = 127/206 (61%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+++S DLL CC CG GC+GG P +AW Y+ G+VT + C PY C
Sbjct: 135 QVNISAEDLLDCCDS-CGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPY-SLAPCE 192
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C PTPKCV C K + +++ KH+ Y I+SD + I EI+KNGP
Sbjct: 193 HHTKGSLPNCTGTVPTPKCVHLCRKGYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGP 252
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE F V DF YKSGVY+H + DV+GGHA++++GWGT ++G YW+ AN WN WG
Sbjct: 253 VEADFIVLADFLSYKSGVYQHHSDDVIGGHAIRILGWGT-ENGTPYWLAANSWNEDWGDH 311
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSK 212
GYFKI RG +ECGIEED+ AG+P ++
Sbjct: 312 GYFKILRGKDECGIEEDINAGIPKNR 337
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 131/205 (63%), Gaps = 16/205 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CC CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCDGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSS 211
G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMPCT 332
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 185 bits (469), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/192 (48%), Positives = 120/192 (62%), Gaps = 17/192 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH-------PG 76
LS D+L+CC CG GC+GG+P AWR+F HG+ TE PY C H
Sbjct: 68 LSAEDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKYPYVFPP-CEHHINKTHYKP 126
Query: 77 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C P+ PTPKCVR KK +++ S Y ++ P I AEI NGPVE +FTVY+D
Sbjct: 127 CGPSQPTPKCVRASEKK------PRYHGKSVYSVS--PAKIQAEIMTNGPVEAAFTVYQD 178
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
F Y+SGVY+H++G +GGHA+K++GWG + G YW++AN WN WG G FKI RG +
Sbjct: 179 FLAYQSGVYRHVSGPELGGHAIKIMGWGV-EAGNKYWLVANSWNEDWGDKGTFKIARGDD 237
Query: 197 ECGIEEDVVAGL 208
ECGIE VVAG+
Sbjct: 238 ECGIESSVVAGM 249
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 185 bits (469), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/214 (46%), Positives = 130/214 (60%), Gaps = 23/214 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
+ ++LS +DLL+CC CG GC GG P++AW+Y+V G+VT Y + +GC P
Sbjct: 38 KQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLRGIVTG--SEYTNHSGCRPYPFPP 94
Query: 77 CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
CE YPTPKCV+KC K + ++ K+Y S Y + S+ E I EI
Sbjct: 95 CEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGQSVYNVESNVESIQKEIM 154
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
GPVE SF VY DF +Y G+YKH+ G + GGHAVK++GWG D G YW+ AN WN
Sbjct: 155 TLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGI-DQGVPYWLAANSWNTD 213
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 216
WG DGYF+I RG NECGIE ++AG+P K L K
Sbjct: 214 WGEDGYFRILRGVNECGIESGIIAGIP--KQLAK 245
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 185 bits (469), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/205 (47%), Positives = 126/205 (61%), Gaps = 19/205 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+N+ +S DL +CC CG+GC+GG+P +AW Y+ G+VT + C PY C
Sbjct: 138 ENVHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPY-TIKAC 195
Query: 73 SH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P + PTPKC C N + KHY +SAY ++ E IM EI N
Sbjct: 196 DHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKHYGMSAYSVHG-VEKIMTEIMTN 254
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVY DF YKSGVYKH TG +GGHA+K++GWGT ++G+DYW++AN WN WG
Sbjct: 255 GPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGT-ENGDDYWLVANSWNPDWG 313
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG +ECGIE + AG P
Sbjct: 314 DQGFFKILRGQDECGIESQISAGEP 338
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 185 bits (469), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 16/204 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST-- 70
Q+ +S DL+ACC CG GC+GGY +AWRYF H G+VT E C PY ++
Sbjct: 125 QSAHISAEDLMACCE-TCGMGCNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCD 183
Query: 71 ----GCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
G P TP+C + C + + KH+ SAY + S E I EI NG
Sbjct: 184 HHVVGKKQPCASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNG 243
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY DF YKSGVY+H +G ++GGHA++++GWGT ++G YW++AN WN WGA
Sbjct: 244 PVEGAFTVYADFPTYKSGVYQHTSGAMLGGHAIRILGWGT-ENGTPYWLVANSWNEDWGA 302
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
GYFKI RG ++CGIE + AG+P
Sbjct: 303 MGYFKIIRGKDDCGIESQITAGMP 326
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 95/204 (46%), Positives = 125/204 (61%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S DLL+CC CG GC GGYP +AW Y+ G+VT + C PY C
Sbjct: 128 TVEISAEDLLSCCE-ECGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPY-SIPPCE 185
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C+ TPKC KC+ + K++ Y + S E IM E+YKNGP
Sbjct: 186 HHVNGTRPPCQGEGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGP 245
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VYEDF YKSGVY+H+TGD++GGHA+K++GWG ++ YW+ AN WN WG
Sbjct: 246 VEAAFSVYEDFLLYKSGVYQHLTGDMLGGHAIKILGWGKENN-TPYWLAANSWNTDWGNQ 304
Query: 187 GYFKIKRGSNECGIEEDVVAGLPS 210
G+FKI RG +ECGIE +VVAG+P
Sbjct: 305 GFFKILRGGDECGIESEVVAGIPQ 328
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 95/206 (46%), Positives = 132/206 (64%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCGF CG GC+GGYP AWR++ G+V+ C PY C
Sbjct: 130 NVEVSAEDLLSCCGFKCGMGCNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C+ TPKC++ C + + + KH+ ++Y + S ++IMA+IYKNG
Sbjct: 189 HHVNGSRPSCKGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VY DF YKSGVY+H TG+ +GGHA+K++GWG ++G YW+ AN WN WG
Sbjct: 249 PVEGAFVVYADFPLYKSGVYQHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
+G+FKI RG + CGIE +VVAG+P +
Sbjct: 308 NGFFKILRGKDHCGIESEVVAGIPKN 333
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 95/195 (48%), Positives = 121/195 (62%), Gaps = 10/195 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S +DLL+CCG CG+GC+GGYPI A R++ GVVT C PY C+
Sbjct: 134 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCT 192
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + + KH+ +SAY + + I AEIY NGPVE +F+
Sbjct: 193 SGNC-PESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFS 251
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVYKH G +GGHA+K+IGWGT + G YW++AN W +WG G+FKI
Sbjct: 252 VYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIY 310
Query: 193 RGSNECGIEEDVVAG 207
RG ++CGIE VVAG
Sbjct: 311 RGDDQCGIESAVVAG 325
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/214 (46%), Positives = 129/214 (60%), Gaps = 23/214 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
+ ++LS +DLL+CC CG GC GG P++AW+Y+V G+VT Y + +GC P
Sbjct: 172 KQVTLSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLRGIVTG--SEYTNHSGCRPYPFPP 228
Query: 77 CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
CE YPTPKCV+KC K + ++ K+Y Y + S+ E I EI
Sbjct: 229 CEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESNVESIQKEIM 288
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
GPVE SF VY DF +Y G+YKH+ G + GGHAVK++GWG D G YW+ AN WN
Sbjct: 289 TLGPVEASFEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGI-DQGVPYWLAANSWNTD 347
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 216
WG DGYF+I RG NECGIE ++AG+P K L K
Sbjct: 348 WGEDGYFRILRGVNECGIESGIIAGIP--KQLAK 379
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/223 (44%), Positives = 139/223 (62%), Gaps = 19/223 (8%)
Query: 2 SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 61
S ++R + S+ +S++ LS DLL+CC CG GC+GGYP +AW ++ G+V+
Sbjct: 29 SEAMSDRICIHSNAKISVE---LSAEDLLSCC-ESCGMGCNGGYPSAAWDFWTKDGLVSG 84
Query: 62 E-------CDPYF-----DSTGCSHPGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISA 107
C PY S P C TP+CV +C ++ KHY ++
Sbjct: 85 GLYDSHIGCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKTS 144
Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
Y ++SD +DI EIYKNGPVE +FTVYEDF YK+GVY+H+TG +GGHA+K++GWG +
Sbjct: 145 YSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGVYQHVTGSALGGHAIKILGWG-EE 203
Query: 168 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
+G YW+ AN WN WG +G+FKI RGSN CGIE ++VAG+P+
Sbjct: 204 NGIPYWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAGIPN 246
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 131/218 (60%), Gaps = 20/218 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
++R ++S+ V + LS +L+ACC CG GC GG+P +AW Y+ G+VT
Sbjct: 118 SDRTCVASNGKVQVH---LSSENLMACCE-TCGMGCHGGFPEAAWEYWKQDGLVTGGPYG 173
Query: 61 --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
+ C PY + C H P C PTP+C + C N + KHY+ SAY ++
Sbjct: 174 SMQGCQPY-EIAPCEHHINGSRPACGKIEPTPRCKKTCESGYNVTFNKDKHYAKSAYSVS 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S + I EI NGPVE +FTVY DF HYKSGVY+H +G +GGHAVK+IGWG +
Sbjct: 233 SKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGM-EGSTP 291
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW++AN WN WG G+FKI RG +ECGIE D+VAG P
Sbjct: 292 YWLIANSWNSDWGDMGFFKILRGQDECGIERDIVAGEP 329
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/204 (46%), Positives = 132/204 (64%), Gaps = 22/204 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S D+L+CCG CG GC GGY I A +Y+++ GVVT C PY S
Sbjct: 144 QQPIISAEDILSCCGSTCGKGCQGGYTIEAMKYWMNSGVVTGGDYNGAGCMPY------S 197
Query: 74 HPGCEPA----YPTPKCVRKCVKKNQL--WRNSKHYSISAYRINSDPE---DIMAEIYKN 124
P C+ + + TP C C +K ++N KH++ SAY++++ I EIY N
Sbjct: 198 FPPCKKSPCVEFSTPSCKTTCQEKYTTADYKNDKHFATSAYKLSTTKNAVPTIQYEIYHN 257
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE S+ V+EDF YKSGVY H++G+++GGHAVK+IGWGT ++G DYW++AN W S+G
Sbjct: 258 GPVEASYRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGT-ENGVDYWLVANSWGTSFG 316
Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
G+FKI+RG+NEC IE ++VAGL
Sbjct: 317 EKGFFKIRRGTNECQIESNIVAGL 340
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/208 (47%), Positives = 129/208 (62%), Gaps = 17/208 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
L LS +L+ACC CG GC+GG+P SAW Y+ G+VT + C PY +
Sbjct: 140 LHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPY-EFPP 197
Query: 72 CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
C H P CE TPKC C N + K Y + YR++S+ E IM E+ ++
Sbjct: 198 CEHHVVGPRPSCEGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEH 257
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++G YW++AN WN WG
Sbjct: 258 GPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWG-EENGVPYWLIANSWNSDWG 316
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSK 212
+GYFKI RG NECGIE DV AG+P K
Sbjct: 317 DNGYFKIIRGRNECGIESDVNAGIPKLK 344
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 124/200 (62%), Gaps = 17/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
+S DL +CC CG+GC+GG+P +AW Y+ G+VT + C PY + C H
Sbjct: 133 ISAEDLNSCCKS-CGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPY-EIKPCEHHI 190
Query: 75 ----PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C PTP+C + C N + KHY+ +AY ++S + I EI NGPVE
Sbjct: 191 NGSRPACGKLEPTPRCKKSCESGYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEA 250
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+FTVY DF HYKSGVY+H +G +GGHAVK+IGWGT + YW++AN WN WG G+F
Sbjct: 251 AFTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGT-EGSTPYWLIANSWNTDWGNMGFF 309
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KI RG +ECGIE D+VAG P
Sbjct: 310 KILRGQDECGIERDIVAGEP 329
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 120/194 (61%), Gaps = 20/194 (10%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 78
LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT Y D TGC +P CE
Sbjct: 148 LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCKPYPYPPCE 205
Query: 79 -----------PA--YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
P+ YPT KC R C L ++ H+ SAY ++ +I EI +
Sbjct: 206 HHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTH 265
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVEV+FTVYEDF HY GVY H G +GGHAVK++GWG D+G YW+ AN WN WG
Sbjct: 266 GPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWG 324
Query: 185 ADGYFKIKRGSNEC 198
+GYF+I RG NEC
Sbjct: 325 ENGYFRIIRGVNEC 338
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 183 bits (465), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 95/204 (46%), Positives = 125/204 (61%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V T+ C PY + C
Sbjct: 131 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPY-EIPPC 188
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC++KC N ++ KHY Y + + I AE+YKNG
Sbjct: 189 EHHVPGNRLPCSGDTKTPKCIKKCEDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVYKH+ GD +GGHA+K++GWG ++G YW++AN WN WG
Sbjct: 249 PVEGAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGV-ENGNKYWLIANSWNSDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 308 NGFFKILRGEDHCGIESSIVAGEP 331
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 183 bits (465), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 123/200 (61%), Gaps = 17/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
+S DL+ CC CG GC GGYP AW Y+V +G+VT + C PY C H
Sbjct: 141 ISPEDLVDCCAD-CGMGCQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPY-SFPPCEHHV 198
Query: 77 CEPAYP------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P P TP+CV+KC + + + N K Y + AY I+SD E IM ++ GP+EV
Sbjct: 199 VGPRKPCTGDPTTPQCVKKCQPEYPKTYENDKWYGLKAYSIHSDQEAIMRDLMTYGPLEV 258
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY DF Y SGVY+H+ G ++GGHAV+L+GWG +DG DYW++AN WN WG GYF
Sbjct: 259 DFEVYADFPSYSSGVYRHVAGGLLGGHAVRLVGWGV-EDGADYWLIANSWNTDWGDGGYF 317
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KI+RG NECGIE D AG P
Sbjct: 318 KIRRGVNECGIESDANAGHP 337
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 183 bits (464), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 130/210 (61%), Gaps = 17/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ +++S D++ CC CGDGC+GG+PI AW+YF++ GVV+ C PY C
Sbjct: 135 KQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPY-PIHPC 193
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C PTP C ++C +++R K Y AY + + I +EI +N
Sbjct: 194 GHHGNDTYYGECRGTAPTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRN 253
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG +++ D+W++AN W+ WG
Sbjct: 254 GPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG-NENNTDFWLIANSWHNDWG 312
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GYF+I RG+N+CGIE + AG+ +++L
Sbjct: 313 EKGYFRIIRGTNDCGIEGTIAAGIVDTESL 342
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 183 bits (464), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 95/191 (49%), Positives = 121/191 (63%), Gaps = 12/191 (6%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
+S D++ CCG CG GCDGGY I A R++V GVVT + C PY C+ GC
Sbjct: 140 ISPMDMVDCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQGDGCKPY---QFCNSAGC 196
Query: 78 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
P TP+C C K N + K++ SAY + I +I NGPVE SF VYED
Sbjct: 197 -PDAVTPECALSCQSKYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYED 255
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
F YKSGVYK+I G ++GGHA+K+IGWGT ++G YW++AN W WG +G+FKI+RG N
Sbjct: 256 FYKYKSGVYKYIAGKMLGGHAIKIIGWGT-ENGTAYWLIANSWGTKWGENGFFKIRRGVN 314
Query: 197 ECGIEEDVVAG 207
ECGIE +VVAG
Sbjct: 315 ECGIENNVVAG 325
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 183 bits (464), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 97/226 (42%), Positives = 142/226 (62%), Gaps = 19/226 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S D+L CCG CGDGC+GG+P AW ++ G+V+
Sbjct: 118 SDRICIRTNGHVSVE---VSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTXXGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C ++ KHY S+Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S ++IMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHAV+++GWG ++G
Sbjct: 234 SSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGVYQHVTGEMMGGHAVRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
YW++ N WN WG +G+FKI RG + CGIE ++VAG+P + K+
Sbjct: 293 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTDQYWKK 338
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 182 bits (463), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 129/212 (60%), Gaps = 30/212 (14%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ Y+DS H GC P
Sbjct: 130 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVS---GGYYDS----HIGCLP- 181
Query: 81 YPTPKC----------------VRKCVKKNQL-----WRNSKHYSISAYRINSDPEDIMA 119
Y P C R+C K + ++ KH+ ++Y +++ + IMA
Sbjct: 182 YTIPPCEHHVNGSRPPCTGEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKKIMA 241
Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
EIYKNGPVE +FTV+ DF YKSGVYKH GD+MGGHA++++ WG ++G YW AN W
Sbjct: 242 EIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILVWGV-ENGVPYWAAANSW 300
Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
N WG +G+FKI RG N CGIE ++VAG+P +
Sbjct: 301 NLDWGDNGFFKILRGENHCGIESEIVAGIPRT 332
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 182 bits (462), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 140/227 (61%), Gaps = 18/227 (7%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S D+L CCG CGDGC+GG+P AW ++ G+V+
Sbjct: 118 SDRICIRTNGHVSVE---VSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVSGGLYD 174
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY G P TPKC + C + ++ KHY S+Y ++
Sbjct: 175 SHVGCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVS 234
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S ++IMAEI+KNGPVE +FTVY DF YKSGVY+H+ GD+MGGHAV+++GWG ++G
Sbjct: 235 SSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTP 293
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++ N WN WG +G+FKI RG + CGIE ++VAG+P + K I
Sbjct: 294 YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTDQYWKRI 340
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 127/204 (62%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C
Sbjct: 131 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 188
Query: 73 SH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKCV++C ++ ++ KHY Y + + I AE+YKNG
Sbjct: 189 EHHVPGNRLPCSGDTKTPKCVKECESGYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVYKH+TGD +GGHA+K++GWG ++G YW++AN WN WG
Sbjct: 249 PVEGAFTVYADLLSYKSGVYKHVTGDALGGHAIKIMGWGV-ENGNKYWLIANSWNSDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 308 NGFFKILRGEDHCGIESSIVAGEP 331
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 120/208 (57%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST- 70
N LS D+L CC F CGDGC+GGYPI AWRY+V +G+VT C PY +
Sbjct: 123 NTLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPC 182
Query: 71 -----GCSHPGCEPAYP-TPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
G + P C TPKC C N + KH+ SAY I + I EI
Sbjct: 183 GETIDGVTWPECPMKISDTPKCEHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEI 242
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
+GPVEV F VYEDF YK+G+Y H+ G +GGHAVK++GWG D+G YW+ AN WN
Sbjct: 243 LAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGV-DNGTPYWLAANSWNT 301
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
WG GYF+I RG +ECGIE VAG+P
Sbjct: 302 VWGEKGYFRILRGVDECGIESAAVAGMP 329
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 132/218 (60%), Gaps = 20/218 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
T+R + S+ V + +S DL+ CC CG GC+GG+ AW Y+V++G+VT
Sbjct: 120 TDRICIHSNGKVKVH---ISAEDLMTCCT-SCGMGCNGGFLPQAWHYWVNNGIVTGGQYH 175
Query: 61 --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
+ C PY + C H C PTPKC +KC N+ + KH+ +Y I
Sbjct: 176 SHKGCQPY-EIPKCEHHVKGPFKACGKELPTPKCSQKCQPGYNKTFNQDKHFGKKSYSIT 234
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
++ + I EI NGPVE +FTVY DF YKSGVY+H TG +GGHAVK++GWGT ++
Sbjct: 235 NNIQQIQKEIMMNGPVEAAFTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENN-TP 293
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW++AN WN +WG GYFKI RG +ECGIE +VAG+P
Sbjct: 294 YWLIANSWNPTWGDKGYFKIIRGKDECGIESSIVAGMP 331
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 101/207 (48%), Positives = 125/207 (60%), Gaps = 21/207 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q +++S +DLL+CC CG GCDG P +AW Y+V +G+VT Y +GC +P
Sbjct: 29 QKVTISADDLLSCCD-ECGFGCDGRDPYAAWSYWVSNGIVTGS--NYTSKSGCKPYPYPP 85
Query: 77 CE-------------PAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIY 122
CE YPT C KC + NS KHY S Y + D I EI
Sbjct: 86 CEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQDVASIQKEIM 145
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
NGPVEV+F VYEDF HY SG+YKH TGD +GGHAVK++GWGT ++G DYWI AN WN
Sbjct: 146 TNGPVEVAFDVYEDFEHYSSGIYKHTTGDYLGGHAVKMLGWGT-ENGTDYWICANSWNSD 204
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
WG +G+F+I RG +EC IE VVAG P
Sbjct: 205 WGENGFFRILRGVDECEIESGVVAGEP 231
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 125/203 (61%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------F 67
++ +S DLL CC CG GC+GGYP +AW ++ G+VT C PY
Sbjct: 129 SVEISSQDLLTCCDS-CGMGCNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH 187
Query: 68 DSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
G P TP+CV +C ++ KHY ++Y + S+ E I +EIYKNGP
Sbjct: 188 HVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGP 247
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F VYEDF YKSGVY+H+TG +GGHA+K+IGWG ++G YW+ AN WN WG +
Sbjct: 248 VEGAFIVYEDFPSYKSGVYQHVTGSALGGHAIKMIGWG-EENGVPYWLCANSWNTDWGDN 306
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RGSN CGIE +VVAG+P
Sbjct: 307 GFFKILRGSNHCGIESEVVAGIP 329
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/195 (48%), Positives = 117/195 (60%), Gaps = 10/195 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S +DLL+CCG CG+GC+GGYPI A R++ GVVT C PY + C+
Sbjct: 135 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CT 193
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + KH+ SAY + I EI NGPVE +FT
Sbjct: 194 SGSC-PESKTPACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFT 252
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVYKH G +GGHA+K+IGWGT + G YW++AN W SWG G+FKI
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIF 311
Query: 193 RGSNECGIEEDVVAG 207
RG ++CGIE VVAG
Sbjct: 312 RGDDQCGIESAVVAG 326
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY + C H
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHNT 205
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ TP C R C N + N K Y YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
KI RG NECGIE DV AG+P K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/205 (47%), Positives = 124/205 (60%), Gaps = 19/205 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+N +S DL +CC CG+GC+GG+P +AW Y+ G+VT + C PY C
Sbjct: 138 ENTHISAEDLTSCC-RTCGNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPY-TIKAC 195
Query: 73 SH-------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P + PTPKC C N + KHY SAY ++ E IM EI N
Sbjct: 196 DHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKHYGSSAYSVHG-VEKIMTEIMTN 254
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVY DF YKSGVYKH TG +GGHA+K++GWGT ++G+DYW++AN WN WG
Sbjct: 255 GPVEGAFTVYADFPQYKSGVYKHTTGQPLGGHAIKILGWGT-ENGDDYWLVANSWNPDWG 313
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG +ECGIE + AG P
Sbjct: 314 DQGFFKILRGQDECGIESQISAGEP 338
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY + C H
Sbjct: 148 LSAENLVSCCS-SCGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ TP C R C N + N K Y YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
KI RG NECGIE DV AG+P K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 132/206 (64%), Gaps = 17/206 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCG CG GC+GGYP AW+++ G+V+ C PY C
Sbjct: 130 NVEVSAEDLLSCCGDECGMGCNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H--PGCEPAYP-----TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G PA TPKCV++C + + + KH+ ++Y + + ++IMAEIYKNG
Sbjct: 189 HHVNGSRPACKGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VY DF YKSGVY+H TG+ +GGHA+K++GWG ++G YW+ AN WN WG
Sbjct: 249 PVEGAFLVYADFPLYKSGVYQHETGEELGGHAIKILGWGV-ENGTPYWLCANSWNTDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
+G+FKI RG + CGIE ++VAG+P +
Sbjct: 308 NGFFKILRGKDHCGIESEIVAGVPKN 333
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/210 (46%), Positives = 127/210 (60%), Gaps = 24/210 (11%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
+ + LS +DLL+CC CG GC GG P++AW+Y+V G+VT Y + +GC P
Sbjct: 170 KQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSGIVTG--SDYTNHSGCRPYPFPP 226
Query: 77 CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
CE YPTPKC ++C K + ++ K+Y AY + +D E I EI
Sbjct: 227 CEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIM 286
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
GPVE SF VY DF HY SG+YKH+ G V GGHAVK++GWG D G YW+ AN WN
Sbjct: 287 TLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNND 345
Query: 183 WGAD---GYFKIKRGSNECGIEEDVVAGLP 209
WG D GYF+I RG++ECGIE +VAG+P
Sbjct: 346 WGEDVFSGYFRILRGADECGIESGIVAGIP 375
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
D + + + LQ ++LS +DLL+CC CG GC+GG P++AWRY+V G+VT
Sbjct: 143 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 200
Query: 62 ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
C PY C H P YPTPKC +KCV ++ + K + SAY +
Sbjct: 201 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 259
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKLIGWG DDG
Sbjct: 260 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 318
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
YW +AN WN WG DG+F+I RG +ECGIE VV G+P +L
Sbjct: 319 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 362
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 134/218 (61%), Gaps = 20/218 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ V ++ +S DLL+CC CG GCDGG+P SAW ++V G+ T
Sbjct: 34 SDRHCIHSNGKVKIE---VSPEDLLSCCS-SCGMGCDGGFPPSAWEFWVDKGIATGGLWN 89
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY + C H P C TPKCV C K N +R+ KH+ +Y I
Sbjct: 90 SHIGCQPY-EIPACEHHTTGDRPPCSDIVDTPKCVHLCEKGYNTSYRDDKHFGKKSYSIE 148
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S + I EI+KNGPVE +F+VY DF +YKSGVY+H +G+ +GGHA++++GWG +D
Sbjct: 149 SLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGVYQHHSGESLGGHAIRVLGWGYEND-VP 207
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG GYFKI RGS+ECGIE +VAG+P
Sbjct: 208 YWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAGIP 245
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY + C H
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ TP C R C N + N K Y YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
KI RG NECGIE DV AG+P K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
D + + + LQ ++LS +DLL+CC CG GC+GG P++AWRY+V G+VT
Sbjct: 144 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201
Query: 62 ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
C PY C H P YPTPKC +KCV ++ + K + SAY +
Sbjct: 202 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 260
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKLIGWG DDG
Sbjct: 261 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 319
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
YW +AN WN WG DG+F+I RG +ECGIE VV G+P +L
Sbjct: 320 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 363
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/195 (48%), Positives = 117/195 (60%), Gaps = 10/195 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S +DLL+CCG CG+GC+GGYPI A R++ GVVT C PY + C+
Sbjct: 135 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CT 193
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + KH+ SAY + I EI NGPVE +FT
Sbjct: 194 SGSC-PESKTPACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFT 252
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVYKH G +GGHA+K+IGWGT + G YW++AN W SWG G+FKI
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTSWGESGFFKIF 311
Query: 193 RGSNECGIEEDVVAG 207
RG ++CGIE VVAG
Sbjct: 312 RGDDQCGIESAVVAG 326
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 127/203 (62%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ LS +L++CC CG GCDGGYP SAW Y+ + G+V+ + C PY + C
Sbjct: 131 QVHLSAENLVSCCDS-CGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAP-CE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP C +C K++ + + +Y SAY + + + I AEI KNGP
Sbjct: 189 HHVPGPRPACSGEGSTPDCRNQCDKRSGISYDKDLYYGESAYSLEDEAKQIQAEILKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVYED +YK GVY+H+ G V+GGHA+K++GWG +D YW++AN WN WG +
Sbjct: 249 VEAAFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVEND-TPYWLVANSWNTDWGNN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG +ECGIE DV AGLP
Sbjct: 308 GFFKILRGKDECGIEIDVSAGLP 330
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 127/203 (62%), Gaps = 17/203 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY + C H
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ TP C R C N + N K Y YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW++AN WN WG +GYF
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIANSWNTDWGDNGYF 324
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
KI RG NECGIE DV AG+P K
Sbjct: 325 KIIRGKNECGIESDVNAGIPKIK 347
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 126/203 (62%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+++S DLL CC CG GC+GGYP +AW+++ G+VT + C PY+ C
Sbjct: 13 QVNISAEDLLTCCD-SCGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDGCQPYYFPP-CE 70
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C PTP+C + C + + + KH+ Y I+SD I EI KNGP
Sbjct: 71 HHTVGPLPNCTGIKPTPECAKTCREGYEKSYTRDKHFGKKVYSISSDETQIKTEICKNGP 130
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE F VY DF YKSGVY+ + +++GGHA++++GWGT +DG YW++AN WN WG
Sbjct: 131 VEADFNVYADFPSYKSGVYQRHSKEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDK 189
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI+RG++ECGIE D+ AG+P
Sbjct: 190 GYFKIRRGNDECGIENDINAGIP 212
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/224 (45%), Positives = 132/224 (58%), Gaps = 21/224 (9%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------E 61
D + + + LQ ++LS +DLL+CC CG GC+GG P++AWRY+V G+VT
Sbjct: 134 DRICIASHGELQ-VTLSADDLLSCCKS-CGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 191
Query: 62 ECDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRIN 111
C PY C H P YPTPKC +KCV ++ + K + SAY +
Sbjct: 192 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVK 250
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKLIGWG DDG
Sbjct: 251 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 309
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
YW +AN WN WG DG+F+I RG +ECGIE VV G+P +L
Sbjct: 310 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLT 353
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/208 (47%), Positives = 127/208 (61%), Gaps = 17/208 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
L LS +L+ACC CG GC+GG+P SAW Y+ G+VT + C PY +
Sbjct: 140 LHKPFLSAENLVACCS-SCGMGCNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPY-EFPP 197
Query: 72 CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
C H P C TPKC C N + K Y + YR++S+ E IM E+ +
Sbjct: 198 CEHHVVGPRPSCGGDVETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDH 257
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVEV F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++G YW++AN WN WG
Sbjct: 258 GPVEVDFEVYADFPNYKSGVYQHVSGGLLGGHAVRLLGWG-EENGVPYWLIANSWNSDWG 316
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSK 212
+GYFKI RG NECGIE DV AG+P K
Sbjct: 317 DNGYFKIIRGRNECGIESDVNAGIPKLK 344
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 123/203 (60%), Gaps = 18/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + LS +D+L+CC + CGDGCDGGYPISAW YFV GVVT + C PY + C
Sbjct: 143 KTVELSADDILSCC-YDCGDGCDGGYPISAWEYFVETGVVTGGLYGTKDSCRPY-EIPPC 200
Query: 73 SHPGCEPAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H E Y TP CV C + + + K + +Y I S I EI
Sbjct: 201 GHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTY 260
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +F VYEDF HY G+YKH++G GGHAV+++GWG + G YW++AN WN WG
Sbjct: 261 GPVTAAFIVYEDFFHYHRGIYKHVSGGEEGGHAVRILGWG-EEKGTAYWLVANSWNTDWG 319
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RGSNECGIEE+VVAG
Sbjct: 320 ENGYFRILRGSNECGIEENVVAG 342
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/204 (47%), Positives = 128/204 (62%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
++ +S DLL CC CG GC+GGYP SAW ++ G+V+ C PY S C
Sbjct: 129 SVEISSEDLLTCCDS-CGMGCNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISP-CE 186
Query: 74 H------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TP+C+ +C + ++ KHY S+Y + E I AEI KNG
Sbjct: 187 HHVNGSRPPCTGEGGDTPECISRCEAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNG 246
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYEDF YKSGVY+H++G V+GGHA+K++GWG +DG YW+ AN WN WG
Sbjct: 247 PVEGAFTVYEDFVMYKSGVYQHVSGSVLGGHAIKVLGWG-EEDGIPYWLCANSWNTDWGD 305
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RGSN CGIE ++VAG+P
Sbjct: 306 NGFFKILRGSNHCGIESEIVAGIP 329
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/195 (48%), Positives = 116/195 (59%), Gaps = 10/195 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S +DLL+CCG CG+GC+GGYPI A R++ GVVT C PY C+
Sbjct: 171 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPY-PIAPCT 229
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + KH+ SAY + I EI NGPVE +FT
Sbjct: 230 SGNC-PESKTPSCSLSCQSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFT 288
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVYKH G +GGHA+K+IGWGT + G YW++AN W SWG G+F+I
Sbjct: 289 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGNSWGESGFFRIF 347
Query: 193 RGSNECGIEEDVVAG 207
RG ++CGIE VVAG
Sbjct: 348 RGDDQCGIESAVVAG 362
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 118/201 (58%), Gaps = 16/201 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
+S+S DL CC + CGDGC+GG+P AW Y+ G+VT + C Y C H
Sbjct: 136 VSISTEDLNTCC-YECGDGCNGGWPAEAWAYWAETGIVTGGKYETKDGCKAY-TVPPCEH 193
Query: 75 ------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
P C PTP+C ++C + S SAY+ +SD I EI NGPVE
Sbjct: 194 HTEGDLPACGDIVPTPQCKKECDAGVDIEYKSDLRKGSAYQTSSDESQIQTEIMTNGPVE 253
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
F VYEDF +YKSGVY+ TG+ GGHA+K++GWG +DG YW+ AN WN WG GY
Sbjct: 254 ADFDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGV-EDGTPYWLAANSWNEDWGDKGY 312
Query: 189 FKIKRGSNECGIEEDVVAGLP 209
FKI RG NECGIE D++ G+P
Sbjct: 313 FKILRGQNECGIESDIIGGIP 333
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 126/203 (62%), Gaps = 15/203 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE------------ECDPYFDS 69
+ LS +DLL+CC CG GC+GG+P AW ++ H G+V+ E P
Sbjct: 151 VRLSADDLLSCCRD-CGMGCNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEHH 209
Query: 70 TGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
+ P CE PTPKC C ++ ++ ++ KHY++ Y ++S+ + I E+ +GPVE
Sbjct: 210 VNGTRPPCEGDAPTPKCKNVCQEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVE 269
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
F VY DF YKSGVY+H++G ++GGHA+KL+GWG +DG YW+ AN WN WG G+
Sbjct: 270 ADFEVYADFPTYKSGVYQHVSGALLGGHAIKLMGWG-EEDGVPYWLCANSWNTDWGEGGF 328
Query: 189 FKIKRGSNECGIEEDVVAGLPSS 211
FKI RG N CGIE D+VAG+P +
Sbjct: 329 FKILRGKNHCGIESDIVAGIPQN 351
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 16/205 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
+ +S +DLL+CCG CG GC+GG P +AWRY+ G+V+ C PY + C H
Sbjct: 135 VRISADDLLSCCGLFCGFGCNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPY-EIPPCEH 193
Query: 75 ------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
P C+ TPKC R+CV+ + ++ KH++ + Y + + EDIM EI GPV
Sbjct: 194 HTSGNRPDCKGNSKTPKCQRQCVESFDGKYQADKHFASNVYNVRASEEDIMNEILVYGPV 253
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E F VY DF YKSGVY+H+ G +GGHAVK++GWG ++G YW+ AN WN WG G
Sbjct: 254 EADFIVYADFLTYKSGVYQHVKGGFLGGHAVKILGWG-EENGVPYWLCANSWNTDWGDGG 312
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSK 212
+FKI RG N C IE D+ AG+P +
Sbjct: 313 FFKILRGYNHCKIEADINAGIPKIR 337
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 19/202 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
L+ D L+CC + CG GC GGYP AW Y++ G+VT C P+ T C H G
Sbjct: 139 LAAADPLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVG 196
Query: 77 -------CEP-AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C YPTP C R C N+ + K Y S+Y + IM EI KNGPV
Sbjct: 197 DSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPV 256
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
EV+F +++DF Y+SG+Y H+ G +G HAV++IGWG ++G +YW++AN WN WG +G
Sbjct: 257 EVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWLMANSWNEEWGENG 315
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF++ RG NECGIE +VVAG+P
Sbjct: 316 YFRMVRGRNECGIESEVVAGMP 337
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 125/203 (61%), Gaps = 18/203 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
+ L+ +D+L+CC CG GC+GG+P +AW Y+VH G+VT E C PY C H
Sbjct: 140 VHLAADDVLSCC-MSCGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDEGCMPY-PIKACDH 197
Query: 75 -------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
P + PTP+CVR C K N + + KHY +Y + S+ I EI NGP
Sbjct: 198 HVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVPSNVTQIQVEIMTNGP 257
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE FTVY DF YKSGVY+ T +GGHA++L+GWG + G YW+ AN WN WG
Sbjct: 258 VEADFTVYADFPLYKSGVYQRHTDQALGGHAIRLLGWGV-EKGVPYWLAANSWNTEWGDK 316
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RGS+ECGIE+DVVAG+P
Sbjct: 317 GFFKILRGSDECGIEDDVVAGIP 339
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 125/208 (60%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
Q++ LS +L+ CC CG GCDGG+P +A Y+V++G+VT + C Y C
Sbjct: 141 QDIRLSTQNLVTCCD-ECGFGCDGGWPEAAMDYYVNNGLVTGDLYGNNSWCQAY-SLAPC 198
Query: 73 SH-------PGCEPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIY 122
+H P C PTP CV+ C + + H AY I+ + + IM EI
Sbjct: 199 AHHVTSDVYPPCTGELPTPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQNEQAIMTEIQ 258
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
NGP+EV+FTVYEDF YKSGVY+H+TG +GGHAVK++GWG ++G YWI+ N WN S
Sbjct: 259 TNGPIEVAFTVYEDFLTYKSGVYQHVTGSELGGHAVKMVGWGV-ENGTPYWIIVNSWNES 317
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPS 210
WG G FKI RG NECGIE + V LP+
Sbjct: 318 WGDKGTFKILRGQNECGIESECVTALPA 345
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 179 bits (455), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 128/206 (62%), Gaps = 18/206 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
+N S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 140 KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAP 196
Query: 73 -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ C+ TP CV+KC + ++ + H+ SAY I +D + I EIY N
Sbjct: 197 CEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW++AN WN WG
Sbjct: 257 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDWG 316
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
+DG+FKI RGS+ECGIE + AGLP+
Sbjct: 317 SDGFFKILRGSDECGIEGQINAGLPA 342
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 179 bits (455), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 94/195 (48%), Positives = 118/195 (60%), Gaps = 10/195 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S +DLL+CCG CG+GC+GGYPI A R++ GVVT C PY + C+
Sbjct: 135 QQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAP-CT 193
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + + KH+ SAY + I EI NGPVE +FT
Sbjct: 194 SGNC-PESKTPACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFT 252
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVYKH G +GGHA+K+IGWGT + G YW++AN W +WG G+FKI
Sbjct: 253 VYEDFYKYKSGVYKHTAGKALGGHAIKIIGWGT-ESGSPYWLVANSWGTNWGESGFFKIL 311
Query: 193 RGSNECGIEEDVVAG 207
RG ++CGIE VVAG
Sbjct: 312 RGDDQCGIEGAVVAG 326
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 179 bits (454), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 101/222 (45%), Positives = 131/222 (59%), Gaps = 19/222 (8%)
Query: 4 TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
+ ++R + S +S++ LS +LL+CC CG GC GG P AW Y+ + G+VT
Sbjct: 51 SMSDRICIHSKNKISVE---LSAINLLSCCT-RCGFGCRGGIPGMAWDYWKYEGIVTGGS 106
Query: 61 ----EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSISAY 108
C PY S+ S+P CE Y PTP+C C + ++ K Y S+Y
Sbjct: 107 NETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGKSSY 166
Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
+ S+ IM EI NGPVE F VYEDF +YKSGVYKHITG +GGHA+++IGWG +
Sbjct: 167 NVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSGVYKHITGSYLGGHAIRIIGWGIQQN 226
Query: 169 GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
YW+ AN WN WG GYFKI RG+NECGIE V AGLP+
Sbjct: 227 HIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAGLPN 268
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 179 bits (454), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V T+ C PY + C
Sbjct: 132 KHFHFSSEDLLSCCP-ICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPY-EIPPC 189
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C N +++ K Y Y +++ + I AE+YKNG
Sbjct: 190 EHHVPGNRMPCSGDTKTPKCQKNCENGYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNG 249
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVYKHI GD +GGHA+K++GWG +D + YW++AN WN WG
Sbjct: 250 PVEGAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNK-YWLVANSWNTDWGD 308
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG N CGIE ++AG P
Sbjct: 309 NGFFKILRGENHCGIEGSIIAGEP 332
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 179 bits (454), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 127/203 (62%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----D 68
N S +DL++CC + CG GC+GGYP +AW Y+V G+V+ + C PY
Sbjct: 127 NFHFSSDDLVSCC-WTCGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEH 185
Query: 69 STGCSHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
T S P C+ + TPKC + C ++ + N H+ AY I+SD + I AEI +NGP
Sbjct: 186 HTNGSRPACDASEGNTPKCAKSCESNYKINYSNDLHFGSKAYSISSDVKQIQAEILQNGP 245
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF +YK+GVY+HI G +GGHA+++ GWG ++ YW++AN WN WG
Sbjct: 246 VEGAFSVYADFVNYKTGVYQHIKGQFLGGHAIRIFGWGVENN-TPYWLIANSWNTDWGDS 304
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI RGS+ CGIE +VAGLP
Sbjct: 305 GTFKILRGSDHCGIESGIVAGLP 327
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/204 (46%), Positives = 128/204 (62%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CC CG GC+GGYP +AW ++ G+V+ C PY + C
Sbjct: 129 NVEISSEDLLTCCDS-CGMGCNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAP-CE 186
Query: 74 H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TP+CVR+C + KHY ++Y + SD + I EIYKNG
Sbjct: 187 HHVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNG 246
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYEDF YK+GVY+H++G +GGHA+K++GWG ++G YW+ AN WN WG
Sbjct: 247 PVEGAFTVYEDFLLYKTGVYQHVSGSAVGGHAIKVLGWG-EENGTPYWLCANSWNTDWGD 305
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+GYFKI RGS+ CGIE ++VAG+P
Sbjct: 306 NGYFKILRGSDHCGIESEIVAGIP 329
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 93/210 (44%), Positives = 126/210 (60%), Gaps = 17/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ +++S DL+ CC CG GCDGG+ I AW YF + G+V+ C PY C
Sbjct: 134 KQVNISATDLVTCCTPTCGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPY-PIHPC 192
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C TP C +KC +L+R K Y A+++ E I E+ KN
Sbjct: 193 GHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKN 252
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV SF VYEDF+ YKSG+Y+H G++ G HAVK+IGWGT ++ DYW++AN W+ WG
Sbjct: 253 GPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGT-ENRTDYWLIANSWHDDWG 311
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
+GYF+I RG N+CGIEE+V AGL ++L
Sbjct: 312 ENGYFRIIRGINDCGIEENVAAGLIDVESL 341
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 134/222 (60%), Gaps = 20/222 (9%)
Query: 4 TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
+ ++R + S +S++ LS +LL+CC CG GC+GG P AW Y+ G+VT
Sbjct: 128 SMSDRICIHSKGRISIE---LSAVNLLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGS 183
Query: 61 ----EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAY 108
C PY ST +H CE Y TP+C + C + + N K+Y S+Y
Sbjct: 184 NETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSY 243
Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD- 167
+ SD IM EI NGPVE +F V++DF +YK+GVYK++TG ++GGHA+++IGWG S
Sbjct: 244 YVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTGVYKYVTGSLLGGHAIRIIGWGVSTL 303
Query: 168 DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ YW+ AN WN+ WG GYFKI RGSNECGIE V AGLP
Sbjct: 304 NHTPYWLCANSWNKQWGDKGYFKILRGSNECGIESMVTAGLP 345
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/202 (47%), Positives = 127/202 (62%), Gaps = 24/202 (11%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGCEPAYP 82
+S ++LL+CC F+CG GC GG P AW ++V G+ TE+C PY FD CSH G YP
Sbjct: 150 MSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATEDCQPYPFDP--CSHHGNSEKYP 206
Query: 83 --------TPKCVRKCVKKNQL----WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
TPKC C ++N++ ++ S YS+ + ++M E+ NGP+E++
Sbjct: 207 PCPSTIYDTPKCNTTC-ERNEMDLVKYKGSTSYSVKGEK------ELMIELMTNGPLELT 259
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
VY DF YKSGVYKH+ GD +GGHAVKL+GWGT DG YW +AN WN WG GYF
Sbjct: 260 MQVYSDFVGYKSGVYKHVLGDFLGGHAVKLVGWGT-QDGVPYWKVANSWNTDWGDKGYFL 318
Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
I+RG+NEC IE VAG+P+ +
Sbjct: 319 IQRGNNECKIESGGVAGIPAQE 340
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 179 bits (453), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 123/200 (61%), Gaps = 17/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
LS DLL CC CG GC+GG+P AW +F GV T + C+ Y + C H
Sbjct: 122 LSDQDLLTCCE-SCGFGCNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAY-EFPKCDHHV 179
Query: 75 ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C PTP+CV KC + + ++ KH+ AY + S+ E I E+ NGP+EV
Sbjct: 180 EGKYPPCGETQPTPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEV 239
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F+VYEDF YKSG+Y+H+ G +GGHAVKL+GWG +DG +YW +AN WN WG +GYF
Sbjct: 240 DFSVYEDFMTYKSGIYQHVAGKYLGGHAVKLVGWGV-EDGVEYWKIANSWNEDWGENGYF 298
Query: 190 KIKRGSNECGIEEDVVAGLP 209
+I G NECGIE D VAG+P
Sbjct: 299 RIIAGKNECGIESDGVAGIP 318
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 123/200 (61%), Gaps = 15/200 (7%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSHPG 76
SLS DL++CCG+ CG GC GGYP +AW ++ +G+VT + DP + CSH G
Sbjct: 132 SLSSIDLVSCCGY-CGFGCQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHG 190
Query: 77 CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
+ Y TPKCV KC N + K + Y + IM EI NGPVE
Sbjct: 191 SKKYPPCPHRIYDTPKCVPKCDTPNIDYETDKTRANITYNVQRSQMAIMKEIMINGPVEA 250
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F VYEDF YK GVY H TG+ +GGHA++++GWG ++G YW++AN WN WG DGYF
Sbjct: 251 AFEVYEDFFGYKQGVYFHSTGEFIGGHAIRILGWG-EENGTPYWLIANSWNEGWGEDGYF 309
Query: 190 KIKRGSNECGIEEDVVAGLP 209
K+ RG NECGIE++V AGLP
Sbjct: 310 KMLRGKNECGIEDEVTAGLP 329
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 125/202 (61%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
QN+ LS DLL CC CG GC+GG AW Y+V G+VT C+PY
Sbjct: 138 QNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCE 196
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C +K + + KH S+Y + +D + I EI K G
Sbjct: 197 HHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++AN WN WG
Sbjct: 257 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RG +EC IE +V+AG
Sbjct: 316 NGYFRIVRGRDECSIESEVIAG 337
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 128/205 (62%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+N+++S +LL+CC + CG GC+GG+P +AWR++ + G+V+ + C PY C
Sbjct: 129 KNVNISAENLLSCC-YTCGFGCNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEP-C 186
Query: 73 SH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKN 124
H C TPKC + C KN K S S+Y I SDP+ I +I N
Sbjct: 187 EHHVNGTRKPCAEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTN 246
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +F+VY DF YKSGVY+H+ G ++GGHA++++GWG + G YW++AN WN WG
Sbjct: 247 GPVEAAFSVYSDFMSYKSGVYRHVKGSLLGGHAIRILGWGM-EKGTPYWLVANSWNTDWG 305
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI RGS+ CGIE+ VVAGLP
Sbjct: 306 DNGTFKILRGSDHCGIEDSVVAGLP 330
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 19/202 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
L+ D L+CC + CG GC GGYP AW Y++ G+VT C P+ T C H G
Sbjct: 47 LAAADPLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVG 104
Query: 77 -------C-EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C YPTP C R C N+ + K Y S+Y + IM EI KNGPV
Sbjct: 105 DSRKYSRCPHYTYPTPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPV 164
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
EV+F +++DF Y+SG+Y H+ G +G HAV++IGWG ++G +YW++AN WN WG +G
Sbjct: 165 EVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYWLMANSWNEEWGENG 223
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF++ RG NECGIE +VVAG+P
Sbjct: 224 YFRMVRGRNECGIESEVVAGMP 245
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 125/200 (62%), Gaps = 18/200 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
+S N+LLACC CGDGC+GGYP +AW F H GVVT + C PY + C H
Sbjct: 12 VSANELLACC-ESCGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAA-CDHHV 69
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C+ TP+C +KC N +++ KHY +Y ++S DIM E+ GPVE
Sbjct: 70 VGKLKPCKGDGKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-VNDIMEELVTRGPVEA 128
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+FTVY DF Y SGVY+H TG +GGHAVK++G+G ++G+ YW++AN WN WG G+F
Sbjct: 129 AFTVYSDFLQYHSGVYRHTTGSALGGHAVKILGYGV-ENGDKYWLVANSWNPDWGDQGFF 187
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KI RG +ECGIE +VAG P
Sbjct: 188 KILRGVDECGIEGQIVAGEP 207
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/199 (46%), Positives = 124/199 (62%), Gaps = 14/199 (7%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSH 74
+LS L CC + CG+GCDGG P SAW +F+ HG+VT + C PY G
Sbjct: 143 NLSAEQLNTCC-YRCGNGCDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGR 201
Query: 75 PGCEPAYP-TPKC-VRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
C P TP C ++ C N + +R HY + Y ++ EDIM ++YKNGPV+ +
Sbjct: 202 NTCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAA 261
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF +YKSGVY + G + GGHA+K++GWG DDG YW+ AN W+RSWG +G F+
Sbjct: 262 FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGV-DDGTKYWLCANSWSRSWGENGLFR 320
Query: 191 IKRGSNECGIEEDVVAGLP 209
I RG+NEC IE+ V+AG+P
Sbjct: 321 ILRGNNECHIEDRVIAGMP 339
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 132/218 (60%), Gaps = 19/218 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL CC CG GC+GGYP +AW ++ G+VT
Sbjct: 117 SDRVCIHSNARVSVE---ISSEDLLTCCES-CGMGCNGGYPTAAWDFWTKEGLVTGGLYD 172
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY G P TP+C+ +C ++ KHY ++Y +
Sbjct: 173 SHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVE 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
++ I EIYKNGPVE +F VYEDF YKSGVY+H++G ++GGHA+K++GWG +DG
Sbjct: 233 ANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGVYQHVSGSLIGGHAIKILGWGV-EDGVP 291
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +GYFKI RGS+ CGIE +VVAG+P
Sbjct: 292 YWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVAGIP 329
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 135/229 (58%), Gaps = 23/229 (10%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 68
D + + + LQ +SLS +DLL+CC CG GC+GG P++AWRY+V G+VT Y
Sbjct: 145 DRICIASHGELQ-VSLSADDLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGS--NYTA 200
Query: 69 STGCS---HPGCE-------------PAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRI 110
++GC P CE YPTPKC +KC+ ++ + K Y SAY +
Sbjct: 201 NSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGV 260
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKLIGWG +DG
Sbjct: 261 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGI 319
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEIT 219
YW AN WN WG DG+F+I RG +ECGIE VV G+P ++ ++
Sbjct: 320 PYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSVSSRLS 368
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/214 (44%), Positives = 124/214 (57%), Gaps = 20/214 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+SLS +DLL+CC CG GCDGG P++AW+Y+V G+VT + C PY C
Sbjct: 130 QVSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PFPPCE 187
Query: 74 H--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
H P YPTPKC +KC + + + K + +AY + D I EI
Sbjct: 188 HHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILT 247
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVEV+F VYEDF Y G+Y H G + GGHAVK++GWG + G YW++AN WN W
Sbjct: 248 HGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWLVANSWNTDW 306
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
G DG+F+I RG +ECGIE VV GLP K+
Sbjct: 307 GEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKK 340
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 122/204 (59%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY C
Sbjct: 12 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHMGLVSGGNYNSSQGCSPYVIPP-C 69
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C N L++ K Y Y + + I AE++KNG
Sbjct: 70 EHHVPGNRLPCNGDTKTPKCSKTCENGYNVLYKKDKRYGKHVYAVRGGEDHIKAELFKNG 129
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVYKH+ GD +GGHA+K+IGWG ++G YW++AN WN WG
Sbjct: 130 PVEAAFTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGV-ENGNKYWLIANSWNTDWGN 188
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 189 NGFFKILRGEDHCGIESSIVAGEP 212
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 97/216 (44%), Positives = 130/216 (60%), Gaps = 18/216 (8%)
Query: 9 DALSSSPYVSLQ-NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------- 60
+A+S VS Q N+ +S +L+ CC F CG+GC GG+ AW Y+V G+VT
Sbjct: 117 EAMSDRYCVSFQENVHISAENLMTCCKF-CGNGCAGGFLQQAWEYWVKDGLVTGGQYGSD 175
Query: 61 EECDPYFDSTGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSD 113
E C PY C+H PG C TP+C R C + HY AY ++ +
Sbjct: 176 EGCQPYLIPK-CNHHEPGPYENCTGEGKTPQCERTCRSGYTTSYEADLHYGEKAYAVHRE 234
Query: 114 PEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
E I EI NGPVE +FTVY DF YKSGVY+H+ G +GGHA++++GWGT ++G YW
Sbjct: 235 VEAIQTEIMTNGPVEGAFTVYSDFPTYKSGVYQHVVGHALGGHAIRILGWGT-ENGVPYW 293
Query: 174 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
++AN WN SWG GYFK+ RG ++CGIE ++VAG P
Sbjct: 294 LIANSWNPSWGDKGYFKMIRGKDDCGIESNIVAGTP 329
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 96/219 (43%), Positives = 135/219 (61%), Gaps = 21/219 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL CC CG GC+GGYP +AW ++ G+V+
Sbjct: 117 SDRVCIHSNAKVSVE---ISSEDLLTCC-MSCGMGCNGGYPSAAWDFWTKEGLVSGGLYD 172
Query: 63 ----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 110
C PY + C H P C TP+C+ KC ++ KH+ ++Y +
Sbjct: 173 SHIGCRPYTIAP-CEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSYKEDKHFGKTSYTV 231
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
SD E I +EI+KNGPVE +F VYEDF YKSGVY+H++G +GGHA+K++GWG +DG
Sbjct: 232 LSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGVYQHVSGSAVGGHAIKILGWGV-EDGV 290
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +G+FK RGS+ CGIE +VVAG+P
Sbjct: 291 PYWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAGIP 329
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 124/204 (60%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q+ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C
Sbjct: 133 QHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 190
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C N +R K Y + ++S + I AE++KNG
Sbjct: 191 EHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNG 250
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D +YK+GVYKH GD +GGHAVK++GWG ++G YW++AN WN WG
Sbjct: 251 PVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWLIANSWNSDWGD 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/214 (44%), Positives = 124/214 (57%), Gaps = 20/214 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+SLS +DLL+CC CG GCDGG P++AW+Y+V G+VT + C PY C
Sbjct: 171 QVSLSADDLLSCCK-SCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPY-PFPPCE 228
Query: 74 H--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
H P YPTPKC +KC + + + K + +AY + D I EI
Sbjct: 229 HHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILT 288
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVEV+F VYEDF Y G+Y H G + GGHAVK++GWG + G YW++AN WN W
Sbjct: 289 HGPVEVAFEVYEDFLMYDGGIYVHTGGKIGGGHAVKMLGWGV-EQGVPYWLVANSWNTDW 347
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKE 217
G DG+F+I RG +ECGIE VV GLP K+
Sbjct: 348 GEDGFFRIIRGIDECGIESSVVGGLPKLNRTYKK 381
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 126/202 (62%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
QN+ LS DLL+CC CG GC+GG AW ++V G+VT C+PY
Sbjct: 143 QNVELSAVDLLSCCES-CGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCE 201
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C KK + + KH S+Y + +D + I EI K G
Sbjct: 202 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 261
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++AN WN WG
Sbjct: 262 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 320
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RG +EC IE +V+AG
Sbjct: 321 NGYFRIVRGRDECFIESEVIAG 342
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 130/218 (59%), Gaps = 21/218 (9%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------ 62
D + + + LQ +SLS +DLL+CC CG GC+GG P++AWRY+V G+VT
Sbjct: 159 DRICIASHGELQ-VSLSADDLLSCC-RSCGFGCNGGDPLAAWRYWVKDGIVTGSNFTANS 216
Query: 63 -CDPYFDSTGCSH--------PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRIN 111
C PY C H P YPTPKC ++C + ++ + K Y SAY +
Sbjct: 217 GCKPY-PFPPCEHHSKKTHFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVK 275
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKLIGWG +DG
Sbjct: 276 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-EDGIP 334
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW +AN WN WG DG+F+I RG +ECGIE VV G+P
Sbjct: 335 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIP 372
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 133/218 (61%), Gaps = 19/218 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL CC CG GC+GGYP +AW ++ G+VT
Sbjct: 117 SDRVCIQSNAKVSVE---ISSQDLLTCCDS-CGMGCNGGYPSAAWDFWTTDGLVTGGLYN 172
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY G P TP C KC + L++ KH+ ++Y +
Sbjct: 173 SHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVP 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S+ IMAE++KNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG ++G
Sbjct: 233 SNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGGHAIKILGWG-EENGVP 291
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 292 YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 124/204 (60%), Gaps = 16/204 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
+++ LS DLL+CC CG GC GG+P +AW Y+V G+VT C PY
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEEGIVTGSSKENHTGCQPYPFPKCE 197
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C E Y TPKC +KC K + ++ K+Y +Y + S + I EI +G
Sbjct: 198 HHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYYGKLSYNVLSKEDAIKKEIMMHG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY DF +YKSG+YKH+ G V+GGHAV++IGWG + YW++AN WN WG
Sbjct: 258 PVEAAFTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWGE 316
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
GYF+I RG + CGIE V AGLP
Sbjct: 317 KGYFRILRGKDVCGIESAVTAGLP 340
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 124/204 (60%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q+ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C
Sbjct: 133 QHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 190
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C N +R K Y + ++S + I AE++KNG
Sbjct: 191 EHHVPGNRMPCNGDSKTPKCEKTCESNYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNG 250
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D +YK+GVYKH GD +GGHAVK++GWG ++G YW++AN WN WG
Sbjct: 251 PVEGAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGV-ENGNKYWLIANSWNSDWGD 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 310 NGFFKILRGEDHCGIESSIVAGEP 333
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 128/206 (62%), Gaps = 18/206 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
L N+ +S DLL+CC CG GC+GGYP +AW ++ G+V+ C PY +
Sbjct: 127 LMNVEISAEDLLSCCDS-CGMGCNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAP- 184
Query: 72 CSH------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C H P C TP+C +KC + KHY +Y ++ ++I EIYK
Sbjct: 185 CEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYK 244
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPVE +FTVYEDF YK+GVY+H+TG +GGHA+K++GWG ++G YW+ AN WN W
Sbjct: 245 NGPVEGAFTVYEDFLLYKTGVYQHVTGSAVGGHAIKVLGWG-EENGTPYWLCANSWNTDW 303
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
G +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 304 GDNGFFKILRGSDHCGIESEIVAGIP 329
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 124/202 (61%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
QN+ LS DLL+CC CG GC+GG AW Y+V G+VT C+PY
Sbjct: 138 QNVELSAVDLLSCCES-CGLGCEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCE 196
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C KK + + KH S+Y + +D + I EI K G
Sbjct: 197 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE FTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++AN WN WG
Sbjct: 257 PVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RG +EC IE +V AG
Sbjct: 316 NGYFRIVRGRDECSIESEVTAG 337
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 93/218 (42%), Positives = 133/218 (61%), Gaps = 20/218 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL+CC CG GC GG+P +AW Y+ G+VT
Sbjct: 117 SDRYCIHSNGKVSVE---ISAEDLLSCCD-ACGMGCMGGFPSAAWDYWAESGLVTGGLYG 172
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRIN 111
C PY + C H P C TPKCV +C ++ K + Y +
Sbjct: 173 SNIGCRPYSIAP-CEHHVNGTRPPCTGEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSVP 231
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ IM E+YKNGPVE +F+VYEDF YK+GVY+H+TG ++GGHA+K++GWG ++
Sbjct: 232 PKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVYQHVTGQMLGGHAIKILGWG-KENNTP 290
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW++AN WN WG +G+FKI RG +ECGIE ++VAG+P
Sbjct: 291 YWLVANSWNTDWGDNGFFKILRGKDECGIESEIVAGIP 328
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 124/202 (61%), Gaps = 16/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ LS +L++CC CG GCDGG+P SAW Y+ + G+V+ + C PY + C
Sbjct: 131 QVHLSAENLVSCCD-SCGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAP-CE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
H P C TP C +C + + + + HY + + I AEI KNGPV
Sbjct: 189 HHVPGSRPACSGGGDTPDCRNQCDEGSGISYDQDHYYGETVYTLDEAKQIQAEILKNGPV 248
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTVYED +YK GVY+H+ G+ +GGHA+K++GWG +D YW++AN WN WG +G
Sbjct: 249 EAAFTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVEND-TPYWLVANSWNTDWGNNG 307
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
+FKI RGS+ECGIE+ +VAGLP
Sbjct: 308 FFKILRGSDECGIEDQIVAGLP 329
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 90/206 (43%), Positives = 126/206 (61%), Gaps = 18/206 (8%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
S + +S +D+L+CCG CG GC GG+PI A+++ GVVT + C PY
Sbjct: 136 STIRVMISDSDILSCCGISCGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPY-AFY 194
Query: 71 GCSHPGCEPAY--------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 121
C H +P Y PTPKC + C +K N+ ++ KH++ AY + ++ +I EI
Sbjct: 195 PCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNERNIRQEI 254
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
YKNGPV +F VY+DF++YK G+Y H G G HAVK++GWG ++ DYW++AN WN
Sbjct: 255 YKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWG-RENATDYWLIANSWNT 313
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
WG GYF+I RG+NECGIE +V G
Sbjct: 314 DWGESGYFRIVRGTNECGIEAQMVGG 339
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 98/213 (46%), Positives = 126/213 (59%), Gaps = 24/213 (11%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
+ + LS +DLL+CC CG GC GG P++AW+Y+V G+VT Y + +GC P
Sbjct: 185 KQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSGIVTG--SDYTNHSGCRPYPFPP 241
Query: 77 CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
CE YPTPKC R+C K + ++ K+Y AY + +D E I EI
Sbjct: 242 CEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKADKYYGEQAYNVENDVELIQKEIM 301
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
GPVE SF VY DF HY G+YKH+ G V GGHAVK++GWG D G YW+ AN WN
Sbjct: 302 TLGPVEASFEVYTDFLHYIGGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNTD 360
Query: 183 WGAD---GYFKIKRGSNECGIEEDVVAGLPSSK 212
WG D GYF+I RG +ECGIE +VAG+P +
Sbjct: 361 WGEDVFSGYFRILRGVDECGIESGIVAGIPRKE 393
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 177 bits (448), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 98/219 (44%), Positives = 131/219 (59%), Gaps = 23/219 (10%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD 68
D + + + LQ +SLS +DLL+CC CG GC+GG P++AWRY+V G+VT Y
Sbjct: 144 DRICIASHGELQ-VSLSADDLLSCCRS-CGFGCNGGDPLAAWRYWVKDGIVTGS--NYTA 199
Query: 69 STGCS---HPGCE-------------PAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRI 110
++GC P CE YPTPKC +KC+ ++ + K Y SAY +
Sbjct: 200 NSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGV 259
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
D E I E+ +GP+E++F VYEDF +Y GVY H G + GGHAVKL+GWG ++G
Sbjct: 260 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLVGWGI-ENGI 318
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW AN WN WG DG+F+I RG +ECGIE VV G+P
Sbjct: 319 PYWTCANSWNTDWGEDGFFRILRGVDECGIESGVVGGVP 357
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 95/197 (48%), Positives = 121/197 (61%), Gaps = 14/197 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
+S ++LL+CC F+CG GC GG P AW ++V G+ TE C PY CSH G YP
Sbjct: 155 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 212
Query: 83 -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
TPKC C K K+ ++Y + + E +M E+ NGP+EV+ VY
Sbjct: 213 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 269
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVYKH++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGS
Sbjct: 270 DFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 328
Query: 196 NECGIEEDVVAGLPSSK 212
NECGIE VAG P+ +
Sbjct: 329 NECGIESGGVAGTPAQE 345
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
Q +S DLL+CC +CG GC GG P AW ++V +G+VT + C PY
Sbjct: 151 QKPHISSTDLLSCCK-ICGFGCQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCN 209
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNG 125
S G P PTP C + C ++ N K+Y + AY +++ D+ E+ NG
Sbjct: 210 HHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYNKDKYYGLKAYSLHNKASDLQKELMMNG 269
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
P+EV+F VYEDF YK+GVY+H TG V+GGHAV+L+GWG ++G YW+LAN WN WG
Sbjct: 270 PMEVAFEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWG-EENGVPYWLLANSWNTEWGD 328
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G+FKI RG NECGIE + VAGL
Sbjct: 329 KGFFKIYRGRNECGIESEAVAGL 351
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/197 (48%), Positives = 121/197 (61%), Gaps = 14/197 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
+S ++LL+CC F+CG GC GG P AW ++V G+ TE C PY CSH G YP
Sbjct: 150 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 207
Query: 83 -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
TPKC C K K+ ++Y + + E +M E+ NGP+EV+ VY
Sbjct: 208 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 264
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVYKH++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGS
Sbjct: 265 DFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 323
Query: 196 NECGIEEDVVAGLPSSK 212
NECGIE VAG P+ +
Sbjct: 324 NECGIESGGVAGTPAQE 340
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 93/198 (46%), Positives = 120/198 (60%), Gaps = 10/198 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +SV D+L+CCG CG GC GGY I A R++ +G VT C PY +
Sbjct: 142 QQPIISVEDILSCCGTTCGKGCQGGYSIEAMRFWKSNGAVTGGDYNGNGCMPYSFAPCQK 201
Query: 74 HPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI---NSDPEDIMAEIYKNGPVEVS 130
P E PT K + + KHY SAYR+ N+ I EIY NGPVE S
Sbjct: 202 SPCVESTTPTCKTTCQSSYTTANYTTDKHYGTSAYRLATTNNVVSTIQYEIYHNGPVEAS 261
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
+ VYEDF YKSGVY +++G ++GGHAVK+IGWGT +D DYW++AN W +G G+FK
Sbjct: 262 YKVYEDFYQYKSGVYHYVSGKLVGGHAVKIIGWGTEND-VDYWLVANSWGIKFGEGGFFK 320
Query: 191 IKRGSNECGIEEDVVAGL 208
I+RG+NEC IE +VVAG+
Sbjct: 321 IRRGTNECQIESNVVAGV 338
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 88/197 (44%), Positives = 126/197 (63%), Gaps = 16/197 (8%)
Query: 36 LCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYP 82
+CGDGC+GGYP AW ++ G+V+ C PY C H P C
Sbjct: 1 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGD 59
Query: 83 TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
TPKC + C + ++ KHY ++Y +++ + IMAEIYKNGPVE +F+VY DF YK
Sbjct: 60 TPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPVEGAFSVYSDFLLYK 119
Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
SGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI RG + CGIE
Sbjct: 120 SGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIE 178
Query: 202 EDVVAGLPSSKNLVKEI 218
+VVAG+P + ++I
Sbjct: 179 SEVVAGIPRTDQYWEKI 195
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/197 (48%), Positives = 121/197 (61%), Gaps = 14/197 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
+S ++LL+CC F+CG GC GG P AW ++V G+ TE C PY CSH G YP
Sbjct: 150 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 207
Query: 83 -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
TPKC C K K+ ++Y + + E +M E+ NGP+EV+ VY
Sbjct: 208 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 264
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVYKH++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGS
Sbjct: 265 DFVGYKSGVYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 323
Query: 196 NECGIEEDVVAGLPSSK 212
NECGIE VAG P+ +
Sbjct: 324 NECGIESGGVAGTPAQE 340
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 121/202 (59%), Gaps = 15/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPY------F 67
N SLS DLL+CC CGDGCDGG+P AW ++ HG+VT EE C PY
Sbjct: 136 NKSLSAVDLLSCCK-DCGDGCDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQH 194
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S G P YPTPKCV+ C ++ K + ++Y ++ IM EI NGPV
Sbjct: 195 HSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPV 254
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +F V+EDF YKSG+Y H G +GGHA++++GWG ++G YW++AN WN WG G
Sbjct: 255 EATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWG-EENGVPYWLIANSWNEDWGEKG 313
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
Y + RG NECGIEE+ AGLP
Sbjct: 314 YLRFLRGHNECGIEEEATAGLP 335
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 91/197 (46%), Positives = 120/197 (60%), Gaps = 10/197 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q ++S D+LACCG CGDGC+GGYPI A+R++ GVVT C PY C+
Sbjct: 85 QQPTISPTDMLACCGRSCGDGCEGGYPIQAFRWWNSRGVVTGGDFRGSGCRPY-PFAPCN 143
Query: 74 HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C P TP C C + + K + +SAY + + I EI NGPV +FT
Sbjct: 144 SYKC-PEEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFT 202
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
+YED YKSGVY+H G ++GGHA+K+IGWGT +G YW++AN W WG +G+ K++
Sbjct: 203 MYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGADWGENGFLKMR 261
Query: 193 RGSNECGIEEDVVAGLP 209
RG NECGIE VVAG+P
Sbjct: 262 RGVNECGIESAVVAGMP 278
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/62 (58%), Positives = 46/62 (74%), Gaps = 1/62 (1%)
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPVE SFTVYEDF YK GVY++ G V+G HA+K++GWGT + G DYW++AN W
Sbjct: 3 NGPVEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGT-EHGTDYWLIANSWGAQC 61
Query: 184 GA 185
G+
Sbjct: 62 GS 63
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 120/198 (60%), Gaps = 11/198 (5%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTG 71
+ Q +SV D+L+CCG CG GC GGY I A R++ G VT C PY
Sbjct: 144 ATQTPIISVEDILSCCGVSCGKGCQGGYSIEALRFWKSSGAVTGGDYNGAGCMPY-SFAP 202
Query: 72 CSHPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C C TP C C K + KH+ +AY+I + I EIY NGPVE
Sbjct: 203 CKKDSCAQG-TTPSCKTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEA 261
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
SF VYEDF YKSGVY++ +G ++GGHAVK+IGWGT ++G DYW++AN W ++G G+F
Sbjct: 262 SFKVYEDFYKYKSGVYQYTSGKLVGGHAVKIIGWGT-ENGVDYWLIANSWGTTFGDSGFF 320
Query: 190 KIKRGSNECGIEEDVVAG 207
K++RG+NE GIE +VVAG
Sbjct: 321 KMRRGTNEVGIEGNVVAG 338
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 93/198 (46%), Positives = 123/198 (62%), Gaps = 16/198 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGCEPAYP 82
+S ++LL+CC F+CG GC GG P AW ++V G+ TE+C PY FD CSH G YP
Sbjct: 150 MSTSNLLSCC-FICGLGCHGGIPTVAWLWWVWVGIATEDCQPYPFDP--CSHHGNSEKYP 206
Query: 83 --------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
TPKC C + K+ ++Y + + E +M E+ NGP+E++ VY
Sbjct: 207 PCPSTIYDTPKCNTTCERSEM--DLVKYKGSTSYSVKGEKE-LMIELMTNGPLELTMQVY 263
Query: 135 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
DF YKSGVYKH+ G+ +GGHAVKL+GWGT DG YW +AN WN WG GYF I+RG
Sbjct: 264 SDFVGYKSGVYKHVLGEFLGGHAVKLVGWGT-QDGVPYWKVANSWNTDWGDKGYFLIQRG 322
Query: 195 SNECGIEEDVVAGLPSSK 212
+NEC IE VAG+P+ +
Sbjct: 323 NNECKIESGGVAGIPAQE 340
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 90/197 (45%), Positives = 119/197 (60%), Gaps = 12/197 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q ++S D+LACCG CGDGC GGYPI A+R++ GVVT C PY + S
Sbjct: 130 QQPTISPTDMLACCGNSCGDGCKGGYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCIS 189
Query: 74 HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
P TP C C + + K + +SAY + + I EI NGPV +FT
Sbjct: 190 CP----EEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFT 245
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
+YED YKSGVY+H G ++GGHA+K+IGWGT +G YW++AN W +WG +G+ K++
Sbjct: 246 MYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMR 304
Query: 193 RGSNECGIEEDVVAGLP 209
RG NECGIE VVAG+P
Sbjct: 305 RGVNECGIERAVVAGMP 321
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 176 bits (445), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 98/217 (45%), Positives = 129/217 (59%), Gaps = 20/217 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +SV D+L+CCG CG GC GGY I A R++ G VT C PY S
Sbjct: 141 QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPC 198
Query: 74 HPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPE--DIMAEIYKNGPVEV 129
C P TP C C K + ++ KHY SAY++ + +I EIY GPVE
Sbjct: 199 TKNC-PESTTPSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEA 257
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG ++G DYW++AN W S+G G+F
Sbjct: 258 SYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFGEKGFF 316
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
KI+RG+NEC IE +VVAG + K T ++ +ED
Sbjct: 317 KIRRGTNECQIEGNVVAG------IAKLGTHSETYED 347
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 176 bits (445), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 122/203 (60%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N LS +L++CC + CG GC+GG+P +AW ++V G+VT + C PY C
Sbjct: 139 NAHLSAENLVSCC-YTCGFGCNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPA-CE 196
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC++ C + + HY S+Y ++ EDI EI NGP
Sbjct: 197 HHTTGDRPPCSEGGGTPKCLKTCEDGYTVDYTQDLHYGASSYSVHKRMEDIQLEIMNNGP 256
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE + TVYEDF YKSGVY+H+ G +GGHA++++GWG ++G YW++AN WN WG +
Sbjct: 257 VEGALTVYEDFPTYKSGVYQHVHGKALGGHAIRILGWGV-EEGVPYWLIANSWNTDWGDN 315
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GY K+ RG + CGIE + AGLP
Sbjct: 316 GYIKLLRGKDHCGIESQITAGLP 338
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 98/223 (43%), Positives = 133/223 (59%), Gaps = 30/223 (13%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV------ 59
T+RD + S+ +N S +L++CC LCG GC+GG+P +A++Y+VH G+V
Sbjct: 117 TDRDCIHSN---GTKNFHYSAENLVSCC-HLCGFGCNGGFPGAAFQYWVHSGIVSGGAFN 172
Query: 60 -TEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVK------KNQLWRNSKHYSIS 106
T+ C PY + C H P C TPKC + C ++ L SKHYS+
Sbjct: 173 STQGCQPY-EIAPCEHHVSGPRPKCAEGGSTPKCHKNCESNYVVDYESDLHHGSKHYSV- 230
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
+ D I +I NGPVE +FTVY DF HYKSGVY+H G +GGHA++++GWG
Sbjct: 231 ----DKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVYQHTHGLPLGGHAIRVLGWG-E 285
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+DG YW+ AN WN WG +GYFKI RGS+ CGIE ++ AGLP
Sbjct: 286 EDGTPYWLCANSWNTDWGDNGYFKILRGSDHCGIESEISAGLP 328
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 122/205 (59%), Gaps = 20/205 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-------- 72
N S +DL++CC CG GC+GG+P +AW Y+V G+V+ PY S GC
Sbjct: 138 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWGYWVRKGIVSG--GPYGSSQGCRPYEIAPC 194
Query: 73 ------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ P CE Y TP+C KC ++ ++ KH+ AY I+ + DI EI N
Sbjct: 195 EHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTN 254
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYED YK GVY+H+ G +GGHA+++IGWG D YW++AN WN WG
Sbjct: 255 GPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-TPYWLIANSWNTDWG 313
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE + AGLP
Sbjct: 314 NNGFFKILRGKDHCGIESSISAGLP 338
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 176 bits (445), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 123/202 (60%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
QN+ LS DLL+CC CG GC+GG AW Y+V G+VT C+PY
Sbjct: 52 QNVELSAVDLLSCCE-SCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCE 110
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C KK + + KH S+Y + +D + I EI K G
Sbjct: 111 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 170
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE FTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG + YW++AN WN WG
Sbjct: 171 PVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKA-PYWLIANSWNEDWGE 229
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RG +EC IE +V AG
Sbjct: 230 NGYFRIVRGRDECSIESEVTAG 251
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 126/203 (62%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
N S +L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C
Sbjct: 130 NFHYSSENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCE 187
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKCV++C + + + H+ AY I D + I EI KNGP
Sbjct: 188 HHVPGPRPKCSEGGGTPKCVKRCENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGP 247
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF HYKSGVY+H G +GGHA++++GWG ++G YW+ AN WN WG +
Sbjct: 248 VEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRILGWG-EENGTPYWLCANSWNTDWGDN 306
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI RGS+ CGIE ++ AGLP
Sbjct: 307 GLFKILRGSDHCGIESEISAGLP 329
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 123/200 (61%), Gaps = 17/200 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GG+P AW ++V HG+V+E C PY F S C+H
Sbjct: 139 VRDLRISAGDLMSCCD-VCGYGCNGGFPEVAWVFYVVHGLVSEYCQPYPFPS--CAHHVN 195
Query: 75 ----PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C Y TPKC C KK L R ++S + S E E+ NGP EV
Sbjct: 196 SSDLAPCSGDYKTPKCNSTCTEKKIPLIRYRGNHSY----VLSGEEHFKRELLLNGPFEV 251
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F VY DF Y GVYKH+ GD++GGHAV+L+GWG +GE YW +AN WN WG +GYF
Sbjct: 252 AFEVYADFMAYTGGVYKHVAGDLLGGHAVRLVGWG-ELNGEPYWKIANSWNHEWGMNGYF 310
Query: 190 KIKRGSNECGIEEDVVAGLP 209
I RG NECGIE + VAG P
Sbjct: 311 LIARGVNECGIESNGVAGTP 330
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 120/202 (59%), Gaps = 18/202 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH------- 74
+++S D L CC +CG GC+GG P AW ++ +G+VT Y D+ GC
Sbjct: 136 VNISAEDPLDCC-TICGMGCNGGMPAMAWLHWTVNGIVTG--GNYEDTNGCKAYSFAPCE 192
Query: 75 -------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
P C P PTP C ++C + L + S Y I+ P+ I EI NGPV
Sbjct: 193 HHVDGDLPPCGPTKPTPDCKKECDSGSSLTYQNDLTHGSNYGIDPYPKQIQTEIMTNGPV 252
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E SF+VYEDF YKSGVY+H+ G+ GGHA+K++GWG +D YW++AN WN WG G
Sbjct: 253 EASFSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVEND-TPYWLVANSWNEDWGDKG 311
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFKI RGSNECGIE +VAG+P
Sbjct: 312 YFKILRGSNECGIEGSIVAGIP 333
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 125/201 (62%), Gaps = 16/201 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-----F 67
QN+ LS DLL+CC CG GC+GG AW ++V G+VT C+PY
Sbjct: 138 QNVELSAVDLLSCCES-CGLGCEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCE 196
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C KK + + KH S+Y + +D + I EI K G
Sbjct: 197 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYG 256
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++AN WN WG
Sbjct: 257 PVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGV-ENKTPYWLIANSWNEDWGE 315
Query: 186 DGYFKIKRGSNECGIEEDVVA 206
+GYF+I RG +EC IE +V+A
Sbjct: 316 NGYFRIVRGRDECFIESEVIA 336
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 92/204 (45%), Positives = 122/204 (59%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S DLL+CC +CG GC GG P AW Y+ H G+V+ + C PY + C
Sbjct: 132 KHFHFSAEDLLSCCP-ICGLGCSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 189
Query: 73 SH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC +KC + ++ K Y Y ++ D + I AE++KNG
Sbjct: 190 EHHVPGNRMPCSGDTKTPKCTKKCESGYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNG 249
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVYKH GD +GGHAVK++GWG +D + YW++AN WN WG
Sbjct: 250 PVEGAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNK-YWLIANSWNSDWGD 308
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +V G P
Sbjct: 309 NGFFKILRGEDHCGIESSIVTGEP 332
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 88/206 (42%), Positives = 124/206 (60%), Gaps = 18/206 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
+N S +L++CC CG GC+GG+P +AW Y+ G+V+ PY GC
Sbjct: 138 KNFHFSAENLVSCC-RTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAP 194
Query: 73 -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ C+ TP CV+KC ++ + H SAY + +D + I EIY N
Sbjct: 195 CEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTN 254
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW++AN WN WG
Sbjct: 255 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNSDWG 314
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
+DG+FKI RGS+ECGIE + AGLP+
Sbjct: 315 SDGFFKILRGSDECGIEGQINAGLPA 340
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 175 bits (443), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 94/205 (45%), Positives = 124/205 (60%), Gaps = 17/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C
Sbjct: 131 KHFHFSAEDLLSCCP-VCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPY-EIPPC 188
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC R C K+ +++ K Y Y + E I AEI+KNG
Sbjct: 189 EHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNG 248
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVYKH G+ +GGHA+K++GWG ++G YW++AN WN WG
Sbjct: 249 PVEGAFTVYADLLTYKSGVYKHTEGEALGGHAIKIMGWGV-ENGNKYWLIANSWNSDWGD 307
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
+G+FKI RG + CGIE +VAG PS
Sbjct: 308 NGFFKILRGEDHCGIESSIVAGEPS 332
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 175 bits (443), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/194 (49%), Positives = 118/194 (60%), Gaps = 14/194 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
+S +LL+CC F+CG GC GG P AW ++V GV TE C PY CSH G YP
Sbjct: 150 ISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTELCQPY-PFGPCSHHGNSSKYPP 207
Query: 83 -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
TPKC C N K+ +S+Y I + E +M E+ NGP+EV+ VY
Sbjct: 208 CPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGERE-LMVELMNNGPLEVAMQVYA 264
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVYKH++GD +GGHAVKL+GWG DG YW +AN WN WG GYF I+RG+
Sbjct: 265 DFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYWKIANSWNTDWGDKGYFLIQRGN 323
Query: 196 NECGIEEDVVAGLP 209
+ECGIE VAG P
Sbjct: 324 DECGIESSGVAGKP 337
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 98/219 (44%), Positives = 130/219 (59%), Gaps = 21/219 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S VS++ +S DLL CC CG GC+GGYP +AW ++ G+V+
Sbjct: 117 SDRVCIHSGSKVSVE---ISSEDLLTCCD-ACGMGCNGGYPSAAWDFWTKEGLVSGGLYN 172
Query: 63 ----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRI 110
C PY C H P C TPKCV C + + KHY S+Y +
Sbjct: 173 SHIGCRPYTIPP-CEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSV 231
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
+ E I AEI +NGPVE +F VYEDF YKSGVY+H TG +GGHA+K++GWG +DG
Sbjct: 232 EASVEQIQAEISQNGPVEGAFIVYEDFVMYKSGVYQHTTGSALGGHAIKVLGWG-EEDGV 290
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +G+FKI RGS+ CGIE ++VAG+P
Sbjct: 291 PYWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAGIP 329
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 124/209 (59%), Gaps = 18/209 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
++ + D+L+CC CG+GC+GGYP++A YFV G+VT + C PY C
Sbjct: 144 DVMYAAEDVLSCC-LTCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPY-TLEACE 201
Query: 74 H------PGCEPAYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TPKC +C+ + +++ K + AY + +D I EI G
Sbjct: 202 HHVPGDRPPCTEGGGTPKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYG 261
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY DF YKSGVY+H +G +GGHA+K+IGWGT + G+DYW++ N WN WG
Sbjct: 262 PVEAAFTVYSDFPSYKSGVYRHTSGSELGGHAIKIIGWGT-EGGDDYWLINNSWNSDWGD 320
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
G FKI RGSNECGIE +VVA + L
Sbjct: 321 KGTFKILRGSNECGIEGEVVAATVDASTL 349
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 132/218 (60%), Gaps = 19/218 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL CC CG GC+GGYP +AW ++ G+V+
Sbjct: 117 SDRLCIHSNAKVSVE---ISAEDLLTCCD-SCGMGCNGGYPSAAWDFWTKEGLVSGGLYD 172
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRIN 111
C PY G P TP+C+ +C +R KHY ++Y +
Sbjct: 173 SHVGCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVL 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
SD +I EIYKNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG ++G
Sbjct: 233 SDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGVYQHVSGSAVGGHAIKVLGWG-EENGVP 291
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +G+FK RGS+ CGIE ++VAG+P
Sbjct: 292 YWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVAGIP 329
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 122/204 (59%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
++S +DLL+CC CG GCDGG+P +AW Y+V G+V+ C PY
Sbjct: 143 QFTVSADDLLSCCD-ECGFGCDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEH 201
Query: 68 DSTGCS-HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+ G HP + YPT C KC + N K Y AY + + + I EI +G
Sbjct: 202 HTNGTHYHPCPKDLYPTNTCEHKCQSGYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHG 261
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVEV++ VYEDF HY G+YKH G +GGHAVK+IGWGT ++G YWI +N WN WG
Sbjct: 262 PVEVAYDVYEDFEHYLKGIYKHTAGSYLGGHAVKMIGWGT-ENGIPYWICSNSWNSDWGE 320
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+F+I RG++ECGIE VVAGLP
Sbjct: 321 NGFFRILRGTDECGIESGVVAGLP 344
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V T+ C PY + C
Sbjct: 133 KHFHFSAEDLLSCCP-ICGLGCNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPY-EVPPC 190
Query: 73 SH--PG----CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C N ++ KHY Y ++ + ++I AE++KNG
Sbjct: 191 EHHVPGNRLPCNGDTKTPKCQKTCEAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNG 250
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D YKSGVY+H G +GGHAVK++GWG ++G YW++AN WN WG
Sbjct: 251 PVEGAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGV-ENGSKYWLIANSWNSDWGD 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +V G P
Sbjct: 310 NGFFKILRGEDHCGIESSIVTGEP 333
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/187 (45%), Positives = 119/187 (63%), Gaps = 16/187 (8%)
Query: 37 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 83
CG GC+GGYP +AW+++ +VT + C PY+ C H P C PT
Sbjct: 3 CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPP-CEHHTVGPLPNCTGIKPT 61
Query: 84 PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
P+C + C + Q + KH+ Y I+SD I EIYKNGPVE F+VY DF YKS
Sbjct: 62 PECAKTCREGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVEADFSVYADFPSYKS 121
Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
GVY+ + +++GGHA++++GWGT +DG YW++AN WN WG GYFKI+RG++ECGIE+
Sbjct: 122 GVYQRHSEEMLGGHAIRILGWGT-EDGVPYWLVANSWNEDWGDKGYFKIRRGNDECGIED 180
Query: 203 DVVAGLP 209
D+ AG+P
Sbjct: 181 DINAGIP 187
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 121/203 (59%), Gaps = 17/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + +S D+L+CCG CG GC+GG+PI A+ YF G VT C PY C
Sbjct: 139 KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 197
Query: 73 SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C TPKCVRKC + ++ + AY + + + I EI KN
Sbjct: 198 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKN 257
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +FTVYEDF++YK G+YKH G GGHA+K+IGWG + G YW++AN W+ WG
Sbjct: 258 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWG 316
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RGSN CGIEE+VVAG
Sbjct: 317 ENGYFRILRGSNHCGIEENVVAG 339
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 89/186 (47%), Positives = 112/186 (60%), Gaps = 18/186 (9%)
Query: 41 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDS------TGCSHPGC-EPAYPTPKC 86
C+GGYPI AW+++V HG+VT C PY + G + P C E PTPKC
Sbjct: 14 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73
Query: 87 VRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
V C N + KH+ +AY + E I EI +GP+EV+FTVYEDF Y +G
Sbjct: 74 VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTG 133
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
VY H G +GGHAVK++GWG D+G YW++AN WN +WG GYF+I RG NECGIE
Sbjct: 134 VYVHTAGKSLGGHAVKILGWGV-DNGTPYWLVANSWNVNWGEKGYFRIIRGLNECGIEHS 192
Query: 204 VVAGLP 209
VAGLP
Sbjct: 193 AVAGLP 198
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 174 bits (441), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 125/203 (61%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
N S +L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C
Sbjct: 129 NFHYSAENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCE 186
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C K + + + H+ AY I D + I EI KNGP
Sbjct: 187 HHVPGPRPKCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGP 246
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF HYKSGVY+H G +GGHA++++GWG ++G YW+ AN WN WG +
Sbjct: 247 VEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLCANSWNTDWGDN 305
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI RGS+ CGIE ++ AGLP
Sbjct: 306 GLFKILRGSDHCGIESEISAGLP 328
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 123/204 (60%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
+ S DL++CC CG GC+GG+P +AW Y+V G+V+ C PY + C
Sbjct: 135 HFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAP-CE 192
Query: 74 H------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P CE TPKCV+KC + N ++ K + S+Y I I EI NG
Sbjct: 193 HHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNG 252
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYED HYK GVY+H+TG ++GGHA++++GWG ++G YW++AN WN WG
Sbjct: 253 PVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGV-ENGTKYWLIANSWNSDWGD 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + GIE + AGLP
Sbjct: 312 NGFFKILRGEDHLGIESSISAGLP 335
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 115/187 (61%), Gaps = 16/187 (8%)
Query: 37 CGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------PGCEPAYPT 83
C C+GG+P SAW Y+ G+VT + C PY C H C+ PT
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPY-QIKSCDHHVNGTKGPCQGEGPT 227
Query: 84 PKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
P+C KC + + KHY++S I+++PE EI NGPVE FTVYEDF YKS
Sbjct: 228 PECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKS 287
Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
GVY+H TG V+GGHA+K++GWG ++G YW++AN WN WG +G+FKI RGSNECGIE
Sbjct: 288 GVYQHTTGGVLGGHAIKILGWGV-EEGTKYWLVANSWNNEWGDNGFFKILRGSNECGIES 346
Query: 203 DVVAGLP 209
D+ G+P
Sbjct: 347 DINFGIP 353
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 122/204 (59%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ H G+V+ E C PY + C
Sbjct: 136 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKEGCRPY-EVEPCE 193
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP+C+ KC + + KH+ AY +N +P DI EI NGP
Sbjct: 194 HHVNGTRPPCHSG-STPRCMHKCESGYSVDYAKDKHFGAKAYSVNRNPLDIQREIMTNGP 252
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK+GVY+H+ G +GGHA++++GWG D+ YW++ N WN WG
Sbjct: 253 VEGAFTVYEDLILYKTGVYQHVHGRQLGGHAIRILGWGVWGDNKVPYWLIGNSWNTDWGD 312
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+F+I RG + CGIE + AGLP
Sbjct: 313 NGFFRILRGEDHCGIESAISAGLP 336
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 128/204 (62%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N+ LS +DL++CC + CG GC+GG+P +AW Y+V+ G+V+ + C PY + C
Sbjct: 130 NVRLSADDLVSCC-YSCGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPY-EIAPCE 187
Query: 74 H--PGCEPA-----YPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G P TP C ++C K N ++ K++ AY I+S+ + I EI NG
Sbjct: 188 HHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNG 247
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYED YK GVY+H+ G+ +GGHA++++GWGT + G YW++AN WN WG
Sbjct: 248 PVEGAFEVYEDLLSYKKGVYQHVKGEALGGHAIRILGWGT-EKGTPYWLIANSWNSDWGD 306
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI RG + CGIE +VAG+P
Sbjct: 307 NGTFKILRGEDHCGIESSIVAGIP 330
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 123/202 (60%), Gaps = 19/202 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
LS D+++CC + CG GC+GG P +W Y+ GVVT C PY CSH
Sbjct: 139 LSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGV 196
Query: 75 --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
PG P YPTPKC +KC N+ + K S+Y + DIM EI KNGPV
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPV 256
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG ++G YW++AN WN WG G
Sbjct: 257 DGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKG 315
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF+++RG+NECGIE + AGLP
Sbjct: 316 YFRMRRGNNECGIEARINAGLP 337
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 96/197 (48%), Positives = 121/197 (61%), Gaps = 18/197 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++ LS DL+ C DGC+GG +SAW + GVVT+EC PY + P C PA
Sbjct: 126 DVQLSFLDLVTC--DQSDDGCEGGDDVSAWNFLKKQGVVTQECKPY------TIPTCPPA 177
Query: 81 YP-------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
TP CV++C + L + KH Y INS E IM EI NGPVE F+
Sbjct: 178 QQPCLNFVNTPNCVKQCESNSTLIYSQDKHKMAKIYSINS-VEAIMQEISTNGPVEACFS 236
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVY+H TG +GGH VK+ G+GT +G +YW +AN W SWG +G F IK
Sbjct: 237 VYEDFLGYKSGVYQHTTGKFLGGHCVKIFGYGTL-NGVNYWSVANSWTTSWGDNGIFLIK 295
Query: 193 RGSNECGIEEDVVAGLP 209
RGS+ECGIE++VVAG+P
Sbjct: 296 RGSDECGIEDEVVAGIP 312
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 173 bits (439), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 96/208 (46%), Positives = 131/208 (62%), Gaps = 21/208 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRY----FVHHGVVT-------EECDPYFD 68
+ +++S +LL+CC CG GCDGGYP +AWR+ ++ G+VT C PY
Sbjct: 133 EQVNISAENLLSCCE-TCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPY-T 190
Query: 69 STGCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEI 121
C H PG C + TP C R C+ ++ +R+ KHY ++Y I+SD I EI
Sbjct: 191 IPKCDHHEPGPYENCSGSQSTPSCKRSCISSYDKSYRSDKHYGKNSYSISSDVSSIQTEI 250
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
NGPVE +F+VY DF Y SGVY+H TG +GGHA+K++GWGT ++G YW++AN WN
Sbjct: 251 MTNGPVEGAFSVYADFPTYTSGVYQHTTGSFLGGHAIKILGWGT-ENGVPYWLVANSWNP 309
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
SWG G+FKI RG +ECGIE +VAG+P
Sbjct: 310 SWGDSGFFKIIRGKDECGIESSIVAGMP 337
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 173 bits (439), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 120/197 (60%), Gaps = 14/197 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
+S ++LL+CC F+CG GC GG P AW ++V G+ TE C PY CSH G YP
Sbjct: 150 ISTSNLLSCC-FICGFGCYGGIPTMAWLWWVWVGITTEVCQPY-PFGPCSHHGNSDKYPP 207
Query: 83 -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
TPKC C K K+ ++Y + + E +M E+ NGP+EV+ VY
Sbjct: 208 CPNTIYDTPKCNTTCEKSEM--DLVKYKGGTSYSVKGEKE-LMIELMTNGPLEVTMQVYS 264
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSG YKH++GD++GGHAVKL+GWGT G YW +AN WN WG GYF I+RGS
Sbjct: 265 DFVGYKSGGYKHVSGDLLGGHAVKLVGWGT-QGGVPYWKIANSWNTDWGDKGYFLIQRGS 323
Query: 196 NECGIEEDVVAGLPSSK 212
NECGIE VAG P+ +
Sbjct: 324 NECGIESGGVAGTPAQE 340
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 173 bits (439), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 132/219 (60%), Gaps = 19/219 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL CC CG GC+GGYP +AW ++ G+VT
Sbjct: 117 SDRVCIHSNAKVSVE---ISAQDLLTCCDG-CGMGCNGGYPSAAWDFWSSDGLVTGGLYN 172
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY G P TP C C + ++ KH+ ++Y +
Sbjct: 173 SHIGCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMSCEPGYSPSYKQDKHFGKTSYSVP 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S+ +DIM E+YKNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG ++G
Sbjct: 233 SNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGVYQHVSGPALGGHAIKILGWG-EENGVP 291
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
YW+ AN WN WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 292 YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIPQ 330
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 122/203 (60%), Gaps = 17/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + +S D+L+CCG CG GC+GG+PI A+ YF G VT C PY C
Sbjct: 51 KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 109
Query: 73 SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C TPKCVRKC + ++ + AY + + + I EI KN
Sbjct: 110 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKN 169
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +FTVYEDF++YK G+YKH G GGHA+K+IGWG ++G YW++AN W+ WG
Sbjct: 170 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KENGVPYWLIANSWHNDWG 228
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RGSN CGIEE+VVAG
Sbjct: 229 ENGYFRILRGSNHCGIEENVVAG 251
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 124/203 (61%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
N S +L++CC LCG GC+GG+P +A++Y+VH G+V T+ C PY + C
Sbjct: 129 NFHYSAENLVSCC-HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQGCQPY-EIAPCE 186
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C K + + + H+ AY I D + I EI NGP
Sbjct: 187 HHVSGPRPKCSEGGGTPKCAKTCEKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGP 246
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF HYKSGVY+H G +GGHA++++GWG ++G YW+ AN WN WG +
Sbjct: 247 VEGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWG-EENGTPYWLCANSWNTDWGDN 305
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI RGS+ CGIE ++ AGLP
Sbjct: 306 GLFKILRGSDHCGIESEISAGLP 328
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 173 bits (438), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 89/197 (45%), Positives = 118/197 (59%), Gaps = 12/197 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q ++S D+LACCG CGDGC G YPI A+R++ GVVT C PY + S
Sbjct: 130 QQPTISPTDMLACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFRGSGCRPYPFAPCIS 189
Query: 74 HPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
P TP C C + + K + +SAY + + I EI NGPV +FT
Sbjct: 190 CP----EEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFT 245
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
+YED YKSGVY+H G ++GGHA+K+IGWGT +G YW++AN W +WG +G+ K++
Sbjct: 246 MYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT-QNGIPYWLIANSWGANWGENGFLKMR 304
Query: 193 RGSNECGIEEDVVAGLP 209
RG NECGIE VVAG+P
Sbjct: 305 RGVNECGIERAVVAGMP 321
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 120/198 (60%), Gaps = 21/198 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
+ + LS +DLL+CC CG GC GG P++AW+Y+V G+VT Y + +GC P
Sbjct: 126 KQVILSADDLLSCCK-TCGFGCFGGEPMAAWKYWVLSGIVTGS--DYTNHSGCRPYPFPP 182
Query: 77 CE-------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
CE YPTPKC ++C K + ++ K+Y AY + +D E I EI
Sbjct: 183 CEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVENDVESIQKEIM 242
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
GPVE SF VY DF HY SG+YKH+ G V GGHAVK++GWG D G YW+ AN WN
Sbjct: 243 TLGPVEASFEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGI-DQGVSYWLAANSWNND 301
Query: 183 WGADGYFKIKRGSNECGI 200
WG DGYF+I RG++ECG+
Sbjct: 302 WGEDGYFRILRGADECGM 319
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 124/205 (60%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+N+++S +LL+CC + CG GC+GG+P +AW+Y+ G+V+ C PY D C
Sbjct: 130 KNVNISAENLLSCC-YSCGFGCNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPY-DIEPC 187
Query: 73 SH------PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI--SAYRINSDPEDIMAEIYKN 124
H C TPKC R C +N K S S+Y I SDP+ I EI N
Sbjct: 188 EHHVNGTRQPCAEGGRTPKCHRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDN 247
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +F+VY DF + KSGVY+H+ G ++GGHA++++GWG + G YW++AN WN WG
Sbjct: 248 GPVEAAFSVYSDFMNDKSGVYRHVKGSLLGGHAIRILGWGV-EKGTPYWLVANSWNTDWG 306
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G FKI RGS+ CGIE VV GLP
Sbjct: 307 DKGTFKILRGSDHCGIEGSVVTGLP 331
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/204 (44%), Positives = 123/204 (60%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C
Sbjct: 136 KHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPY-EIPPC 193
Query: 73 SH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C N + K Y Y ++S + I AE+YKNG
Sbjct: 194 EHHVPGNRMPCNGDSKTPKCHKTCESSYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNG 253
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D +YK+GVYKH G+ +GGHA+K++GWG ++G YW++AN WN WG
Sbjct: 254 PVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGNKYWLIANSWNSDWGD 312
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 313 NGFFKILRGEDHCGIESSIVAGEP 336
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 122/203 (60%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CG GC GG+P AW Y+V G+VT C PY
Sbjct: 13 QSAELSALDLISCC-EDCGQGCQGGFPGVAWDYWVTQGIVTGGSKENHTGCQPYPFPKCE 71
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C +KC K + ++ KHY +Y + S+ + I EI NG
Sbjct: 72 HHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISNEKAIQKEIMMNG 131
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG YW++AN WN WG
Sbjct: 132 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVKKR-TPYWLIANSWNEDWGE 190
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G F+I RG +EC IE +VVAGL
Sbjct: 191 KGLFRIVRGRDECSIESNVVAGL 213
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 172 bits (437), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 96/202 (47%), Positives = 125/202 (61%), Gaps = 14/202 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S D+L+CCG C +GC GGY I A +Y+++ GVVT C PY CS
Sbjct: 134 QQPIISPEDILSCCGSSCNNGCQGGYTIEAMKYWMNSGVVTGGDYQGAGCIPY-SFRPCS 192
Query: 74 HPGCEPAYPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
C+ P C C K +R S +A N+ + I EIY NGPVEV+
Sbjct: 193 --TCKEPKDAPSCKTTCQASYKAKSAYRLPTTTSSNAIVANA-VQMIQTEIYNNGPVEVA 249
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
+ VY+DF HYKSGVY H+ GD GHAVK+IGWGT + DYW++AN W+ ++G +G+FK
Sbjct: 250 YQVYDDFYHYKSGVYYHVYGDKPSGHAVKIIGWGT-EKKVDYWLVANSWSTTFGENGFFK 308
Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
I+RG+NECGIEE+VVAGLP SK
Sbjct: 309 IRRGTNECGIEENVVAGLPKSK 330
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 90/205 (43%), Positives = 122/205 (59%), Gaps = 20/205 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-------- 72
N S +DL++CC CG GC+GG+P +AW Y+V G+V+ PY S GC
Sbjct: 138 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGIVSG--GPYGSSQGCRPYEIAPC 194
Query: 73 ------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ P CE Y TP+C KC ++ ++ KH+ AY I+ + DI EI +
Sbjct: 195 EHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTH 254
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYED YK GVY+H+ G +GGHA+++IGWG D YW++AN WN WG
Sbjct: 255 GPVEGAFTVYEDLILYKDGVYEHVHGKELGGHAIRIIGWGVEKD-IPYWLVANSWNTDWG 313
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE + AGLP
Sbjct: 314 NNGFFKILRGKDHCGIESSISAGLP 338
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 123/204 (60%), Gaps = 18/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPYFDSTGC 72
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT EE C PY C
Sbjct: 139 QSAELSALDLISCCED-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C Y TP+C + C K + + KHY Y + S+ + I EI
Sbjct: 197 EHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMY 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + G+ YW++AN WN WG
Sbjct: 257 GPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGKPYWLIANSWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
G F++ RG +EC IE VVAGL
Sbjct: 316 EKGLFRMVRGRDECSIESHVVAGL 339
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 92/209 (44%), Positives = 122/209 (58%), Gaps = 25/209 (11%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF--------- 67
+S D+L+CCG CG GC+GG+PI AWR+F G T C PY
Sbjct: 136 ISDTDILSCCGLYCGYGCNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHL 195
Query: 68 ---DSTGCSHPG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMA 119
D C + C TP+C R+C+ + + + ++Y SAY + + I
Sbjct: 196 KRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYGKSAYIVKQSVKAIQR 255
Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
EI KNGPV SF VYEDF HYKSG+YKH G++ G HAVK+IGWG ++ D+W++AN W
Sbjct: 256 EIMKNGPVVASFAVYEDFRHYKSGIYKHTAGELRGYHAVKIIGWG-KENNTDFWLIANSW 314
Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGL 208
++ WG GYF+I RG NECGIE DVVAG+
Sbjct: 315 HQDWGEKGYFRIVRGKNECGIETDVVAGI 343
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 172 bits (436), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 122/206 (59%), Gaps = 26/206 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ S D+LACC CGDGC GGY AW+++V GV + PY GC HP
Sbjct: 136 EQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP---- 187
Query: 80 AYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
YP TPKC ++C +W++ + Y AY I +D + IM EIY N
Sbjct: 188 -YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYIN 245
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV+ +F Y+D YKSGVY+H+ G + GGHAVKL+GWG ++G YW++AN W WG
Sbjct: 246 GPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWG 304
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
+G+FKI RG N CGIE+DV AGLPS
Sbjct: 305 DNGFFKIVRGENHCGIEKDVHAGLPS 330
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 172 bits (436), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 122/206 (59%), Gaps = 26/206 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ S D+LACC CGDGC GGY AW+++V GV + PY GC HP
Sbjct: 136 EQFSFGATDMLACC-HACGDGCKGGYLGPAWQFWVEQGVSSG--GPYNSRQGC-HP---- 187
Query: 80 AYP------------TPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
YP TPKC ++C +W++ + Y AY I +D + IM EIY N
Sbjct: 188 -YPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQD-RRYGRVAYSIPNDEQKIMEEIYIN 245
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV+ +F Y+D YKSGVY+H+ G + GGHAVKL+GWG ++G YW++AN W WG
Sbjct: 246 GPVQAAFMTYQDLHAYKSGVYRHVWGHMAGGHAVKLMGWGV-ENGLKYWLVANSWGDDWG 304
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
+G+FKI RG N CGIE+DV AGLPS
Sbjct: 305 DNGFFKIVRGENHCGIEKDVHAGLPS 330
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 172 bits (436), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 140/243 (57%), Gaps = 40/243 (16%)
Query: 2 SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 61
S ++R + S+ VS++ LS DLL CC CG GC+GGYP SAW ++V G+V+
Sbjct: 113 SEAMSDRVCIHSNAKVSVE---LSAQDLLTCCNS-CGMGCNGGYPSSAWNFWVSDGLVSG 168
Query: 62 -------------------ECDPYFDSTGC--------------SHPGCE-PAYPTPKCV 87
D F S GC S P C TP+C+
Sbjct: 169 GLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCEHHVNGSRPSCSGEGGDTPECI 228
Query: 88 RKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 146
+C + ++ KH+ ++Y ++S+ ++I EIYKNGPVE +FTVYEDF YKSGVY+
Sbjct: 229 FRCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEGAFTVYEDFVLYKSGVYQ 288
Query: 147 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
H++G +GGHA+K++GWG ++G YW+ AN WN WG +G+FKI RG++ CGIE ++VA
Sbjct: 289 HVSGSALGGHAIKMLGWG-EENGVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVA 347
Query: 207 GLP 209
G P
Sbjct: 348 GNP 350
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 121/203 (59%), Gaps = 17/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + +S D+L+CCG CG GC+GG+PI A+ YF G VT C PY C
Sbjct: 51 KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 109
Query: 73 SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C TPKCVRKC + ++ + AY + + + I EI KN
Sbjct: 110 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSEKAIQREIMKN 169
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +FTVYEDF++YK G+YKH G GGHA+K+IGWG + G YW++AN W+ WG
Sbjct: 170 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWG 228
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RGSN CGIEE+VVAG
Sbjct: 229 ENGYFRILRGSNHCGIEENVVAG 251
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 19/202 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
LS D+++CC + CG GC+GG P +W Y+ GVVT C PY CSH
Sbjct: 139 LSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGV 196
Query: 75 --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
PG P YPTPKC +KC N+ + K S+Y + D M EI KNGPV
Sbjct: 197 VTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPV 256
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG ++G YW++AN WN WG G
Sbjct: 257 DGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVKYWLIANSWNEGWGEKG 315
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF+++RG+NECGIE + AGLP
Sbjct: 316 YFRMRRGNNECGIEARINAGLP 337
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 120/203 (59%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ LS +LL+CC CGDGC GG P SAW Y+ G+V+ + C PY C
Sbjct: 132 QVHLSAENLLSCCD-SCGDGCLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPY-SIAPCE 189
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC ++C K + + + +Y Y I +D + I AEI KNGP
Sbjct: 190 HSIHGSSPACGGVTDTPKCKKQCEKGYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGP 249
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
+ SF VYED YK GVY+H+ G+ +GGH +K+ GWG ++G YW++AN WN WG +
Sbjct: 250 IVASFLVYEDLFSYKEGVYQHVAGEFLGGHVIKIFGWGI-ENGTPYWLVANSWNTDWGNN 308
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG +ECGIE DV AGLP
Sbjct: 309 GFFKIPRGKDECGIEIDVSAGLP 331
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 121/203 (59%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT C PY
Sbjct: 13 QSAELSALDLISCC-EDCGDGCQGGFPGQAWDYWVTQGIVTGGSKENHTGCQPYPFPKCE 71
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C K + + KHY +Y + S+ + I EI NG
Sbjct: 72 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNEKAIQKEIMMNG 131
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW++AN WN WG
Sbjct: 132 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 190
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G F+I RG +EC IE VVAGL
Sbjct: 191 KGLFRIVRGRDECSIESHVVAGL 213
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/219 (43%), Positives = 133/219 (60%), Gaps = 21/219 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S VS++ +S DLL CC CG GC+GGYP +AW ++ G+VT
Sbjct: 117 SDRVCIHSDAKVSVE---ISSQDLLTCCD-SCGMGCNGGYPSAAWDFWATEGLVTGGLYN 172
Query: 63 ----CDPYFDSTGCSH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRI 110
C PY C H P C TP C KC + ++ KH+ ++Y +
Sbjct: 173 SHIGCRPYTIEP-CEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYKQDKHFGKTSYSV 231
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
S+ IMAE++KNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG ++G
Sbjct: 232 PSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVGGHAIKILGWG-EENGV 290
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +GYFKI RG + CGIE ++VAG+P
Sbjct: 291 PYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 171 bits (434), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 120/203 (59%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
N S +DL+ CC CG GC+GG+P +AW Y+ G+V TE C PY + C
Sbjct: 138 NFHFSADDLVTCC-HTCGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPY-EVEPCE 195
Query: 74 HPGCEPAYP-----TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
H P P TP C +C + + KH+ S+Y IN +P +I EI NGPV
Sbjct: 196 HHVDGPRPPCHSGSTPHCKHQCQPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGPV 255
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGAD 186
E +FTVYED YK+GVY+H+ G +GGHA+++IGWG + + YW++AN WN WG +
Sbjct: 256 EGAFTVYEDLILYKTGVYQHVHGKQLGGHAIRIIGWGVWGESKVPYWLIANSWNTDWGDN 315
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 316 GFFRILRGKDHCGIESQISAGLP 338
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 171 bits (434), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 122/204 (59%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
+ S DL++CC CG GC+GG+P +AW Y+VH G+V+ C PY + C
Sbjct: 133 HFRFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVHKGLVSGGPFGSNLGCQPYAIAP-CE 190
Query: 74 H------PGCE-PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P CE TPKCV+KC + + K Y +Y I + I EI NG
Sbjct: 191 HHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSKSYSIPRHEDQIRKEIMTNG 250
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYED HYK GVY+H+TG ++GGHA++++GWG ++ + YW++AN WN WG
Sbjct: 251 PVEGAFTVYEDLLHYKEGVYQHVTGKMLGGHAIRILGWGVENNTK-YWLIANSWNSDWGD 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + GIE + AGLP
Sbjct: 310 NGFFKILRGEDHLGIESSIAAGLP 333
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q++ +S DLLACC CGDGC+GG P AW YF G+V++ C PY H +
Sbjct: 118 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 176
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
YP TPKC C N + S ++Y + + +D M E++ GP EV+
Sbjct: 177 NGYPPCSQFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 233
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VYEDF Y SGVY H++G +GGHAV+L+GWGTS +G YW +AN WN WG DGYF
Sbjct: 234 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 292
Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
I+RGS+ECGIE+ AG+P + N
Sbjct: 293 IRRGSSECGIEDGGSAGIPLAPN 315
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q++ +S DLLACC CGDGC+GG P AW YF G+V++ C PY H +
Sbjct: 141 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 199
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
YP TPKC C N + S ++Y + + +D M E++ GP EV+
Sbjct: 200 NGYPPCSQFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 256
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VYEDF Y SGVY H++G +GGHAV+L+GWGTS +G YW +AN WN WG DGYF
Sbjct: 257 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 315
Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
I+RGS+ECGIE+ AG+P + N
Sbjct: 316 IRRGSSECGIEDGGSAGIPLAPN 338
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 127/205 (61%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
+++ LS DLL+CC CG GC GG+P +AW Y+V G+VT C PY
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197
Query: 67 FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+TG +P C E Y TPKC +KC K + ++ K+Y +Y + ++ I EI +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTV+ DF +YKSG+YK++TG +GGHAV++IGWG + YW++AN WN WG
Sbjct: 257 GPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
GYF+I RG +ECGIE +V GLP
Sbjct: 316 EKGYFRILRGKDECGIESEVTGGLP 340
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q++ +S DLLACC CGDGC+GG P AW YF G+V++ C PY H +
Sbjct: 119 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 177
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
YP TPKC C N + S ++Y + + +D M E++ GP EV+
Sbjct: 178 NGYPPCSQFNFDTPKCDYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 234
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VYEDF Y SGVY H++G +GGHAV+L+GWGTS +G YW +AN WN WG DGYF
Sbjct: 235 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 293
Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
I+RGS+ECGIE+ AG+P + N
Sbjct: 294 IRRGSSECGIEDGGSAGIPLAPN 316
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 13/203 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q++ +S DLLACC CGDGC+GG P AW YF G+V++ C PY H +
Sbjct: 141 VQDVHISAGDLLACCS-DCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHHSKSK 199
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
YP TPKC C N + S ++Y + + +D M E++ GP EV+
Sbjct: 200 NGYPPCSQFNFDTPKCNYTCDDPTIPVVNYR--SWTSYALQGE-DDYMRELFFRGPFEVA 256
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VYEDF Y SGVY H++G +GGHAV+L+GWGTS +G YW +AN WN WG DGYF
Sbjct: 257 FDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGWGTS-NGVPYWKIANSWNTEWGMDGYFL 315
Query: 191 IKRGSNECGIEEDVVAGLPSSKN 213
I+RGS+ECGIE+ AG+P + N
Sbjct: 316 IRRGSSECGIEDGGSAGIPLAPN 338
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 127/205 (61%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
+++ LS DLL+CC CG GC GG+P +AW Y+V G+VT C PY
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197
Query: 67 FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+TG +P C E Y TPKC +KC K + ++ K+Y +Y + ++ I EI +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTV+ DF +YKSG+YK++TG +GGHAV++IGWG + YW++AN WN WG
Sbjct: 257 GPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
GYF+I RG +ECGIE +V GLP
Sbjct: 316 EKGYFRILRGKDECGIESEVTGGLP 340
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 171 bits (432), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/205 (43%), Positives = 122/205 (59%), Gaps = 17/205 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
+S DL++CCG+ CG GC GG+P +AW ++ G+VT C Y CSH G
Sbjct: 133 ISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHG 190
Query: 77 CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
+ Y TP CV+KC + + K + Y + + IM EI NGPVE
Sbjct: 191 SKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEA 250
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F VYEDF YKSGVY H G ++GGHA++++GWG ++G YW++AN WN WG DGYF
Sbjct: 251 AFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGYF 309
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
K+ RG NECGIE++V AGLP ++
Sbjct: 310 KMLRGKNECGIEDEVTAGLPELSSI 334
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 120/200 (60%), Gaps = 17/200 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 139 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 195
Query: 75 ----PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C Y TP C C KK L + Y + I S E E+ NGP EV
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCTDKKIPLIK----YRGNTSYILSGEESFKRELLLNGPFEV 251
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
SF+VY DF Y GVYKH+TG +GGHAV+++GWG +GE YW +AN WN WG +GYF
Sbjct: 252 SFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG-ELNGEPYWKIANSWNHEWGMNGYF 310
Query: 190 KIKRGSNECGIEEDVVAGLP 209
I RG +ECGIE VAG+P
Sbjct: 311 LIARGVDECGIEGSGVAGIP 330
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 127/199 (63%), Gaps = 19/199 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKNGPVE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G
Sbjct: 234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTP 292
Query: 172 YWILANQWNRSWGADGYFK 190
YW++AN WN WG +G+FK
Sbjct: 293 YWLVANSWNTDWGDNGFFK 311
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 94/207 (45%), Positives = 120/207 (57%), Gaps = 21/207 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q + LS +D+L+CC CG GC+GG AW Y+ G+VT Y +GC +P
Sbjct: 143 QQVILSADDILSCCT-ECGYGCEGGDTYKAWNYWTTDGIVTGS--NYTTKSGCKPYPYPP 199
Query: 77 CE-------------PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIY 122
CE YPT C KC + + KHY Y + D I EI
Sbjct: 200 CEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKHYGAYPYVLVGDASFIQQEIM 259
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
+GPVEV+F VYEDF HY SG+YKH+ G+ +G HAVK++GWGT ++G DYWI AN WN
Sbjct: 260 NHGPVEVTFDVYEDFEHYSSGIYKHMAGEYVGVHAVKMLGWGT-ENGVDYWICANSWNSD 318
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
WG +G+F+I RG NECGIE +VVAG P
Sbjct: 319 WGENGFFRILRGENECGIESNVVAGKP 345
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 118/202 (58%), Gaps = 21/202 (10%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 139 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 195
Query: 75 ----PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C Y TP C C K +R + Y +S E E+ NGP
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPF 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
EVSF+VY DF Y GVYKH+ G +GGHAV+++GWG +GE YW +AN WNR WG +G
Sbjct: 250 EVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWG-ELNGEPYWKIANSWNREWGMNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF I RG +ECGIE VAG P
Sbjct: 309 YFLIARGVDECGIEGSGVAGTP 330
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 122/192 (63%), Gaps = 16/192 (8%)
Query: 41 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH------PGCEPAYPTPKCV 87
C+GGYP AW ++ G+V+ C PY C H P C TPKC
Sbjct: 87 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCS 145
Query: 88 RKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK 146
+ C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F+VY DF YKSGVY+
Sbjct: 146 KICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ 205
Query: 147 HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI RG + CGIE +VVA
Sbjct: 206 HVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 264
Query: 207 GLPSSKNLVKEI 218
G+P + ++I
Sbjct: 265 GIPRTDQYWEKI 276
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 127/203 (62%), Gaps = 18/203 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
+++S +LL+CC + CG GC+GG+P +AW ++ G+V+ + C PY + C H
Sbjct: 133 VNVSAENLLSCC-YSCGFGCNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAP-CEH 190
Query: 75 ------PGCEPAYPTPKCVRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
P C TPKC C ++ + K + S+Y + SDP+ I EI NGP
Sbjct: 191 HANGTRPPCSGGGRTPKCHTFCENEDYSLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF +YKSGVY+H+ G ++GGHA++++GWG ++G YW++AN WN WG +
Sbjct: 251 VEAAFSVYSDFLNYKSGVYRHVKGSLLGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 309
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI +GS+ CGIE +VAGLP
Sbjct: 310 GTFKILKGSDHCGIEGSIVAGLP 332
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 92/197 (46%), Positives = 112/197 (56%), Gaps = 10/197 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +S DLL CCG CG+GCDGG+P A++++ GVVT C PY C+
Sbjct: 132 QQPIISPTDLLTCCGMSCGEGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPY-PIRPCN 190
Query: 74 HPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C TP C C + N K+Y SAY + I A+IY NGPV +F
Sbjct: 191 SDNCV-NLQTPPCRLSCQPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFI 249
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSG+Y+HI G GGHAVKLIGWGT + G YW+ N W WG G F+I
Sbjct: 250 VYEDFEKYKSGIYRHIAGRSKGGHAVKLIGWGT-ERGTPYWLAVNSWGSQWGESGTFRIL 308
Query: 193 RGSNECGIEEDVVAGLP 209
RG +ECGIE +VAGLP
Sbjct: 309 RGVDECGIESRIVAGLP 325
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/205 (43%), Positives = 122/205 (59%), Gaps = 17/205 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
+S DL++CCG+ CG GC GG+P +AW ++ G+VT C Y CSH G
Sbjct: 24 ISAVDLISCCGY-CGFGCQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHG 81
Query: 77 CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
+ Y TP CV+KC + + K + Y + + IM EI NGPVE
Sbjct: 82 SKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEA 141
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F VYEDF YKSGVY H G ++GGHA++++GWG ++G YW++AN WN WG DGYF
Sbjct: 142 AFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGYF 200
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
K+ RG NECGIE++V AGLP ++
Sbjct: 201 KMLRGKNECGIEDEVTAGLPELSSI 225
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 92/218 (42%), Positives = 132/218 (60%), Gaps = 19/218 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL+CC CG GC+GGYP +AW ++ G+VT
Sbjct: 117 SDRVCIHSNAKVSVE---ISSEDLLSCCDS-CGMGCNGGYPSAAWDFWTTEGLVTGGLYD 172
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRIN 111
C PY G P TP+C +C ++ KH+ ++Y +
Sbjct: 173 SHVGCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLP 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
S+ + IMAE+ KNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG + G
Sbjct: 233 SEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGVYQHVSGSAVGGHAIKVLGWG-EEGGTP 291
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 292 YWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAGVP 329
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 14/194 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
+S +LL+CC F+CG GC GG P AW ++V GV TE C PY CSH G YP
Sbjct: 150 ISTTNLLSCC-FICGFGCYGGIPAMAWLWWVWVGVTTELCQPY-PFGPCSHHGNSSKYPP 207
Query: 83 -------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
TPKC C N K+ +S+Y I + E + E+ NGP+EV+ VY
Sbjct: 208 CPNTIYNTPKCNTTC--DNVEMELVKYKGVSSYSIKGERE-LDHELMNNGPLEVAMQVYA 264
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVYKH++GD +GGHAVKL+GWG DG YW +AN WN WG GYF I+RG+
Sbjct: 265 DFVAYKSGVYKHVSGDHLGGHAVKLVGWGV-KDGIPYWKIANSWNTDWGDKGYFLIQRGN 323
Query: 196 NECGIEEDVVAGLP 209
+ECGIE VAG P
Sbjct: 324 DECGIESSGVAGKP 337
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/204 (44%), Positives = 123/204 (60%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S DL++CC CG GC+GG+P +AW Y+V G+V+ + C PY + C
Sbjct: 138 HFRVSSEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAP-CE 195
Query: 74 H------PGCE-PAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P CE TPKCV+KC N + K Y S+Y I + + I EI NG
Sbjct: 196 HHVNGSRPSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNG 255
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVYED +YK GVY H+ G ++GGHA++++GWG +DG YW++AN WN WG
Sbjct: 256 PVEGAFTVYEDLLNYKEGVYHHVHGKMLGGHAIRILGWGV-EDGTKYWLIANSWNSDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + GIE + AGLP
Sbjct: 315 NGFFKILRGEDHLGIESSIAAGLP 338
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 118/205 (57%), Gaps = 22/205 (10%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
+ LS +DLL+CC CGDGCDGG +W Y+ + G+VT C PY D C+H
Sbjct: 130 VRLSASDLLSCC-TSCGDGCDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPY-DFPACAH 187
Query: 75 PGCEPAYP--------TPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
P YP TPKC + CV + HY S+Y + I EI +
Sbjct: 188 HEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNH 247
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVY DF Y+SGVYKH +G V+GGHA+ ++GWGT + G YW++ N WN SWG
Sbjct: 248 GPVEAAFTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGT-ESGSPYWLVKNSWNPSWG 306
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG +CGI DVV GLP
Sbjct: 307 DGGFFKILRG--DCGINNDVVGGLP 329
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 170 bits (430), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 124/205 (60%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
+++ LS DLL+CC CG GC GG+P SAW Y+V GVVT C PY
Sbjct: 139 KSVELSAVDLLSCC-RECGLGCLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCE 197
Query: 67 FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
++TG +P C + Y TPKC +KC K + ++ KHY AY + ++ + I EI +
Sbjct: 198 HNTTG-KYPACGQKIYETPKCQKKCQKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV FTVY DF +YKSG+YKH+ G +G H V+++GWG + G YW++AN WN WG
Sbjct: 257 GPVGSFFTVYSDFLNYKSGIYKHMKGTEIGVHTVRIVGWGV-EKGTPYWLIANSWNEGWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
GYF+I RG +EC IE V+ GLP
Sbjct: 316 EKGYFRILRGKDECDIESLVIGGLP 340
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 170 bits (430), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 15/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDS---- 69
+++S DLL CC CG GC GG+P +AW ++ G+V+ + C PY +
Sbjct: 135 QVNISAEDLLDCCD-TCGHGCKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEY 193
Query: 70 -TGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
T C P C P TP+CV C K ++ ++ KH+ Y I+ D + I EI+ NGPV
Sbjct: 194 HTKCRIPNCIPIVHTPECVHHCRKGYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPV 253
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E F VY DF YKSGVY+ + D G HA++++GWGT ++G YW+ AN WN +WG G
Sbjct: 254 EADFHVYGDFLCYKSGVYQRHSNDGRGMHAIRILGWGT-ENGTPYWLAANSWNENWGDKG 312
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFKI R +NECGIEE + AG+P
Sbjct: 313 YFKILRRTNECGIEEHIYAGIP 334
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 170 bits (430), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 93/204 (45%), Positives = 121/204 (59%), Gaps = 18/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT---EE----CDPYFDSTGC 72
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT EE C PY C
Sbjct: 101 QSAELSALDLISCCKD-CGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPY-PFPKC 158
Query: 73 SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C Y TP+C + C K + + KHY Y + S+ + I EI
Sbjct: 159 EHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMY 218
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW++AN WN WG
Sbjct: 219 GPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWG 277
Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
G F+I RG +EC IE VVAGL
Sbjct: 278 EKGLFRIVRGRDECSIESHVVAGL 301
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 170 bits (430), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 118/200 (59%), Gaps = 13/200 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S LL+CC CGDGCDGGYP SAW Y+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLLSCCK-DCGDGCDGGYPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K K+ +Y + +D E+Y NGP V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNDSYVLLHGEDDFKRELYFNGPFVVA 252
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF YK+GVY+H++GD +GGHAV+++GWG +G YW +AN W+ WG +G+F
Sbjct: 253 FQVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 311
Query: 191 IKRGSNECGIEEDVVAGLPS 210
I RG+NECGIE AGLP+
Sbjct: 312 ILRGNNECGIESTGYAGLPA 331
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 170 bits (430), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 123/203 (60%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S DL++CC CG GC+GG+P +AW Y+V G+V+ + C PY S C
Sbjct: 133 HFRVSAEDLVSCC-HTCGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISP-CE 190
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H C TPKCV+KC N + K + S+Y I S + I E++ NGP
Sbjct: 191 HHVNGTRGPCNGEGKTPKCVKKCQASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVYED +YK GVY+H G ++GGHA++++GWG +D + +W++AN WN WG +
Sbjct: 251 VEGAFTVYEDLLNYKEGVYQHTAGKMLGGHAIRILGWGVENDTK-FWLIANSWNSDWGDN 309
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI RGS+ GIE + AGLP
Sbjct: 310 GYFKILRGSDHLGIESSIAAGLP 332
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 169 bits (429), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 122/203 (60%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT C PY
Sbjct: 139 QSAELSALDLISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C +KC K + + K+Y Y + S+ + I EI G
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKNYGDQRYNVISNEKAIQREIMMYG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+Y+H+ G ++GGHA+++IGWG + G+ YW++AN WN WG
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGE 316
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
+G F++ RG +EC IE VVAGL
Sbjct: 317 NGLFRMVRGRDECSIESHVVAGL 339
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 118/201 (58%), Gaps = 12/201 (5%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY------FDSTGC 72
+ +L +S LL+CC F+CG GC GG P AW ++V G+ +E C PY + G
Sbjct: 145 ITDLRVSTGHLLSCC-FVCGMGCQGGIPTMAWLWWVWVGLTSEVCQPYPFPPCGHHTDGG 203
Query: 73 SHPGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+P C Y TP C C + +KH +Y + + E M E+ GP EV+F
Sbjct: 204 KYPACPSTIYDTPTCNSTCADSHTAL--TKHKGEKSYSLRGERE-YMIELMTYGPFEVAF 260
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY DF YKSGVY H TG+ +GGHAVKL+GWG +G YW +AN WN WG +GYF I
Sbjct: 261 DVYADFVSYKSGVYSHTTGERLGGHAVKLVGWGV-QNGTPYWKIANSWNSDWGDNGYFLI 319
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++ECGIE VAGLPS K
Sbjct: 320 RRGTDECGIESTGVAGLPSLK 340
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 94/202 (46%), Positives = 120/202 (59%), Gaps = 21/202 (10%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 139 VRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 195
Query: 75 ----PGCEPAYPTPKCVRKCV-KKNQL--WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C Y TP C C KK L +R + Y +S E E+ NGP
Sbjct: 196 SSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPF 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
EVSF+VY DF Y GVYKH+ G +GGHAV+++GWG +GE YW +AN WNR WG +G
Sbjct: 250 EVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWG-ELNGEPYWKIANSWNREWGMNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF I RG +ECGIE VAG P
Sbjct: 309 YFLIARGVDECGIEGSGVAGTP 330
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 121/203 (59%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT C PY
Sbjct: 139 QSAELSALDLISCCKD-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C K + + KHY +Y + ++ + I +I G
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW++AN WN WG
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 316
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G F++ RG +EC IE DVVAGL
Sbjct: 317 KGLFRMVRGRDECSIESDVVAGL 339
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 169 bits (429), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/205 (44%), Positives = 126/205 (61%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
+++ LS DLL+CC CG GC GG+P +AW Y+V G+VT C PY
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197
Query: 67 FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+TG +P C E Y TPKC +KC K + + K+Y +Y + ++ I EI +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYGKDKYYGRMSYNVLNNENAIKKEIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTV+ DF +YKSG+YK++TG +GGHAV++IGWG + YW++AN WN WG
Sbjct: 257 GPVEAAFTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
GYF+I RG +ECGIE +V GLP
Sbjct: 316 EKGYFRILRGKDECGIESEVTGGLP 340
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 117/199 (58%), Gaps = 17/199 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
+S DLL CCG CG GC+GG+P AW YF + G+VT + C PY C H
Sbjct: 126 ISTEDLLTCCGITCGMGCNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPY-TFPPCDHHV 184
Query: 75 -----PGCEPAYPTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
C + PTP CV+ C ++ + + + K SI +Y ++S E I EI GPVE
Sbjct: 185 DDGKYGPCGDSQPTPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVE 244
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
SFTVYEDF YKSGVY+++ G +GGHAVK+IGWG + YW++ N WN WG +G
Sbjct: 245 ASFTVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKN-VPYWLVVNSWNEGWGENGL 303
Query: 189 FKIKRGSNECGIEEDVVAG 207
FKI RGSN GIE + AG
Sbjct: 304 FKILRGSNHVGIEGGIYAG 322
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 169 bits (428), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 125/225 (55%), Gaps = 20/225 (8%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+ T T D + + LQ S+S DLL CC CG+GC GGYP +AW+Y GV T
Sbjct: 118 FAATETYSDRICIASNQELQT-SISSEDLLECCA-TCGNGCQGGYPSAAWKYMKATGVST 175
Query: 61 -------EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSI 105
C PY C H P C P PTPKCV++C + + ++ H+
Sbjct: 176 GGLYGDDSSCKPYVFPP-CDHHVVGQYPPCGPIKPTPKCVKQCNSQYTEKTYQQDLHHPS 234
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWG 164
Y++ ++ E I EI +GPV+ SF V DF YKSGVY + GGH+VK+IGWG
Sbjct: 235 KVYQLPNNAEAIQREIMAHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWG 294
Query: 165 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ G YW++AN WN WG +G FK+ RG NECGIE +VVAGLP
Sbjct: 295 V-EQGTPYWLIANSWNEDWGENGLFKMLRGKNECGIEAEVVAGLP 338
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 169 bits (428), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 118/205 (57%), Gaps = 19/205 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
QN +S DL +CC CG+GC+GG+ AW Y+ G+VT + C PY C
Sbjct: 163 QNAHISAEDLTSCC-RSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPY-TVKAC 220
Query: 73 SH-------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P + TP C +C N + KHY +AY + + IM EI N
Sbjct: 221 DHHVVGKLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTN 279
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVY DF YKSGVYKH TG +GGHA+K++GWGT + G+DYW++AN WN WG
Sbjct: 280 GPVEGAFTVYADFPQYKSGVYKHTTGSPLGGHAIKIMGWGT-EGGDDYWLVANSWNPDWG 338
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G FKI RG +ECGIE + AG P
Sbjct: 339 NQGTFKILRGRDECGIESQIAAGEP 363
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 169 bits (427), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 95/226 (42%), Positives = 131/226 (57%), Gaps = 45/226 (19%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
+SV D+L+CCG CG+GC GGYP+ +++++ GVVT C PY CS C
Sbjct: 129 ISVEDILSCCGSSCGEGCKGGYPLEGLKFWMNSGVVTGGDYNGTGCQPY-TFPPCSS--C 185
Query: 78 EPAYPTPKCVRKC--------VKKNQLWRNSKH---------YSI--------SAYRINS 112
E + TP C +KC K ++ + N + Y + SAYR+++
Sbjct: 186 EASKSTPSCQKKCQTGYLEATYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLST 245
Query: 113 DPED----------IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 162
I EIY NGPVEVS+ V+EDF YKSGVY +++G + G HAVK+IG
Sbjct: 246 TTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQYKSGVYHYVSGKLTGAHAVKIIG 305
Query: 163 WGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
WGT ++ DYW++AN W +G G+FKI+RG+NECGIEE+VVAGL
Sbjct: 306 WGT-ENKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGL 350
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 118/200 (59%), Gaps = 14/200 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+++L +S DLL+CC CGDGCDGGYP AW YF G+V++ C PY C H G
Sbjct: 138 VRDLGISAGDLLSCCT-SCGDGCDGGYPDEAWLYFTESGLVSDYCQPY-PFPPCKHSGGR 195
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K ++++ +Y + + ED E+Y GP EV+
Sbjct: 196 SKNPSCHDMHFHTPKCNATCTDKRIP--VVRYFASESYSLQGE-EDYKRELYLRGPFEVA 252
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
FTVYEDF Y+SGVYKH++G +GGHAV+++GWG +G YW +AN WN WG +GY
Sbjct: 253 FTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWG-ERNGVPYWKIANSWNTDWGENGYLY 311
Query: 191 IKRGSNECGIEEDVVAGLPS 210
RG +ECGIE AG PS
Sbjct: 312 FYRGKDECGIESQGSAGTPS 331
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 121/202 (59%), Gaps = 16/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ S DLL CC CG GC+GG P +AW Y+V G+V+ + C PY C
Sbjct: 143 HFHFSAEDLLTCCSS-CGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEP-CE 200
Query: 74 HPGCEPAYP-----TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
H P TP+CV++C + + + +H+ SAY + + I E+ NGP
Sbjct: 201 HHVNGTRKPCGEGDTPRCVKRCEEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPA 260
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E + TVY+DF HY++GVY+H++G +GGHAV+L+GWG +DG YW+LAN WN WG +G
Sbjct: 261 EAALTVYDDFLHYRTGVYQHVSGGALGGHAVRLLGWGV-EDGTPYWLLANSWNYDWGDNG 319
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF+I RG +ECGIE D+ GLP
Sbjct: 320 YFRILRGQDECGIESDINGGLP 341
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/217 (42%), Positives = 130/217 (59%), Gaps = 21/217 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----- 60
T+R + S V N LS DL +CC CG+GC+GG+ AW Y G+VT
Sbjct: 124 TDRLCIQSKGIV---NAHLSAEDLTSCC-RTCGNGCNGGFLEGAWNYLKRDGIVTGGPYN 179
Query: 61 --EECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
+ C PY + C H C+ PTP+C ++C N + +H++ + + +
Sbjct: 180 SHQGCLPY-EIKACDHHVVGKLQPCKGDGPTPRCKKECESGYNNTYSKDEHHAKTVHAVE 238
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
E IM EI NGPVE +FTVY DF YKSGVY+H +G +GGHA+K +GWG ++DG+D
Sbjct: 239 G-VEQIMTEIMTNGPVEAAFTVYSDFPTYKSGVYEHKSGGPLGGHAIKTLGWG-NEDGKD 296
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
YW++AN WN WG +G+FKI RG +ECGIE ++VAG+
Sbjct: 297 YWLVANSWNPDWGDNGFFKILRGRDECGIESNIVAGM 333
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 116/203 (57%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
+++S DLL CC + C GC GG P AW ++ G+VT + C PY +
Sbjct: 133 QVNISAQDLLTCCDY-CRTGCKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRY 191
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
+TG P P P C R+C K + + KHY Y ++ D I EI+KNGP
Sbjct: 192 TTTGLLPPPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGP 251
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE F VY DF YKSGVY+ + G HA++++GWGT ++G YW+ AN W WG
Sbjct: 252 VEADFAVYADFYSYKSGVYQAHSRVRCGSHAIRILGWGT-ENGVPYWLAANSWTEHWGDK 310
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI+RG+NECGIEED+ AG+P
Sbjct: 311 GYFKIRRGNNECGIEEDINAGIP 333
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 127/205 (61%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY------ 66
+++ LS DLL+CC CG GC GG+P +AW Y+V G+VT C PY
Sbjct: 139 KSVELSAVDLLSCCT-ECGLGCQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCE 197
Query: 67 FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+TG +P C E Y TPKC +KC K + ++ K+Y +Y + ++ I EI +
Sbjct: 198 HHTTG-KYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVEV+FTV+ DF +YKSG+YK++TG +G HAV++IGWG + YW++AN WN WG
Sbjct: 257 GPVEVAFTVHSDFLNYKSGIYKYMTGAEIGEHAVRIIGWGV-EKKTPYWLIANSWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
GYF++ RG +ECGIE V +GLP
Sbjct: 316 EKGYFRMLRGKDECGIESAVTSGLP 340
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/202 (44%), Positives = 118/202 (58%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
QN+ LS DLL+CC CGDG +GG+P AW Y+V G+VT C PY
Sbjct: 116 QNVELSAVDLLSCCEH-CGDGFEGGFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCE 174
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C E Y TP C C K + + KH S Y + +D + I EI K G
Sbjct: 175 HHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYG 234
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+YKHITG ++ HA+++IGWG ++ YW++ N WN WG
Sbjct: 235 PVEANFIVYEDFLNYKSGIYKHITGKLVSWHAIRIIGWGV-ENNTPYWLIPNSWNEDWGE 293
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+G F+I RG +EC IE +V AG
Sbjct: 294 NGNFRILRGRHECSIESEVTAG 315
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 123/202 (60%), Gaps = 17/202 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ + +S D+++CC + CGDGC+GG+PISA+R+ GVVT C PY + C
Sbjct: 140 KQVLISAQDVVSCCTW-CGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPY-EIHPC 197
Query: 73 SHPGCEPAY-------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G E Y TP+C R+C+ S Y AY++ + + I +I KNG
Sbjct: 198 GHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNSVKAIQKDIMKNG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV ++TVYEDFAHY+SG+YKH G G HAVK+IGWG + G YWI+AN W+ WG
Sbjct: 258 PVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWG-EEKGTPYWIVANSWHDDWGE 316
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+G+F++ RGSN+CG EE + AG
Sbjct: 317 NGFFRMHRGSNDCGFEERMAAG 338
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 125/208 (60%), Gaps = 12/208 (5%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
T+R + S ++ + LS +L++CC +CG GCDGGYP A+ Y+ G+ T P
Sbjct: 122 TDRICIES---IAAKQPLLSEEELVSCCK-ICGYGCDGGYPDKAFIYWATRGIPTG--GP 175
Query: 66 YFDSTGCS----HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAE 120
Y + GC E TP C R+C+ + +H+ Y +NS+ E IM E
Sbjct: 176 YGSTKGCKPYSIGSNSEDEAETPLCTRQCINEYPYNLSQDRHFGEKPYWVNSNEEQIMQE 235
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
+YKNGPV V+F VYEDF +Y GVY+H G +GGHAVKLIGWG ++ + YW+++N WN
Sbjct: 236 LYKNGPVVVAFNVYEDFMYYIKGVYEHRFGKFLGGHAVKLIGWGI-ENSKKYWLISNSWN 294
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGL 208
+WG +G+FKI RG N C IE VVAG+
Sbjct: 295 TTWGENGFFKIIRGKNCCAIESYVVAGM 322
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 167 bits (424), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 123/203 (60%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGCS 73
L +S D+LACCG CGDGC GG+P AW + +GV T C PY +
Sbjct: 135 KLHVSDTDILACCGEFCGDGCSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGN 194
Query: 74 HP-----GCEP--AYPTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G P ++PTP+C + C + + ++ K Y+ +Y + +D ++I +I KNG
Sbjct: 195 HENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNG 254
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV+ +F VYEDF YK G+YKH G GGHAVK+IGWG D+G DYW++AN W++ WG
Sbjct: 255 PVQAAFDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWG-KDNGTDYWLIANSWSKDWGE 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G+F++ RG N+C IE+ + AG+
Sbjct: 314 SGFFRMVRGENDCEIEDMITAGI 336
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/218 (42%), Positives = 124/218 (56%), Gaps = 22/218 (10%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+ LS DL++CC + CG+GC GG P +AW Y+ +G+VT C PY
Sbjct: 124 MMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQ 181
Query: 72 CSHPGCEP--------AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 122
C HPG YPTP C C ++ + K Y ++Y ++ IM EI
Sbjct: 182 CRHPGSRSQLNPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVDRHEYTIMEEIM 241
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
KNGPVE F VY DFA YKSG+Y H++G G HA+++IGWG ++G YW+ AN WN
Sbjct: 242 KNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVKYWLTANSWNVG 300
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITS 220
WG +GYF+I RG++EC IE VVAG+P L K IT+
Sbjct: 301 WGENGYFRILRGTDECRIESIVVAGMP---RLQKNITN 335
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 15/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYF-----D 68
+ +S DL+ CC CG GC GG +AW+Y+ G+V T+ C PY
Sbjct: 21 QVDISAEDLMDCCD-KCGSGCSGGVSAAAWQYWKDAGLVSGGLYNTTDGCKPYSLAPCEH 79
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S+ S P C PTPKC R+C + + + + K+++ + Y IN + I EI++NGPV
Sbjct: 80 SSQGSLPECVGTLPTPKCKRQCREGYERSYDDDKYFAKNVYSINGSEKQIRTEIFQNGPV 139
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E FT Y DF YKSGVY+H + D++G HA++++GWG S+D YW+LAN WN WG G
Sbjct: 140 EAEFTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWG-SEDNNPYWLLANSWNEDWGDHG 198
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFK+ RG NEC IE V AG+P
Sbjct: 199 YFKMLRGVNECDIESFVNAGIP 220
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 120/211 (56%), Gaps = 30/211 (14%)
Query: 24 LSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVT------------EECDPYFDS 69
+S +LL+CC F CG GC+GGY AW Y+V G+V+ EC PY
Sbjct: 139 ISSENLLSCCRGTFACGMGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPY-SF 197
Query: 70 TGCSH------PGCE--PAYPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDI 117
CSH C P + TPKC +C +Q +NS H +S+Y + E I
Sbjct: 198 PPCSHHVQGEYQACTDLPQFNTPKCYTEC--NSQYTQNSYEQDLHKGVSSYSVPKSEEQI 255
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
AEIY+ G SF VY DF Y SGVY++ +G MGGHA+K++GWG ++G YW+ AN
Sbjct: 256 KAEIYQYGSTTASFNVYSDFLTYSSGVYQNTSGSYMGGHAIKMLGWGV-ENGTPYWLCAN 314
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
WN SWG +G+FKI RGSNECGIE +VAG
Sbjct: 315 SWNSSWGENGFFKILRGSNECGIESGMVAGF 345
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 87/205 (42%), Positives = 120/205 (58%), Gaps = 18/205 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S DL++CC CG GC+GG+P +AW Y+ H G+V+ E C PY + C
Sbjct: 140 NFHFSAEDLVSCC-HTCGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEGCRPY-EIEPCE 197
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C+ TP C +C + + KH+ +Y I +P +I EI NGP
Sbjct: 198 HHVNGTRPPCKNGR-TPSCKHQCESSYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNGP 256
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
VE +FTVYED YKSGVYKH+ G +GGHA++++GWG D + YW++ N WN WG
Sbjct: 257 VEGAFTVYEDLILYKSGVYKHVHGKELGGHAIRILGWGVWGDSKVPYWLIGNSWNTDWGD 316
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
+G+F+I RG + CGIE + AGLP+
Sbjct: 317 NGFFRIVRGEDHCGIESAISAGLPA 341
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 121/202 (59%), Gaps = 15/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF-----D 68
+ LS +LL+CC CG GC GG +AW Y+ G+V+ + C PY
Sbjct: 131 QVHLSAENLLSCCDS-CGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEH 189
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S S P CE TPKC ++C K + + + Y Y I +D + I AEI KNGP+
Sbjct: 190 SIPGSRPACEGVRDTPKCKKQCEKGYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPI 249
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
S VYED YK+GVY+H+ G+V+GGH +K++GWG +D YW++AN WN WG +G
Sbjct: 250 VASILVYEDLFSYKAGVYQHVAGEVLGGHVIKILGWGVEND-TPYWLVANSWNTDWGNNG 308
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
+FKI RGS+ECGIE+ +VAG+P
Sbjct: 309 FFKILRGSDECGIEDQIVAGIP 330
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 167 bits (423), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 119/200 (59%), Gaps = 13/200 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S LL+CC CGDGCDGGYP +AWRY+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLLSCCK-DCGDGCDGGYPDAAWRYYVSHGLASSYCQPY-PFPHCGHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K ++ +Y + +D E+Y NGP V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IEYRGNDSYVLLHGEDDFKRELYFNGPFVVA 252
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F V+ DF YK+GVY+H++GD +GGHAV+++GWG +G YW +AN W+ WG +G+F
Sbjct: 253 FQVFSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 311
Query: 191 IKRGSNECGIEEDVVAGLPS 210
RG+NECGIE + AGLP+
Sbjct: 312 FLRGNNECGIEFEGYAGLPA 331
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 93/219 (42%), Positives = 134/219 (61%), Gaps = 21/219 (9%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL+CC CG GC+GGYP +A ++ G+V+
Sbjct: 117 SDRVCIHSNAKVSVE---ISSEDLLSCCES-CGMGCNGGYPSAACDFWTKEGLVSGGLYD 172
Query: 63 ----CDPYFDSTGCSH------PGCEPAY-PTPKCVRKCVKK-NQLWRNSKHYSISAYRI 110
C PY C H P C+ TP+C +C ++ KH+ +Y +
Sbjct: 173 SHIGCRPY-SIPPCEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKRSYSV 231
Query: 111 NSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE 170
SD ++IM E+YKNGPVE +FTVYEDF YKSGVY+H++G +GGHA+K++GWG + G
Sbjct: 232 PSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGVYRHVSGSAVGGHAIKVLGWG-EEGGI 290
Query: 171 DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW+ AN WN WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 291 PYWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAGIP 329
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 122/204 (59%), Gaps = 17/204 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S DLL+CC +CG GC+GG P AW Y+ H G+V+ + C PY + C
Sbjct: 136 KHFHFSAEDLLSCCP-VCGLGCNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPY-EIPPC 193
Query: 73 SH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H PG C TPKC + C + + K Y Y ++S + I AE++KNG
Sbjct: 194 EHHVPGNRVPCNGDSKTPKCHKTCEASYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNG 253
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +FTVY D +YK+GVYKH G+ +GGHA+K++GWG ++G Y ++AN WN WG
Sbjct: 254 PVEGAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGV-ENGNKYRLIANSWNSDWGD 312
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+FKI RG + CGIE +VAG P
Sbjct: 313 NGFFKILRGEDHCGIESSIVAGEP 336
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 166 bits (421), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 92/208 (44%), Positives = 119/208 (57%), Gaps = 28/208 (13%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N+ +S D+ CC CG GC+GGYP +AW ++V GVV+ E C PY
Sbjct: 135 NIHISAEDINDCCKS-CGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDH 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVK------KNQLWRNSKHYSISAYRINSDPEDIMAEI 121
+TG P C PTPKC +KC+ N R K Y + + IM E+
Sbjct: 194 HTTGKYQP-CPAVVPTPKCEKKCLTGYPKSYSNDKTRGKKSYGVRGV------QSIMQEL 246
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
NGPV +F VY DF YK+GVY+H TG GGHAVK+IG+GT + G+DYW++AN WN
Sbjct: 247 VDNGPVTAAFDVYSDFLSYKTGVYRHTTGSYEGGHAVKIIGYGT-ESGQDYWLVANSWNE 305
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
WG G+FKI +G +ECGIE +VAG P
Sbjct: 306 DWGDKGFFKIAKGKDECGIESSIVAGDP 333
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 166 bits (420), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 118/203 (58%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ LS +L+ CCG CG GC GG P SAW Y+ G+V+ E C PY C
Sbjct: 131 QVHLSAENLVTCCGS-CGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPY-SIAPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C T C ++C K + + HY+ Y D ++I EI KNGP
Sbjct: 189 HHIPGSRPPCRGEGHTADCRKQCEKGYSIPYDKDLHYAEFVYSTERDVKEIQTEILKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F VYED YK GVYKH+ G +GGHA+K++GWG ++G YW++AN WN WG +
Sbjct: 249 VEAAFFVYEDLLTYKEGVYKHVAGAPVGGHAIKILGWGV-ENGTPYWLIANSWNTDWGNN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RGS+ECGIE DV AGLP
Sbjct: 308 GFFKILRGSDECGIEIDVSAGLP 330
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 166 bits (420), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 120/205 (58%), Gaps = 17/205 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
+S DL++CCG+ CG GC GG+P AW ++ G+VT C Y CSH G
Sbjct: 133 ISAVDLISCCGY-CGFGCQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSY-PFPRCSHHG 190
Query: 77 CEP-------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
+ Y TP CV+KC + + K + Y + + IM EI NGPVE
Sbjct: 191 SKKYPPCSHRIYDTPNCVQKCDTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEA 250
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F VYEDF YKSGVY H G ++GGHA++++GWG ++G YW++AN WN WG DG F
Sbjct: 251 AFQVYEDFLGYKSGVYFHSDGTLLGGHAIRILGWG-EENGVAYWLIANSWNDGWGEDGCF 309
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
K+ RG NECGIE++V AGLP ++
Sbjct: 310 KMLRGKNECGIEDEVTAGLPELSSI 334
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 166 bits (419), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 120/203 (59%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CG GC GG+P AW Y+V G+VT C PY
Sbjct: 76 QSAELSALDLISCCE-DCGQGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 134
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C + C K + + KHY +Y + ++ + I +I G
Sbjct: 135 HHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYG 194
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++IGWG + YW++AN WN WG
Sbjct: 195 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKRTPYWLIANSWNEDWGE 253
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
G F+I RG +EC IE +VVAGL
Sbjct: 254 KGLFRIVRGRDECSIESNVVAGL 276
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 166 bits (419), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 115/204 (56%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
N SLS DLL+CC CG GC GGYP AW Y+ HG+VT D +GC P C
Sbjct: 136 NKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKC 192
Query: 78 E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
E YPTP+CV++C + + K + +Y I + IM EI G
Sbjct: 193 EHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRG 252
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE FT+YEDF Y SGVY H G M GHAV+++GWG + YW++AN WN WG
Sbjct: 253 PVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGN-VPYWLIANSWNEDWGE 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+GY K RG NECGIE+DV AGLP
Sbjct: 312 EGYMKFLRGYNECGIEDDVTAGLP 335
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 166 bits (419), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 86/202 (42%), Positives = 118/202 (58%), Gaps = 18/202 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
+S DLL CC CG GCDGG P + W++++ G+V+ P+ GC EP
Sbjct: 134 FRVSAEDLLTCCTN-CGHGCDGGAPGAGWKHWIEKGLVSG--GPFGSDQGCRPYTIEPCV 190
Query: 82 P-------------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
TPKC++KC+ N + K + S Y I +D I EI+ NGPV
Sbjct: 191 HVENGAQSPCKDSITPKCIKKCLPGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPV 250
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTV++DFA YK G+Y+H +G++ G HAV+++GWG ++G YW+ AN WN WG +G
Sbjct: 251 EATFTVFDDFASYKHGIYQHTSGNLAGEHAVRILGWGV-ENGTKYWLAANSWNSDWGDNG 309
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFKI RGSN IE +VAGLP
Sbjct: 310 YFKILRGSNHVDIESAIVAGLP 331
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 98/233 (42%), Positives = 129/233 (55%), Gaps = 36/233 (15%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
Q +SV D+L+CCG CG GC GGY I A R++ G VT C PY S
Sbjct: 79 QQPVISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPY--SFAPC 136
Query: 74 HPGCEPAYPTPKCVRKCVK--KNQLWRNSKHYS----------------ISAYRINSDPE 115
C P TP C C K + ++ KHY SAY++ +
Sbjct: 137 TKNC-PESTTPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKS 195
Query: 116 --DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
+I EIY GPVE S+ VYEDF HYKSGVY + +G ++GGHAVK+IGWG ++G DYW
Sbjct: 196 VTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYW 254
Query: 174 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
++AN W S+G G+FKI+RG+NEC IE +VVAG + K T ++ +ED
Sbjct: 255 LIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAG------IAKLGTHSETYED 301
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 115/203 (56%), Gaps = 24/203 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
S DLL+CC CGDGC GG AW+++V GV + PY GC HP
Sbjct: 139 QFSFGAYDLLSCC-HSCGDGCQGGNLGPAWQFWVQRGVSSG--GPYNSRQGC-HP----- 189
Query: 81 YP------------TPKCVRKCVKKNQLWRNS--KHYSISAYRINSDPEDIMAEIYKNGP 126
YP TPKC RKC + S + + AY ++ D E I EI++NGP
Sbjct: 190 YPVDVCHSADEDADTPKCTRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRNGP 249
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
V+ SF VY DF YK+GVY+H+ G + GGHAVK+IGWG ++G YW+ +N W WG
Sbjct: 250 VQASFDVYLDFKAYKTGVYRHVFGPMEGGHAVKMIGWGV-ENGTKYWLCSNSWGEDWGER 308
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG N CGIE DV AGLP
Sbjct: 309 GFFKIVRGENHCGIESDVHAGLP 331
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 119/200 (59%), Gaps = 14/200 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S LL+CC CG GCDGGYP +AWRY+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLLSCCKD-CGYGCDGGYPDAAWRYYVSHGLASSYCQPY-PFPHCDHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K K+ +Y ++ + ED E+Y NGP V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-EDYKRELYFNGPFVVA 251
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF YK+GVY+H++GDV+GGHAV+++GWG +G YW +AN W+ WG +G+F
Sbjct: 252 FQVYSDFFAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 310
Query: 191 IKRGSNECGIEEDVVAGLPS 210
I RG +ECGIE AG P+
Sbjct: 311 ILRGKDECGIEHQGYAGSPA 330
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 120/202 (59%), Gaps = 16/202 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------ 66
+ + LS +D+L+CC G GCDGG+P+SAW+YFV GVVT + C PY
Sbjct: 115 KKVELSADDILSCC-TDGGYGCDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCG 173
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+ C TP C C + + + K Y +AY +++ I EI G
Sbjct: 174 IHKNETFYSNCTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAYSVSNSVHAIQKEIMTYG 233
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +FTVY+DF HYK+G+YKH++G GGHAV+++GWG G YW++AN WN WG
Sbjct: 234 PVVAAFTVYDDFFHYKTGIYKHVSGAEAGGHAVRILGWG-QQGGVPYWLVANSWNTDWGE 292
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF+I RGS+ECGIE+ VVAG
Sbjct: 293 NGYFRILRGSDECGIEDGVVAG 314
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/193 (44%), Positives = 116/193 (60%), Gaps = 11/193 (5%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
+S DLL+CCG CG GC G P+ A+R++ GVVT C PY C+ C
Sbjct: 128 ISPTDLLSCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRGSGCKPY-PFAPCTALPC 186
Query: 78 EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
+ TP+C C ++ + K++ AY + D I EI NGPVE +F VY+D
Sbjct: 187 TKS-ETPRCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI-TNGPVEAAFIVYDD 244
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
F HY+SGVY+H+ G ++GGHAVK+IGWG +G YW++AN W WG +G+FK+ RG +
Sbjct: 245 FNHYRSGVYRHVAGKLVGGHAVKIIGWGI-QNGAPYWLMANSWGPYWGENGFFKMLRGVD 303
Query: 197 ECGIEEDVVAGLP 209
ECGIE +VAG P
Sbjct: 304 ECGIESTIVAGKP 316
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/202 (43%), Positives = 119/202 (58%), Gaps = 18/202 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S D ++CC CG GCDGG+PI A+ ++ + G VT + C PY C
Sbjct: 144 QMHISSIDFVSCCE-SCGYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCG 201
Query: 74 HPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G C TPKC R+C + + + K Y AY + + I EI KNG
Sbjct: 202 HHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNG 261
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +FTVYEDF++YK G+YKH G GGHA+K+IGWG +D YW++AN W+ WG
Sbjct: 262 PVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGE 320
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF++ RG NECGIE++VVAG
Sbjct: 321 EGYFRMIRGINECGIEQEVVAG 342
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/200 (44%), Positives = 118/200 (59%), Gaps = 13/200 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S L++CC CGDGC GG P SAW Y+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLMSCCED-CGDGCKGGAPDSAWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K K+ ++Y + + +D E+Y NGP V
Sbjct: 195 GKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNNSYMLLNGEDDYKRELYFNGPFVVD 252
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF YK+GVY+H++GDV+GGHAV+++GWG +G YW +AN W+ WG +G+F
Sbjct: 253 FGVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 311
Query: 191 IKRGSNECGIEEDVVAGLPS 210
I RG+NECGIE AGLP+
Sbjct: 312 ILRGNNECGIESTGYAGLPA 331
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 117/201 (58%), Gaps = 17/201 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
LS DLL CC CG GCDGG+ AWR+F GV T + C+ Y C H
Sbjct: 122 LSEQDLLTCCD-SCGFGCDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAY-SFPKCEHHA 179
Query: 75 ----PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C + TP+CV++C + + + KH+ AY + + I E+ NGP+EV
Sbjct: 180 EGKYPPCGESQETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEV 239
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
SF VYEDF YKSG+Y+H+ G +GGHAVKL+GWG +DG +YW +AN WN WG +GYF
Sbjct: 240 SFFVYEDFLTYKSGIYQHVAGKYLGGHAVKLVGWGV-EDGIEYWKIANSWNEDWGENGYF 298
Query: 190 KIKRGSNECGIEEDVVAGLPS 210
+I G ECGIE + G+P
Sbjct: 299 RIVAGKGECGIEVGPIGGIPK 319
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 118/203 (58%), Gaps = 17/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + +S D+L+CCG CG GC+GG+PI A+ YF G VT C PY C
Sbjct: 51 KQVHVSATDILSCCGNQCGYGCNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPY-PFHPC 109
Query: 73 SHPG-------CEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C TPKCVRKC + ++ + AY + + EI KN
Sbjct: 110 GHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAEKATQREIMKN 169
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +FTVYEDF++YK G+YKH G GGHA+K+IGWG + G YW++AN W+ WG
Sbjct: 170 GPVVGAFTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWG-KEGGVPYWLIANSWHNDWG 228
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
+GYF+I GSN CGIEE+VVAG
Sbjct: 229 ENGYFRILCGSNHCGIEENVVAG 251
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 164 bits (414), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 122/205 (59%), Gaps = 20/205 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-------- 72
+ S +DL++CC CG GC+GG+P +AW Y+ G+V+ PY S GC
Sbjct: 142 HFHFSADDLVSCC-HTCGFGCNGGFPGAAWAYWTRKGIVSG--GPYGSSQGCRPYEIAPC 198
Query: 73 ------SHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ P C+ + TP C +C K + ++ KH+ +Y + + +DI EI +N
Sbjct: 199 EHHVNGTRPPCDGEHGKTPSCRHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQN 258
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +FTVYED YK GVY+H+ G +GGHA++++GWG ++ YW++AN WN WG
Sbjct: 259 GPVEGAFTVYEDLILYKDGVYQHVHGRELGGHAIRILGWGV-ENKTPYWLIANSWNTDWG 317
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+G+FK+ RG + CGIE + AGLP
Sbjct: 318 NNGFFKMLRGEDHCGIESAIAAGLP 342
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/200 (44%), Positives = 120/200 (60%), Gaps = 14/200 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S LL+CC CG GCDGGYP +AW Y+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLLSCCKD-CGYGCDGGYPGTAWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K K+ +Y ++ + +D E+Y NGP V+
Sbjct: 195 GKKPPCSKYDFHTPKCNTTCTDKAIPL--IKYRGNHSYGLDGE-DDYKRELYFNGPFVVA 251
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF YK+GVY+H++GDV+GGHAV+++GWG +G YW +AN W+ WG +G+F
Sbjct: 252 FQVYSDFLAYKTGVYRHVSGDVLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHFL 310
Query: 191 IKRGSNECGIEEDVVAGLPS 210
I RG +ECGIE + AGLP+
Sbjct: 311 ILRGKDECGIESEGYAGLPA 330
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 163 bits (413), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 135/212 (63%), Gaps = 16/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CG+GC+GGYP +AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGGQCGEGCNGGYPSAAWNFWTKKGLVSGGLYDSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY S+Y + ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPGIEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H+TG++MGGHA++++GWGT ++G YW++AN WN WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGT-ENGTPYWLVANSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+FKI RG + CGIE ++VAG+P + +I
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPRTDQYWAKI 339
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 163 bits (413), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 122/204 (59%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N LS +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + C
Sbjct: 138 NFHLSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPY-EIEPCE 195
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP+C C ++ ++ K++ +Y I ++ DI EI NGP
Sbjct: 196 HHVNGTRPPCSSG-STPRCQHVCESSYKVDYKKDKNFGSKSYSIKNNVLDIQKEIMNNGP 254
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YKSGVY+H+ G +GGHA++++GWG D+ YW++AN WN WG
Sbjct: 255 VEGAFTVYEDLILYKSGVYEHVHGKELGGHAIRILGWGVWGDEKIPYWLIANSWNTDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+F+I RG + CGIE + AGLP
Sbjct: 315 NGFFRIVRGKDHCGIESSISAGLP 338
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 122/203 (60%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S D+L+CCG CG GC+ PI A+R+ VVT + C PY +
Sbjct: 137 RVMISDTDILSCCGISCGYGCEV-LPIEAYRWMQRSVVVTGGKYRQKDVCKPYAFYPCGN 195
Query: 74 H-------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P +PTPKC + C +K N+ + K+++ +Y + S+ I EIYKNG
Sbjct: 196 HTNERYYGPCPRGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNERSIREEIYKNG 255
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +F VY+DF++Y+ G+Y H G G HAVK++GWG ++G DYW++AN WN WG
Sbjct: 256 PVVAAFKVYQDFSYYRGGIYVHKWGGQTGAHAVKVVGWG-RENGTDYWLIANSWNTDWGE 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
+GYF+I RGSNECGIE +V+G+
Sbjct: 315 NGYFRIARGSNECGIEGQMVSGV 337
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 123/208 (59%), Gaps = 17/208 (8%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 66
+L N+ LS DLLACC CG GC GG+ AW Y+ +G+VT C PY
Sbjct: 134 TLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPP 192
Query: 67 ---FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEI 121
+ G +P C E Y TP+CV +C K + + K + ++Y + I EI
Sbjct: 193 CRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRSVTTIQKEI 252
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
+ GPVE + VY DFA+Y GVYKH TG+++GGHA++L+GWG +DG YW+ AN WN
Sbjct: 253 WMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANSWNP 312
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
SWG G+F+I RGS+ CGIE DV AGLP
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 162 bits (411), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 118/202 (58%), Gaps = 18/202 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ +S D ++CC C GCDGG+PI A+ ++ + G VT + C PY C
Sbjct: 144 QMHISSIDFVSCCE-SCSYGCDGGWPILAFDFYTYEGAVTGGDYGSKDGCRPY-PFHPCG 201
Query: 74 HPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H G C TPKC R+C + + + K Y AY + + I EI KNG
Sbjct: 202 HHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDAYEVPHSVKAIQREIMKNG 261
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +FTVYEDF++YK G+YKH G GGHA+K+IGWG +D YW++AN W+ WG
Sbjct: 262 PVVGAFTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVEND-VPYWLIANSWHNDWGE 320
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+GYF++ RG NECGIE++VVAG
Sbjct: 321 EGYFRMIRGINECGIEQEVVAG 342
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 162 bits (410), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 93/208 (44%), Positives = 123/208 (59%), Gaps = 17/208 (8%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY---- 66
+L N+ LS DLLACC CG GC GG+ AW Y+ +G+VT C PY
Sbjct: 134 TLVNVQLSATDLLACCT-TCGFGCVGGWGGMAWDYWRDNGIVTGGEYKDSHTCLPYPFPP 192
Query: 67 ---FDSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEI 121
+ G +P C E Y TP+CV +C K + + K + ++Y + I EI
Sbjct: 193 CRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRSVTAIQKEI 252
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
+ GPVE + VY DFA+Y GVYKH TG+++GGHA++L+GWG +DG YW+ AN WN
Sbjct: 253 WMRGPVEATMNVYTDFANYAGGVYKHTTGELLGGHAIRLLGWGVEEDGTPYWLAANSWNP 312
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
SWG G+F+I RGS+ CGIE DV AGLP
Sbjct: 313 SWGEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/214 (43%), Positives = 120/214 (56%), Gaps = 27/214 (12%)
Query: 20 QNLSLSVNDLLACC-GFLCG--DGCDGGYPISAWRYFVHHGVVT----------EECDPY 66
Q L LS D ACC GF CG GC+GG P SAW++F GVVT C PY
Sbjct: 343 QLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPY 402
Query: 67 --------FDSTGCSHPGC-EPAYPTPKCVRKCVKKN---QLWRNSKHYSISAYRINSDP 114
D +P C + YPTP+C+ +C + N + K + AY + +
Sbjct: 403 EFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGI 461
Query: 115 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGEDYW 173
E+I ++ K G V +F+V+ DF Y GVY H +G MGGHAVK+IGWGT + GEDYW
Sbjct: 462 ENIQRDMMKYGSVTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYW 521
Query: 174 ILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
++AN WN SWG G F+I RG NECGIE +VAG
Sbjct: 522 LIANSWNPSWGEGGLFRILRGVNECGIEGQIVAG 555
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/208 (42%), Positives = 119/208 (57%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC CG GCDGGY + +W Y+V HG+VT + TGC P
Sbjct: 139 QSVELSAIDLISCCKN-CGSGCDGGYFLPSWDYWVSHGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 116/203 (57%), Gaps = 18/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC CG GCDGG+P AW Y+V HG+VT C PY C
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C + Y TP+C RKC K + + KHY A + + I EI
Sbjct: 197 EHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPYEHDKHYGGIAINVIKNELAIQKEIMMY 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE ++EDF +YKSG+YK+ TG +G H V++IGWG ++G YW+ AN WN WG
Sbjct: 257 GPVEAYLLIFEDFLNYKSGIYKYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
GYF+I RG NEC IE VVAG
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAG 338
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 115/204 (56%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + + C
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC C + + KH+ +Y + + DI EI NGP
Sbjct: 195 HHVNGTRPPCAHGGATPKCSHVCQSSYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTNGP 254
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK GVY+H G +GGHA++++GWG D+ YW++ N WN WG
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNTDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 315 QGFFRILRGQDHCGIESSISAGLP 338
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 119/225 (52%), Gaps = 19/225 (8%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+ T T D + + +LQ S+S DLL CC CG GC GGYP +AW Y GV T
Sbjct: 119 FAATETFSDRICIASNQTLQT-SISSEDLLECCADYCGMGCKGGYPSAAWGYMKRQGVST 177
Query: 61 -------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSI 105
C PY TG P C P PTP+CV++C + + H++
Sbjct: 178 GGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPIQPTPQCVKECNSEYTQNTYEKDLHFAS 236
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWG 164
Y I + + I EI +GPV+ SF V DF YKSGVY ++ GGH+VK+IGWG
Sbjct: 237 QTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGVYIRNPKLKYEGGHSVKIIGWG 296
Query: 165 TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ YW++AN WN WG G F++ RG NECGIE +VAGLP
Sbjct: 297 -KEGNTPYWLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAGLP 340
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 113/203 (55%), Gaps = 19/203 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
N SLS DLL+CC CG GC GGYP AW Y+ HG+VT D +GC P C
Sbjct: 136 NKSLSAVDLLSCCKD-CGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKC 192
Query: 78 E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
E YPTP+CV++C + + K + +Y I + IM EI G
Sbjct: 193 EHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRG 252
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE FT+YEDF Y SGVY H G M GHAV+++GWG + YW++AN WN WG
Sbjct: 253 PVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGELGN-VPYWLIANSWNEDWGE 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
+GY K RG NECGIE+DV A L
Sbjct: 312 EGYMKFLRGYNECGIEDDVTAVL 334
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 119/202 (58%), Gaps = 21/202 (10%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
LS DL++CC + CG GC+GGYP AW Y+ HG+V+ C PY CSH
Sbjct: 139 LSAVDLVSCCPY-CGYGCEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPY-PFPKCSHLE 196
Query: 75 --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
PG P Y TPKC ++C ++ K S+Y + DIM EI NGPV
Sbjct: 197 ETPGLAPCPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPV 256
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ ++EDF YKSG+Y++ +G +MGGH + IGWG ++G YW+ AN WN WG +G
Sbjct: 257 STIYYIFEDFTVYKSGIYQYTSGSLMGGHGI--IGWGV-ENGVKYWLAANSWNEGWGENG 313
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF+I+RG+NECGIE + AGLP
Sbjct: 314 YFRIRRGTNECGIESRINAGLP 335
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 121/204 (59%), Gaps = 19/204 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + +S D+L+CC CGDGCDGGY I A+++F G VT + C PY C
Sbjct: 140 KQVYVSATDILSCC-HSCGDGCDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPY-PFHPC 197
Query: 73 SHPGCEPAY-------PTPKCVRKCVKKNQL-WRNSKHYSISAYRIN-SDPEDIMAEIYK 123
H G E Y TP+CVRKC + + + + AYR+ + I EI +
Sbjct: 198 GHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSVKAIQKEIMR 257
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPV +F V++DF+ Y+ G+Y H+ G GGHAVK+IGWGT + G YWI+AN W+ W
Sbjct: 258 NGPVVAAFIVFDDFSFYRKGIYAHVAGSPRGGHAVKIIGWGT-EHGVPYWIIANSWHSDW 316
Query: 184 GADGYFKIKRGSNECGIEEDVVAG 207
G DGYF++ RG N+CGIE +VVAG
Sbjct: 317 GEDGYFRMVRGINDCGIETNVVAG 340
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 125/202 (61%), Gaps = 16/202 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVVT-------EECDPY-FDSTG 71
NL L+ DL+ CC CG+GC+GG+ +A++Y+V G+V+ E C PY F+
Sbjct: 141 NLELATEDLMGCCK-DCGNGCNGGFLDGTAFQYWVDAGLVSGAPYNSSEGCKPYPFEP-- 197
Query: 72 CSHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
CS+P GC PKC+ C+ ++ +R K + +AY+I +D I EI NGPV
Sbjct: 198 CSYPFVGCHHEKKNPKCLHHCINGYDRKYRKDKFFGATAYKIPNDARMIQLEIMTNGPVA 257
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
F V+EDF Y SGVYKH+ G +G HA++++GWGT ++G YW++AN + +WG G+
Sbjct: 258 TGFEVFEDFYFYHSGVYKHVVGKKVGMHAIRIVGWGT-ENGTPYWLIANSYGDTWGDKGF 316
Query: 189 FKIKRGSNECGIEEDVVAGLPS 210
FK+ RGSN GIE V+AGLP
Sbjct: 317 FKMLRGSNHLGIESTVIAGLPQ 338
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/227 (42%), Positives = 143/227 (62%), Gaps = 19/227 (8%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + ++ +VS++ +S DLL CCG CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNGHVSVE---VSAEDLLTCCGGQCGDGCNGGYPAEAWNFWTRKGLVSGGLYE 174
Query: 63 ----CDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KH+ ++Y +
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPACTGEGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLP 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
++ +IMAEIYKNGPVE +F+VY DF YKSGVY+H+TGD+MGGHA++++GWG ++G
Sbjct: 234 TNEWEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHLTGDMMGGHAIRILGWG-EENGVP 292
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG G+F+I RG + CGIE +VVAG+P + ++I
Sbjct: 293 YWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPRTDQYWEKI 339
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/178 (46%), Positives = 111/178 (62%), Gaps = 13/178 (7%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR-KC----VKKN 94
GC+GG+ +A+ + +G++ E+C PY C HPGC +PTPKC + KC K
Sbjct: 143 GCNGGWMSTAFGFMQSNGILGEDCIPY-QMGKCKHPGCS-TWPTPKCNKTKCYPNDTKST 200
Query: 95 QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
+LW ++ S+Y + S+ DI EIY+NGPV SF VYED + Y+SGVY+H+TG G
Sbjct: 201 ELW-----HAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVYQHVTGGFEG 255
Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
HA+K++GWG DG YW + N W WG DG I+RG +ECGIE DVVAG P K
Sbjct: 256 LHAIKVVGWGIL-DGVKYWTIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKLK 312
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 116/203 (57%), Gaps = 18/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC CG GCDGG+P AW Y+V HG+VT C PY C
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C + Y TP+C RKC K + + KHY + + + I EI
Sbjct: 197 EHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMY 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE ++EDF +YKSG+Y++ TG +G H V++IGWG ++G YW+ AN WN WG
Sbjct: 257 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
GYF+I RG NEC IE VVAG
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAG 338
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 121/205 (59%), Gaps = 16/205 (7%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A++S+ + N+ LS DL++C GCDGGYPI+AW Y GVVT+ C P
Sbjct: 117 SDRLAIASNNSI---NVVLSPQDLVSCDS--TDYGCDGGYPINAWHYMQSLGVVTDTCYP 171
Query: 66 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
Y G S TP C K + +AY++ ++ I +EI NG
Sbjct: 172 YTSGNGDSGTCQITGKKTPACATATFYKAK----------TAYQVANNMAAIQSEILANG 221
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE +F+VY+DF Y SGVY H +G + GGHAVK++GWG D YWI+AN W SWG
Sbjct: 222 PVEAAFSVYDDFFSYTSGVYSHQSGALDGGHAVKIVGWGV-DGTTPYWIVANSWGTSWGQ 280
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
G+F IKRG++ECGIE+ +VAGL +
Sbjct: 281 AGFFWIKRGNDECGIEDGIVAGLAA 305
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 14/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S DL+ACC CG GC+GGYP +AW Y+V HG+ + +C PY C H G +
Sbjct: 139 KQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGITSSQCQPY-PFPRCEHRGAQG 196
Query: 80 AYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
P TP+C C K+ K+ +Y + + ED E+Y NGP V F
Sbjct: 197 KKPPCSKYKFVTPQCNATCTDKSVPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRF 253
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
V+ DF YKSGVY+H+ G+ +GG AV+++GWG +G YW +AN W+ WG +GYF I
Sbjct: 254 QVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLI 312
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG NEC IE AG P L
Sbjct: 313 LRGDNECNIEHLGFAGTPDPSQLA 336
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 88/207 (42%), Positives = 118/207 (57%), Gaps = 18/207 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT C PY C
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H C + Y TP+C + C K N + KHY +Y + S I +I +
Sbjct: 197 DHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN WG
Sbjct: 257 GPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSS 211
GYF+I RG NEC IE ++ AGL S
Sbjct: 316 EKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 116/203 (57%), Gaps = 18/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC CG GCDGG+P AW Y+V HG+VT C PY C
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C + Y TP+C RKC K + + KHY + + + I EI
Sbjct: 197 EHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQNEIMMY 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE ++EDF +YKSG+Y++ TG +G H V++IGWG ++G YW+ AN WN WG
Sbjct: 257 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
GYF+I RG NEC IE VVAG
Sbjct: 316 EKGYFRIVRGRNECSIESVVVAG 338
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 88/205 (42%), Positives = 118/205 (57%), Gaps = 18/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+++ LS DLL+CC CG GC G+P AW Y+V G+VT C PY C
Sbjct: 139 KSVELSAVDLLSCC-IECGLGCQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C E Y PKC +KC K + + K+Y +Y + + + I EI +
Sbjct: 197 EHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKYYGKVSYNLLKNEDSIKKEIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE SF V+ DF +YKSG+YKH+TG +G H V++IGWG + YW++AN WN WG
Sbjct: 257 GPVEASFRVHSDFLNYKSGIYKHMTGIDIGSHVVRIIGWGVEKE-TPYWLIANSWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
GYF++ RG +ECGIE V +GLP
Sbjct: 316 EKGYFRMLRGKDECGIESAVTSGLP 340
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 117/203 (57%), Gaps = 18/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC CG GCDGG+P AW Y+V HG+VT C PY C
Sbjct: 113 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 170
Query: 73 SH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C + Y TP+C RKC K + + + KHY + + + I EI
Sbjct: 171 EHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNESAIQKEIMMY 230
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE ++EDF +YKSG+Y++ TG +G H V++IGWG ++G YW+ AN WN WG
Sbjct: 231 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGI-ENGTAYWLAANTWNEDWG 289
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
GYF+I RG NEC +E VVAG
Sbjct: 290 EKGYFRIVRGRNECSVESVVVAG 312
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 91/196 (46%), Positives = 114/196 (58%), Gaps = 11/196 (5%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDG-----GYPISAWRYFVHHGVVTEE-CDPYFDSTGCSH 74
+++S DLLACC CG GCDG I R V V TE+ C PY S
Sbjct: 137 QVNISAEDLLACC-HTCGHGCDGRCHCSSVAILQGRRLVPEPVRTEDGCQPY--SLPPCV 193
Query: 75 PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
P C PTPKC C K + + KH++ + YR+ + I +IYKNGPVE +F V
Sbjct: 194 PNCTHPEPTPKCQHVCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFV 253
Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
Y DF YKSGVY+ MG HA+K++GWGT +DG YW++AN WN WG GYFKI R
Sbjct: 254 YADFPSYKSGVYQQHMIKFMGVHAIKILGWGT-EDGVPYWLVANSWNVGWGDKGYFKILR 312
Query: 194 GSNECGIEEDVVAGLP 209
G +ECGIEE + AG+P
Sbjct: 313 GKDECGIEEVIDAGIP 328
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ C PY + C
Sbjct: 138 NFRFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPY-EIAPCE 195
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H C TPKC +C N + KH+ +Y + + DI EI NGP
Sbjct: 196 HHVNGTRAPCNHDSKTPKCQHQCEAGYNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTNGP 255
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
VE +FTVYED YKSGVY+H G +GGHA++++GWG E YW++AN WN WG
Sbjct: 256 VEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGKEEVPYWLIANSWNDDWGD 315
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 316 KGFFRILRGEDHCGIESSISAGLP 339
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 159 bits (403), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 88/207 (42%), Positives = 118/207 (57%), Gaps = 18/207 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT C PY C
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPY-PFPKC 196
Query: 73 SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H C + Y TP+C + C K N + KHY +Y + S I +I +
Sbjct: 197 DHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMH 256
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN WG
Sbjct: 257 GPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWG 315
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSS 211
GYF+I RG NEC IE ++ AGL S
Sbjct: 316 EKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 159 bits (403), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 86/193 (44%), Positives = 117/193 (60%), Gaps = 16/193 (8%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
DLL+CC CG GC GG AW+++V G+ + + C PY C PG +
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239
Query: 81 YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
TPKC KC +W++ +HY AY + +D IM EI+ NGPV+ +F Y D
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
YKSG+Y+H+ G + GGHAVKL+GWG ++G YW++AN W R WG +G+FKI RG N
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKIVRGENH 355
Query: 198 CGIEEDVVAGLPS 210
CGIEE++ AGLP+
Sbjct: 356 CGIEENIHAGLPN 368
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 159 bits (402), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 115/198 (58%), Gaps = 20/198 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+N+ LS D++ C +GC+GG SAW + G V+EEC PY + P C P
Sbjct: 126 ENVQLSFMDMVTCDE--TDNGCEGGDAFSAWNWLRKQGAVSEECLPY------TIPTCPP 177
Query: 80 AYP-------TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
A TP C ++C + L + KH Y +SD E IM EI NGPVE F
Sbjct: 178 AQQPCLNFVNTPSCTKECQSNSSLIYSQDKHKMAKIYSFDSD-EAIMQEIVTNGPVEACF 236
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
TV+EDF YKSGVY H TG +GGH VKL+G+GT +G DY+ NQW SWG +G F I
Sbjct: 237 TVFEDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTL-NGVDYYAANNQWTTSWGDNGTFLI 295
Query: 192 KRGSNECGIEEDVVAGLP 209
KRG +CGI +DVVAGLP
Sbjct: 296 KRG--DCGISDDVVAGLP 311
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 159 bits (402), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
N SLS DLL+CC CG GC GGYP AW Y+ HG+VT D +GC P C
Sbjct: 136 NKSLSAVDLLSCCEN-CGYGCSGGYPAVAWDYWGAHGIVTGGSKE--DPSGCRSYPFPKC 192
Query: 78 E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
E YPTP+CV+ C + K + +Y I S IM EI G
Sbjct: 193 EHHVQGHYPPCPHQYYPTPECVQHCDTPGIDYVKDKTRANMSYNIYSSEILIMKEIMLRG 252
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE FTVYEDF YK GVY H G + HA++++GWG D YW++AN WN WG
Sbjct: 253 PVEAVFTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEEGD-VPYWLIANSWNEDWGE 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
GY K RG NECGIE+DV AGLP
Sbjct: 312 KGYMKFLRGLNECGIEDDVTAGLP 335
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 159 bits (402), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 115/204 (56%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + + C
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC C + + KH+ +Y + + +I EI NGP
Sbjct: 195 HHVNGTRPPCAHGGGTPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGP 254
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK GVY+H G +GGHA++++GWG D+ YW++ N WN WG
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNTDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 315 HGFFRILRGQDHCGIESSISAGLP 338
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 159 bits (402), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 88/207 (42%), Positives = 122/207 (58%), Gaps = 19/207 (9%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
DLL+CC CG GC GG AW+++V G+ + + C PY C PG +
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239
Query: 81 YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
TPKC KC +W++ +HY AY + +D IM EI+ NGPV+ +F Y D
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
YKSG+Y+H+ G + GGHAVKL+GWG ++G YW++AN W R WG +G+FK+ RG N
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENH 355
Query: 198 CGIEEDVVAGLPSSKNLVKEITSADMF 224
CGIEE++ AGLP N ++ +A F
Sbjct: 356 CGIEENIHAGLP---NFHRQGEAAKYF 379
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 159 bits (402), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 88/207 (42%), Positives = 122/207 (58%), Gaps = 19/207 (9%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
DLL+CC CG GC GG AW+++V G+ + + C PY C PG +
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239
Query: 81 YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
TPKC KC +W++ +HY AY + +D IM EI+ NGPV+ +F Y D
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
YKSG+Y+H+ G + GGHAVKL+GWG ++G YW++AN W R WG +G+FK+ RG N
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENH 355
Query: 198 CGIEEDVVAGLPSSKNLVKEITSADMF 224
CGIEE++ AGLP N ++ +A F
Sbjct: 356 CGIEENIHAGLP---NFHRQGEAAKYF 379
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 115/204 (56%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + C
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EIAPCE 194
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC C + + KH+ +Y + + DI EI NGP
Sbjct: 195 HHVNGTRPPCGHGGGTPKCSHVCESGYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTNGP 254
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK GVY+H G +GGHA++++GWG ++ YW++ N WN WG
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHQHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+F+I RG + CGIE + AGLP
Sbjct: 315 NGFFRILRGQDHCGIESSISAGLP 338
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 121/204 (59%), Gaps = 20/204 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+ + +S D ++CC CG GC+GG+PI A+ Y+ + GVVT C PY C
Sbjct: 143 KQVHISSIDFVSCCD-SCGFGCEGGWPIDAFEYYSYQGVVTGGDYGSKTGCRPY-PFHPC 200
Query: 73 SHPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
H G E Y TP+CV++C K KN +R K + Y + + + I EI +
Sbjct: 201 GHHGNETYYGECPKEESTPECVKQCQKGYKNS-YRRDKTWGEDYYEVENSVKAIQREIMR 259
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPV SFTVY+DF++Y G+YKH G G HA+K+IGWGT + YWI+AN W+ W
Sbjct: 260 SGPVVSSFTVYDDFSYYVKGIYKHTAGKARGSHAIKIIGWGT-EKNVPYWIIANSWHNDW 318
Query: 184 GADGYFKIKRGSNECGIEEDVVAG 207
G G+F++ RG+N CGIEEDVVAG
Sbjct: 319 GEKGFFRMVRGTNHCGIEEDVVAG 342
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAIDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 118/204 (57%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
N LS +DL++CC +CG GC+GG+P +AW Y+ G+V T+ C PY + C
Sbjct: 136 NFHLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCE 193
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP C KC + + K++ +Y + + +I EI NGP
Sbjct: 194 HHVNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGP 252
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
VE +FTVYED YKSGVY+H G +GGHA++++GWG + + YW++ N WN WG
Sbjct: 253 VEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNTDWGD 312
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G+F+I RG + CGIE + AGLP
Sbjct: 313 NGFFRILRGQDHCGIESSISAGLP 336
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 189 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 249 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 308 GFFKILRGQDHCGIESEIVAGMP 330
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/199 (43%), Positives = 117/199 (58%), Gaps = 17/199 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
SV D+L CC CG GCDGG+P +AW YFV GVVT C PY S +HP
Sbjct: 146 FSVEDILTCCD-ECGFGCDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPN 204
Query: 77 CEPAY------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
E Y TP C C K + +++ K +Y + + I +I K+GP+
Sbjct: 205 -ETFYRNCTGVSTPSCKTSCQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVA 263
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F+VYEDF +YK G+Y++ G GGHAV+++GWG ++ + YWI+AN WN WG DG+F
Sbjct: 264 TFSVYEDFMYYKKGIYRYTHGGYEGGHAVRILGWGVENNVK-YWIIANSWNTDWGEDGFF 322
Query: 190 KIKRGSNECGIEEDVVAGL 208
++ RG N+CGIEE V AGL
Sbjct: 323 RMVRGINDCGIEESVSAGL 341
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + + C
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC C + + KH+ +Y + + +I EI NGP
Sbjct: 195 HHVNGTRPPCANGSGTPKCSHVCQSSYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGP 254
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK GVY+H G +GGHA++++GWG ++ YW++ N WN WG
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGNEKIPYWLIGNSWNTDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 315 HGFFRILRGQDHCGIESSISAGLP 338
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 83/197 (42%), Positives = 117/197 (59%), Gaps = 17/197 (8%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
+ ++ S DL++CC +CG GC+GG P AW Y+ H G+V+ + C PY +
Sbjct: 90 ATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIP 147
Query: 71 GCSH--PG----CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C H PG C TPKC + C N ++ K Y Y ++ + I AE++K
Sbjct: 148 PCEHHVPGNRMPCNGDTKTPKCQKNCESSYNVPFKKDKRYGKHVYSVSGHEDHIKAELFK 207
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPVE +FTVY D YK+GVYKH G+ +GGHA+K+IGWG ++ + YW++AN WN W
Sbjct: 208 NGPVEAAFTVYSDLLSYKNGVYKHTEGNALGGHAIKIIGWGVENNNK-YWLIANSWNSDW 266
Query: 184 GADGYFKIKRGSNECGI 200
G +G+FKI RG + CGI
Sbjct: 267 GDNGFFKILRGEDHCGI 283
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/198 (43%), Positives = 123/198 (62%), Gaps = 16/198 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
+S D+L CC GC GG+ + A +++ GVVT + C PY CS C
Sbjct: 133 ISPEDILTCC--TNSHGCQGGFVLEAMKFWKSKGVVTGGDFQGDGCIPY-SYGSCSD--C 187
Query: 78 EPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDP--EDIMAEIYKNGPVEVSFTV 133
A TPKC +C K ++ K+Y SAYR+++ I +EI +NGPVE ++ V
Sbjct: 188 HTAQTTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQV 247
Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
YEDF +YKSGVY++I+G MGGHAVK+IGWG ++ +YW++AN W +G +G+FK++R
Sbjct: 248 YEDFYYYKSGVYEYISGRHMGGHAVKIIGWGV-EENVNYWLIANSWGTGFGENGFFKMRR 306
Query: 194 GSNECGIEEDVVAGLPSS 211
G+NECGIE VVAG+ S
Sbjct: 307 GNNECGIENYVVAGMAKS 324
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/214 (42%), Positives = 119/214 (55%), Gaps = 22/214 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----HP 75
+ + S D+L CCG CG GC GG+PI AW++F + GVV+ PY CS HP
Sbjct: 138 KQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSG--GPYLGKGCCSPYPLHP 195
Query: 76 -----------GCEPAYPTPKCVRKCVKKNQ-LWRNSKHYSI--SAYRINSDPEDIMAEI 121
C PTP C RKC + ++R K Y Y + I +I
Sbjct: 196 CGRHGNDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDI 255
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HAVKLIGWGTSDDGEDYWILANQWN 180
+ G V F VYEDF+HY+SG+YKH G GG HAVK+IGWG D+G DYW++AN W+
Sbjct: 256 KERGSVVAVFAVYEDFSHYQSGIYKHTAGRFTGGYHAVKMIGWG-KDNGTDYWLIANSWH 314
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
WG +G+F++ RG N CGIEE V AG+ ++L
Sbjct: 315 DDWGENGFFRMIRGINNCGIEEQVDAGIVDVESL 348
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/205 (41%), Positives = 114/205 (55%), Gaps = 17/205 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ +++S D+L CC CG GC GG+ I AW YFV+ GVV+ C PY C
Sbjct: 133 KQVNISSTDILTCCNPQCGFGCGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPY-PIHPC 191
Query: 73 SHPG-------CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G C TP C +KC +++R K AY + E I EI ++
Sbjct: 192 GHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRH 251
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSW 183
GPV SF VYEDF+ YK+GVYKH G + G HAVK++GWG S YW++AN W+ W
Sbjct: 252 GPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDW 311
Query: 184 GADGYFKIKRGSNECGIEEDVVAGL 208
G +GYF+ RG N+C IE+ V AG+
Sbjct: 312 GENGYFRFIRGINDCEIEDTVAAGI 336
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/197 (45%), Positives = 112/197 (56%), Gaps = 12/197 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPGCEP 79
S DL++CC CGDGC GG AW Y+V GV + PY GC S+P
Sbjct: 148 TFSFGSFDLISCC-HSCGDGCQGGVLGPAWDYWVQKGVSSG--GPYNSKQGCHSYPFDTC 204
Query: 80 AYP-----TPKCVRKCVKKNQLWRNSK--HYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
P PKC RKC + SK + AY + +D IM EI+ NGPV+ +F
Sbjct: 205 HSPDEDDDAPKCSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQ 264
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VY DF YKSGVY+H+TG + GGHA+K++GWG ++G YW+ +N W WG G+FKI
Sbjct: 265 VYLDFKTYKSGVYRHVTGPLEGGHAIKILGWGV-ENGTKYWLCSNSWGEDWGDHGFFKIV 323
Query: 193 RGSNECGIEEDVVAGLP 209
RG N GIE DV AGLP
Sbjct: 324 RGENHLGIETDVHAGLP 340
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + + C
Sbjct: 137 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 194
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC C + + KH+ +Y + + +I EI NGP
Sbjct: 195 HHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGP 254
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK GVY+H G +GGHA++++GWG ++ YW++ N WN WG
Sbjct: 255 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGD 314
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 315 HGFFRILRGQDHCGIESSISAGLP 338
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 106 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 162
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 163 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 222
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 223 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 281
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 282 GEKGYFRIVRGRNECLIESEIAAGLIKS 309
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 51 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 109
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 110 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 169
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 170 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 228
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 229 GFFKILRGQDHCGIESEIVAGMP 251
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYIEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSGESVFQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAGLIKS 342
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMV 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 116/203 (57%), Gaps = 19/203 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+++S DLL CC CG GC GGYP +AW Y+ G+VT + C PY+ C
Sbjct: 75 QVNISAQDLLTCC-HQCGMGCFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPP-CE 132
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C PTPKC++ C K + + K+++ + Y ++SD I EIYKNGP
Sbjct: 133 HHTKGPLPNCTDTKPTPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDETQIKTEIYKNGP 192
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE F+VY DF YKSGVY+ + ++ L GW W++AN WN+ WG
Sbjct: 193 VEADFSVYTDFLAYKSGVYQRHSYELWEARHQNL-GWALKR--RSVWLVANSWNQDWGDK 249
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYFKI+RG+NECGIE D+ AG+P
Sbjct: 250 GYFKIRRGNNECGIENDINAGIP 272
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 17/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N S +DL++CC CG GC+GG+P +AW Y+ G+V+ + C PY + + C
Sbjct: 127 NFHFSADDLVSCC-HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY-EISPCE 184
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC C + + KH+ +Y + + +I EI NGP
Sbjct: 185 HHVNGTRPPCAHGGRTPKCSHVCQSGYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGP 244
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGA 185
VE +FTVYED YK GVY+H G +GGHA++++GWG ++ YW++ N WN WG
Sbjct: 245 VEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGD 304
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+F+I RG + CGIE + AGLP
Sbjct: 305 HGFFRILRGQDHCGIESSISAGLP 328
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 157 bits (397), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC I+ ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECSIDSEIAAGLIKS 342
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 113/204 (55%), Gaps = 14/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S DL+ACC CG GC+GGYP +AW Y+V HG+ + +C PY C H G +
Sbjct: 139 KQLRISAADLMACCK-DCGGGCEGGYPDAAWEYYVSHGIASSQCQPY-PFPRCEHRGAQG 196
Query: 80 A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TP+C C K K+ +Y + + ED E+Y NGP V F
Sbjct: 197 KKTPCSKYKFVTPQCNATCTDKTIPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRF 253
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
V+ DF YK+GVY+H+ G+ +GG AV+++GWG +G YW +AN W+ WG +GYF I
Sbjct: 254 QVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYFLI 312
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG NEC IE AG P L
Sbjct: 313 LRGDNECNIEHLGFAGTPDPSQLT 336
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG+P AW ++ G+V+ C PY C
Sbjct: 3 NVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 61
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 62 HHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 121
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++ N WN WG +
Sbjct: 122 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDN 180
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 181 GFFKILRGQDHCGIESEIVAGMP 203
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 131/203 (64%), Gaps = 16/203 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GG P AW ++ G+V+ C PY C
Sbjct: 51 NVEVSAEDMLTCCGGECGDGCNGGEPSGAWNFWTKKGLVSGGLYNSHVGCRPY-SIPPCE 109
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KH+ S+Y + ++ ++IMAEIYKNGP
Sbjct: 110 HHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGP 169
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +F+VY DF YKSGVY+H++G++MGGHA++++GWG ++G YW++AN WN WG +
Sbjct: 170 VEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDN 228
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI RG + CGIE ++VAG+P
Sbjct: 229 GFFKILRGQDHCGIESEIVAGMP 251
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 82/198 (41%), Positives = 118/198 (59%), Gaps = 17/198 (8%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
+ ++ S DL++CC +CG GC+GG P AW Y+ H G+V+ + C PY +
Sbjct: 91 ATKHFHFSAEDLVSCCP-ICGLGCNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPY-EIP 148
Query: 71 GCSH--PG----CEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYK 123
C H PG C TPKC + C + ++ K Y Y ++ ++I AE++K
Sbjct: 149 PCEHHVPGNRMPCNGDTKTPKCEKTCESSYTVPFKKDKRYGKHVYSVSGHEDNIKAELFK 208
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPVE +FTVY D YKSGVY+H G+ +GGHA+K++GWG ++G YW++AN WN W
Sbjct: 209 NGPVEGAFTVYSDLLSYKSGVYQHTHGNALGGHAIKILGWGV-ENGSKYWLIANSWNSDW 267
Query: 184 GADGYFKIKRGSNECGIE 201
G +G+ KI RG + CGIE
Sbjct: 268 GDNGFLKILRGEDHCGIE 285
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GP E +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPAEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 116/204 (56%), Gaps = 20/204 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAG 207
G GYF+I RG NEC IE ++ AG
Sbjct: 315 GEKGYFRIVRGRNECSIESEIAAG 338
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 112/189 (59%), Gaps = 18/189 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGC-DGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCE 78
N+ LS DLL+C G GC DGG AWRY GVV C PY +TG
Sbjct: 93 NIILSSEDLLSC--DKAGRGCSDGGRLSEAWRYMQKKGVVANRCKPYTSGATGF------ 144
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P+C+ KC + ++ K Y + Y ++ + + I EI NGPVE +FTVY D
Sbjct: 145 ----IPECMSKCTGEGHAYQ--KFYGLYLYTVSGENQ-IKVEIMTNGPVEAAFTVYSDIV 197
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
HYKSGVY H +G +GGHAVK++GWG D+ E+YW++AN W WG G+FKIKRGS+EC
Sbjct: 198 HYKSGVYHHTSGGKLGGHAVKVLGWGVEDE-EEYWLVANSWGPDWGDQGFFKIKRGSDEC 256
Query: 199 GIEEDVVAG 207
GIE V+ G
Sbjct: 257 GIESRVLTG 265
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)
Query: 73 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F
Sbjct: 128 SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 187
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI
Sbjct: 188 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 246
Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEI 218
RG + CGIE +VVAG+P + ++I
Sbjct: 247 LRGQDHCGIESEVVAGIPRTDQYWEKI 273
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 156 bits (395), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 87/207 (42%), Positives = 121/207 (58%), Gaps = 19/207 (9%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPA 80
DLL+CC CG GC GG AW+++V G+ + + C PY C PG +
Sbjct: 182 DLLSCC-HSCGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHPYPIGE-CRIPGEDED 239
Query: 81 YPTPKCVRKC---VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
TPKC KC +W++ +H AY + +D IM EI+ NGPV+ +F Y D
Sbjct: 240 --TPKCSNKCRSGYNVTDVWQD-RHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDL 296
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
YKSG+Y+H+ G + GGHAVKL+GWG ++G YW++AN W R WG +G+FK+ RG N
Sbjct: 297 HAYKSGIYRHVWGPLSGGHAVKLLGWGV-ENGVKYWLVANSWGREWGENGFFKMVRGENH 355
Query: 198 CGIEEDVVAGLPSSKNLVKEITSADMF 224
CGIEE++ AGLP N ++ +A F
Sbjct: 356 CGIEENIHAGLP---NFHRQGEAAKYF 379
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 156 bits (395), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 84/200 (42%), Positives = 116/200 (58%), Gaps = 14/200 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S L++CC CGDGCDGGYP ++W Y+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLMSCCED-CGDGCDGGYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K K+ +Y ++ + +D E+Y NGP V
Sbjct: 195 GKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVV 251
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF YK+GVY+H++GD +GGHAV+++GWG +G YW +AN W+ WG +G+
Sbjct: 252 FWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLL 310
Query: 191 IKRGSNECGIEEDVVAGLPS 210
RG+NECGIE AG P+
Sbjct: 311 FLRGNNECGIEAAGYAGSPA 330
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 112/189 (59%), Gaps = 12/189 (6%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC-DPYFDSTGCSHPGCEPA 80
++LS L+ C L GC GG+PI+AW Y V G++TE+C PY+ C
Sbjct: 132 VTLSAQQLVDCD--LDNSGCSGGWPINAWNYMVKTGLLTEQCYGPYY----AKQYTCRLT 185
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
T C + K + + Y + A + E I +I NGPVE FT+++DF Y
Sbjct: 186 ANTTDCPWQPGVKARFYHAKSAYKLPAKNV----EAIQTDIMNNGPVEADFTIFQDFYAY 241
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
+SG+Y H TG +GGHA+K++GWGT D+ DYW+ AN W +WG GYFKI+RG++ECGI
Sbjct: 242 RSGIYVHATGKQLGGHAIKILGWGTEDN-VDYWLCANSWGANWGIQGYFKIRRGTDECGI 300
Query: 201 EEDVVAGLP 209
E+ + AGLP
Sbjct: 301 EDGLAAGLP 309
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLGIESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 155 bits (393), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 104/147 (70%), Gaps = 2/147 (1%)
Query: 73 SHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
S P C TPKC + C + ++ KHY ++Y +++ +DIMAEIYKNGPVE +F
Sbjct: 8 SRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 67
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN WG +G+FKI
Sbjct: 68 SVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKI 126
Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEI 218
RG + CGIE +VVAG+P + ++I
Sbjct: 127 LRGQDHCGIESEVVAGIPRTDQYWEKI 153
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 155 bits (393), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 78/174 (44%), Positives = 110/174 (63%), Gaps = 16/174 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 24 SVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPY-SIPPCE 82
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TPKC + C + ++ KHY +Y +++ +DIMAEIYKNGP
Sbjct: 83 HHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGP 142
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
VE +F+VY DF YKSGVY+H+TG++MGGHA++++GWG ++G YW++AN WN
Sbjct: 143 VEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGV-ENGTPYWLVANSWN 195
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVKLSAVDLISCCEN-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/205 (45%), Positives = 119/205 (58%), Gaps = 23/205 (11%)
Query: 22 LSLSVNDLLACCGFLCG---DGCDGGYPISAWRYFVHHGVVTE---ECDPYFDSTGCSH- 74
+ LS ACC G GCDGG P SAWR+F HGVV+E C PY + CSH
Sbjct: 180 VPLSAGHTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSGCWPY-NFPECSHH 238
Query: 75 ---PGCEPAY---PTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDIMAEIYKN 124
G EP P+P C C +N ++ S +H++ + ++I EI N
Sbjct: 239 VETKGMEPCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDN 296
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +FTVYEDF +YKSGVYKH+ G +GGHAVK+IGWGT D E YW++ N WN +WG
Sbjct: 297 GPVAAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGT-DQNEQYWLVMNSWNVNWG 355
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
G FKI G ECGI+ +V AG+P
Sbjct: 356 DQGIFKIAIG--ECGIDSEVTAGIP 378
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/200 (44%), Positives = 120/200 (60%), Gaps = 16/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY--FDSTGCSH 74
LS D+LACC CG GC GG+ I AW YF + GV T + C PY + S+
Sbjct: 143 LSDTDILACCPN-CGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESY 201
Query: 75 PGC-EPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
C + ++PTPKC + C K ++ + + K+Y+ SAYRI + I EI +NGPV SF
Sbjct: 202 GKCPKDSFPTPKCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFR 261
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD-DGED--YWILANQWNRSWGA-DGY 188
+Y DF Y+ GVY G +GGHA+K+IGWGT +G D YW++AN W WG +GY
Sbjct: 262 IYPDFGFYEKGVYVTSGGRELGGHAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGY 321
Query: 189 FKIKRGSNECGIEEDVVAGL 208
F+I RG N C IE+ V+AG+
Sbjct: 322 FRILRGQNHCQIEQKVIAGM 341
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 124/212 (58%), Gaps = 14/212 (6%)
Query: 4 TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEEC 63
T ++R ++S+ + + S DLLACC CG GC GGY AW+Y+V G+V+
Sbjct: 115 TMSDRLCIASN---ATKKFEFSAQDLLACCK-ECGHGCGGGYSSRAWQYWVTDGIVSG-- 168
Query: 64 DPYFDSTGCSHPGCEPAY---PTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIM 118
+ S GC HP A+ TP C C K + + K Y +YRI + E I
Sbjct: 169 GDFNTSQGC-HPYSVQAFRDSTTPNCSSFCTNPKYQKNYSEDKRYGARSYRIAKNIEQIQ 227
Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 178
AEI +GPV+ S+ VY+DF Y++GVY+H+ G+V G H+VK++GWG ++G DYW++AN
Sbjct: 228 AEIMTSGPVQASYVVYDDFYSYQNGVYQHVLGNVSGRHSVKILGWG-RENGTDYWLVANS 286
Query: 179 WNRSWGA-DGYFKIKRGSNECGIEEDVVAGLP 209
W R WG G+FK RG N C IE +++ G P
Sbjct: 287 WGRDWGRLGGFFKFLRGENHCDIESNILGGDP 318
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 133/212 (62%), Gaps = 16/212 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S D+L CCG CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 130 NVEVSAEDMLTCCGGQCGDGCNGGYPSGAWNFWTKKGLVSGGLYDSHVGCKPY-SIPPCE 188
Query: 74 H------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP+C + C + ++ KHY S+Y ++SD +I AEIYKNGP
Sbjct: 189 HHVNGSRPACTGEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGP 248
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE +FTVY DF YKSGVY+H TGD+MGGHA++++GWG ++G YW++AN WN WG
Sbjct: 249 VEGAFTVYSDFLMYKSGVYQHTTGDIMGGHAIRILGWG-EENGVPYWLVANSWNTDWGDK 307
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
G+FKI RG + CGIE ++VAG+P + ++I
Sbjct: 308 GFFKILRGQDHCGIESEIVAGIPRTDQYWRQI 339
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/201 (43%), Positives = 112/201 (55%), Gaps = 14/201 (6%)
Query: 21 NLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS----- 73
L +S D++ CC DGC GG P + + G V+ Y + GC
Sbjct: 131 QLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSG--GEYNSTNGCMSYPLP 188
Query: 74 --HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEV 129
+P C+ Y P C ++C K + L + KHY+ AYRI S E I EI KNGPV
Sbjct: 189 RCNPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVA 248
Query: 130 SFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
SFTVY DF HY SGVYK ++GGHAV++IGWG + YW+++N WN WG G
Sbjct: 249 SFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGL 308
Query: 189 FKIKRGSNECGIEEDVVAGLP 209
FKI RG NECGIEE++ AGLP
Sbjct: 309 FKIWRGKNECGIEEEITAGLP 329
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/184 (44%), Positives = 109/184 (59%), Gaps = 13/184 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ LS DL++C + GCDGG +AW Y H G+VT++C PY G +
Sbjct: 55 NVVLSPQDLVSCNWY--NAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVA------- 105
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
P C + C + + K+ + Y + S E IM EI NGPV+ F+VY+DF Y
Sbjct: 106 ---PSCPKYCNGTSTPIDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSY 162
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSGVY H TG +GGHA+K++GWG ++ + YW++AN W WG +G FKIKRG NECGI
Sbjct: 163 KSGVYTHQTGSFLGGHAIKIVGWGVENNVK-YWLVANSWGPDWGLNGLFKIKRGDNECGI 221
Query: 201 EEDV 204
E DV
Sbjct: 222 EADV 225
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 116/208 (55%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC CG GCDGG +W Y+V HG+VT + TGC P
Sbjct: 106 QSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVSHGIVTGGSKE--NHTGCRPYPFPK 162
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 163 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 222
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+G VE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 223 HGTVEAYLEIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 281
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 282 GEKGYFRIVRGRNECLIESEIAAGLIKS 309
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/210 (37%), Positives = 119/210 (56%), Gaps = 20/210 (9%)
Query: 19 LQNLSLSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYF-- 67
+N LS +LL+CC F CG+GC+GG P AW+Y HG+ T C PY
Sbjct: 124 FKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIP 183
Query: 68 ---DSTG-CSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMA 119
+ G ++P C PTP C +KC + +HY +S ++ + +I +
Sbjct: 184 PCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQS 243
Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
++ NGP++ +F VY+DF Y +G+Y H+TG+ G +V++IGWG G YW+ AN W
Sbjct: 244 DVMLNGPIQATFEVYDDFLQYTTGIYVHLTGNKQGHLSVRIIGWGVW-QGVPYWLCANSW 302
Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
R WG +G F++ RG+NECG+E + V+G+P
Sbjct: 303 GRQWGENGTFRVLRGTNECGLESNCVSGMP 332
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 82/185 (44%), Positives = 111/185 (60%), Gaps = 14/185 (7%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSH 74
+LS L CC + CG+GCDGG P +AW +F+ HG+VT + C PY G
Sbjct: 28 NLSAEQLNTCC-YRCGNGCDGGSPEAAWYFFMRHGIVTGGDYESGDGCQPYSIYPRGKGR 86
Query: 75 PGC-EPAYPTPKC-VRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
C + TP C +R C N + +R HY + Y ++ EDIM +IYKNGPV+ +
Sbjct: 87 NTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAA 146
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF +YKSGVY + G + GGHA+K++GWG DD YW+ AN W+RSWG +G F+
Sbjct: 147 FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGV-DDNTKYWLCANSWSRSWGENGLFR 205
Query: 191 IKRGS 195
I RG+
Sbjct: 206 ILRGN 210
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 108/204 (52%), Gaps = 13/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S LL+CC CGDGC GG+P AWRY+V +G+ + C PY C H G +
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196
Query: 80 A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TPKC C K K+ + Y + ED E+Y NGP F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY D YKSGVY+H+ GD +GG AVK++GWG +G YW LAN W+ WG GY I
Sbjct: 255 YVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLANSWDTDWGMGGYLLI 313
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE AG P + L
Sbjct: 314 LRGNNECNIEHLGFAGTPEASQLT 337
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 108/204 (52%), Gaps = 13/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S LL+CC CGDGC GG+P AWRY+V +G+ + C PY C H G +
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196
Query: 80 A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TPKC C K K+ + Y + ED E+Y NGP F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY D YKSGVY+H+ GD +GG AVK++GWG +G YW LAN W+ WG GY I
Sbjct: 255 YVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLANSWDTDWGMGGYLLI 313
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE AG P + L
Sbjct: 314 LRGNNECNIEHLGFAGTPEASQLT 337
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 108/204 (52%), Gaps = 13/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S LL+CC CGDGC GG+P AWRY+V +G+ + C PY C H G +
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196
Query: 80 A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TPKC C K K+ + Y + ED E+Y NGP F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKAIPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY D YKSGVY+H+ GD +GG AVK++GWG +G YW LAN W+ WG GY I
Sbjct: 255 YVYTDLFAYKSGVYRHVDGDFLGGTAVKVVGWGKL-NGTPYWKLANSWDTDWGMGGYLLI 313
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE AG P + L
Sbjct: 314 LRGNNECNIEHLGFAGTPEASQLT 337
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 154 bits (388), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 110/204 (53%), Gaps = 13/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S LL+CC CGDGC GG+P AWRY+V +G+ + C PY C H G +
Sbjct: 139 KQLRISAAHLLSCCK-DCGDGCKGGFPGFAWRYYVEYGITSSSCQPY-PFPRCEHQGAQG 196
Query: 80 A--------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TPKC C K+ K+ + Y + ED E+Y NGP F
Sbjct: 197 NKTPCSKYNFDTPKCNATCTDKSVPL--IKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 254
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY D YKSGVY+++ GD +GG AVK++GWG +G YW +AN W+ WG DGY I
Sbjct: 255 YVYTDLFAYKSGVYRNVDGDFLGGTAVKVVGWGKL-NGTPYWKVANSWDTDWGMDGYLLI 313
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE AG P + L
Sbjct: 314 LRGNNECNIEHLGFAGTPETSQLT 337
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 113/201 (56%), Gaps = 22/201 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-----------EECDPYFD 68
+L +S +L+ CC CG+GC+GG+ +AW Y+ G+VT + C PY
Sbjct: 127 MHLLISAANLMECCRN-CGNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPY-P 184
Query: 69 STGCSHP--GCEPAYP-----TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
C H G +PA P TP+CV C + HY SAY + +I E
Sbjct: 185 LPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDLHYGESAYSVRRRVAEIQTE 244
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
I NGPVE +FTVY DF YKSGVYK + +GGHAVK+IGWG +DG YW++AN WN
Sbjct: 245 IMTNGPVEAAFTVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWG-EEDGIPYWLIANSWN 303
Query: 181 RSWGADGYFKIKRGSNECGIE 201
WG GYFKI RG +ECGIE
Sbjct: 304 SDWGDHGYFKIVRGQDECGIE 324
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 153 bits (387), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 114/208 (54%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC CG GCDGG +W Y+V HG+VT + TGC P
Sbjct: 139 QSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + I EI
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 YGPVEAYLQIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTSYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG +EC IE +VAG S
Sbjct: 315 GEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 114/208 (54%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC CG GCDGG +W Y+V HG+VT + TGC P
Sbjct: 139 QSVELSAIDLISCCKN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + I EI
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIGVESVIQKEIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 YGPVEAYLHIYEDFLNYKSGIYRYTTGQFISGHAVRLIGWGV-ENGTSYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG +EC IE +VAG S
Sbjct: 315 GEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 84/195 (43%), Positives = 106/195 (54%), Gaps = 16/195 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+++ LS DL+ C +GC GG +A ++ G+V+ +C PY + P C P
Sbjct: 117 EDVLLSFQDLVTC--DQSDNGCQGGDAYTAMKFIQKKGIVSNDCLPY------TIPTCAP 168
Query: 80 AYP-------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
A TP+CV KC + + H+ Y +N I EI NGPVE F
Sbjct: 169 AQQPCLNFVDTPQCVEKCSNASYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFE 228
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVY+H TG +GGH VK+IGWGT ++ E YWI N W WG G F IK
Sbjct: 229 VYEDFLGYKSGVYQHTTGKDLGGHCVKMIGWGTQNN-ELYWICNNSWTTYWGNQGVFWIK 287
Query: 193 RGSNECGIEEDVVAG 207
G NECGIE DVVA
Sbjct: 288 AGVNECGIESDVVAA 302
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 153 bits (386), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 115/200 (57%), Gaps = 14/200 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+Q L +S L++CC CG GCDGGYP ++W Y+V HG+ + C PY C H G +
Sbjct: 137 VQQLRISAAHLMSCCED-CGYGCDGGYPGTSWEYYVSHGLASSYCQPY-PFPHCGHHGGK 194
Query: 79 PAYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
P TPKC C K K+ +Y ++ + +D E+Y NGP V
Sbjct: 195 GKKPPCSKYHFHTPKCNTTCTDKAIPL--IKYRGNHSYEVHGE-DDYKRELYFNGPFVVV 251
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VY DF YK+GVY+H++GD +GGHAV+++GWG +G YW +AN W+ WG +G+
Sbjct: 252 FWVYSDFLAYKTGVYRHVSGDFLGGHAVRIVGWGKL-NGTPYWKIANSWDTDWGMNGHLL 310
Query: 191 IKRGSNECGIEEDVVAGLPS 210
RG+NECGIE AG P+
Sbjct: 311 FLRGNNECGIEAAGYAGSPA 330
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 152 bits (385), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 86/203 (42%), Positives = 112/203 (55%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ + LS D+LACCG CG GCDGGY AW++ GVVT C PY
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204
Query: 73 SHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+H G P++P TP C C + + N K + + Y + +D I EI K G
Sbjct: 205 AHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKG 264
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +F +YEDF HY GVY H G + GGH++K+IGWG D G YW++AN W+ WG
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323
Query: 186 D-GYFKIKRGSNECGIEEDVVAG 207
D GYF++ RG N C IE V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 152 bits (385), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 117/208 (56%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC + CG GCDGG+ +W Y+V G+VT + TGC P
Sbjct: 139 QSVELSAVDLISCCKY-CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + S I +I
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
+GPVE +YEDF +YKSG+Y++ TG + GHAV+LIG G ++G YW+ AN WN W
Sbjct: 256 HGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGCGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG NEC IE ++ AGL S
Sbjct: 315 GEKGYFRIVRGRNECLIESEIAAGLIKS 342
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 152 bits (385), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 109/204 (53%), Gaps = 13/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S DL+ACC CGDGC GG+P AW Y+V +G+ + +C PY C H G +
Sbjct: 138 KQLRISAADLMACCK-QCGDGCKGGFPGFAWLYYVEYGITSSQCQPY-PFPHCEHRGAQG 195
Query: 80 --------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TPKC C K+ K+ + Y + ED E+Y NGP F
Sbjct: 196 NKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 253
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY D YKSGVY+++ GD +GG AV+++GWG +G YW +AN W+ WG +GY I
Sbjct: 254 FVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLI 312
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE G P L
Sbjct: 313 LRGNNECNIEHLGFTGFPDPSQLT 336
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 152 bits (385), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 95/216 (43%), Positives = 127/216 (58%), Gaps = 30/216 (13%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGC 77
+S D+L CCG CG+GC GG + A +++ +G VT + C PY CS+ C
Sbjct: 123 ISAEDILTCCGKSCGNGCQGGQGLEAMKFWTTYGAVTGGDYKGDGCKPY-SFAPCSN--C 179
Query: 78 EPAYPTPKCVRKCVKKNQL--WRNSKHYS---------------ISAYRINSDPED---I 117
+ TP C KC + ++ KHY SAYR+++ I
Sbjct: 180 VESKTTPSCQSKCQSTYTVTNYKGDKHYGKNEGKVTERHKHLECTSAYRLDTSSNAVPII 239
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
EIY+NGPVEV++TVY+DF HYKSGVY H+TG GGHAVK+IGWGT + G DYW++ N
Sbjct: 240 QNEIYQNGPVEVAYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGT-EKGVDYWLVTN 298
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
W S+G G+FKI+RG+NECGIE +VVAG+ N
Sbjct: 299 SWGTSFGDKGFFKIRRGTNECGIESNVVAGMAKVGN 334
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 114/208 (54%), Gaps = 20/208 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPG 76
Q++ LS DL++CC CG GCDGG +W Y+V HG+VT + TGC P
Sbjct: 139 QSVELSAIDLISCCEN-CGSGCDGGVTGYSWDYWVKHGIVTGGSKE--NHTGCRPYPFPK 195
Query: 77 CE------------PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C+ Y TP+C + C K N + KHY +Y + I EI
Sbjct: 196 CDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVESAIQKEIMM 255
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GPVE +YEDF +YKSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN W
Sbjct: 256 YGPVEAYLEIYEDFLNYKSGIYRYTTGKYISGHAVRLIGWGV-ENGTAYWLAANTWNEDW 314
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSS 211
G GYF+I RG +EC IE +VAG S
Sbjct: 315 GEKGYFRIVRGRDECLIESFIVAGQIKS 342
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 112/199 (56%), Gaps = 17/199 (8%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
+ +S +LL+CC CG GC+GGYP AW Y++ G+ T + C PY C H
Sbjct: 131 VPVSAENLLSCCDS-CGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPY-SLQPCEH 188
Query: 75 ------PGCEPA-YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C Y TP C KC +++ + + R +I EI NGPV
Sbjct: 189 HTEGNKVQCSTLDYDTPSCKHKCDDSALNYKSELTFGSGSVRNFYSVANIQKEILTNGPV 248
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +F VY DF +YKSGVY+H+ G+ +GGHAV+++GWG + G YW++AN WN WG G
Sbjct: 249 EAAFDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWG-EESGVPYWLVANSWNEDWGDKG 307
Query: 188 YFKIKRGSNECGIEEDVVA 206
FKI+RG+NE G E+ +VA
Sbjct: 308 LFKIRRGNNESGFEDSIVA 326
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 113/203 (55%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ + LS D+LACCG CG GCDGGY AW++ GVVT C PY
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204
Query: 73 SHPGCE----PAYP--TPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+H G P++P TP C C + + N K + + Y + +D I EI + G
Sbjct: 205 AHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKG 264
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +F +YEDF HY+ GVY H G + GGH++K+IGWG D G YW++AN W+ WG
Sbjct: 265 PVHATFNIYEDFEHYEGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323
Query: 186 D-GYFKIKRGSNECGIEEDVVAG 207
D GYF++ RG N C IE V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 111/202 (54%), Gaps = 23/202 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
+ D+L+CC CG GCDGG P + W Y+V +G+ + SH GC+ +
Sbjct: 181 QFNFGAYDVLSCC-HRCGFGCDGGVPSAVWHYWVENGITS-------GGAFGSHEGCQ-S 231
Query: 81 YP------------TPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
YP TP+C+R C N + KHY AY + D E IM E++ GP
Sbjct: 232 YPFDVCKKSGDSNDTPRCLRFCQPGYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGPA 291
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ +FT+Y DF YKSGVY+H G +G H+VK++GWG +D + YW+ AN W WG G
Sbjct: 292 QATFTMYTDFVQYKSGVYRHTFGVRVGTHSVKVMGWGVENDVK-YWLCANSWGAQWGDGG 350
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
+FKI RG + E +VVAGLP
Sbjct: 351 FFKIVRGEDHLSFETNVVAGLP 372
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 113/206 (54%), Gaps = 22/206 (10%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
+ LS +LL+CC LCG GC GG+P AW ++ HG+VT Y GC P Y
Sbjct: 164 VRLSAGNLLSCCK-LCGKGCKGGFPGGAWMHWSKHGIVTG--GSYSSDYGCQKYQFFPCY 220
Query: 82 -PTPK----------------CVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
P K C C N+ ++ +Y S YRI +D I EI +
Sbjct: 221 QPRTKGSIKNKCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIPNDARAIQLEIME 280
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
NGPV+ + +YEDF HYK GVY+H+ G + HAVK+ GWGT + G YW+ AN W++ W
Sbjct: 281 NGPVQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGT-EGGTPYWLAANPWSKRW 339
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
G G+FKI RGSN IE+ V+AG+P
Sbjct: 340 GNGGFFKILRGSNHAEIEDHVMAGIP 365
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY F
Sbjct: 135 NELLSAEELAFCC-HKCGSGCHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPF 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ Y+ AY +N + I ++ GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQNLDFKEDHRYTRDAYYLNY--QIIQNDLMTYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E S+ VY+DF +YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASYDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 109/200 (54%), Gaps = 20/200 (10%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH 74
S +DLLACC CG GCDGG P A+ Y+V G+V+ E C PY S +
Sbjct: 133 FEFSADDLLACCT-ACGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYEGSAFLNS 191
Query: 75 PGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSF 131
TPKC KC+ K + KHY Y + + +I EI NGPV
Sbjct: 192 V-------TPKCSTKCLNSKYTTPYAKDKHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHM 244
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG-ADGYFK 190
VYEDF YKSGVY+H++G+ MGGHAVK+IGWGT + G YW++AN W W DG++K
Sbjct: 245 DVYEDFYSYKSGVYQHVSGNSMGGHAVKIIGWGT-EKGVPYWLIANSWGAKWADLDGFYK 303
Query: 191 IKRGSNECGIEEDVVAGLPS 210
I RG N C IE + G P
Sbjct: 304 ILRGKNHCKIETYIYGGTPQ 323
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 110/203 (54%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS ++ CC CG GC+GGYPI AW+YF HG+VT E C+PY
Sbjct: 138 NELLSAEEITFCC-HTCGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
D G S +P +C R C L N H Y + I ++ GP+
Sbjct: 197 DEEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPI 255
Query: 128 EVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
E SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW++ N WN WG +
Sbjct: 256 EASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGDN 314
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG++ECGI+ AG+P
Sbjct: 315 GLFKIRRGTDECGIDSAATAGVP 337
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/165 (44%), Positives = 103/165 (62%), Gaps = 1/165 (0%)
Query: 47 ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSI 105
+S Y + G + E P + C+ TP CV+KC + ++ + H+
Sbjct: 18 VSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGK 77
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
SAY I +D + I EIY NGPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG
Sbjct: 78 SAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGV 137
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
+ YW++AN WN WG+DG+FKI RGS+ECGIE + AGLP+
Sbjct: 138 QNGEIPYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLPA 182
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 120/201 (59%), Gaps = 15/201 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTGC 72
++ L+ DL+ CC CG+GC+GG+ ++++Y+V G+V T+ C PY C
Sbjct: 135 DVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPY-PFKPC 192
Query: 73 SHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
+P GC P TP C C + + +R K+Y +AY++ +D I EI NGPVE
Sbjct: 193 LYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVES 251
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F+VY+D YK+GVY+H+ G +G HAV+LIGWG + G YW++AN + WG GYF
Sbjct: 252 GFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYWLIANSYGEDWGEHGYF 310
Query: 190 KIKRGSNECGIEEDVVAGLPS 210
K RGSN GIE V+AGLP
Sbjct: 311 KFLRGSNHLGIESVVIAGLPK 331
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 120/201 (59%), Gaps = 15/201 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTGC 72
++ L+ DL+ CC CG+GC+GG+ ++++Y+V G+V T+ C PY C
Sbjct: 135 DVELAAEDLMGCCK-DCGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCKPY-PFKPC 192
Query: 73 SHP--GCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
+P GC P TP C C + + +R K+Y +AY++ +D I EI NGPVE
Sbjct: 193 LYPFVGCHPE-KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVES 251
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F+VY+D YK+GVY+H+ G +G HAV+LIGWG + G YW++AN + WG GYF
Sbjct: 252 GFSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWG-KERGVPYWLIANSYGEDWGEHGYF 310
Query: 190 KIKRGSNECGIEEDVVAGLPS 210
K RGSN GIE V+AGLP
Sbjct: 311 KFLRGSNHLGIESVVIAGLPK 331
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 135 NELLSAEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ HY+ AY + I +I GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 111/203 (54%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS +L CC CG+GC+GGYPI AW+YF HG+VT E C+PY
Sbjct: 137 NELLSAEELTFCC-HTCGNGCNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPR 195
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
+ G S +P +C R C L N H Y + I ++ GP+
Sbjct: 196 NEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLT-YGSIQKDVMNYGPI 254
Query: 128 EVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
E SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW++ N W+ WG +
Sbjct: 255 EASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGIPYWLMVNSWSAQWGDN 313
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG++ECGI+ AG+P
Sbjct: 314 GLFKIRRGTDECGIDSATTAGVP 336
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 108/206 (52%), Gaps = 20/206 (9%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
++ L +S DLLACCG CG GC GG P AW YF G+ + C PY CSH
Sbjct: 137 VRGLRISAADLLACCGD-CGYGCLGGDPDMAWAYFSSEGIASGRCQPY-PFPRCSHYTNS 194
Query: 79 PAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
YP TP C C + +R K YS+S ED E+Y GP
Sbjct: 195 TTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSLSG------EEDFRRELYFRGPF 248
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ F V+ D YK GVYKH+ G +G HAV+++GWG + G YW +AN WN WG G
Sbjct: 249 QAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYWKIANSWNAEWGDRG 307
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
YF + RG NECGIE+ AG+P+ N
Sbjct: 308 YFFMLRGDNECGIEDSGSAGVPAIPN 333
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 85/195 (43%), Positives = 117/195 (60%), Gaps = 13/195 (6%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-FDSTGCSHP 75
LS D+LACCG CG GC+GGYPI A+ Y + GV + C PY F ++
Sbjct: 141 LSSADILACCGEDCGSGCEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCDGNYG 200
Query: 76 GC--EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSF 131
C E A+ TPKC + C + + + K + +++ + D E I EI+ NGPV +F
Sbjct: 201 PCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANF 260
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
V+EDF HYK G+YK G +G HA+KLIGWGT ++G DYW++AN +N WG +G F+I
Sbjct: 261 YVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGT-ENGTDYWLVANSYNYDWGENGTFRI 319
Query: 192 KRGSNECGIEEDVVA 206
RG+N C IE V+A
Sbjct: 320 LRGTNHCLIESQVIA 334
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N +S +L CC CG GC+GG P+ AW+YF HGVVT + C PY
Sbjct: 134 NELISAEELTFCC-HRCGFGCNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVK 192
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSI-SAYRINSDPEDIMAEIYKNGP 126
D G + +P P KC R C HY +AY +N D + + GP
Sbjct: 193 DEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKNAYYLNIDT--MQKDTIAYGP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHIT-GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF +Y+SGVY+ +GGHAVK+IGWG +DG YW++ N W WGA
Sbjct: 251 IEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWG-EEDGTPYWLMVNSWGEQWGA 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI RG+NECGIE AG+P
Sbjct: 310 NGMFKILRGTNECGIEGSPTAGVP 333
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 149 bits (377), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 110/204 (53%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS ++ CC CG GC GGYPI AW+YF HG+VT E C+PY
Sbjct: 138 NELLSAEEITFCC-HTCGFGCHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPR 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D G + +P +C R C L N H ++ Y + I ++ GP
Sbjct: 197 DDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG--SIQKDVMTYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG ++G YW++ N WN WG
Sbjct: 255 IEASFDVYDDFPSYKSGVYEKTENASYLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ AG+P
Sbjct: 314 KGLFKIRRGTNECGIDNSTTAGVP 337
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 149 bits (377), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 121/214 (56%), Gaps = 21/214 (9%)
Query: 4 TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
T ++R + SS + S DLL+CC CG C GGY ++A+ +++ GVV+
Sbjct: 117 TMSDRICIHSS---GAKKFFFSAEDLLSCCT-ACGS-CSGGYMMAAFDFYIKQGVVSGGD 171
Query: 61 ----EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE 115
E C PY T +H TP C + C K + + KHY Y +++
Sbjct: 172 LNSNEGCRPY---TADAHDKG----VTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGVS 224
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
+I EI NGP+ VSF VY+DF +Y SGVY H++G+ G H VK++GWGT + +DYW++
Sbjct: 225 NIQYEIMTNGPIIVSFKVYQDFYNYGSGVYHHVSGNYTGNHIVKIVGWGTEKE-QDYWLI 283
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
AN W SWG G+FKI RG NECGIE + A LP
Sbjct: 284 ANSWGSSWGEHGFFKILRGKNECGIENNPYAVLP 317
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 149 bits (377), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 108/204 (52%), Gaps = 13/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S DLL+CC CGDGC GG+P AW Y+V +G+ + C PY C H G +
Sbjct: 138 KQLRISAADLLSCCK-QCGDGCKGGFPGFAWLYYVEYGIASSGCQPY-PFPHCEHRGAQG 195
Query: 80 --------AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+ TPKC C K+ K+ + Y + ED E+Y NGP F
Sbjct: 196 NKTPCSKYKFDTPKCNATCTDKSIPL--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVF 253
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
VY D YKSGVY+++ GD +GG AV+++GWG +G YW +AN W+ WG +GY I
Sbjct: 254 FVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLI 312
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE G P L
Sbjct: 313 LRGNNECNIEHLGFTGFPDPSQLT 336
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 118/208 (56%), Gaps = 20/208 (9%)
Query: 21 NLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST- 70
N LS +LL+CC G L CG+GC GG AW+Y+ HG+ T C PY +
Sbjct: 78 NTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPC 137
Query: 71 -----GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAYRINSDPEDIMAEI 121
++P C PTP C +KC KN +HY S ++ + +I +++
Sbjct: 138 GKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDV 197
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
NGP+E +F VY+DF Y +G+Y H+TG+ G +V+++GWG +G YW+LAN W +
Sbjct: 198 MLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWGK 256
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
WG +G F+ RG+NECG+E + V+G+P
Sbjct: 257 EWGENGTFRALRGTNECGLEANCVSGMP 284
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 107/194 (55%), Gaps = 9/194 (4%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPGCE 78
+ + D LACC CDGGY W+Y+V G+ +E PY GC S+P
Sbjct: 130 KQFTFGATDYLACCTDCFK--CDGGYVGKTWQYWVDSGLTSE--GPYKSGQGCNSYPFGS 185
Query: 79 PAY--PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
P P C R C L + Y SAYR+ + IM EIY+NGPV V F V+
Sbjct: 186 YCVNDPLPTCSRTCQAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQNGPVVVQFEVFA 245
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVY+H+TG G HAV++IGWG ++G YW++AN W WG G+FK RG
Sbjct: 246 DFYQYKSGVYRHVTGATEGWHAVRVIGWGV-ENGVKYWLVANSWGVRWGDKGFFKFVRGE 304
Query: 196 NECGIEEDVVAGLP 209
N GIE+ V AGLP
Sbjct: 305 NHLGIEDFVYAGLP 318
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 115/203 (56%), Gaps = 17/203 (8%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDST 70
S + +S D+L+CCG CG GC GG+PI A+R+ GVVT + C PY
Sbjct: 135 STIKVMISDTDILSCCGLDCGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYP 194
Query: 71 GCSH-------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIY 122
H P +PTPKC + +K N+ ++ KH++ +Y + ++ I EIY
Sbjct: 195 CGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIY 254
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
KNGPV +F VYED++ G+Y H G G HA K+IGWG ++G DYW++AN WN
Sbjct: 255 KNGPVVAAFKVYEDYSS-TGGIYVHKWGIQTGAHADKVIGWG-RENGTDYWLIANSWNTD 312
Query: 183 WGADGYFKIKRGSNECGIEEDVV 205
WG DGY++I R ++ C IE +V
Sbjct: 313 WGEDGYYRIVRETDNCEIERQMV 335
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 107/206 (51%), Gaps = 20/206 (9%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
++ L +S DLLACCG CG GC GG P AW YF G+ + C PY CSH
Sbjct: 137 VRGLRISAADLLACCG-DCGYGCLGGDPDMAWAYFSSEGIASGRCQPY-PFPRCSHYTNS 194
Query: 79 PAYP--------TPKCVRKCVKKN---QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
YP TP C C + +R K YS S ED E+Y GP
Sbjct: 195 TTYPQCSALHLWTPTCNPACTDSTISKKKYRGLKSYSFSG------EEDFRRELYFRGPF 248
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+ F V+ D YK GVYKH+ G +G HAV+++GWG + G YW +AN WN WG G
Sbjct: 249 QAVFDVWSDLFAYKHGVYKHVGGAFIGAHAVRIVGWG-NQSGVPYWKIANSWNAEWGDRG 307
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
YF + RG NECGIE+ AG+P+ N
Sbjct: 308 YFFMLRGDNECGIEDSGSAGVPAIPN 333
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 149 bits (376), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 138 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ HY+ AY + I +I GP
Sbjct: 197 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 255 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 314 QGLFKIRRGTNECGIDNSTTGGVP 337
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 149 bits (376), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ HY+ AY + I +I GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 149 bits (376), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 88/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC+GGYPI AW F HG+VT E C PY
Sbjct: 134 NELLSAEELTFCC-HKCGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPL 192
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +P +C R C L + N HY+ AY + I ++ GP
Sbjct: 193 DEYGNNTCHGKPMEKNHRCTRMCYGDQDLDFNNDHHYTRDAYYLTYGT--IQNDVLTYGP 250
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 251 IEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 310 QGLFKIRRGTNECGIDNSTTGGVP 333
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 149 bits (376), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 108/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 138 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPL 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ HY+ AY + I +I GP
Sbjct: 197 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDILAYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 255 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 314 QGLFKIRRGTNECGIDNSTTGGVP 337
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 109/199 (54%), Gaps = 15/199 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--EECDPY----FDSTGCSH 74
N SLS DLL+CC CG GC G+ AW ++ HG+VT + +P F C H
Sbjct: 101 NKSLSATDLLSCCED-CGLGCGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGH 159
Query: 75 ------PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
P C YPTP+C+++C + + K + +Y + IM EI NGPV
Sbjct: 160 RRKGRYPPCPRHIYPTPECIKQCDEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPV 219
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E SF +Y DF Y GVY H G + HA++++GWG DDG YW++AN WN WG G
Sbjct: 220 EASFGIYADFLEYNGGVYFHCWGGPISRHAIRILGWG-EDDGVPYWLIANSWNEDWGEKG 278
Query: 188 YFKIKRGSNECGIEEDVVA 206
Y + RG NECGIEE+V A
Sbjct: 279 YVRFLRGHNECGIEEEVTA 297
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 113/266 (42%), Gaps = 76/266 (28%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
N SLS DL++CC CG GC GGY AW ++ HG+VT TGC P C
Sbjct: 689 NKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDFWKTHGIVTGGSKE--KPTGCRSYPFPSC 745
Query: 78 E------------PAYPTPKCVRKCVKKNQLWRNSK------------------------ 101
E YPTP+C+++C K + K
Sbjct: 746 EHRGKGQYPPCPHQLYPTPECIKRCDTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFG 805
Query: 102 ----------------HYSIS-----------------AYRINSDPEDIMAEIYKNGPVE 128
H+SI +Y + + +M EI GPV
Sbjct: 806 EASAQRTLHLTCLNFMHHSIDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVG 865
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
VYED YKSGVY H+ G +G H ++++GWG +DG YW++AN WN WG GY
Sbjct: 866 AILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWG-EEDGVPYWLVANSWNEDWGEKGY 924
Query: 189 FKIKRGSNECGIEEDVVAGLPSSKNL 214
++ R NECGI + V AGLP N
Sbjct: 925 MRVLRWRNECGIVDQVTAGLPDLSNF 950
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 111/203 (54%), Gaps = 16/203 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ + LS D+LACCG CG GCDGGY AW++ GVVT C PY
Sbjct: 145 KKVLLSATDILACCGKFCGYGCDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG 204
Query: 73 SHPGCE----PAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+H G P++P RK + + + N K + + Y + +D I EI + G
Sbjct: 205 AHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDERTIQLEIMQKG 264
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV +F +YEDF HY GVY H G + GGH++K+IGWG D G YW++AN W+ WG
Sbjct: 265 PVHATFNIYEDFEHYNGGVYIHTAGAMEGGHSIKIIGWGV-DKGVKYWLIANSWSTDWGE 323
Query: 186 D-GYFKIKRGSNECGIEEDVVAG 207
D GYF++ RG N C IE V+AG
Sbjct: 324 DGGYFRVVRGINNCDIEGGVLAG 346
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 110/204 (53%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW +F HG+VT E C PY
Sbjct: 135 NELLSAEELAFCC-HKCGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C +L ++ H++ AY + I ++ GP
Sbjct: 194 DEYGNNTCRGKPAEKNHRCTRMCYGNQELDFKEDHHWTRDAYYLTY--TTIQKDVMAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF +YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI RG+NECGI+ G+P
Sbjct: 311 QGLFKILRGTNECGIDNSTTGGVP 334
>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
Length = 283
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/184 (42%), Positives = 109/184 (59%), Gaps = 18/184 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
LS DL++C G C+GGY ++W + + G+ TE C PY +G
Sbjct: 113 LSPQDLISCDSNDLG--CNGGYQENSWTWVLTTGITTESCWPYRSGSG----------RI 160
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
P C +CV + L RN+ I+ YR D ++ E+Y NGP++V++ VYEDF +Y G
Sbjct: 161 PSCPHRCVNGSVLQRNT----INNYR-RLDSSELQDELYNNGPIQVTYVVYEDFFYYSKG 215
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
+YKH++G+ +GGHAV L+GWG +DG YW++ N W WG GYF+I RGSNECGIE
Sbjct: 216 IYKHLSGNKVGGHAVVLMGWGI-EDGVKYWLVQNSWGYEWGEQGYFRILRGSNECGIESS 274
Query: 204 VVAG 207
AG
Sbjct: 275 AYAG 278
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 106/179 (59%), Gaps = 17/179 (9%)
Query: 45 YPISAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKC 90
+P AW Y+V G+VT EE C PY C H P C Y TP+C + C
Sbjct: 163 FPGQAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTC 221
Query: 91 VKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 149
K + + KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+T
Sbjct: 222 QKGYKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVT 281
Query: 150 GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
G ++GGHA+++IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 282 GSIVGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 111/206 (53%), Gaps = 23/206 (11%)
Query: 24 LSVNDLLACCG--FLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA- 80
+S DL +CC F CG GCDGGY W Y+ G+VT Y S GC EP
Sbjct: 137 VSAEDLNSCCFGLFACGLGCDGGYVAEPWDYWRTDGIVTG--GAYNSSQGCKDYSLEPCE 194
Query: 81 ---------------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+ TP+CVR C + + + S + ++ + + EI KNG
Sbjct: 195 HHVEVGSRPQCSSLNFDTPECVRSCYESSLDYTESLTFGQQVSTFTNEKQ-MQLEILKNG 253
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGD-VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
P+E +FTVY DF YKSGVY+ D +GGHA+K++GWG ++G YW++AN WN WG
Sbjct: 254 PIEAAFTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGV-EEGTKYWLIANSWNTDWG 312
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
+GYFK RG + CGIE + A LP+
Sbjct: 313 DNGYFKFLRGVDHCGIESETAASLPA 338
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 105/189 (55%), Gaps = 18/189 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
LS D+++C GCDGG +AW + + G+V + C PY G
Sbjct: 137 LSPQDMVSCD--YNDMGCDGGNLDNAWWWMKNKGIVPDSCMPYVSGGG----------NV 184
Query: 84 PKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P C C N QL+ IS + DI EIY NGPV+ F+VY+DF
Sbjct: 185 PACPSNCNGTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQDFM 244
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
+YKSGVY H TG +GGHA+K+IGWG + G DYW++AN W+ WG DG FKI RG NEC
Sbjct: 245 NYKSGVYSHKTGSFLGGHAIKIIGWGV-EGGVDYWLVANSWSTDWGIDGTFKILRGHNEC 303
Query: 199 GIEEDVVAG 207
GIE+DV AG
Sbjct: 304 GIEDDVYAG 312
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 104/184 (56%), Gaps = 18/184 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
+S DL++C GC+GG P+ +W + H G+ TEEC PY G
Sbjct: 112 MSPQDLVSC--DKVDHGCNGGSPLFSWEWVKHSGITTEECIPYVSGGG----------RV 159
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
P C +KC + + R +K S+ + + + E+Y GP E +F+VYEDF YKSG
Sbjct: 160 PSCPKKCTNGSAIVR-TKAKSVGLVK----GDKMQNELYSRGPFEAAFSVYEDFKSYKSG 214
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
VY HITG ++GGHAV ++GWG +DG YW++ N W +WG G+FKI RG NECGIE
Sbjct: 215 VYHHITGKMLGGHAVMVVGWGV-EDGTPYWLIQNSWGTTWGEQGFFKILRGKNECGIETT 273
Query: 204 VVAG 207
G
Sbjct: 274 CFQG 277
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/198 (40%), Positives = 114/198 (57%), Gaps = 13/198 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC-SHPG--C 77
S D+L+CC CG GCDGG P + W Y+V +G+ + Y GC S+P C
Sbjct: 185 QFSFGAYDVLSCC-HRCGFGCDGGVPSAVWHYWVENGITSG--GAYESHEGCQSYPFGVC 241
Query: 78 EPA-----YPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+P + C+R+C N + KH+ AY + D + I+ E++ GPV+ SF
Sbjct: 242 KPQEIFAPHVDLICLRQCQPGYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASF 301
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
TVY DF YKSGVY+H G +G H+VK++GWG ++G +W+ AN W WG +G+FKI
Sbjct: 302 TVYTDFIQYKSGVYRHTYGVRVGDHSVKIVGWGV-ENGTKFWLCANSWGAEWGENGFFKI 360
Query: 192 KRGSNECGIEEDVVAGLP 209
RG + +E +VVAGLP
Sbjct: 361 IRGEDHLSVESNVVAGLP 378
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +P +C R C L ++ HY+ AY + I ++ GP
Sbjct: 194 DEYGNNTCSGKPTEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYGT--IQNDVLAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 106/191 (55%), Gaps = 19/191 (9%)
Query: 37 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE------------PAY 81
CG GCDGG+ +W Y+V G+VT + TGC P C+ Y
Sbjct: 142 CGSGCDGGFLGPSWDYWVLRGIVTGGSKE--NHTGCRPYPFPKCDHFVKGKYRACGDKLY 199
Query: 82 PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
TP+C + C K N + KHY +Y + S I +I +GPVE +YEDF +Y
Sbjct: 200 KTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNY 259
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSG+Y++ TG + GHAV+LIGWG ++G YW+ AN WN WG GYF+I RG NEC I
Sbjct: 260 KSGIYRYTTGQFISGHAVRLIGWGV-ENGTAYWLAANTWNEDWGEKGYFRIVRGRNECSI 318
Query: 201 EEDVVAGLPSS 211
E ++ AGL S
Sbjct: 319 ESEIAAGLIKS 329
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 106/176 (60%), Gaps = 17/176 (9%)
Query: 48 SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
AW Y+V G+VT EE C PY C H P C Y TP+C + C K
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224
Query: 94 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
+ ++ KHY +Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG +
Sbjct: 225 YKTPYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSI 284
Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+GGHA+++IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 113/210 (53%), Gaps = 21/210 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N +S +L++CC + CG GC+GG+P +AW + HG+VT + C PY C
Sbjct: 142 NGHISSRELMSCCSY-CGFGCEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPY-PIAPCE 199
Query: 74 H------PGCE--PAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C P PTP C C + L ++ + SAY + + EI+KN
Sbjct: 200 HHMEGSKPNCSASPTEPTPACETTCTHGSSLAYQKDRQKGKSAYLVPVGEKQTQLEIFKN 259
Query: 125 GPVEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GP+ +F VYEDF YKSGVYK H G HAVK+IGWG +G YW++ N W+ W
Sbjct: 260 GPIVAAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWG-EQNGLPYWLVQNSWDYDW 318
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
G G FKI RG NEC E+ + AGLP K
Sbjct: 319 GDKGLFKIARG-NECDFEKSMTAGLPKYKK 347
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 146 bits (369), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 114/206 (55%), Gaps = 21/206 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N +S +L CC CG GC+GGYP+ AW+YF HGVVT + C PY
Sbjct: 134 NELISAEELTFCC-HRCGFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVK 192
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGP 126
D G + +P KC +KC + + HY AY + + +Y GP
Sbjct: 193 DDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVY--GP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDV--MGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
+E SF VY+DF +Y+SGVY+ TG+ +GGHAVK+IGWG ++G YW++ N W WG
Sbjct: 251 IEASFDVYDDFMNYESGVYQR-TGNASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
G FKI RG++ECGIE AG+PS
Sbjct: 309 DKGMFKILRGTDECGIESSCTAGVPS 334
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 146 bits (369), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 114/204 (55%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS +L CC CG+GC+GGYPI AWRYF GV T E C PY ++
Sbjct: 135 NELLSPEELAFCCK-DCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYN 193
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G + G +P +C + C K ++ + S Y INS + I +I GPVE
Sbjct: 194 KQGKNTCGGKPMERNHQCPKTCYGKTT--DQKRYKTKSEYVINS-IKTIEQDIKTYGPVE 250
Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF VY+DF+ YKSG+Y+ GH+VK+IGWG ++G YW+ N W++ WG G
Sbjct: 251 ASFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWG-QENGTPYWLAVNSWSKFWGDHG 309
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
FKI +G NECGIE V AG+PSS
Sbjct: 310 TFKIIKGKNECGIERAVTAGIPSS 333
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC+GGYPI AW +F HG+VT E C+PY +
Sbjct: 138 NQLLSAEELTFCC-HKCGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D +G + +P +C R C L + H Y+ +Y + I ++ GP
Sbjct: 197 DESGNNTCAGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG--SIQKDVLTYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
VE SF VY+DF YKSGVY + +GGHA KLIGWG + G YW++ N WN WG
Sbjct: 255 VEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWG-EEYGVPYWLMVNSWNADWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI+RG+NECGI+ G+P
Sbjct: 314 NGLFKIQRGTNECGIDNSTTGGVP 337
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 93/137 (67%), Gaps = 13/137 (9%)
Query: 77 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
CE Y T ++ KHY ++Y ++ ++IMAEIYKNGPVE +FTV+ D
Sbjct: 2 CEAGYSTS------------YKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSD 49
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
F YKSGVYKH GDVMGGHA++++GWG ++G YW++AN WN WG +G+FKI RG N
Sbjct: 50 FLTYKSGVYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRGEN 108
Query: 197 ECGIEEDVVAGLPSSKN 213
CGIE ++VAG+P ++
Sbjct: 109 HCGIESEIVAGIPRTQQ 125
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 106/204 (51%), Gaps = 20/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+ LS D+L CC CG C GGY AW Y GVVT E C Y CS
Sbjct: 119 QVRLSAEDVLECCK-DCGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSY-PFPPCS 176
Query: 74 HPGCEPAYP--------TPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKN 124
H G E YP PKC C + + Y S Y++ ++ + I EI +N
Sbjct: 177 H-GIEGQYPQCSTKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMEN 235
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV+ SF VYEDF YKSG+Y H+ G M H VK+IGWG ++GE YW N WN WG
Sbjct: 236 GPVDASFQVYEDFMTYKSGIYHHVEGKFMNLHTVKIIGWG-EENGEAYWKAVNSWNSEWG 294
Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
+G F+I+ G+NEC IE V GL
Sbjct: 295 ENGLFRIRLGTNECTIESQVEGGL 318
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 109/209 (52%), Gaps = 19/209 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGC 77
N SLS DL++CC CG GC GGY AW + HG+VT TGC P C
Sbjct: 136 NKSLSAVDLVSCCT-ECGCGCRGGYSPIAWDLWKTHGIVTGGSKE--KPTGCRSYPFPSC 192
Query: 78 E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
E YPTP+C+++C K + K + +Y + + +M EI G
Sbjct: 193 EHRGKGQYPPCPHQLYPTPECIKRCDTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRG 252
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PV VYED YKSGVY H+ G +G H ++++GWG +DG YW++AN WN WG
Sbjct: 253 PVGAILHVYEDLLDYKSGVYFHVWGGHLGEHGIRILGWG-EEDGVPYWLVANSWNEDWGE 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GY ++ R NECGI + V AGLP N
Sbjct: 312 KGYMRVLRWRNECGIVDQVTAGLPDLSNF 340
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 117/203 (57%), Gaps = 18/203 (8%)
Query: 24 LSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST---- 70
LS +LL+CC G L CG+GC GG P+ AW+Y+ HG+ T C PY +
Sbjct: 138 LSAQELLSCCTGVLSCGEGCAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKT 197
Query: 71 --GCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
++P C PTP C +KC + +HY +S ++ + +I +++ NGP
Sbjct: 198 IGNVTYPPCTNTTLPTPTCEKKCKPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGP 257
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VE + +Y+DF Y +G+Y H+ G+ G +V+++GWG +G YW+LAN W + WG +
Sbjct: 258 VEATMEIYDDFLQYTTGIYVHLAGNKQGHLSVRILGWGMF-EGVPYWLLANSWGKEWGEN 316
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G F++ RG NECG+E + ++G+P
Sbjct: 317 GTFRVLRGVNECGLEANCISGMP 339
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 108/203 (53%), Gaps = 18/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS L CC + CG GC GG PI AW+YF HG+ T E C PY +D
Sbjct: 136 NEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPCYD 194
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G +P KC R C + + Y + + + + I +I K GPVE
Sbjct: 195 DQGEFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVKSIYVLDSSKTIEQDIRKYGPVE 251
Query: 129 VSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF VY+DF YKSG+Y+ +GGH+VKLIGWG +DG YW+L N W++ WG G
Sbjct: 252 ASFDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWG-EEDGIPYWLLVNSWSKFWGEQG 310
Query: 188 YFKIKRGSNECGIEEDVVAGLPS 210
F+I +G NECGIE AG+PS
Sbjct: 311 TFRIIKGRNECGIERSATAGVPS 333
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C + C L ++ HY+ AY + I ++ GP
Sbjct: 194 DEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY + +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECG + G+P
Sbjct: 311 QGLFKIRRGTNECGTDNSTTGGVP 334
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 87/204 (42%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GG PI AW F HG+VT E C PY
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ HY+ AY + I ++ GP
Sbjct: 194 DEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQYDVLAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 311 QGLFKIRRGTNECGIDNSTTGGVP 334
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 135 NELLSPEELAFCC-HKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPL 193
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C + C L ++ HY+ AY + I ++ GP
Sbjct: 194 DEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYGT--IQNDVLAYGP 251
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW+L N WN WG
Sbjct: 252 IEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWG-EEYGVPYWLLVNSWNDQWGD 310
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECG + G+P
Sbjct: 311 QGLFKIRRGTNECGTDNSTTGGVP 334
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 145 bits (367), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 83/195 (42%), Positives = 109/195 (55%), Gaps = 17/195 (8%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH------ 74
D+L+CC + CG GCDGG P +A+ + + +GV T C PY H
Sbjct: 146 DILSCC-WNCGMGCDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYF 204
Query: 75 -PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
P + +PTPKC + C +K N +++ K Y AY + ++ IM EI+ NGPV SF+
Sbjct: 205 GPCPKELWPTPKCRKMCQLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFS 264
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
V+ DFA YK GVY G HAVK+IGWG D G YW++AN WN WG +GY +
Sbjct: 265 VFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQD-GLKYWLIANSWNNDWGDEGYVRFL 323
Query: 193 RGSNECGIEEDVVAG 207
RG N CGIE VV G
Sbjct: 324 RGDNHCGIESRVVTG 338
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 145 bits (367), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 79/175 (45%), Positives = 104/175 (59%), Gaps = 17/175 (9%)
Query: 49 AWRYFVHHGVVTEE-------CDPYFDSTGCSHPGCEPAY-------PTPKCVRKCVKKN 94
AW+YF GVVT C PY + C G EP Y TPKC + C +
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPY-EFPPCGRHGKEPYYGECYDTAKTPKCQKTCQRGY 59
Query: 95 -QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
+ ++ KH+ SAYR+ ++ + I +I KNGPV F VYEDFAHYKSG+YKH G +
Sbjct: 60 LKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMT 119
Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
GGHAVK+IGWG + G YW++AN W+ WG G++++ RG N C IEE V AG+
Sbjct: 120 GGHAVKIIGWG-KEKGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGI 173
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS +L CC CG GC GGYPI AW+YF GV T E C PY +D
Sbjct: 135 NQLLSPEELAFCC-MDCGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYD 193
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G + G +P +C + C K + ++ + + Y INS E I ++ GPVE
Sbjct: 194 EQGKNTCGGKPMERNHQCPKTCYGKTTV--QDRYKTKNEYVINS-IETIEQDLMTYGPVE 250
Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF VY+DF+ YKSG+Y+ GGH++K+IGWG ++G YW+ N W++ WG G
Sbjct: 251 ASFDVYDDFSVYKSGIYRKTPKAKYEGGHSIKIIGWG-EENGTPYWLAVNSWSKFWGDHG 309
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
FKI +G NECGIE V AG+PS+
Sbjct: 310 TFKIIKGRNECGIERAVTAGIPST 333
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 80/175 (45%), Positives = 106/175 (60%), Gaps = 18/175 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
+ + LS D+LACC + CG GC+GG+P+ AW+YF GVVT C PY + C
Sbjct: 23 KQVLLSDQDMLACCSW-CGYGCEGGWPMKAWQYFXLEGVVTGGNYRKQGCCRPY-EFPPC 80
Query: 73 SHPGCEPAY-------PTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
G EP Y TPKC + C + + ++ KH+ SAYR+ ++ + I +I KN
Sbjct: 81 GRHGKEPYYGECYDSAKTPKCQKTCQRGYLKPYKEDKHFGKSAYRLPNNVKAIQRDIMKN 140
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
GPV F VYEDFAHYKSG+YKH G + GGHAVK+IGWG + G YW++AN W
Sbjct: 141 GPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWG-KEXGTPYWLIANSW 194
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 104/176 (59%), Gaps = 17/176 (9%)
Query: 48 SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
AW Y+V G+VT EE C PY C H P C Y TP+C + C K
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224
Query: 94 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
+ + KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+TG +
Sbjct: 225 YKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSI 284
Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+GGHA+++IGWG + G+ YW++AN WN WG G F++ RG +EC IE VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 111/200 (55%), Gaps = 15/200 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N LS + +CC + CG GC GGYPI AWRY+ HG+VT E C PY
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192
Query: 74 HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 130
+ C + KC +KC + +R + Y S Y + D ++ +I GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250
Query: 131 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY+DF YKSGVY K +GGH+VK IGWG + YW++ N WN +WG GYF
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGV-ERNVSYWLMMNSWNSTWGDGGYF 309
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KI+RG+NEC +E+ AG+P
Sbjct: 310 KIRRGTNECQVEDSSTAGVP 329
>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/183 (43%), Positives = 106/183 (57%), Gaps = 18/183 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
+S DL++C GC+GGY AW + HG+ TE+C PY +G
Sbjct: 112 MSPQDLVSC--DTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSG----------RV 159
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
P C KCV + + RN S+S ++N+ + +M E+Y+NGP+ V+FTVY DF +YKSG
Sbjct: 160 PACPAKCVNGSAIVRNK---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSG 214
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
VY H TG + GGHAV +GWG +D YW+ N W +WG G+FKI RGSN CGIE
Sbjct: 215 VYVHKTGGIAGGHAVLCVGWGV-EDNTPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQ 273
Query: 204 VVA 206
A
Sbjct: 274 SYA 276
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 112/195 (57%), Gaps = 21/195 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
Q + +S D+L+C GC+GGYP A+ ++ GVVT S ++ GC+P
Sbjct: 158 QKVHISAQDILSCATDR-SQGCNGGYPDEAFEHYAQSGVVT-------GSGNSANQGCKP 209
Query: 80 ---------AYPTPKCVRKC--VKKNQLWRNSKHYSISAYRIN-SDPEDIMAEIYKNGPV 127
Y TP+C +KC + + ++ KH+ +S Y + SDP DI EI NGPV
Sbjct: 210 YPFLPHTTVEYSTPECSKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPV 269
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGAD 186
E + VY DF YKSGVY+ + +GGHAV+++GWG + YW++AN WN WG D
Sbjct: 270 EANMIVYYDFMFYKSGVYQTVFPWPLGGHAVRIVGWGVDGPTKVPYWLVANSWNTDWGED 329
Query: 187 GYFKIKRGSNECGIE 201
GYF+I+RG++E IE
Sbjct: 330 GYFRIRRGTDESYIE 344
>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/167 (45%), Positives = 99/167 (59%), Gaps = 16/167 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC+GGY AW + HGV TE+C PY +G P C KCV + + RN
Sbjct: 126 GCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSG----------RVPACPAKCVNGSAIVRN 175
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
S+S ++N+ + +M E+Y+NGP+ V+FTVY DF +YKSGVY H TG + GGHAV
Sbjct: 176 K---SVSYKKLNA--QQMMEELYENGPISVAFTVYYDFMNYKSGVYVHKTGGIAGGHAVL 230
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
+GWG D+ YW+ N W +WG G+FKI RGSN CGIE A
Sbjct: 231 CVGWGVEDN-TPYWLCQNSWGPAWGEKGHFKILRGSNHCGIENQSYA 276
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/200 (41%), Positives = 112/200 (56%), Gaps = 18/200 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
LS +LL+CC CG GC+GGYP ++Y+V+ G+ T + C PY P
Sbjct: 343 LSDAELLSCCT-SCGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPY------PIPP 395
Query: 77 CE--PAYPTPKCVRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
C TPKC + C+ L N +HY + Y+ + +M +I GP+ +V
Sbjct: 396 CSNCSETRTPKCSKSCISTYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSV 455
Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
YEDF HYK GVY +G +GGHAV++IGWG D+ YW++AN WN ++G DG FKI+R
Sbjct: 456 YEDFLHYKEGVYTQESGIFLGGHAVRIIGWGEQDN-IPYWLVANSWNTTFGEDGLFKIRR 514
Query: 194 GSNECGIEEDVVAGLPSSKN 213
G +ECGIE V AG K
Sbjct: 515 GFDECGIESYVSAGRAKCKQ 534
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/211 (36%), Positives = 119/211 (56%), Gaps = 21/211 (9%)
Query: 19 LQNLSLSVNDLLACC-GFL-CGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDS 69
+ N LS +LL+CC G L CG+GC GG AW+Y+ HG+ T C PY +
Sbjct: 120 MINTILSAQELLSCCTGVLSCGEGCGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIA 179
Query: 70 T------GCSHPGC-EPAYPTPKCVRKCVKKNQL---WRNSKHYSISAY-RINSDPEDIM 118
++P C PTP C +KC KN +HY S+ ++ + +I
Sbjct: 180 PCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQ 239
Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQ 178
+++ NGP+E +F VY+DF Y +G+Y H+TG+ G +V+++GWG +G YW+LAN
Sbjct: 240 SDVMLNGPIETTFEVYDDFLQYTTGIYVHLTGNKQGHLSVRILGWGMY-EGVPYWLLANS 298
Query: 179 WNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
W + WG +G F+ RG+NECG+E + V+ +P
Sbjct: 299 WGKEWGENGTFRALRGTNECGLEANCVSAMP 329
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 108/204 (52%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 133 NELLSAEELTFCC-HKCGFGCHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPL 191
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C L ++ H++ AY + I ++ GP
Sbjct: 192 DEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKKDHHFTRDAYYLTFGI--IQRDVMAYGP 249
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E S+ VY+DF YKSGVY + +GGHAVKLIGWG + G YW++ N WN WG
Sbjct: 250 IEASYDVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNDQWGD 308
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 309 KGLFKIRRGTNECGIDNSTTGGVP 332
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/200 (39%), Positives = 110/200 (55%), Gaps = 13/200 (6%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVH-------HGVVTEECDPYFDST 70
+ + L +S DLL C GC+GG+P AW + + +G + + C YF
Sbjct: 129 ATKKLLVSSQDLLTCG---TAGGCNGGWPAVAWSDWTNGIVTGGLYGALEQGCKSYFLEG 185
Query: 71 GCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
HP C TP CV +C + + ++ + Y + Y I + E I EI NGPVE
Sbjct: 186 CDDHPNKCRNYVSTPACVEQCDEPSLYYKAQETYGQTPYEIQGE-EQIQYEIMTNGPVEA 244
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+ VY DFA Y+SG+Y+ T + GGHAVK++GWG +DG YW++AN WN WG +G F
Sbjct: 245 TMDVYVDFAQYQSGIYQLTTDEYEGGHAVKILGWGV-EDGVKYWLVANSWNERWGENGLF 303
Query: 190 KIKRGSNECGIEEDVVAGLP 209
+I RG +E GIE + A LP
Sbjct: 304 RIIRGRDEVGIESTIDAALP 323
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS +L CC CG GC+GGYPI AW+YF HG+VT + C+PY
Sbjct: 137 NELLSAEELTFCC-HACGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR 195
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
+ G S +P +C R C L + H ++ Y + I ++ GP
Sbjct: 196 NEDGKSSCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGP 253
Query: 127 VEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW++ N WN WG
Sbjct: 254 IEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGTPYWLMVNSWNAQWGD 312
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI+RG++EC I+ AG+P
Sbjct: 313 NGLFKIRRGTDECRIDSATTAGVP 336
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 113/206 (54%), Gaps = 19/206 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS ++ CC CG+GC+GGYPI AW+ F +HG+VT E C+PY +
Sbjct: 136 NQLLSAEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY 194
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D G + +P KC +KC + N H Y+ Y + I ++ GP
Sbjct: 195 DKDGKNTCSGQPMESNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGP 252
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF +YKSG+Y K +GGH+VKLIGWG + G YW++ N WN WG
Sbjct: 253 IETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWGD 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSS 211
G FKI+RG+NEC ++ G+P +
Sbjct: 312 KGLFKIRRGTNECRVDNSTTGGVPDT 337
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/164 (48%), Positives = 100/164 (60%), Gaps = 16/164 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
QN+ +S DLL+CCGF CG GC+GGYP AW+Y+ G+V+ C PY C
Sbjct: 62 QNVEVSAEDLLSCCGFECGMGCNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPP-C 120
Query: 73 SH------PGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C TPKCV+KC + K Y SAY + S PE IM EIYK+
Sbjct: 121 EHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKD 180
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
GPVE +FTVYEDF YKSGVY+H TG+ +GGHA+K++GWG ++
Sbjct: 181 GPVEGAFTVYEDFLLYKSGVYQHHTGEAVGGHAIKILGWGIENN 224
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/202 (42%), Positives = 115/202 (56%), Gaps = 20/202 (9%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH 74
+ LS +L++CC C GC+ GY SAW Y+V +G+VT E C PY C H
Sbjct: 135 VELSAIELVSCCS-KCAVGCNFGYSESAWYYWVENGLVTGESNGNNSGCLPY-PFPKCDH 192
Query: 75 PGCEPAYPT--------PKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
G +YP P C C + + + KH+ SAY++ + DI EI G
Sbjct: 193 -GSSDSYPMCGYVVYTPPVCNGTCRPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYG 251
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE S +Y+DF YKSGVYKH+TG ++ +V++IGWG ++G YW+ AN WN WG
Sbjct: 252 PVEASIFIYDDFVDYKSGVYKHLTGRLITIQSVRIIGWGI-ENGIPYWLCANSWNEEWGL 310
Query: 186 DGYFKIKRGSNECGIEEDVVAG 207
+G+FKI RGSNEC IE V AG
Sbjct: 311 NGFFKILRGSNECEIEAFVNAG 332
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/224 (37%), Positives = 118/224 (52%), Gaps = 20/224 (8%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
M+ + D L + L LS ++ CC CG GC+GGYPI AW F + G+VT
Sbjct: 119 MATSSAFADRLCVATNADFNEL-LSAEEITFCCS-SCGYGCNGGYPIKAWESFNNRGLVT 176
Query: 61 -------EECDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSIS 106
E C+PY +D+ G + +P +C R C L N H ++
Sbjct: 177 GGDYQSGEGCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYNDDHRFTRD 236
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGT 165
+Y + I ++ + GP+E SF +Y+DF YKSGVY + +GGHAVKLIGWG
Sbjct: 237 SYYLTY--SSIQKDVMRYGPIEASFDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWG- 293
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ G YW++ N WN WG +G FKI+RG+NECGI+ G+P
Sbjct: 294 EEHGVLYWLMVNSWNEGWGDNGLFKIRRGTNECGIDNSTTGGVP 337
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS ++ CC CG GC+GGYPI AW F G+VT E C+PY +
Sbjct: 138 NELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D+ G + +P +C R C L + H Y+ +Y + I ++ GP
Sbjct: 197 DAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW++ N WN WG
Sbjct: 255 IEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNADWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI+RG+NECGI+ AG+P
Sbjct: 314 NGLFKIRRGTNECGIDNSTTAGVP 337
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/176 (43%), Positives = 104/176 (59%), Gaps = 17/176 (9%)
Query: 48 SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
AW Y+V G+VT EE C PY C H P C Y TP+C + C K
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224
Query: 94 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
+ + KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+ G +
Sbjct: 225 YKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSI 284
Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+GGHA+++IGWG + G+ YW++AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 111/204 (54%), Gaps = 16/204 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF---DST 70
N LS ++ CC CG GC GGYPI AW+ F HG+VT E C+PY +
Sbjct: 138 NELLSAEEITFCC-HTCGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSND 196
Query: 71 GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEV 129
G S +P C R C + N H Y+ Y + I ++ GP+E
Sbjct: 197 GNSSSSDQPLAINHICRRHCYGNQSIDFNDDHRYTRDYYYLTYG--SIQKDVLTYGPIEA 254
Query: 130 SFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
SF VY+DF YKSGVY K +GGHAVKLIGWG +DG YW++ N WN WG +G+
Sbjct: 255 SFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWG-EEDGTPYWLMVNSWNTQWGDNGF 313
Query: 189 FKIKRGSNECGIEEDVVAGLPSSK 212
FKI+RG+NECG++ AG+P +
Sbjct: 314 FKIRRGTNECGVDNSTTAGVPVTN 337
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/176 (43%), Positives = 104/176 (59%), Gaps = 17/176 (9%)
Query: 48 SAWRYFVHHGVVT---EE----CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKK 93
AW Y+V G+VT EE C PY C H P C Y TP+C + C K
Sbjct: 166 QAWDYWVKRGIVTGGSEENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKG 224
Query: 94 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
+ + KHY Y + S+ + I EI GPVE +F VYEDF +YKSG+Y+H+ G +
Sbjct: 225 YKTPYEQDKHYGDQRYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSGIYRHVAGSI 284
Query: 153 MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+GGHA+++IGWG + G+ YW++AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 285 VGGHAIRIIGWGV-EKGKPYWLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 114/204 (55%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS +L CC CG GC GGYPI AW+YF GV T E C PY ++
Sbjct: 135 NQLLSPEELAFCCK-DCGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYN 193
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G + G +P +C + C K + +++ + S Y INS + I ++ GPVE
Sbjct: 194 KQGKNTCGGQPMERNHQCPKTCYGKTTV--QNRYKTKSEYSINS-IKTIEQDLKTYGPVE 250
Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF VY+DF+ YKSG+Y+ G H++K+IGWG ++G YW+ N W++ WG G
Sbjct: 251 ASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWG-QENGTTYWLAVNSWSKFWGEHG 309
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
FKI +G NECGIE V AG+PSS
Sbjct: 310 TFKIIKGRNECGIERAVTAGIPSS 333
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 111/204 (54%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS ++ CC CG GC+GGYPI AW F G+VT E C+PY +
Sbjct: 138 NELLSAEEITFCC-HSCGFGCNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D+ G + +P +C R C L + H Y+ +Y + I ++ GP
Sbjct: 197 DAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG--SIQKDVMTYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW++ N WN WG
Sbjct: 255 IEASFDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNADWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
+G FKI+RG+NECGI+ AG+P
Sbjct: 314 NGLFKIRRGTNECGIDNSTTAGVP 337
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 114/206 (55%), Gaps = 23/206 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS ++ CC CG GC+GGYPI AW F HG+VT E C+PY +
Sbjct: 138 NQLLSAEEITFCC-HKCGYGCNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPY 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAY--RINSDPEDIMAEIYKN 124
D +G + +P +C R C L + H ++ +Y I S +D+M
Sbjct: 197 DESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIGSIQKDVMTY---- 252
Query: 125 GPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GP+E SF VY+DF YKSGVY + +GGHAVKLIGWG + G YW++ N WN W
Sbjct: 253 GPIEASFDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWG-EEYGTPYWLMMNSWNADW 311
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLP 209
G +G FKI+RG+NECG++ AG+P
Sbjct: 312 GDEGLFKIRRGTNECGVDNSTTAGVP 337
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 113/221 (51%), Gaps = 33/221 (14%)
Query: 2 SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVT 60
S ++R A++S+ V N LS DL++C GD GC GGY AW Y +G+VT
Sbjct: 126 SEVLSDRFAIASNGTV---NKILSPEDLVSCDK---GDMGCQGGYLDKAWDYLKTNGIVT 179
Query: 61 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
E C PY G + P C CV K Y S Y + EDIM E
Sbjct: 180 ESCFPYAAQKGVA----------PSCRISCVDGEPY----KKYKASDYYQLTTEEDIMKE 225
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM-GGHAVKLIGWGTS------DDGEDYW 173
IY NGPVE F VY F YKSGVY H D+M GGHA+K++GWG YW
Sbjct: 226 IYLNGPVEAGFRVYTSFMSYKSGVYHHRILDIMEGGHAIKIVGWGVEPPKRFWQKPTKYW 285
Query: 174 ILANQWNRSWGADGYFKIKRGSN-----ECGIEEDVVAGLP 209
I AN W WG +G+FKI+RG N ECGIE+ V AG P
Sbjct: 286 ICANSWTADWGMNGFFKIRRGKNRFGQSECGIEDQVFAGHP 326
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 113/206 (54%), Gaps = 21/206 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N +S +L CC C GC+GGYP+ AW+YF HGVVT + C PY
Sbjct: 134 NELISAEELTFCC-HRCVFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPCVK 192
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGP 126
D G + +P KC +KC + + HY AY + + +Y GP
Sbjct: 193 DDEGHNSCSGQPTERNHKCSKKCYGDDTIDYKKNHYKTKDAYYLKNTTMQKDTMVY--GP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDV--MGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
+E SF VY+DF +Y+SGVY+ TG+ +GGHAVK+IGWG ++G YW++ N W WG
Sbjct: 251 IEASFDVYDDFMNYESGVYQR-TGNASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWG 308
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPS 210
G FKI RG++ECGIE AG+PS
Sbjct: 309 DKGMFKILRGTDECGIESSCTAGVPS 334
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/203 (41%), Positives = 113/203 (55%), Gaps = 21/203 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE------CDPY-------- 66
L +S D+++CC LCG GCDGG+PI A+ YF G VT E C PY
Sbjct: 144 QLHISSIDIVSCCK-LCGYGCDGGWPIEAFDYFSRQGAVTGETTSKDGCRPYPFHPLWTY 202
Query: 67 -FDSTGCSHPG-CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
D+ G G C+ + + V++ V +N R + RI + + N
Sbjct: 203 GNDTVGRRMSGRCKHSKTVGEGVKR-VTRNHTRRTG--LTARRLRITEFCQSHSEGDHGN 259
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV FTVYEDF++YK G+Y HI G G HA+K+IGWG ++G YW++AN W+ WG
Sbjct: 260 GPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIKIIGWGV-ENGLPYWLIANSWHDDWG 318
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
G F+I RG NECGIE++VVAG
Sbjct: 319 EQGLFRIVRGINECGIEQEVVAG 341
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/196 (38%), Positives = 110/196 (56%), Gaps = 5/196 (2%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ S ++++CC CG GC GG+ ++Y+V +G+ + Y GC
Sbjct: 133 KKFIFSAEEVVSCCT-ACGGGCRGGFLNEPYKYWVTNGIPSG--GDYGSKLGCKPYTAAV 189
Query: 80 AYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
+ TP+C + CV + W ++ SAY++N I EI NGPV VYEDF
Sbjct: 190 SGETPQCQKACVSGYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFY 249
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
Y +G+Y+H +G +GGHAVK+IGWG+ +D YWI AN W +G DG+F+I RGSN
Sbjct: 250 SYGTGIYQHTSGSFVGGHAVKIIGWGSEND-VPYWIAANSWGTGFGEDGFFRILRGSNCA 308
Query: 199 GIEEDVVAGLPSSKNL 214
GIE +VAG P++ +
Sbjct: 309 GIESYIVAGYPNTSEV 324
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 116/226 (51%), Gaps = 20/226 (8%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
MS + D L + L LS ++ CC CGDGC GGYPI AW+ + HG+VT
Sbjct: 56 MSTSSAFSDRLCVATNGDFNQL-LSAEEITFCC-HTCGDGCSGGYPIRAWKRYKKHGLVT 113
Query: 61 -------EECDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSIS 106
E C+PY D G + +P +C R C L + H Y+
Sbjct: 114 GGNYKSGEGCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRD 173
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGT 165
Y + I ++ GP+E SF VY+DF YKSG+Y K +GGH+VKLIGWG
Sbjct: 174 HYYLTY--RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWG- 230
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
+ G YW++ N WN WG G FKI+RG+NECG++ G+P++
Sbjct: 231 EEYGVLYWLMVNSWNADWGDKGLFKIRRGTNECGVDNSTTGGVPAT 276
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 117/218 (53%), Gaps = 33/218 (15%)
Query: 21 NLSLSVNDLLACCG---FLCGDGCDGGYPISAWRYFVHHGVVT-------------EECD 64
N LS D+LACC F GC GG PI++W + +G+V+ + C
Sbjct: 151 NQLLSAADMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCW 210
Query: 65 PYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINS 112
PY + C+H P + Y TP C C K + +HY+ S + R S
Sbjct: 211 PY-NFPKCAHHQKESDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS 269
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
I EI NGP +F+VYEDF YKSGVYKH +G +GGHAV++IGWGT + G DY
Sbjct: 270 T-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDY 327
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
W++ N WN WG G FKI +G +CGI++ ++AG P+
Sbjct: 328 WLVMNSWNEEWGDHGTFKIVQG--DCGIDDMILAGTPA 363
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 81/205 (39%), Positives = 109/205 (53%), Gaps = 19/205 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N +S +L CC CG GC+GG P+ AW+YF HGVVT + C PY
Sbjct: 134 NELISAEELTFCC-HTCGFGCNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPCVR 192
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGP 126
D G + +P KC +KC + HY AY +++ +Y GP
Sbjct: 193 DDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYLSNTTMQKDTMVY--GP 250
Query: 127 VEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF Y+SGVY+ +GGHAVK+IGWG ++G YW++ N W WG
Sbjct: 251 IEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGV-EEGTPYWLMVNSWGEQWGD 309
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPS 210
G FKI RG++ECG+E AG+PS
Sbjct: 310 KGMFKILRGTDECGVESSCTAGVPS 334
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 112/216 (51%), Gaps = 37/216 (17%)
Query: 23 SLSVNDLLACCGFLC----GDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+S DLL+CCG C GCDGGYP AW+Y G+VT C PY
Sbjct: 123 QISAEDLLSCCGINCELDGNGGCDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPY-SFPP 181
Query: 72 CSH-------PGCEPAY-----PTPKCVRKCVKKNQLWRNSKHYSI-------SAYRINS 112
CSH CE + TP C +KC + S+ Y + + Y++
Sbjct: 182 CSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQF-----SRTYDVDKIRSRENPYKLIK 236
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
D E I EIY NGPV+ FTV++DF +YKSGVY+ TG G HAVK+IGWGT ++G Y
Sbjct: 237 DQEQIKNEIYLNGPVQAVFTVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGT-ENGVPY 295
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
W N WN WG +G FKI RG N IE +V A +
Sbjct: 296 WEAINSWNDGWGINGKFKILRGFNHLDIEGEVYASI 331
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 142 bits (359), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 109/204 (53%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS ++ CC CG GC+GGYPI AW+ F G+VT E C+PY
Sbjct: 136 NELLSAEEITFCC-HTCGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPN 194
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D G + +P +C R C L + H Y+ Y + I ++ GP
Sbjct: 195 DDQGNNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYG--SIQKDVMTYGP 252
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY K +GGHAVKLIGWG + G YW++ N WN WG
Sbjct: 253 IEASFDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWG-EEYGVPYWLMVNSWNEDWGD 311
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G+FKI+RG+NECG++ AG+P
Sbjct: 312 HGFFKIQRGTNECGVDNSTTAGVP 335
>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
protease B3; Flags: Precursor
gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
Length = 299
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/168 (44%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 140 CDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDLPHLY 190
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
K Y + D IM + GP++ +FTVY DF +Y+SGVY+H G V GGHAV +
Sbjct: 191 KATKAVDYGL--DAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVEGGHAVDM 248
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT DDG DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 249 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 296
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 82/200 (41%), Positives = 110/200 (55%), Gaps = 15/200 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N LS + +CC + CG GC GGYPI AWRY+ HG+VT E C PY
Sbjct: 134 NQLLSAEHVTSCC-YRCGLGCQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPCTG 192
Query: 74 HPGCE-PAYPTPKCVRKCVKKNQL-WRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVS 130
+ C + KC +KC + +R + Y S Y + D ++ +I GP+E S
Sbjct: 193 NNSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD--NMQNDIMTYGPIESS 250
Query: 131 FTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VY+DF YKSGVY K +GGH+VK IGWG + YW++ N WN +WG G F
Sbjct: 251 FDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-VSYWLMMNSWNNTWGDGGNF 309
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KI+RG+NEC +E+ AG+P
Sbjct: 310 KIRRGTNECQVEDSSTAGMP 329
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 59/113 (52%), Positives = 89/113 (78%), Gaps = 1/113 (0%)
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
++ KH+ S+Y + ++ ++IMAEIYKNGPVE +F+VY DF YKSGVY+H++G++MGGH
Sbjct: 6 YKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGH 65
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
A++++GWG ++G YW++ N WN WG +G+FKI RG + CGIE ++VAG+P
Sbjct: 66 AIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 109/204 (53%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC GGYPI AW F HG+VT E C PY
Sbjct: 133 NELLSAEELTFCC-HTCGYGCHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPL 191
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
D G + +PA +C R C +++ ++ ++ AY + I ++ GP
Sbjct: 192 DEYGNNTCRGKPAEKNHRCTRMCYGDQDRDFKEDHRFTRDAYYLTYGT--IQKDVMTYGP 249
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E S+ VY+DF YKSGVY + +GGHAVKLIGWG + G YW++ N WN WG
Sbjct: 250 IEASYEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWG-EEYGVPYWLMVNSWNDQWGD 308
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECGI+ G+P
Sbjct: 309 RGLFKIRRGTNECGIDNSTTGGVP 332
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 89/139 (64%), Gaps = 3/139 (2%)
Query: 74 HPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPE-DIMAEIYKNGPVEVSF 131
+P C+ Y P C ++C K + L + KHY+ AYRI S E I EI KNGPV SF
Sbjct: 121 NPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASF 180
Query: 132 TVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
TVY DF HY SGVYK ++GGHAV++IGWG + YW+++N WN WG G FK
Sbjct: 181 TVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGTYPYWLVSNSWNERWGDQGLFK 240
Query: 191 IKRGSNECGIEEDVVAGLP 209
I RG NECGIEE++ AGLP
Sbjct: 241 IWRGKNECGIEEEITAGLP 259
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 107/203 (52%), Gaps = 18/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS L CC + CG GC GG PI AW+YF G+ T E C PY +D
Sbjct: 136 NEQLSAEKLTFCC-WTCGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPCYD 194
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G +P KC R C + + Y + + + + I +I GPVE
Sbjct: 195 DQGEFLCQGKPTEHNHKCPRACYGNSTV---ENRYKVESIYVLDSFKTIEQDIRTYGPVE 251
Query: 129 VSFTVYEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF VY+DF YKSG+Y+ + +GGH+VKLIGWG +DG YW+L N W++ WG G
Sbjct: 252 ASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWG-EEDGIPYWLLVNSWSKFWGEQG 310
Query: 188 YFKIKRGSNECGIEEDVVAGLPS 210
F+I +G NECGIE AG+PS
Sbjct: 311 TFRIIKGRNECGIERSATAGIPS 333
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/210 (40%), Positives = 112/210 (53%), Gaps = 18/210 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S +LL+CC C GC G AW ++V HG+V+ E C PY C
Sbjct: 100 KHFHFSALNLLSCCD-SCEKGCLGCDHHLAWDHWVKHGIVSGGSYGSKEGCQPYHLPP-C 157
Query: 73 SH------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIM-AEIYKN 124
H C PTP C R C ++ + + H+ Y + E I+ EI+ N
Sbjct: 158 EHHRAGPRRNCTKYGPTPSCARVCQPDYKISYEDDLHFGKQWYALAPHNEKIIRTEIFHN 217
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSW 183
GPVE + YEDF Y+SG+Y HI G + HAVK+IGWGT YW++AN +N W
Sbjct: 218 GPVEATMAAYEDFYTYESGIYHHIEGTFVCDHAVKIIGWGTDKKTNTPYWLVANSFNTDW 277
Query: 184 GADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
G G+FKIKRG NECGIE + AG+P+ KN
Sbjct: 278 GEYGFFKIKRGVNECGIENKITAGIPAYKN 307
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 113/205 (55%), Gaps = 16/205 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF---- 67
+ N LS D+L+CC +CG C GGYP +AW Y+ G+V+ + C PY
Sbjct: 137 VMNFRLSGLDMLSCCA-ICGFACQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPC 195
Query: 68 -DSTGCSHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
S S P C +C C ++ ++ K+++ Y I++D +I EI NG
Sbjct: 196 DHSGNGSRPVCTVGGGV-RCQHLCEPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNG 254
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWG 184
PV+ TVYEDF YK+GVY H+ G+ +G HAV+++GWG YW++AN W WG
Sbjct: 255 PVQAILTVYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSDWG 314
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+G+F I RG N C IE ++AGLP
Sbjct: 315 DNGFFHIFRGENHCDIEGYIMAGLP 339
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/179 (39%), Positives = 105/179 (58%), Gaps = 18/179 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
+N S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 77 KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSNMGCIPYEIAP 133
Query: 73 -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ C+ TPKCV+KC ++ + H SAY +++D + I EIY N
Sbjct: 134 CEHHVNGTRGPCKEGGKTPKCVKKCEDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTN 193
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW++AN WN W
Sbjct: 194 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWNTDW 252
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 104/171 (60%), Gaps = 16/171 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL CCG CGDGC+GGYP AW ++ G+V+ C PY C
Sbjct: 26 NVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPP-CE 84
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P TP+C + C + ++ KH+ ++Y +++ ++IMAEIYKNGP
Sbjct: 85 HHVNGSRPPMHGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGP 144
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
VE +FTV+ DF YKSGVYKH GD+MGGHA++++GWG ++G YW+ AN
Sbjct: 145 VEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGV-ENGVPYWLAAN 194
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 107/202 (52%), Gaps = 32/202 (15%)
Query: 25 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 84
S D++ C + GCDGG+P +Y + +G+ E CDPY +
Sbjct: 242 SPQDIVDCSAY--SQGCDGGFPFLVGKYAMDYGLTVESCDPY------------QGHDLG 287
Query: 85 KCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
KC +C V + Q +S +Y + Y NS +M EIY+NGP+ + F VY D +YK G
Sbjct: 288 KCSNQCPVNRQQRLHSSNYYFVGGYYGNSHELSMMHEIYQNGPLAIGFEVYPDLRNYKHG 347
Query: 144 VYKHITGDVMGG----------------HAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
VYKH+T + + HAV ++GWG ++G YW + N W+ +WG +G
Sbjct: 348 VYKHVTAEELKAQGLSEDEMIPHFEVVNHAVLMVGWGV-ENGTPYWKIKNSWSTTWGDNG 406
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YFKI RGS+ECG+E D AG+P
Sbjct: 407 YFKILRGSDECGVESDAEAGIP 428
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 115/194 (59%), Gaps = 18/194 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
S DLL+CC CGD C GGY +SA ++++ G+V+ E C PY T +H
Sbjct: 136 FSPEDLLSCCT-SCGD-CGGGYMMSALDFYINEGIVSGGDVNSNEGCRPY---TADAHDQ 190
Query: 77 CEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
+ TP C + C + + KHY + Y ++S + I E+ NGP+ V+F V++
Sbjct: 191 GQ----TPACTKSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQ 246
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF +Y SGVY+H++G+ +G H VK++GWG ++G YW++AN W SWG G+FK+ RG
Sbjct: 247 DFYNYVSGVYRHVSGESVGFHVVKIVGWGV-ENGVPYWLIANSWGSSWGDHGFFKMLRGQ 305
Query: 196 NECGIEEDVVAGLP 209
NECGIE A +P
Sbjct: 306 NECGIENYPYAVMP 319
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/185 (44%), Positives = 105/185 (56%), Gaps = 20/185 (10%)
Query: 24 LSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
LSV DL++C GD GC+GG + ++ V +GV TEEC PY G
Sbjct: 112 LSVQDLVSCDK---GDSGCNGGSGPLSSKWLVSNGVTTEECLPYVSGNG----------R 158
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
P C KC +Q+ R K+ Y + ++I E+ KNGPV FTVY DF +YKS
Sbjct: 159 VPACAAKCSNGSQIIR-YKYEKAETYTV----QNIQEELMKNGPVYFRFTVYSDFMNYKS 213
Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
GVY+H +G GGHAV LIGWG +DG YW+L N W +WG G+FKI RG NECG E+
Sbjct: 214 GVYQHKSGYQEGGHAVLLIGWGV-EDGVPYWLLQNSWGPAWGEKGHFKIIRGKNECGCEQ 272
Query: 203 DVVAG 207
AG
Sbjct: 273 GFYAG 277
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 98/171 (57%), Gaps = 12/171 (7%)
Query: 50 WRYFVHHGVVT-------EECDPYFDSTGCSHPGCEPAYP-TPKCVRKCVKKNQLWRN-- 99
W Y+V GV + + C PY C P E YP P C +C + +
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPS-EGDYPDEPNCSTRCNAGYNVTEDLR 199
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
+ + AY I +D IM +I+ NGPV+ F YED +Y GVY+H +G + GGHAVK
Sbjct: 200 DRRFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVYRHQSGRLKGGHAVK 259
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
LIGWG +DG YW++AN W R WG DG+FK+ RG N CGIEE+V AGLPS
Sbjct: 260 LIGWGV-EDGTKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLPS 309
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/172 (43%), Positives = 100/172 (58%), Gaps = 17/172 (9%)
Query: 52 YFVHHGVVT-------EECDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQL- 96
Y V G+VT C PY C H P C Y TP+C +KC K +
Sbjct: 9 YLVKRGIVTGGSKENHTGCQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTP 67
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
+ K+Y Y + S+ + I EI NGPVE +F VYEDF +YKSG+Y+H+TG ++GGH
Sbjct: 68 YEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGH 127
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
A+++IGWG + YW++AN WN WG G F+I RG +EC IE +VVAGL
Sbjct: 128 AIRIIGWGV-EKRTPYWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGL 178
>gi|38639319|gb|AAR25797.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 218
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 61/72 (84%), Positives = 65/72 (90%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
++SLSVNDLLACC FLCG GCDGGYPI+AWRYF GVVTEECDPYFD+TGCSHPGCEP
Sbjct: 146 SISLSVNDLLACCSFLCGSGCDGGYPIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCEPL 205
Query: 81 YPTPKCVRKCVK 92
YPTPKC RKCVK
Sbjct: 206 YPTPKCHRKCVK 217
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 109/204 (53%), Gaps = 19/204 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS +L CC CG GC+GGYPI AW F HG+VT E C+PY
Sbjct: 138 NEFLSPEELTFCC-HTCGYGCNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRH 196
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
+ G + +P +C R C L + H Y+ +Y + I ++ GP
Sbjct: 197 HAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG--SIQKDVMNYGP 254
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF YKSGVY + +GGHAVKLIGWG + G YW++ N WN WG
Sbjct: 255 IEASFDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWG-EESGVPYWLMVNSWNTDWGD 313
Query: 186 DGYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NECG++ AG+P
Sbjct: 314 KGLFKIQRGTNECGVDNSTTAGVP 337
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 108/199 (54%), Gaps = 19/199 (9%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDSTGCSH 74
+L CC CG GC GGYPI AW+ F +HG+VT E C+PY +D G +
Sbjct: 1 ELTFCC-HTCGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNNT 59
Query: 75 PGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+P +C R C +L + H Y+ Y + I ++ GP+E SF V
Sbjct: 60 CAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG--SIQKDVMTYGPIEASFDV 117
Query: 134 YEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
Y DF YKSG+Y+ +GGHAVKLIGWG G YW++ N WN WG +G FKI+
Sbjct: 118 YSDFPSYKSGIYERTENATYLGGHAVKLIGWG-EQYGIPYWLMVNSWNEDWGDNGLFKIR 176
Query: 193 RGSNECGIEEDVVAGLPSS 211
RG+NECG++ AG+P +
Sbjct: 177 RGTNECGVDNSTTAGVPVT 195
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 106/206 (51%), Gaps = 30/206 (14%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDST----------- 70
+ LS ++L+C GCDGG+ +AWRY +GV+ C PY
Sbjct: 236 VQLSAQNILSCTRR--QQGCDGGHLDAAWRYMHKNGVLDANCYPYIQQRDTCKVQRHRGR 293
Query: 71 GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
GC+PA+ V ++ + YS+S DIMAEIY +GPV+ +
Sbjct: 294 SLKAYGCQPAHG--------VNRDNFYTVGPAYSLSR------EADIMAEIYHSGPVQAT 339
Query: 131 FTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
TVY DF Y SGVY+H G G H+VKL+GWG +G YWI AN W WG G
Sbjct: 340 MTVYRDFFSYSSGVYQHTAANRGAATGFHSVKLVGWGEEHNGVKYWIAANSWGPWWGERG 399
Query: 188 YFKIKRGSNECGIEEDVVAGLPSSKN 213
YF+I RGSNECGIEE V+A P N
Sbjct: 400 YFRILRGSNECGIEEYVLASWPHVYN 425
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 116/218 (53%), Gaps = 33/218 (15%)
Query: 21 NLSLSVNDLLACCG---FLCGDGCDGGYPISAWRYFVHHGVVT-------------EECD 64
N LS ++LACC F GC GG PI++W + +G+V+ + C
Sbjct: 30 NQLLSAANMLACCNIGHFCLSFGCSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCW 89
Query: 65 PYFDSTGCSH--------PGCEPAYPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINS 112
PY C+H P + Y TP C C K + +HY+ S + R S
Sbjct: 90 PY-SFPKCAHHQDGSDYKPCAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS 148
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
I EI NGP +F+VYEDF YKSGVYKH +G +GGHAV++IGWGT + G DY
Sbjct: 149 T-SSIKKEIMTNGPTSAAFSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDY 206
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
W++ N WN WG G FKI +G +CGI++ ++AG P+
Sbjct: 207 WLVMNSWNEEWGDHGTFKIVQG--DCGIDDTILAGTPA 242
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 114/199 (57%), Gaps = 13/199 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTGC 72
N++L+ DL+ CC CG+GC+GG+ ++++Y+V G+V T+ C PY C
Sbjct: 137 NVALAAEDLMGCC-VDCGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCKPY-PFKPC 194
Query: 73 SHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
+P + +PKC C ++ + K + AY + D I EI NGPVE
Sbjct: 195 EYPFNDCHVEISPKCTHHCRDGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAG 254
Query: 131 FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFK 190
F VYED YKSGVY+H+ G+ +G HAV++IGWG D G YW++AN + WG GYFK
Sbjct: 255 FDVYEDVLLYKSGVYRHVYGEQIGKHAVRIIGWG-RDGGIPYWLIANSYGDDWGDHGYFK 313
Query: 191 IKRGSNECGIEEDVVAGLP 209
RGSN GIE ++ GLP
Sbjct: 314 FVRGSNHLGIESKIITGLP 332
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 109/204 (53%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
N LS ++L+CC CG GC GGYP A+ Y +G+ T + C PY C
Sbjct: 144 NRILSDTEVLSCCFGSCGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDACQPY-AFYPCG 202
Query: 74 HPGCEPAY--------PTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ EP Y PTP C R C + + K ++ Y I + +I EI
Sbjct: 203 NHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTR 262
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV ++ VY DF +YK GVY H G+V G HAVK+IGWG +D YW++AN WN WG
Sbjct: 263 GPVVATYKVYRDFDYYKKGVYIHREGEVTGLHAVKIIGWGKGND-VPYWLVANSWNTDWG 321
Query: 185 ADGYFKIKRGSNECGIEEDVVAGL 208
+GYF+I RG++ C IE +V G+
Sbjct: 322 DNGYFRIVRGTDNCEIERQMVGGI 345
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 106/203 (52%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS +L CC LCG C GGYPI AW YF HG+VT E C PY
Sbjct: 140 NELLSAEELTFCC-HLCGFACHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPCFS 198
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
+ G + +P +C R C ++ + H Y + I ++ GP+
Sbjct: 199 EEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYA-SIQKDVMTYGPI 257
Query: 128 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
E S VY+DF YKSGVY K +GGHAVKLIGWG +DG YW++ N W+ WG
Sbjct: 258 EASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWG-EEDGVPYWLMVNSWSEMWGDK 316
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NEC ++ + AG+P
Sbjct: 317 GLFKIRRGTNECSVDNSMTAGVP 339
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 111/205 (54%), Gaps = 28/205 (13%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------FDST 70
LS +L+CC +LCGDGC GG +W ++ HG+V+ E C PY T
Sbjct: 106 LSAQQILSCC-YLCGDGCSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTET 164
Query: 71 GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK------HYSISAYRINSDPEDIMAEIYKN 124
+ TP+C +C + R K HY + AY M EIY+N
Sbjct: 165 AVENACSNKTLFTPECKVQCYNPDYGTRYVKDNHQGTHYRVPAYTA-------MKEIYEN 217
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GP+ SF +Y+DF +Y+SGVY + +G + AVK++GWG ++G YW+ AN +N WG
Sbjct: 218 GPITASFYMYQDFVNYQSGVYAYNSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWG 276
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLP 209
+G+ KI RG+NEC IEE + AGLP
Sbjct: 277 DNGFVKILRGANECYIEEFMYAGLP 301
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 104/191 (54%), Gaps = 11/191 (5%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-CDPYFDSTGCSHP--GCEPA 80
LS ++ AC F GC GG P SAW + G+ T E P S + P +
Sbjct: 196 LSAGEMNACTLFF---GCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDI 252
Query: 81 YPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
YPTP CV +C K R+ +H+ + + + D I +GPV SFTVYEDF
Sbjct: 253 YPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTDGPVSASFTVYEDFL 312
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
YKSGVYKH +G +GGHAVK+IGWG G+ YW+ N WN WG G FKI G+ C
Sbjct: 313 AYKSGVYKHTSGSYLGGHAVKIIGWG-EKSGQAYWLAVNSWNEDWGDKGLFKIALGN--C 369
Query: 199 GIEEDVVAGLP 209
GI++D++ G P
Sbjct: 370 GIDDDLLGGTP 380
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 106/203 (52%), Gaps = 17/203 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYF------ 67
N LS +L CC LCG C GGYPI AW YF HG+VT E C PY
Sbjct: 140 NELLSAEELTFCC-HLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPCFS 198
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
+ G + +P +C R C ++ + H Y + I ++ GP+
Sbjct: 199 EEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTYA-SIQKDVMTYGPI 257
Query: 128 EVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
E S VY+DF YKSGVY K +GGHAVKLIGWG +DG YW++ N W+ WG
Sbjct: 258 EASMEVYDDFPSYKSGVYEKSENATYLGGHAVKLIGWG-EEDGVPYWLMVNSWSEMWGDK 316
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G FKI+RG+NEC ++ + AG+P
Sbjct: 317 GLFKIRRGTNECSVDNSMTAGVP 339
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 65/119 (54%), Positives = 85/119 (71%), Gaps = 1/119 (0%)
Query: 94 NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
N + N K Y YR+ S+ E IM E+ ++GPVEV F VY DF +YKSGVY+H++G ++
Sbjct: 3 NVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVYQHVSGALL 62
Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
GGHAV+L+GWG ++ YW++AN WN WG +GYFKI RG NECGIE DV AG+P K
Sbjct: 63 GGHAVRLLGWGEENN-VPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKIK 120
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/222 (38%), Positives = 120/222 (54%), Gaps = 39/222 (17%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
++ S+ D+L+CCG+ CG+GC+GG AW Y+ G+V+ + C PY C
Sbjct: 133 EHFYFSIKDVLSCCGY-CGNGCEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPY-TIPPC 190
Query: 73 SH---------------PGCE--PAYP--------TPKCVRKCVKKNQL-WRNSKHYSIS 106
+H P C+ P P TP+C +KC K ++ + KH S
Sbjct: 191 NHLVWGEIEQCKNIPMTPKCKNIPVIPEQCKYIPITPECEKKCNKNYKVCYSKDKHRGKS 250
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
YR+ +I EIY+ GPV FTVYEDF +YK G+Y + +G +G H+VK+IGWG
Sbjct: 251 VYRVKKS--EIFKEIYEYGPVTSYFTVYEDFLNYKEGIYNYTSGQKLGLHSVKIIGWG-E 307
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKR-GSNECGIEEDVVAG 207
+ G YW+ AN +N WG G+FKI R G CGI ++VVAG
Sbjct: 308 ERGIKYWLAANSFNTDWGDKGFFKIIREGVGSCGISDNVVAG 349
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 107/200 (53%), Gaps = 13/200 (6%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPA 80
+ LS ++L+C GC+GG+ +AWRY GVV E C PY C P +
Sbjct: 191 IQLSPQNILSCTRR--QQGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDACKIPHNSRS 248
Query: 81 YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C V +++L+ YS++ + DIMAEI+ +GPV+ + TVY DF
Sbjct: 249 LRNNGCRSYSGVDRDELYTVGPAYSLN------NETDIMAEIFMSGPVQATLTVYRDFFS 302
Query: 140 YKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
Y G+Y+H G +G H+VKLIGWG DG YWI N W WG G F+I RGSN
Sbjct: 303 YSGGIYRHTAASRGSPVGFHSVKLIGWGEEHDGNKYWIATNSWGTWWGEHGNFRILRGSN 362
Query: 197 ECGIEEDVVAGLPSSKNLVK 216
ECGIEE V+A P+ N K
Sbjct: 363 ECGIEEYVLAAWPNVYNYFK 382
>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
Length = 282
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 107/204 (52%), Gaps = 21/204 (10%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
SV T D LS + +S DL++C GC+GGY AW + HGV
Sbjct: 92 FSVAETMGDRLS---IIGCGRGHMSPQDLVSC--DTTDMGCNGGYMDKAWAWTKSHGVTN 146
Query: 61 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
EEC PY G P C KCV + + R +K S + + + + E
Sbjct: 147 EECMPYQSGGG----------RVPACPAKCVNGSTIVR-TKSQSFTHFTAS----QMQQE 191
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
+Y+NGP+ V+FTVY DF +YKSGVY H TG V GGHAV IGWG D+ YW+ N W
Sbjct: 192 LYENGPLSVAFTVYYDFMNYKSGVYVHKTGGVAGGHAVLCIGWGVEDN-TPYWLCQNSWG 250
Query: 181 RSWGADGYFKIKRGSNECGIEEDV 204
+WG G+FKI RGSN CGIE V
Sbjct: 251 PAWGEKGHFKILRGSNHCGIENQV 274
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/176 (39%), Positives = 104/176 (59%), Gaps = 18/176 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
+N S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 75 KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEIAP 131
Query: 73 -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ C+ TP CV+KC + ++ + H+ SAY I +D + I EIY N
Sbjct: 132 CEHHVNGTRGPCKEGGKTPTCVKKCEEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTN 191
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
GPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW++AN WN
Sbjct: 192 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 247
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 107/202 (52%), Gaps = 18/202 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS D+ CC CG GC+GGYPI AW+YF GV T E C PY FD
Sbjct: 135 NELLSPEDVAFCCQ-NCGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFD 193
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G + +P +C + C + K Y + + + P + ++ K GP+E
Sbjct: 194 QKGKNTCAGKPLERNHQCPKTCYGSTTV---QKRYKVKNEYVLNSPNTMEQDLIKYGPIE 250
Query: 129 VSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF +++D + YKSG+Y+ + GH++K+IGWG ++G YW+ N W++ WG G
Sbjct: 251 ASFNLFDDLSAYKSGIYQKTPKAKFLSGHSIKIIGWG-KENGVPYWLAVNSWSKFWGEQG 309
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
F+I +G NECGIE AG+P
Sbjct: 310 TFRIIKGRNECGIERSATAGIP 331
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 106/202 (52%), Gaps = 30/202 (14%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-----------FDST 70
+ LS ++L+C GC+GG+ +AWRY GV+ E+C PY +S
Sbjct: 236 VQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKIQRHNSR 293
Query: 71 GCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
GC+PAY V ++ L+ YS+S DIMAEIY +GPV+ +
Sbjct: 294 SLKANGCQPAYG--------VNRDSLYTVGPAYSLSR------EADIMAEIYHSGPVQAT 339
Query: 131 FTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
+Y DF Y G+Y+ G G H+VKL+GWG DG YWI AN W WG G
Sbjct: 340 MRIYRDFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHDGVKYWIAANSWGPWWGEHG 399
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
YF+I RGSNECGIEE V+A P
Sbjct: 400 YFRILRGSNECGIEEYVLASWP 421
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 111/208 (53%), Gaps = 17/208 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
L + S ++ ACC CG+ C GG +A+ ++V G V+ E C PY
Sbjct: 124 LVDFRFSSENVAACCT-ECGNACYGGDEDTAFTHWVTKGFVSGGRHNSNEGCQPY-SVEE 181
Query: 72 CSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
C H P CE P C C ++ + + Y + AY + D I EI N
Sbjct: 182 CEHHIEGPRPPCEGDMPELVCSETCHEEYGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTN 241
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GPV +F VY+DF YKSGVY+H TG + G HAV++IGWG ++G YW++AN WN WG
Sbjct: 242 GPVTAAFAVYDDFLSYKSGVYQHETGLLDGYHAVRVIGWG-EEEGTPYWLVANSWNTDWG 300
Query: 185 ADGYFKIKRGSNECGIEEDVVAGLPSSK 212
+G FKI RGS+EC E D+ A SSK
Sbjct: 301 DNGLFKILRGSDECEFEGDMAAATYSSK 328
>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
Length = 236
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 100/169 (59%), Gaps = 15/169 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GCDGGY +AW + G+ +++C PY G V C K Q +
Sbjct: 78 GCDGGYLNNAWAFLAGTGIPSDKCAPYTSQNGD--------------VAACPSKCQDGSS 123
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
K Y + +D IM ++ +NGPV+ +F+VY DF YKSGVY H++G ++GGHA+K
Sbjct: 124 VKLYKAKNPQQLNDIPSIMEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGGHAIK 183
Query: 160 LIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
++GWG S + YWI+AN W SWG +G+F I RGS+ECGIE++V +G
Sbjct: 184 MVGWGVDSATNKPYWIIANSWGPSWGLNGFFWILRGSDECGIEDNVWSG 232
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/193 (40%), Positives = 105/193 (54%), Gaps = 10/193 (5%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+N+ LS ++L+C GC+GG+ +AWRY GVV E C PY C+
Sbjct: 234 ENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYTQH----RDTCKI 287
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
+ C K + R+S + AY +N + DIMAEI+ +GPV+ + V DF
Sbjct: 288 RHSRSLKANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDFFA 346
Query: 140 YKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
Y GVY+ + G H+VKL+GWG +GE YWI AN W WG GYF+I RGSN
Sbjct: 347 YSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSN 406
Query: 197 ECGIEEDVVAGLP 209
ECGIEE V+A P
Sbjct: 407 ECGIEEYVLASWP 419
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 106/205 (51%), Gaps = 29/205 (14%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF---DSTGCSHP--- 75
+ LS ++L+C GC+GG+ +AWRY GVV E C PY DS H
Sbjct: 236 VQLSPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDETCYPYTQRRDSCKIRHNSRS 293
Query: 76 ----GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
GC PAY V ++ L+ YS+ DIMAEIY +GPV+ +
Sbjct: 294 LKANGCRPAYG--------VNRDSLYTVGPAYSLKG------ETDIMAEIYHSGPVQATM 339
Query: 132 TVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
VY DF Y GVY+ G G H+VK++GWG DG YWI AN W WG GY
Sbjct: 340 RVYRDFFSYSGGVYRQTAANRGAPTGFHSVKIVGWGEEHDGVKYWIAANSWGPWWGEHGY 399
Query: 189 FKIKRGSNECGIEEDVVAGLPSSKN 213
F+I RGSNECGIEE V+A P+ N
Sbjct: 400 FRILRGSNECGIEEYVLASWPNVYN 424
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/160 (45%), Positives = 96/160 (60%), Gaps = 16/160 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CCG CG GC+GGYP AW ++ G+V+ C PY C
Sbjct: 63 NVEISAEDLLSCCGMECGFGCNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPP-CE 121
Query: 74 H------PGCEPAY-PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
H P C TPKCV +C + KH+ ++Y ++S+ DI EIYKNG
Sbjct: 122 HHVNGSRPSCTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNG 181
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
PVE +FTVYEDF YKSGVYKH+TGD +GGHA++++GWG
Sbjct: 182 PVEGAFTVYEDFLQYKSGVYKHVTGDAVGGHAIRILGWGV 221
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/186 (40%), Positives = 102/186 (54%), Gaps = 16/186 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+++ LS DL+ C +GC+GG P +A++Y +GVVT C PY + P C P
Sbjct: 117 ESVQLSFQDLITCDN--QDNGCEGGDPYTAYKYVQKNGVVTSNCQPY------TIPTCPP 168
Query: 80 AYP-------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
A TP C KC + ++ H+ + Y + + I EI NGPVE F
Sbjct: 169 AQQPCMNFVNTPPCSAKCANSSVNFQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFE 228
Query: 133 VYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
VYEDF YKSGVY H +G +GGH +K++G+G S +G YWI N W SWG +G F I+
Sbjct: 229 VYEDFLGYKSGVYTHKSGKDLGGHCIKIVGFGVS-NGTPYWICNNSWTTSWGNNGIFWIE 287
Query: 193 RGSNEC 198
G NEC
Sbjct: 288 AGKNEC 293
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 97/161 (60%), Gaps = 15/161 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
N+ +S DLL+CC CG+GC+GGYP AW ++ + G+V+ C PY S C
Sbjct: 45 NVEVSAEDLLSCCKLECGNGCNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISP-CE 103
Query: 74 H------PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP+C R+C + + KHY +++Y I SD +IM EIYKNGP
Sbjct: 104 HHVNGSRPKCSGEIETPRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGP 163
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
VE + V++DF YKSGVY+H TG +GGHA+K++GWG +
Sbjct: 164 VEAALEVFKDFLLYKSGVYQHKTGGSIGGHAIKILGWGEEN 204
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/181 (40%), Positives = 102/181 (56%), Gaps = 21/181 (11%)
Query: 45 YPISAWRYFVHHGVVT-------EECDPY------FDSTGCSHPGCEPAYPTPKCVR-KC 90
+P AW+Y +G+ T E C PY ++ CS + TP+C + +C
Sbjct: 160 HPEKAWKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENED----TPQCYKDQC 215
Query: 91 VKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 148
N + +Y+ Y + PE IM+E++KNGPV + VY+DF YK G+Y++
Sbjct: 216 TNNNYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIYQYT 275
Query: 149 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
TG + G HAVK++GWG DDG DYW+ AN W SWG G FKI+RG NECGIE + GL
Sbjct: 276 TGGLKGDHAVKIMGWG-EDDGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGL 334
Query: 209 P 209
P
Sbjct: 335 P 335
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/191 (41%), Positives = 105/191 (54%), Gaps = 15/191 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ LS +++C G +GC+GG+ + WR+ V G V+E C PY S G + P C
Sbjct: 173 NVDLSPQFMVSCSG--QNNGCNGGFFDATWRFLVSVGTVSEACVPYV-SFGGAVPACN-- 227
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
V+ C Q S Y + R DIMA++ NGP++V+ VY DF Y
Sbjct: 228 ------VKSCGVPGQ---KSPFYRAGSARKLEGMLDIMADLKANGPIQVAMGVYRDFYSY 278
Query: 141 KSGVYKHITGDVMGGHAVKLIGWG-TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECG 199
KSGVY H++G +GGHAVK++GWG S YWI AN W WG GYF I RG ECG
Sbjct: 279 KSGVYHHVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWGEDWGIKGYFWILRGRGECG 338
Query: 200 IEEDVVAGLPS 210
I + V +G P+
Sbjct: 339 IGKMVWSGKPA 349
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 78/209 (37%), Positives = 116/209 (55%), Gaps = 16/209 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
++ +S D L C LCGDGC+GG P W ++ G+V+ C + C
Sbjct: 147 HVEVSAEDKLTC---LCGDGCNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCK 203
Query: 74 HPGCEPAY----PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
H Y +PKC C + Q ++ KHY S+Y I+ +DIM IYKN VE
Sbjct: 204 HHIHGXPYVXTGDSPKCSMTC-EPGQTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEE 262
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+F+VY DF YK Y+ +TG++ GGHA+ ++G ++ YW++AN WNR WG +G+F
Sbjct: 263 AFSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKV-ENSTSYWLVANXWNRDWGDNGFF 321
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
KI RG + GIE +VVA +P ++ ++I
Sbjct: 322 KILRGQDHYGIESEVVAEIPHTEQYWEKI 350
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 105/203 (51%), Gaps = 17/203 (8%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGC 72
+ + +S D+L+CCG CG GC G P A+ Y + GV + C PY C
Sbjct: 143 KQVYVSETDILSCCGQRCGSGCTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPY-PFYPC 201
Query: 73 SHPGCEPAY--------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ P Y PTP C + C + N S + + E I EI+ N
Sbjct: 202 GYHAHLPYYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVLTGEEKIKREIFNN 261
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG 184
GP+ ++TVYEDFA+YK+G+Y G G HAVK+IGWG ++G YW++AN WN WG
Sbjct: 262 GPLVATYTVYEDFAYYKNGIYMTGLGRATGAHAVKIIGWG-EENGVKYWLIANSWNTDWG 320
Query: 185 ADGYFKIKRGSNECGIEEDVVAG 207
+G+F++ RG+N C IE G
Sbjct: 321 ENGFFRMLRGTNLCDIELSATGG 343
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 122/228 (53%), Gaps = 32/228 (14%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECD 64
T+R ++S+ V+ LS D+ +C GD GC+GG P S + Y+ G+V +
Sbjct: 26 TDRMCIASNGTVTTH---LSAQDVTSCDKL--GDMGCNGGIPSSVYSYWALSGIV--DGG 78
Query: 65 PYFDSTGC---------------SHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR 109
Y D +GC +P C PKC RKC +++ W +K Y
Sbjct: 79 NYGDKSGCWSYQLEPCAHHVNSSKYPACPDEVRAPKCARKCESEDKDWTKAKVKGEKGYS 138
Query: 110 INSDPE-------DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK-HITGDVMGGHAVKLI 161
+ E + A+IY+NGP+ F V +DF YKSGVY+ + +GGHA+K++
Sbjct: 139 VCQQGELEGTCAIKMAADIYQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIM 198
Query: 162 GWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
G+GT +DG+DYW++AN WN WG DGYFKI RG N C IE+ V+ G P
Sbjct: 199 GFGT-EDGKDYWLVANSWNEDWGDDGYFKIIRGKNACQIEDPVINGGP 245
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 78/193 (40%), Positives = 108/193 (55%), Gaps = 12/193 (6%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
S DL+ CC CG C GGY AW+Y+ G+V+ Y S GC P + +
Sbjct: 125 FEFSPEDLINCCE-TCGKKCKGGYSYYAWKYYTSTGLVSG--GDYNTSRGC-QPYSKSNF 180
Query: 82 ---PTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY-KNGPVEVSFTVYE 135
+P+C + C K + N +H+ Y I + I EI + GPV F VYE
Sbjct: 181 NDGVSPECSKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRGGPVMAGFDVYE 240
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA-DGYFKIKRG 194
DF Y+ GVY H +G ++G HAVK+IGWGT ++G YW++AN W + WGA G FKI+RG
Sbjct: 241 DFKLYREGVYVHTSGALLGSHAVKIIGWGT-ENGWAYWLVANSWGKDWGALGGVFKIRRG 299
Query: 195 SNECGIEEDVVAG 207
+NEC IE+ ++ G
Sbjct: 300 TNECKIEQSIITG 312
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 112/208 (53%), Gaps = 31/208 (14%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH--- 74
DLL C F GC GG P AWR+F + GVVT + C PY + C H
Sbjct: 301 DLLHCLSF----GCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSE 355
Query: 75 ---PGCEPAYP-TPKCVRKC-----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
P CE P PKC + C K + +++ H++ SAY + + I E+ +NG
Sbjct: 356 GPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENG 414
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+ +F VYEDF YK GVY H+TG MGGHAVK+IG+G ++DG DYW+ N WN WG
Sbjct: 415 TLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGD 473
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKN 213
G FKI+ G E GI+++ G P N
Sbjct: 474 KGTFKIEMG--EAGIDKEFCGGEPKVPN 499
>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 298
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 71/168 (42%), Positives = 96/168 (57%), Gaps = 12/168 (7%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
CDGG+ S WR+ G T EC PY T + C PT KC +L S
Sbjct: 140 CDGGWLQSVWRFLTKTGTTTNECVPYQSGTTGARGTC----PT-----KCADGGEL---S 187
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
+ A D + IM + GP++ +FTVY DF +Y+ GVY+H++G V GGHAV++
Sbjct: 188 TVKAKKAVDYGLDCDLIMKALVTGGPLQTAFTVYSDFMYYEGGVYQHMSGRVEGGHAVEM 247
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT + DYWI+ N W WG DGYF+I R +NECGIEE V+ G+
Sbjct: 248 VGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 112/208 (53%), Gaps = 31/208 (14%)
Query: 28 DLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----------EECDPYFDSTGCSH--- 74
DLL C F GC GG P AWR+F + GVVT + C PY + C H
Sbjct: 301 DLLHCLSF----GCSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSE 355
Query: 75 ---PGCEPAYP-TPKCVRKC-----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
P CE P PKC + C K + +++ H++ SAY + + I E+ +NG
Sbjct: 356 GPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGR-DQIKRELMENG 414
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+ +F VYEDF YK GVY H+TG MGGHAVK+IG+G ++DG DYW+ N WN WG
Sbjct: 415 TLTGAFLVYEDFLLYKEGVYHHVTGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGD 473
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSKN 213
G FKI+ G E GI+++ G P N
Sbjct: 474 KGTFKIEMG--EAGIDKEFCGGEPKVPN 499
>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
Length = 283
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 77/186 (41%), Positives = 102/186 (54%), Gaps = 18/186 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
++ DL++C F DGCDGG+ AW + +G+ TEEC PY G P
Sbjct: 112 IAPEDLVSCDIF--DDGCDGGFIDMAWDWCQENGLTTEECIPYKAGEGVPSP-------- 161
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
C C + ++R I +YR D +DI EIY+ GPV + F VY DF YKSG
Sbjct: 162 --CPETCEDGSAIYRTP----IESYRY-IDADDIQGEIYEYGPVSMGFIVYSDFMSYKSG 214
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
VY H G + GGHAV ++GWG D+ YW++ N W WG +G+FKI RGS+ C E +
Sbjct: 215 VYVHQAGYIEGGHAVLIVGWGVEDE-VPYWLVQNSWGTDWGENGFFKILRGSDHCECESN 273
Query: 204 VVAGLP 209
V AG P
Sbjct: 274 VTAGYP 279
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 119/208 (57%), Gaps = 14/208 (6%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R ++++ V +Q LS DL+ CC + CG+ C GGY AW YF+ G+V+
Sbjct: 111 SDRLCIATNGKVKIQ---LSPEDLIDCCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GD 164
Query: 66 YFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
Y STGC P E Y TP C C K + + KH+ S Y I + I EI
Sbjct: 165 YNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEI 223
Query: 122 YKNG-PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
G PV +F VY DF Y+ GVY + +G + G AVK+IGWGT ++G YW+ AN W
Sbjct: 224 LSGGGPVVAAFDVYGDFKIYRDGVYIYTSGALFGRTAVKIIGWGT-ENGWAYWLAANSWG 282
Query: 181 RSWGA-DGYFKIKRGSNECGIEEDVVAG 207
+ WGA G+FKI+RG+NECG EE ++AG
Sbjct: 283 KDWGALGGFFKIRRGTNECGFEESIIAG 310
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 80/195 (41%), Positives = 106/195 (54%), Gaps = 13/195 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+N+ LS ++L+C GC+GG+ +AWRY GVV E C PY H
Sbjct: 234 ENVQLSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYPYT-----QHRDTCK 286
Query: 80 AYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
+ +R C K + R+S + AY +N + DIMAEI+ +GPV+ + V DF
Sbjct: 287 IRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREA-DIMAEIFHSGPVQATMRVNRDF 345
Query: 138 AHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
Y GVY+ + G H+VKL+GWG +GE YWI AN W WG GYF+I RG
Sbjct: 346 FAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRG 405
Query: 195 SNECGIEEDVVAGLP 209
SNECGIEE V+A P
Sbjct: 406 SNECGIEEYVLASWP 420
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 115/200 (57%), Gaps = 13/200 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGY-PISAWRYFVHHGVV-------TEECDPYFDSTG 71
+N++++ DL+ CC CG+GC+GG+ ++++Y+V G+V TE C PY
Sbjct: 141 RNVAIAAEDLMGCCA-DCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPY-PFKP 198
Query: 72 CSHPGCE-PAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C +P + +PKC C ++ + K + AY + D I EI NGPVE
Sbjct: 199 CLYPFTDCHREESPKCKHHCQHGVDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEG 258
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
F VYED YKSGVY+H+ G+ +G HAV++IGWG + G YW+++N + WG GYF
Sbjct: 259 GFDVYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWG-REGGIPYWLISNSYGEDWGDHGYF 317
Query: 190 KIKRGSNECGIEEDVVAGLP 209
KI RG N GIE V+ GLP
Sbjct: 318 KIVRGINHLGIESKVITGLP 337
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 116/212 (54%), Gaps = 31/212 (14%)
Query: 22 LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
+ LS +CC + C GC+GG P AWR+F GVVT C PY +
Sbjct: 329 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EV 387
Query: 70 TGCSH------PGCEPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
C+H P C+ TPKC + C ++ + H + SAY + S +
Sbjct: 388 PFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-D 446
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
D+ ++ +GPV +F VYEDF YKSGVYKH++G +GGHA+K+IGWGT ++GE+YW
Sbjct: 447 DVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHA 505
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
N WN WG G FKI G +CGI+ ++VAG
Sbjct: 506 VNSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 535
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 116/212 (54%), Gaps = 31/212 (14%)
Query: 22 LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
+ LS +CC + C GC+GG P AWR+F GVVT C PY +
Sbjct: 326 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EV 384
Query: 70 TGCSH------PGCEPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
C+H P C+ TPKC + C ++ + H + SAY + S +
Sbjct: 385 PFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-D 443
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
D+ ++ +GPV +F VYEDF YKSGVYKH++G +GGHA+K+IGWGT ++GE+YW
Sbjct: 444 DVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHA 502
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
N WN WG G FKI G +CGI+ ++VAG
Sbjct: 503 VNSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 116/212 (54%), Gaps = 31/212 (14%)
Query: 22 LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
+ LS +CC + C GC+GG P AWR+F GVVT C PY +
Sbjct: 326 MPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPY-EV 384
Query: 70 TGCSH------PGCEPAY---PTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
C+H P C+ TPKC + C ++ + H + SAY + S +
Sbjct: 385 PFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-D 443
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
D+ ++ +GPV +F VYEDF YKSGVYKH++G +GGHA+K+IGWGT ++GE+YW
Sbjct: 444 DVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGT-ENGEEYWHA 502
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
N WN WG G FKI G +CGI+ ++VAG
Sbjct: 503 VNSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 110/203 (54%), Gaps = 27/203 (13%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDSTGCS 73
N LS +DL++CC +CG GC+GG+P +AW Y+ G+V T+ C PY + C
Sbjct: 136 NFHLSADDLVSCC-HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPY-EIAPCE 193
Query: 74 H------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
H P C TP C KC + + K++ +Y + + +I EI NGP
Sbjct: 194 HHVNGTRPPCSHG-STPSCQHKCQASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGP 252
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA 185
VE +FTVYED YKSGVY+H G +GGHA++++GWG + + YW++ N WN WG
Sbjct: 253 VEGAFTVYEDLILYKSGVYQHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNTDWGD 312
Query: 186 DGYFKIKRGSNECGIEEDVVAGL 208
+ + CGIE + AGL
Sbjct: 313 N---------DHCGIESSISAGL 326
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 111/201 (55%), Gaps = 13/201 (6%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPYF-----DSTGCS 73
+ LS +L++C G C G+ +W Y++ +G+VT + C PY + S
Sbjct: 135 VQLSAIELISCSKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSSNS 192
Query: 74 HPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
+P C Y P C + C + ++ KHY Y + + DI EI NGPVE
Sbjct: 193 YPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGI 252
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
V+ DF +YKSGVY+HITG ++ H+V++IGWG +D YW+ AN WN WG +GYFKI
Sbjct: 253 FVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIEND-IPYWLCANSWNEDWGLNGYFKI 311
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
RGSNEC IE V AG +K
Sbjct: 312 LRGSNECEIESFVNAGKVDNK 332
>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 298
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 95/168 (56%), Gaps = 12/168 (7%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
CDGG+ S WR+ V G T+EC PY G A T C KC ++L
Sbjct: 140 CDGGWLPSVWRFLVKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSEL---P 187
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
+ + A D + IM + GP++ +FTVY DF +Y+ GVY+H+ G GGHAV++
Sbjct: 188 IYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYQGGVYQHVYGRAEGGHAVEM 247
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT + DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 248 VGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 60/128 (46%), Positives = 83/128 (64%), Gaps = 2/128 (1%)
Query: 83 TPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
TPKC++ C + + K Y +Y + I EI NGPVE +FTVYED YK
Sbjct: 41 TPKCIKHCQASYTVAYEQDKSYGAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYK 100
Query: 142 SGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
GVY+H+TG ++GGHA++++GWG +D YW++AN WN WG +G+FKI RGS+ CGIE
Sbjct: 101 DGVYQHVTGKMLGGHAIRILGWGVEND-VPYWLIANSWNTDWGNNGFFKILRGSDHCGIE 159
Query: 202 EDVVAGLP 209
+ AG+P
Sbjct: 160 SQISAGIP 167
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/217 (39%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ S ++Q LS ++L+C GC+GG+ +AWRY GVV E C P
Sbjct: 225 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYP 279
Query: 66 Y----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
Y +S GC P+ R+S + AY +N +
Sbjct: 280 YTQHRDTCKIRHNSRSLKANGCRPSANVD-------------RDSFYTVGPAYTLNKE-S 325
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDY 172
DIMAEIY +GPV+ + VY DF Y SGVY+ G G H+VKL+GWG +G+ Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WI AN W WG GYF+I RGSNECGIE+ V+A P
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 101/176 (57%), Gaps = 18/176 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
+N S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY GC
Sbjct: 73 KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWHYWKTKGIVSG--GPYGSKMGCIPYEIAP 129
Query: 73 -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ C+ TP CV+KC ++ + H SAY + +D + I EIY N
Sbjct: 130 CEHHVNGTRGPCKEGGKTPACVKKCEDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTN 189
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
GPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG + YW++AN WN
Sbjct: 190 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQNGEIPYWLVANSWN 245
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/217 (39%), Positives = 113/217 (52%), Gaps = 32/217 (14%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ S ++Q LS ++L+C GC+GG+ +AWRY GVV E C P
Sbjct: 225 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYP 279
Query: 66 Y----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
Y +S GC P+ R+S + AY +N +
Sbjct: 280 YTQHRDTCKIRHNSRSLKANGCRPSANVD-------------RDSFYTVGPAYTLNKE-S 325
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDY 172
DIMAEIY +GPV+ + VY DF Y SGVY+ G G H+VKL+GWG +G+ Y
Sbjct: 326 DIMAEIYHSGPVQATMRVYRDFFSYSSGVYRQTAANRGAPTGFHSVKLVGWGEEHNGDKY 385
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WI AN W WG GYF+I RGSNECGIE+ V+A P
Sbjct: 386 WIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWP 422
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 109/198 (55%), Gaps = 13/198 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE---CDPYF-----DSTG 71
+ LS +L++C G C G+ +W Y++ +G+VT + C PY +
Sbjct: 50 MKVQLSAIELISCSKNKLG--CQIGFSEFSWDYWLKNGLVTGDPTGCLPYPFPKCDHRSS 107
Query: 72 CSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
S+P C Y P C + C + ++ KHY Y + + DI EI NGPVE
Sbjct: 108 NSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEA 167
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
V+ DF +YKSGVY+HITG ++ H+V++IGWG +D YW+ AN WN WG +GYF
Sbjct: 168 GIFVHSDFLNYKSGVYRHITGQLVTIHSVRIIGWGIEND-IPYWLCANSWNEDWGLNGYF 226
Query: 190 KIKRGSNECGIEEDVVAG 207
KI RGSNEC IE V AG
Sbjct: 227 KILRGSNECEIESFVNAG 244
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/208 (38%), Positives = 110/208 (52%), Gaps = 16/208 (7%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D + +Q + +S D+LACCG CG GC+GG AW Y GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184
Query: 61 ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
+E C PY +H G C + ++ TP C + C + + K Y S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
Y ++ D + I E+ KNGPV+ +F YEDF+ Y G+Y H G G HAVK++GWG
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV- 303
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
++G YW +AN W+ WG DGYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGEDGYFRILRG 331
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 112/207 (54%), Gaps = 25/207 (12%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
LS ++ AC L GC GG+P SAW + G+ T + C PY D
Sbjct: 194 LSAGEMNACAPSLKDPGCRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPY-DFP 252
Query: 71 GCSHPGCEPAYPT-PKCVR---KCVKKNQ----LWRNSKHYSISAYRINSDPEDIMAEIY 122
C+H +P YP PK R +CV K + ++ + +++ + + + +D I
Sbjct: 253 PCAHFFKDPKYPACPKFARVNLRCVSKLRHMMVVYFSDRYFMVESVPYHFSADDAKNAIR 312
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
+GPV +F VYEDF YKSGVYKH +G ++G HAVK+IGWG D GE YW++ N WN
Sbjct: 313 TDGPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHAVKIIGWG-EDGGEAYWLVVNSWNEG 371
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGLP 209
WG G FKI G +CGI+ +++ G P
Sbjct: 372 WGDHGLFKIALG--DCGIDNELLGGTP 396
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 117/232 (50%), Gaps = 31/232 (13%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
VT D L + L LS ++ AC GCDGGYP SAW + G+ T
Sbjct: 92 FGVTEAFNDRLCVKSNGTFTEL-LSAGEMNACAPSY---GCDGGYPDSAWSWVHDEGIAT 147
Query: 61 -------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC--VKKNQLW 97
+ C PY D C+H P C + +Y TP CV +C K +
Sbjct: 148 GGDYVARGNLTKGDGCWPY-DFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSL 206
Query: 98 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
+N +HY + + + I +GPV S+ VYEDF YKSGVYKH +G +GGHA
Sbjct: 207 KNDRHYMLESSPYQYSVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHA 266
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
VK+IGWG ++GE YW++ N WN WG G FKI G+ C I++D++ G P
Sbjct: 267 VKIIGWG-EENGEAYWLVVNSWNEDWGDHGLFKIALGN--CQIDDDLLGGTP 315
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/213 (38%), Positives = 113/213 (53%), Gaps = 25/213 (11%)
Query: 21 NLSLSVNDLLACCGFLCG---DGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDST 70
N LS +++ACC GC GG ++AW + HG+ TE C PY +
Sbjct: 110 NQLLSAGEMIACCNSTHSWQPRGCKGGMILNAWSFLKTHGIATEGSMSAADGCWPY-NFP 168
Query: 71 GCSH--------PGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAE 120
C+H P + Y TP C+ +C K +H++ + + ++I E
Sbjct: 169 KCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKE 228
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
I NGP +F+VYEDF YKSGVYKH G +MG H+V++IGWGT + G DYW++ N WN
Sbjct: 229 IMTNGPTSATFSVYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGT-EKGVDYWLVMNSWN 287
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
WG G FKI +G +CGI +D V G P + N
Sbjct: 288 EGWGDHGTFKIAQG--DCGI-DDAVLGSPPAMN 317
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 115/204 (56%), Gaps = 14/204 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ L +S DL+ACC CG GC+GGYP +AW Y+V +G+ + +C PY C H G +
Sbjct: 139 KQLRISAADLMACCT-GCGGGCEGGYPDAAWEYYVSNGITSSQCQPY-PFPRCEHRGAQG 196
Query: 80 AYP--------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
P TP C C K+ K+ +Y + + ED E+Y NGP V F
Sbjct: 197 KKPPCSKYNFDTPTCNATCTDKSVPL--IKYRGNHSYEVRGE-EDYKRELYFNGPFVVRF 253
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
V+ DF YKSGVY+H+ G+ +GG AV+++GWG +G YW +AN W+ WG +GYF I
Sbjct: 254 QVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKM-NGTPYWKVANSWDTDWGMNGYFLI 312
Query: 192 KRGSNECGIEEDVVAGLPSSKNLV 215
RG+NEC IE AG P + L
Sbjct: 313 LRGNNECNIEHLGFAGTPDTSQLT 336
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 85/232 (36%), Positives = 117/232 (50%), Gaps = 31/232 (13%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
VT D L + + L LS ++ AC GC+GG+P SAW + G+ T
Sbjct: 53 FGVTEAFNDRLCIKSHGTFTEL-LSAGEMNACAP---SHGCNGGFPNSAWSWVHDKGIAT 108
Query: 61 -------------EECDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC--VKKNQLW 97
+ C PY D C+H P C + +Y TP C +C K
Sbjct: 109 GGDYVAEDDMTKDDGCWPY-DFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTL 167
Query: 98 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
R+ +H+ + + D I +GPV SFTVYEDF YKSGVYKH +G+ +GGHA
Sbjct: 168 RDDRHFMVESSPYQYSVNDAKNAIRTDGPVSASFTVYEDFLAYKSGVYKHTSGEYLGGHA 227
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
VK+IGWG + G+ YW++ N WN WG G FKI G+ CGI++ ++ G P
Sbjct: 228 VKIIGWG-EESGQAYWLVVNSWNEDWGDHGLFKIALGN--CGIDDYLLGGTP 276
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 107/208 (51%), Gaps = 29/208 (13%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDS 69
+ + LS ++L+C GCDGG+ +AWRY GVV E C PY +S
Sbjct: 234 ETVQLSAQNILSCTRRQ--QGCDGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKIRHNS 291
Query: 70 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
GCE TP V R++ + AY +N + DIMAEI+ +GPV+
Sbjct: 292 RSLRANGCE----TPVNVD---------RDTFYTVGPAYSLNREA-DIMAEIFNSGPVQA 337
Query: 130 SFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
+ V DF Y GVY+ + G H+VKL+GWG +GE YWI AN W WG
Sbjct: 338 TMRVNRDFFSYSRGVYRQTAANREAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEK 397
Query: 187 GYFKIKRGSNECGIEEDVVAGLPSSKNL 214
GYF+I RGSNECGIEE V+A P N
Sbjct: 398 GYFRILRGSNECGIEEYVLASWPYVYNF 425
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 109/208 (52%), Gaps = 28/208 (13%)
Query: 23 SLSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+LS +L++C GDG CDGG AW ++ G+VT E C PY +
Sbjct: 117 NLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-KNRP 170
Query: 72 CSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAE 120
C H G C T C +KCV KN + + H + Y + ++ + I E
Sbjct: 171 CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQE 230
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
I +GPV VYE+F YK G+YK TG+++G H VKLIGWG DG +YW+ N WN
Sbjct: 231 IMTHGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWN 290
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGL 208
+WG DG FKI RG N C IE V+AG+
Sbjct: 291 SNWGNDGLFKILRGYNFCSIELLVMAGI 318
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 110/208 (52%), Gaps = 16/208 (7%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D + +Q + +S D+LACCG CG GC+GG AW Y GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184
Query: 61 ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
+E C PY +H G C + ++ TP C + C + + K Y S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
Y ++ D + I E+ KNGPV+ +F YEDF+ Y G+Y H G G HAVK++GWG
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV- 303
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
++G YW +AN W+ WG +GYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGENGYFRILRG 331
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 100/172 (58%), Gaps = 17/172 (9%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 44 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100
Query: 75 ----PGCEPAYPTPKCVRKCV-KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
C Y TP C C KK L + + S I S E E+ NGP EV
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSC----ILSGEESFKRELLLNGPFEV 156
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
SF+VY DF Y GVYKH+TG +GGHAV+++GWG +GE YW +AN WN
Sbjct: 157 SFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 71/152 (46%), Positives = 90/152 (59%), Gaps = 20/152 (13%)
Query: 63 CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
C+PY C H P C Y TP+C C K R Y+ +R
Sbjct: 162 CEPY-PFPKCEHFTKGQYPPCGSKIYKTPRCKTTCQK-----RYKTSYAQDKHRA----- 210
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
I EI K GPVE SFTVYEDF +YKSG+YKHITG+ +GGHA+++IGWG ++ YW++
Sbjct: 211 -IQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGV-ENKTPYWLI 268
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
AN WN WG +GYF+I RG +EC IE +V AG
Sbjct: 269 ANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 110/208 (52%), Gaps = 16/208 (7%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D + +Q + +S D+LACCG CG GC+GG AW Y GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGSECGRGCNGGMDHKAWEYVKEFGVVT 184
Query: 61 ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
+E C PY +H G C + ++ TP C + C + + K Y S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
Y ++ D + I E+ KNGPV+ +F YEDF+ Y G+Y H G G HAVK++GWG
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAAFITYEDFSFYTKGIYVHTRGRQRGAHAVKVVGWGV- 303
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
++G YW +AN W+ WG +GYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGENGYFRILRG 331
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 112/217 (51%), Gaps = 32/217 (14%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ S ++Q LS ++L+C GC+GG+ +AWRY GVV E C P
Sbjct: 223 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDESCYP 277
Query: 66 Y----------FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPE 115
Y +S GC+ Y R++ + AY +N +
Sbjct: 278 YTQQRDTCKIRHNSRSLRANGCQTPYNVD-------------RDTFYTVGPAYSLNREA- 323
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGEDY 172
DIMAEI+ +GPV+ + V DF Y GVY+ + M G H+VKL+GWG +GE Y
Sbjct: 324 DIMAEIFHSGPVQATMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEHNGEKY 383
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WI AN W WG GYF+I RGSNECGIEE V+A P
Sbjct: 384 WIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 420
>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 298
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 93/168 (55%), Gaps = 12/168 (7%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
CDGG+ S WR+ G T+EC PY G A T C KC + L
Sbjct: 140 CDGGWLPSVWRFLTKTGTTTDECVPY-------QSGSTGARGT--CPTKCADGSDL---P 187
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
+ + A D + IM + GP++ +FTVY DF +Y+ GVY+H G V GGHAV++
Sbjct: 188 IYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDFMYYEGGVYQHTYGRVEGGHAVEM 247
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT + DYWI+ N W WG DGYF+I R +NECGIEE V+ G
Sbjct: 248 VGYGTDEYDVDYWIIRNSWGPDWGEDGYFRIIRMTNECGIEEQVIGGF 295
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 109/204 (53%), Gaps = 18/204 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----FD 68
N LS +L CC CG GC GG P+ AW YF GV T E C PY +
Sbjct: 135 NQLLSPEELTFCCK-DCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRN 193
Query: 69 STGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVE 128
G + +P +C + C K + +++ + S Y INS + I +I GPVE
Sbjct: 194 KQGENICDEQPMERNHQCPKTCYGKTTV--QNRYKTKSEYYINS-IKTIEQDIKTYGPVE 250
Query: 129 VSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
SF Y+D + YKSG+Y K GGH++K+IGWG +DG YW+ N W++ WG G
Sbjct: 251 ASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWG-QEDGTPYWLAVNSWSKFWGDHG 309
Query: 188 YFKIKRGSNECGIEEDVVAGLPSS 211
FKI +G NECGIE V AG+PSS
Sbjct: 310 TFKIIKGRNECGIERAVTAGIPSS 333
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 82/213 (38%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 21 NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTE-------------- 61
N LS D L+CC L CGDG CDG +P +++ HG+ T
Sbjct: 141 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYSI 200
Query: 62 -ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPED 116
CD + + S P P Y TP C C N W + KH+ + Y + D
Sbjct: 201 YPCDKNYPNGTTSVPC--PGYHTPPCEDHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTD 257
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI NGPV SF +YEDF YKSG+Y H GD GG K+IGWG D+G YW+
Sbjct: 258 IQTEIMTNGPVIASFIIYEDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCV 316
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+QW +G +G+ +I RG NE IE V+A LP
Sbjct: 317 HQWGTDFGENGFVRILRGVNEVNIEHQVLAALP 349
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 255 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGCAMASR 313
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YRI+S+ +IM EI +NGPV+ V
Sbjct: 314 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRISSNETEIMKEIMQNGPVQAIMQV 368
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYKSG+Y+H+ + HAVKL+GWGT E +WI AN W +
Sbjct: 369 HEDFFHYKSGIYRHVASTHGESENYRKLRTHAVKLLGWGTLRGAQGRKEKFWIAANSWGK 428
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 429 SWGENGYFRILRGVNESDIEKLIIAA 454
>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
Length = 311
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 106/191 (55%), Gaps = 19/191 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LS L++C GC+GG P AW Y HG+ T+ C PY G +
Sbjct: 129 NVTLSPQALVSC-DIEFNQGCNGGIPQMAWEYLELHGIPTDSCFPYTSGNGTA------- 180
Query: 81 YPTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P C ++C K QL++ K +++ + S I A ++ GP+E + VY+DF
Sbjct: 181 ---PDCQKECSDGSKYQLYKG-KTFTL---KTCSSVAAIQANVFAYGPIEGTMDVYQDFM 233
Query: 139 HYKSGVYKHITGD-VMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
Y SGVY G ++GGHA+K++GWGT S G DYWI+ N W WG +G+F I+RG+N
Sbjct: 234 SYTSGVYVMTPGSKLLGGHAIKIVGWGTDSTSGLDYWIVQNSWGSDWGMNGFFWIQRGTN 293
Query: 197 ECGIEEDVVAG 207
CGI+ D AG
Sbjct: 294 MCGIDRDASAG 304
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 110/208 (52%), Gaps = 16/208 (7%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D + +Q + +S D+LACCG CG GC+GG AW Y GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184
Query: 61 ----EE---CDPYFDSTGCSHPG----C--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
+E C PY +H G C + ++ TP C + C + + K Y S
Sbjct: 185 GGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKS 244
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
Y ++ D + I E+ KNGPV+ + YEDF+ Y+ G+Y H G G HAVK++GWG
Sbjct: 245 VYILDEDEKAIQREMMKNGPVQAASITYEDFSFYRRGIYVHTRGRQRGAHAVKVVGWGV- 303
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRG 194
++G YW +AN W+ WG DGYF+I RG
Sbjct: 304 ENGTKYWNVANSWSTDWGEDGYFRILRG 331
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 108/208 (51%), Gaps = 28/208 (13%)
Query: 23 SLSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+LS +L++C GDG CDGG AW ++ G+VT E C PY +
Sbjct: 117 NLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPY-KNRP 170
Query: 72 CSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAE 120
C H G C T C +KCV KN + + H + Y + ++ + I E
Sbjct: 171 CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQE 230
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
I GPV VYE+F YK G+YK TG+++G H VKLIGWG DG +YW+ N WN
Sbjct: 231 IMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWN 290
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGL 208
+WG DG FKI RG N C IE V+AG+
Sbjct: 291 SNWGNDGLFKILRGYNFCSIELLVMAGI 318
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 106/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----------FDS 69
+ + LS ++L+C GCDGG+ +AWR+ GVV + C PY +S
Sbjct: 233 EAVRLSAQNILSCTRRQ--QGCDGGHLDAAWRFLHKKGVVDDSCYPYTQQRDTCKIRHNS 290
Query: 70 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
GC P+ + R+S + AY +N + DIMAEIY +GPV+
Sbjct: 291 RSLKANGCRPS-------------PNVDRDSFYTVGPAYTLNRE-GDIMAEIYHSGPVQA 336
Query: 130 SFTVYEDFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
+ VY DF Y G+Y+ G G H+VKL+GWG +G+ YWI AN W WG
Sbjct: 337 TMRVYRDFFSYSGGIYRQTAANRGAPQGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGER 396
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
GYF+I RGSNECGIEE V+A P
Sbjct: 397 GYFRILRGSNECGIEEYVLASWP 419
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/194 (41%), Positives = 103/194 (53%), Gaps = 17/194 (8%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D L + L LS D+LACCG CG GC+GGY AW Y + GV +
Sbjct: 5 VSAAETMSDRLCVQTNGRKKTL-LSDTDILACCGDFCGYGCNGGYSARAWLYARNSGVCS 63
Query: 61 ----EE---CDPY------FDSTGCSHPGC-EPAYPTPKCVRKC-VKKNQLWRNSKHYSI 105
+E C PY + + C + Y TP C + C + + K Y+
Sbjct: 64 GGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKIYAX 123
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
AYR++SD I AEI+ GPV+ SF YEDFAHYKSG+Y H G GGHAVK+IGWG
Sbjct: 124 DAYRVSSDEAAIRAEIFARGPVQASFATYEDFAHYKSGIYVHTAGKRRGGHAVKIIGWGV 183
Query: 166 SDDGEDYWILANQW 179
++G WI+AN W
Sbjct: 184 -ENGTKXWIVANSW 196
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 98/174 (56%), Gaps = 21/174 (12%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 44 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100
Query: 75 ----PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C Y TP C C K +R + Y +S E E+ NGP
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPF 154
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
EVSF+VY DF Y GVYKH+ G +GGHAV+++GWG +GE YW +AN WN
Sbjct: 155 EVSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 106/201 (52%), Gaps = 22/201 (10%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTG---CSHPGCE 78
+ LS ++L+C GC+GG+ +AWRY GV+ E C PY S G H G
Sbjct: 238 VQLSPQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVLDESCYPYTQSRGTCKVRHSGSL 295
Query: 79 PAY---PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
A+ P P V ++ L+ YS+S DI AEI+ +GPV+ + VY
Sbjct: 296 KAHGCRPAPG-----VDRDSLYTVGPAYSLSR------EADIKAEIFHSGPVQATMRVYR 344
Query: 136 DFAHYKSGVYKHIT---GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
DF Y G+Y+ G G H+VKL+GWG +G+ YWI AN W WG GYF+I
Sbjct: 345 DFFSYSGGIYRQTAANRGAPTGFHSVKLVGWGEEHNGDKYWIAANSWGPWWGERGYFRIL 404
Query: 193 RGSNECGIEEDVVAGLPSSKN 213
RGSNECGIE+ V+A P N
Sbjct: 405 RGSNECGIEDYVLASWPYVYN 425
>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
Length = 297
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/172 (43%), Positives = 98/172 (56%), Gaps = 19/172 (11%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC+GGY AW + HGVV + C PY +G + P C KC +
Sbjct: 141 GCNGGYMDMAWEFLDQHGVVADSCFPYSAGSGFA----------PACASKCADGSA---- 186
Query: 100 SKHYSI--SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
K YS + R + E I +EI +GPVE +FTVY DF +Y+SGVY T DV GGHA
Sbjct: 187 EKKYSCVHGSIRQSQGVEQIKSEIVAHGPVEGAFTVYTDFFNYQSGVYTPTTSDVAGGHA 246
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+K++G+G ++G YW+ AN W SWG G+FKIK+G ECGIE+ V + P
Sbjct: 247 IKILGFGV-ENGTPYWLCANSWGPSWGMQGFFKIKQG--ECGIEDQVFSCDP 295
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 101/167 (60%), Gaps = 17/167 (10%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH-- 74
LS +L++CC CG GC+GG+P SAW Y+ + G+VT + C PY + C H
Sbjct: 148 LSAENLVSCCSS-CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPY-EFPPCEHHT 205
Query: 75 ----PGCEPAYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ TP C R C N + N K Y YR+ S+ E IM E+ ++GPVEV
Sbjct: 206 LGPLPVCDGDVETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEV 265
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
F VY DF +YKSGVY+H++G ++GGHAV+L+GWG ++ YW++A
Sbjct: 266 DFEVYADFPNYKSGVYQHVSGALLGGHAVRLLGWG-EENNVPYWLIA 311
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKV 380
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 159 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 217
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 218 SDGRGKRHATKPCPNNFEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 272
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 273 HEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGK 332
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 333 SWGENGYFRILRGVNESDIEKLIIAA 358
>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
Length = 294
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/203 (38%), Positives = 113/203 (55%), Gaps = 19/203 (9%)
Query: 9 DALSSSPYVSLQNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYF 67
+A S ++ +++ LS DL++C D GC+GGY AW Y HG T+ C PY
Sbjct: 109 EAFSDRFAINGKDVILSPEDLVSC---DTNDYGCNGGYMDVAWEYLADHGAATDSCFPYS 165
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
+G + P C KC + + R + ++ R + I +EI +GPV
Sbjct: 166 AGSGFA----------PACSDKCADGSAMQRFK--CAPNSVRQSKGVAQIQSEIVSHGPV 213
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADG 187
E +FTVY DF +Y+SGVY T DV GGHA+K++G+G ++G YW+ AN W +WG G
Sbjct: 214 EGAFTVYTDFFNYQSGVYTPTTTDVAGGHAIKILGYGV-ENGTPYWLCANSWGPAWGMSG 272
Query: 188 YFKIKRGSNECGIEEDVVAGLPS 210
+FKIK+G ECGIE+ V + P
Sbjct: 273 FFKIKQG--ECGIEDQVFSCDPQ 293
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 94/179 (52%), Gaps = 12/179 (6%)
Query: 45 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQL 96
+P AWRY+V +G+ + C PY C H G + + TPKC C K+
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSI- 219
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
K+ + Y + ED E+Y NGP F VY D YKSGVY+H+ GD +GG
Sbjct: 220 -PLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGGT 278
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
AVK++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 113/216 (52%), Gaps = 16/216 (7%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ S ++Q LS ++L+C GC+GG+ +AWRY GVV E C P
Sbjct: 223 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYP 277
Query: 66 YFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
Y H + +R C + R++ + AY +N + DIMAEI+
Sbjct: 278 YTQ-----HRDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFH 331
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDGEDYWILANQWN 180
+GPV+ + V DF Y GVY+ + G H+VKL+GWG +GE YWI AN W
Sbjct: 332 SGPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWG 391
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVK 216
WG GYF+I RGSNECGIEE V+A P N K
Sbjct: 392 SWWGEHGYFRILRGSNECGIEEYVLASWPYVYNYYK 427
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNSGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMKV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/215 (37%), Positives = 107/215 (49%), Gaps = 26/215 (12%)
Query: 21 NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTEE-------CDPYFD 68
N LS D L+CC L CGDG CDG +P +++ HG+ T C PY
Sbjct: 142 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYNDQFGCKPYSI 201
Query: 69 -------STGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDI 117
+ G + C P Y TP C C N W + KH+ + Y + DI
Sbjct: 202 YPCDKKYANGTTSVPC-PGYHTPTCEEHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTDI 259
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
EI NGPV SF +Y+DF YK+G+Y H GD GG K+IGWG D+G YW+ +
Sbjct: 260 QIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCVH 318
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
QW +G +G+ + RG NE IE V+A LP S+
Sbjct: 319 QWGTDFGENGFVRFLRGVNEVNIEHQVLAALPDSE 353
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 98/174 (56%), Gaps = 21/174 (12%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 44 VRDLRISAGDLMSCCD-VCGYGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100
Query: 75 ----PGCEPAYPTPKCVRKCVKKNQ---LWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C Y TP C C K +R + Y +S E E+ NGP
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKVPLIKYRGNTSYLLSG------EESFKRELLLNGPF 154
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
EVSF+VY DF Y GVYKH+ G +GGHAV+++GWG +GE YW +AN WN
Sbjct: 155 EVSFSVYADFLAYTGGVYKHVAGIFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/186 (38%), Positives = 102/186 (54%), Gaps = 18/186 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
+S DL++C GC+GGY W + G+ TE+C PY +G
Sbjct: 106 MSPQDLVSCESN--NMGCEGGYADRVWNWIQKKGITTEQCLPYVSGSG----------RV 153
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
P C KC + + R+ + S NS + +M E+ NGPV F V+EDF +YKSG
Sbjct: 154 PTCPSKCKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFLNYKSG 208
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
+Y+H TG G H V L+GWGT ++G YW+L N W WG G+F+I+RG+N+C I+E
Sbjct: 209 IYQHKTGKSKGWHHVMLMGWGT-ENGVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEI 267
Query: 204 VVAGLP 209
+GLP
Sbjct: 268 FYSGLP 273
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/174 (43%), Positives = 100/174 (57%), Gaps = 21/174 (12%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSH--- 74
+++L +S DL++CC +CG GC+GGYP AW Y+ HG+V+E C PY F S C+H
Sbjct: 44 VRDLRISAGDLMSCCD-VCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPS--CAHHVN 100
Query: 75 ----PGCEPAYPTPKCVRKCV-KKNQL--WRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C Y TP C C KK L +R + Y +S E E+ NGP
Sbjct: 101 SSDLSPCSGEYDTPTCNSTCTDKKIPLIKYRGNTSYVLSG------EEPFKRELILNGPF 154
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
EVSF+VY DF Y GVYKH+ G +GGHAV+++GWG +GE YW +AN WN
Sbjct: 155 EVSFSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWGEL-NGEPYWKIANSWNH 207
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 106/213 (49%), Gaps = 28/213 (13%)
Query: 21 NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT--------------- 60
N LS D L+CC L CGDG CDG +P +++ HG+ T
Sbjct: 144 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTI 203
Query: 61 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPED 116
CD + + S P P Y TP C +C N W + KH+ + Y + D
Sbjct: 204 YPCDKKYPNGTTSVPC--PGYHTPVCEERCTS-NITWPISYKQDKHFGKAHYNVGKKMTD 260
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI +NGPV SF +Y+DF YKSG+Y H GD GG K+IGWG D+G YW+
Sbjct: 261 IQTEIMRNGPVIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCV 319
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+QW +G +G+ +I RG NE IE V+A P
Sbjct: 320 HQWGTDFGENGFVRILRGVNEVNIEHQVLAAQP 352
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 81/229 (35%), Positives = 114/229 (49%), Gaps = 18/229 (7%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D + LQ + LS D+L+CCG +CGDGC+GGY AW + GVVT
Sbjct: 125 VSAASTMSDRICVQTKGKLQTI-LSDTDILSCCGRMCGDGCEGGYDHLAWEWVQRFGVVT 183
Query: 61 E-------ECDPY-FDSTGCSHP---GC--EPAYPTPKCVRKC-VKKNQLWRNSKHYSIS 106
C PY F G H C + ++ TP C C + + K + S
Sbjct: 184 GGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVKS 243
Query: 107 AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTS 166
Y +++D + I E+ KNGPV+ +F YEDF+ YK G+Y H+ G G HAVKLIGWG
Sbjct: 244 TYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYKGGIYVHVKGRERGAHAVKLIGWGV- 302
Query: 167 DDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
++G YW +AN W+ WG + S + +V +NL+
Sbjct: 303 ENGTKYWTVANSWHDDWGGKRFLPYSTWSESLRVR--IVCRFRRIQNLI 349
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 109/207 (52%), Gaps = 28/207 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F + ++ GC A
Sbjct: 264 NLSPQNLISCCP-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKNQNATNHGCAMASR 322
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 323 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 377
Query: 134 YEDFAHYKSGVYKHITGDV---------MGGHAVKLIGWGT----SDDGEDYWILANQWN 180
+EDF HYK+G+Y+HIT + HAVKL GWGT E +WI AN W
Sbjct: 378 HEDFFHYKTGIYRHITKKANEESGKYRKLQTHAVKLTGWGTLKGAQGRKEKFWIAANSWG 437
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAG 207
+SWG +GYF+I RG NE IE+ ++A
Sbjct: 438 KSWGENGYFRILRGVNESDIEKLIIAA 464
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 85/232 (36%), Positives = 117/232 (50%), Gaps = 26/232 (11%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+SVT D + + ++ L S L++CC CG+GC GGY +AWRY + G+VT
Sbjct: 127 ISVTSAMNDRICIASQGNITAL-YSPQKLVSCCE-DCGNGCSGGYTAAAWRYILKKGIVT 184
Query: 61 -------EECDPYF-----DSTGCSHP----------GCEPAYPTPKCVRKCVKKNQLWR 98
E C P+ ST + P G +PA TPKC C +
Sbjct: 185 GGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPA-TTPKCDLSCYNARHEGK 243
Query: 99 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 158
+ D + K+GP V+ VYEDF YKSGVY H+TGD +G +V
Sbjct: 244 YLDDIIKAKKVFTFDGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYHHVTGDYLGLLSV 303
Query: 159 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
++IGWG + G+ +W+LAN W SWG G+FKI+R NEC IE AG+P+
Sbjct: 304 RMIGWGL-EGGQAFWLLANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPN 354
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/182 (42%), Positives = 95/182 (52%), Gaps = 19/182 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS------- 73
N SLS DLL+CC CG GC GGYP AW Y+ HG+VT D +GC
Sbjct: 136 NKSLSAVDLLSCCK-DCGFGCRGGYPAVAWDYWRTHGIVTGGSKE--DPSGCRSYPFPKC 192
Query: 74 -------HPGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
+P C YPTP+CV+ C + K + +Y I + IM EI G
Sbjct: 193 DHHVQGHYPPCPRQIYPTPECVQDCDTPELGYLEDKTRANISYNIYASEISIMKEIMLRG 252
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
PVE FTVYEDF YKS VY H G M GHA++++GWG D YW++AN WN WG
Sbjct: 253 PVEAVFTVYEDFLQYKSRVYFHAWGAPMSGHAIRILGWGEEGD-VPYWLIANSWNEDWGE 311
Query: 186 DG 187
G
Sbjct: 312 KG 313
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 109/194 (56%), Gaps = 15/194 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCE 78
+N+ LS LL+C GC GG+ AW + HG+V E+C PY S T C
Sbjct: 237 ENMVLSPQTLLSC-NVRAQQGCHGGHIDVAWNFARGHGLVDEKCFPYKASVTRC------ 289
Query: 79 PAYPTPKCVRK-CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
P P ++ C+ + R + Y + S +DIM +I ++GPV+ TVY+DF
Sbjct: 290 PFRPRGNLIQDGCMPLVK--RRTSRYKLGPPAKLSHEKDIMYDIMESGPVQAVMTVYQDF 347
Query: 138 AHYKSGVYK---HITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
HY+ GVY+ H ++ G H+V++IGWG D G+ YW++AN W R WG +GYF+I RG
Sbjct: 348 FHYRDGVYRRSYHGNNELKGFHSVRIIGWG-EDRGDRYWVVANSWGRQWGENGYFRIARG 406
Query: 195 SNECGIEEDVVAGL 208
SNE IE VV GL
Sbjct: 407 SNEADIESFVVTGL 420
>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 104/189 (55%), Gaps = 17/189 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
NL LS D+++C GC GGY AW+Y GV ++ C+PY S G +P+
Sbjct: 126 NLVLSPQDMVSC--DTSNFGCFGGYLDQAWQYLEQQGVSSDSCEPYK-----SGNGDQPS 178
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
PT + +KK + S + A E + I ++GPVE FTVY+DF +Y
Sbjct: 179 CPTKCSNGQAIKKYKCKAGSTKQAKGA-------EATKSLIQESGPVETGFTVYQDFYNY 231
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
SGVY H+TGD GGHAVK++GWG E+YWI+AN W WG GYF I++G + GI
Sbjct: 232 NSGVYHHVTGDAEGGHAVKILGWG-KQGLENYWIVANSWGEDWGEKGYFNIRQG--DSGI 288
Query: 201 EEDVVAGLP 209
+E +P
Sbjct: 289 DEATFGCIP 297
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 95/179 (53%), Gaps = 12/179 (6%)
Query: 45 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--------YPTPKCVRKCVKKNQL 96
+P AWRY+V +G+ + C PY C H G + + TPKC C K+
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTDKSIP 220
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
K+ + Y + ED E+Y NGP F VY D YKSGVY+++ GD++GG
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDILGGQ 278
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
AV+++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVRIVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPETSQLT 336
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 72/186 (38%), Positives = 102/186 (54%), Gaps = 18/186 (9%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
+S DL++C GC+GGY W + G+ TE+C PY +G
Sbjct: 104 MSPQDLVSCESN--NMGCNGGYADRVWNWIQKKGITTEQCIPYVSGSG----------RV 151
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
P C KC + + R+ + S NS + +M E+ NGPV F V+EDF +Y+SG
Sbjct: 152 PTCPSKCKNGSNIVRS---FVSSWGSFNS--KTVMDEVANNGPVYACFEVFEDFYNYRSG 206
Query: 144 VYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
VY+H TG G H V L+GWGT ++G YW+L N W WG G+F+I+RG+N+C I+E
Sbjct: 207 VYQHKTGRSQGWHHVMLMGWGT-ENGVPYWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEI 265
Query: 204 VVAGLP 209
+GLP
Sbjct: 266 FYSGLP 271
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 112/209 (53%), Gaps = 16/209 (7%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ S ++Q LS ++L+C GC+GG+ +AWRY GVV E C P
Sbjct: 223 SDRFAIQSKGKEAVQ---LSAQNILSCTRRQ--QGCEGGHLDAAWRYLHKKGVVDENCYP 277
Query: 66 YFDSTGCSHPGCEPAYPTPKCVRK--CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
Y H + +R C + R++ + AY +N + DIMAEI+
Sbjct: 278 YT-----QHRDTCKIRHNSRSLRANGCQTPVNVDRDTLYTVGPAYSLNREA-DIMAEIFH 331
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGEDYWILANQWN 180
+GPV+ + V DF Y GVY+ + + G H+VKL+GWG +GE YWI AN W
Sbjct: 332 SGPVQATMRVNRDFFAYSGGVYRETAANRKALTGFHSVKLVGWGEEHNGEKYWIAANSWG 391
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WG GYF+I RGSNECGIE+ V+A P
Sbjct: 392 SWWGEHGYFRILRGSNECGIEDYVLASWP 420
>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 75/192 (39%), Positives = 100/192 (52%), Gaps = 23/192 (11%)
Query: 21 NLSLSVNDLLAC-CGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
NL LS D+L+C C C GGY +AW+Y GV ++ C+PY G
Sbjct: 126 NLVLSPQDMLSCDASNFC---CFGGYLDTAWQYLEQQGVGSDSCEPYKSGNG-------- 174
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
P C KC + K Y A + E + I ++GPVE FT+YEDF
Sbjct: 175 --DQPSCPSKCSNGQAI----KKYKCKAGSTKQAKGAEATKSLIQQSGPVETGFTIYEDF 228
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
+Y SG+Y H+TG MGGHAVK++GWG E+YWI+AN W WG GYF I++G +
Sbjct: 229 LNYNSGIYHHVTGGNMGGHAVKILGWGKQGL-ENYWIVANSWGEDWGEKGYFNIRQG--D 285
Query: 198 CGIEEDVVAGLP 209
GI+E +P
Sbjct: 286 SGIDEATFGCIP 297
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 109/202 (53%), Gaps = 27/202 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-----DSTGCSHPGC 77
+LSV +L++C GC+GG SAWRY HGVV+ C P F + +G +H
Sbjct: 272 NLSVQNLISC-DTRNQHGCNGGNIDSAWRYLKTHGVVSYACYPSFWKKHLEPSGENHCYV 330
Query: 78 EPAY-------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVS 130
Y P P + K N+L+R + HY R++S +IM EI GPV+
Sbjct: 331 SSEYGKNYTNGPCPNALEK---SNRLYRCASHY-----RVSSKETNIMKEIMDKGPVQAI 382
Query: 131 FTVYEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGTSDDG----EDYWILANQWNRSWG 184
VYEDF YK G+Y+H G H+VKL+GWG D + +WI AN W +SWG
Sbjct: 383 MKVYEDFFLYKEGIYRHSQKAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSWG 442
Query: 185 ADGYFKIKRGSNECGIEEDVVA 206
+GYF+I RG NEC IE+ ++A
Sbjct: 443 ENGYFRILRGQNECDIEKLILA 464
>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 300
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C+GG+ +AW++ G T+EC PY + C PT KC + +
Sbjct: 141 CNGGWLPNAWKFLTKTGTTTDECVPYQSGSTTLRGTC----PT-----KCADGSSKVHLT 191
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
S Y + D +M + GP++V+F VY DF +Y+SGVY+H G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTTGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEM 249
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT DDG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 250 VGYGTDDDGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 86/143 (60%), Gaps = 14/143 (9%)
Query: 77 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDI-MAEIYKNGPVEVSFTVYE 135
CEP Y ++ KHY S+Y ++ KNGPVE +FTVY
Sbjct: 5 CEPGYSPS------------YKEDKHYGCSSYSVSRGARRRSWQRSSKNGPVEAAFTVYS 52
Query: 136 DFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
DF YKSGVY+H+ GD+MGGHAV+++GWG ++G YW++ N WN WG +G+FKI RG
Sbjct: 53 DFLQYKSGVYQHVAGDMMGGHAVRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQ 111
Query: 196 NECGIEEDVVAGLPSSKNLVKEI 218
+ CGIE ++VAG+P + K I
Sbjct: 112 DHCGIESEIVAGIPCTDQYWKRI 134
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 105/206 (50%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F + GC A
Sbjct: 267 NLSPQNLISCCS-KNRPGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATSNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSANKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 103/213 (48%), Gaps = 28/213 (13%)
Query: 21 NLSLSVNDLLACCGFL---CGDG--CDGGYPISAWRYFVHHGVVT--------------- 60
N LS D L+CC L CGDG CDG +P +++ HG+ T
Sbjct: 144 NWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYEDQFGCKPYSI 203
Query: 61 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPED 116
CD + + S P P Y TP C C N W + KH+ + Y + D
Sbjct: 204 YPCDKKYPNGTTSVPC--PGYHTPTCEEHCTS-NITWPIAYKQDKHFGKAHYNVGKKMTD 260
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI NGPV SF +Y+DF YKSG+Y H GD GG K+IGWG D G YW+
Sbjct: 261 IQTEIMTNGPVIASFVIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DSGVPYWLCV 319
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+QW +G +G+ + RG NE IE V+A LP
Sbjct: 320 HQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 352
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 34/221 (15%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+HIT HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
SWG +GYF+I RG NE IE+ ++A ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNDGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFHYKTGIYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 72/189 (38%), Positives = 104/189 (55%), Gaps = 17/189 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ LS D+++C GCDGGY AW+Y GV ++ C+PY ++G +
Sbjct: 126 NVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYLEKKGVASDSCEPYKSASGTA------- 176
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
P C KC Q + K + S + N + I ++GPVE FTVY DF +Y
Sbjct: 177 ---PSCPSKCAN-GQAIKKYKCQAGSTKQANGAAA-TKSLIQQSGPVETGFTVYADFFNY 231
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSG+Y H++G GGHAVK++GWG E+YWI+AN W SWG G+F I++G + GI
Sbjct: 232 KSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYWIVANSWGESWGEKGFFNIRQG--DSGI 288
Query: 201 EEDVVAGLP 209
++ +P
Sbjct: 289 DQATFGCIP 297
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+HIT + HAVKL GWGT E +WI AN W
Sbjct: 381 HEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGI 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 108/206 (52%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ GC A
Sbjct: 263 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNYGCAMASR 321
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 322 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 376
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+T + HA+KL GWGT E +WI AN W +
Sbjct: 377 HEDFFHYKTGIYRHVTRTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEKFWIAANSWGK 436
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 437 SWGENGYFRILRGVNESDIEKLIIAA 462
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY- 81
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 82 --------PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRDATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANFWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ V+A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLVIAA 466
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 107/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC GG AW Y G+V+ C P F ++ GC+ A
Sbjct: 259 NLSPQNLISCC-VKNRHGCKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMASR 317
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 318 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 372
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYKSG+Y+HI + HAVKL GWG E +WI AN W +
Sbjct: 373 HEDFFHYKSGIYRHINNLKDESEKYRNLRTHAVKLTGWGVLRGAQGKKEKFWIAANSWGK 432
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 433 SWGENGYFRILRGVNESDIEKLIIAA 458
>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
Length = 309
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/189 (38%), Positives = 104/189 (55%), Gaps = 17/189 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ LS D+++C GCDGGY AW+Y GV ++ C+PY ++G +
Sbjct: 126 NVVLSPQDMVSC--DTNNYGCDGGYLNLAWQYLEKKGVASDSCEPYKSASGTA------- 176
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
P C KC Q + K + S + N + I ++GPVE FTVY DF +Y
Sbjct: 177 ---PSCPSKC-SNGQAIKKYKCKAGSTKQANGAAA-TKSLIQQSGPVETGFTVYADFFNY 231
Query: 141 KSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGI 200
KSG+Y H++G GGHAVK++GWG E+YWI+AN W SWG G+F I++G + GI
Sbjct: 232 KSGIYHHVSGGAEGGHAVKILGWGKQGS-ENYWIVANSWGESWGEKGFFNIRQG--DSGI 288
Query: 201 EEDVVAGLP 209
++ +P
Sbjct: 289 DQATFGCIP 297
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 34/221 (15%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCAKK-RHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+HIT HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAHGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
SWG +GYF+I RG NE IE+ ++A ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 105/206 (50%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC G AW Y G+V+ C P+ ++ C A
Sbjct: 216 NLSPQNLISCCA-KNRHGCSSGSIDRAWWYLRKRGLVSHACYPFLKDQNTTNNACAMASR 274
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI NGPV+ V
Sbjct: 275 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIIHNGPVQAIMQV 329
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYKSG+Y+H+T + HAVKL GWGT E +WI+AN W
Sbjct: 330 HEDFFHYKSGIYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRGAQGRKEKFWIVANSWGN 389
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 390 SWGENGYFRILRGVNESDIEKLIIAA 415
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 87/217 (40%), Positives = 119/217 (54%), Gaps = 23/217 (10%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R ++++ V +Q LS DL+ CC + CG+ C GGY AW YF+ G+V+
Sbjct: 111 SDRLCIATNGKVKIQ---LSPEDLIDCCHY-CGNQCKGGYTYYAWNYFMLTGLVSG--GD 164
Query: 66 YFDSTGCSHPGCEPAY--PTPKCVRKCV--KKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
Y STGC P E Y TP C C K + + KH+ S Y I + I EI
Sbjct: 165 YNTSTGC-QPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEI 223
Query: 122 YKNG-PVEVSFTVYEDFAHYK---------SGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
G PV +F VY DF Y+ GVY + +G + G AVK+IGWGT ++G
Sbjct: 224 LSGGGPVVAAFDVYGDFKIYRDGEQHDTILEGVYIYTSGALFGRTAVKIIGWGT-ENGWA 282
Query: 172 YWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 207
YW+ AN W + WGA G+FKI+RG+NECG EE ++AG
Sbjct: 283 YWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIAG 319
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 109/207 (52%), Gaps = 26/207 (12%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFD----STGCSHP- 75
++LS +L++CC GC GG AW Y G+V+ C P F + GC+
Sbjct: 265 TVNLSPQNLISCC-LKHRYGCSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGCAMAS 323
Query: 76 ---GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
G + T C K N++++ S YR++S+ IM EI KNGPV+
Sbjct: 324 RSDGRGKRHATTPCPNNIEKSNRIYQCS-----PPYRVSSNETQIMKEIMKNGPVQAIMQ 378
Query: 133 VYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGT----SDDGEDYWILANQWN 180
V+EDF +YK+G+Y+H+T + + HAVKL GWGT E +WI AN W
Sbjct: 379 VHEDFFYYKTGIYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRGAKGRKEKFWIAANSWG 438
Query: 181 RSWGADGYFKIKRGSNECGIEEDVVAG 207
+SWG +GYF+I RG NE IE+ ++A
Sbjct: 439 KSWGENGYFRILRGVNESDIEKLIIAA 465
>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
Length = 158
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 94/169 (55%), Gaps = 13/169 (7%)
Query: 43 GGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH 102
GG+ ++ WR+ G +E+C PY S G + P C ++ C + S
Sbjct: 1 GGFLVATWRFLAAVGTASEQCVPYV-SFGGAVPACN--------IKSCAVSGE---KSPF 48
Query: 103 YSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIG 162
Y + + R D+MA++ NGP++ + VY+DF YKSGVY H++G ++G HA+K++G
Sbjct: 49 YKVKSARKLKGMVDMMADLKANGPLQATMIVYKDFFSYKSGVYHHVSGRMVGAHAIKIVG 108
Query: 163 WGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
WG S YWI AN W WG DGYF I RG ECG+ + V +G P+
Sbjct: 109 WGVDSASKLPYWICANSWGEDWGLDGYFWIARGRGECGLGKTVWSGKPA 157
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/212 (38%), Positives = 115/212 (54%), Gaps = 31/212 (14%)
Query: 22 LSLSVNDLLACCGFL-CGD-GCDGGYPISAWRYFVHHGVVT----------EECDPYFDS 69
+ LS +CC + C GC+GG P AWR+F GVVT C PY +
Sbjct: 220 MPLSTQHTTSCCNAIHCASFGCNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPY-EI 278
Query: 70 TGCSH------PGCEP---AYPTPKCVRKCVKKN-----QLWRNSKHYSISAYRINSDPE 115
C+H P C+ TPKC + C + + H + S+Y + S +
Sbjct: 279 PFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDVHKASSSYSLRSR-D 337
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWIL 175
+ ++ +G V +F VYEDF +YKSGVYKH+ G +GGHA+K+IGWGT +DGE+YW
Sbjct: 338 AVKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGT-EDGEEYWHA 396
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
N WN WG G+FKI+ G +CG++ ++VAG
Sbjct: 397 VNSWNTYWGDSGHFKIEMG--QCGVDNEMVAG 426
>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
Length = 289
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 95/154 (61%), Gaps = 14/154 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GCDGGY +AW + G+ +++CDPY ++G G P T K K
Sbjct: 148 GCDGGYLNNAWAFLAGTGIPSDKCDPY--TSGNGDVGSCPTSCTDGSAIKLYK------- 198
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
+K S++ S +DI +I NGPV+ +F+VY+DF YKSGVY+H++G + GGHA+K
Sbjct: 199 AKSSSVAQL---SSIDDIQKDIQANGPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIK 255
Query: 160 LIGWGTSDDGED--YWILANQWNRSWGADGYFKI 191
++GWG + DG+D YWI+AN WN +WG +G+F I
Sbjct: 256 IVGWGVTSDGKDTPYWIVANSWNTNWGQEGFFWI 289
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/175 (41%), Positives = 100/175 (57%), Gaps = 9/175 (5%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS-HP---GCEPAYPTPKCVRKCVKKNQL 96
CDGGY + Y+V +G+ + PY GC +P + KC R+C L
Sbjct: 115 CDGGYVHKTFDYWVKYGLTSG--GPYHSGQGCKPYPFGGATQDVNIVLKCDRQCQAGYPL 172
Query: 97 -WRNSKHYSISAYRINSDPEDIM-AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
+ + S+Y + E+ M AEIY+NGP+ SF VY DF Y+SGVY+H+TG G
Sbjct: 173 TYSQDLKHGASSYILPWGDENAMKAEIYQNGPIVTSFDVYGDFFQYRSGVYRHVTGAYKG 232
Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
HAV++IGWG ++G YW+ AN WN WG +G+FKI RG N G+E+ AGLP
Sbjct: 233 SHAVRVIGWGV-ENGVKYWLCANSWNERWGENGFFKIVRGENHVGVEDISYAGLP 286
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/203 (38%), Positives = 107/203 (52%), Gaps = 22/203 (10%)
Query: 25 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPGC 77
S +L+CC CGDGC+GGY +AW+Y++ G+VT E C P+ C+H
Sbjct: 141 SPQKMLSCCDD-CGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPP-CNHTVM 198
Query: 78 EPAYP----------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMA-EIYKNGP 126
+ P TP+C C N K S RI+ ++ E+ K+GP
Sbjct: 199 DERSPSYMCGKYKSETPQCTLNCYNPNYSKPFLKDIS-KGIRIDWHCSGMIRNELKKHGP 257
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
VYEDF YKSG+Y+H+TG ++G VK+IGWG G YW+ AN W SWG
Sbjct: 258 ATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVY-RGVQYWLAANSWGTSWGDK 316
Query: 187 GYFKIKRGSNECGIEEDVVAGLP 209
G+FKI+RG NEC E+ ++G P
Sbjct: 317 GFFKIRRGYNECLFEDYFISGRP 339
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 101/187 (54%), Gaps = 18/187 (9%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
++SV DL++C C+GG A Y V G+ TE C Y +G
Sbjct: 111 AMSVQDLVSC--DKTDSACNGGDMKKAQEYLVKTGITTEACVKYVSGSG----------R 158
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
P C KC +Q+ R Y + +++ + +P +IM + + GP+ F VY DF +Y+S
Sbjct: 159 VPACPSKCDNGSQIIR----YKLQSWK-SVEPSEIMQALMEYGPLSCGFMVYSDFMNYRS 213
Query: 143 GVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEE 202
GVY+H +G GGHAV L GWG ++G YW++ N W +WG G+FKI RGSN C IE
Sbjct: 214 GVYQHKSGYFEGGHAVLLCGWGV-ENGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIES 272
Query: 203 DVVAGLP 209
V G+P
Sbjct: 273 YVTLGVP 279
>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 300
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C+GG+ + W++ G T+EC PY + C PT KC + +
Sbjct: 141 CNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLA 191
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
S Y + D +M + +GP++V+F VY DF +Y+SGVY+H G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTSGPLQVAFLVYSDFMYYESGVYQHTYGYMEGGHAVEM 249
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT DDG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 250 VGYGTDDDGVDYWIIRNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 98/170 (57%), Gaps = 17/170 (10%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW-- 97
GC GG ++AWRY G+ + C PY + C +KC +++ +
Sbjct: 131 GCGGGIEVNAWRYIDLRGLPLDSCQPY-----------DGNITKYNCSKKCTNESETYEA 179
Query: 98 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHA 157
+ ++++S++ Y + E++ I GPV S VY D +YKSG+Y H G+ +G HA
Sbjct: 180 QFTEYWSVARY---ASIEEMQIGIMTEGPVTTSLKVYSDLMYYKSGIYTHTKGEFLGHHA 236
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
V++IGWGT +G DYWI++N WN +WG +G F IKRG NEC IE+ V AG
Sbjct: 237 VEIIGWGTK-NGIDYWIISNSWNTTWGMNGLFLIKRGVNECHIEDYVCAG 285
>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
Length = 812
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 77/184 (41%), Positives = 107/184 (58%), Gaps = 21/184 (11%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
LS DL++C GC+GG +AW Y + G+VT+ C PY G +
Sbjct: 390 LSPEDLVSCD--RVDQGCNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDA---------- 437
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
PKC C K W +K+ + SAY +N E++ EI +GP++V+F VY+ F YKSG
Sbjct: 438 PKCETSC-KDGSSW--TKYKAASAYAVNG-VENMQKEIMTHGPIQVAFNVYKSFMSYKSG 493
Query: 144 VYKHITGDVM--GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIE 201
VY ++M GGHAVK++GWGT + G+DYW++AN WN SWG +GYFKI G+ +
Sbjct: 494 VYAKKWYELMPEGGHAVKIVGWGT-EGGKDYWLVANSWNTSWGDEGYFKIAVGAESISL- 551
Query: 202 EDVV 205
DVV
Sbjct: 552 -DVV 554
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/180 (40%), Positives = 100/180 (55%), Gaps = 17/180 (9%)
Query: 45 YPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVK 92
YPI AW+YF GV T E C PY ++ G + G +P +C + C
Sbjct: 158 YPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYG 217
Query: 93 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGD 151
K + +++ + S Y INS + I +I GPVE SF VY+D + YKSG+Y+
Sbjct: 218 KTTV--QNRYKTKSEYVINSI-KTIERDIMTYGPVEASFDVYDDLSAYKSGIYRKTPKAK 274
Query: 152 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
GGH++K+IGWG +G YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 275 YQGGHSIKIIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/218 (38%), Positives = 114/218 (52%), Gaps = 41/218 (18%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
LS ++ AC GC GG + AW++ GVVT + C PY D
Sbjct: 193 LSPGNVAACSK---TSGCHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPY-DIP 248
Query: 71 GCSH-------PGC-EPAYPTPKCVRKCVKK--NQLWRNSKHY----SISAYRINSDPED 116
C+H P C + Y P C C K + +H+ S+SA R +
Sbjct: 249 PCAHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALR---SIDA 305
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI NGPV S+ VY+DF YKSGVYK + + +GGHAVK+IGW GEDYW++
Sbjct: 306 IKKEIMTNGPVSASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGW-----GEDYWLVV 360
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
N WN++WG +G FKI G +CGIE++V+AG P + +L
Sbjct: 361 NSWNKNWGDNGMFKI--GCGQCGIEDNVLAGTPMTSSL 396
>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
Length = 247
Score = 127 bits (319), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 81/208 (38%), Positives = 108/208 (51%), Gaps = 23/208 (11%)
Query: 2 SVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE 61
S T ++R ++S ++ LS DL+AC G+ GC+GG AW Y + G V +
Sbjct: 61 SETLSDRICIASDKKT---DVILSPEDLVACDGW--NMGCNGGILPWAWSYLTNTGAVED 115
Query: 62 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEI 121
C PY G P C +KC + K S + S + I AEI
Sbjct: 116 SCFPYSSDKG----------AVPTCAKKCQNDKDSFTKYKCKKNSVVQA-SGVDKIKAEI 164
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
KNGP+E FTVYEDF +Y+SGVY H TG+ +GGHAVK++G+ G+ YWI AN W+
Sbjct: 165 SKNGPMETGFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGY-----GDGYWICANSWSE 219
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLP 209
WG G+F I G ECGI+ A P
Sbjct: 220 KWGEKGFFNI--GFGECGIDSAAYACTP 245
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 74/179 (41%), Positives = 101/179 (56%), Gaps = 24/179 (13%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSH 74
+ +S D+++CC + CG GC+GG+PI AW+Y V GVVT +EC ++ C +
Sbjct: 24 QVLISAQDIVSCCTW-CGAGCEGGWPIEAWKYGVTEGVVTGGNFGRKECCRSYEIHPCGY 82
Query: 75 PGCEPAY-------PTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDIMAEIYK 123
G EP Y TP C ++C ++NS K Y SAY + + I +I +
Sbjct: 83 HGNEPFYGHCHSMARTPPCKKRC---RPGYKNSYMMDKRYGTSAYELPNSVXAIQRDIME 139
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG---TSDDGEDYWILANQW 179
NGPV F VYEDF +YKSG+Y+H G GGHAVK+IGWG T + YWI+AN W
Sbjct: 140 NGPVVAGFDVYEDFKYYKSGIYRHTAGKXTGGHAVKVIGWGEEXTENGTIPYWIIANSW 198
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 115/219 (52%), Gaps = 23/219 (10%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ SLS +LL+C GC GG AW Y GVV+E C
Sbjct: 253 AAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWYLRRRGVVSEPCY 311
Query: 65 PY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISAYRINSDPEDIMA 119
P+ ++ G S P + + R+ NQ + +++ Y S AYR+ S +DIM
Sbjct: 312 PFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMK 371
Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVKLIGWGTSD--DG 169
E+Y+NGPV+ V+EDF YKSG+Y+ G H+VK+ GWG DG
Sbjct: 372 ELYENGPVQAIMEVHEDFFMYKSGIYRRTPVTEREPEHHRRHGTHSVKITGWGEERGRDG 431
Query: 170 E--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
+ YW+ AN W R WG DGYF+I RG NEC IE +V
Sbjct: 432 QTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 470
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 105/206 (50%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE---- 78
+LS +L++CC GC+ G AW Y G+V+ C P F S+ C
Sbjct: 265 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISNNTCAMTSK 323
Query: 79 -----PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 324 ADGRGKRHATRPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 378
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF HYK+G+Y+H+ + HAVKL GWGT E +WI AN W +
Sbjct: 379 HEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGARGQKEKFWIAANSWGK 438
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 439 SWGENGYFRILRGVNESDIEKLIIAA 464
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/180 (40%), Positives = 101/180 (56%), Gaps = 17/180 (9%)
Query: 45 YPISAWRYFVHHGVVT-------EECDPY-----FDSTGCSHPGCEPAYPTPKCVRKCVK 92
YPI AW+YF GV T E C PY ++ G + G +P +C + C
Sbjct: 158 YPIKAWKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQGKNTCGGQPMERNHQCPKTCYG 217
Query: 93 KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGD 151
K + +++ + S Y +NS + I ++ GPVE SF VY+DF+ YKSG+Y+
Sbjct: 218 KTTV--QNRYKTKSEYVMNSI-KTIEQDLKTYGPVEASFDVYDDFSVYKSGIYRKTPKAK 274
Query: 152 VMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSS 211
GGH++K+IGWG +G YW+ N W++ WG G FKI +G NECGIE V AG+PSS
Sbjct: 275 YQGGHSIKIIGWG-QQNGTPYWLAVNSWSKFWGEHGTFKIIKGRNECGIERAVTAGIPSS 333
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLSKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNVEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +W+ AN W +
Sbjct: 381 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWVAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAA 466
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/172 (42%), Positives = 98/172 (56%), Gaps = 20/172 (11%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE-------ECDPYFDSTGC 72
+ + +S D+++CC + CG GC GG+ I AW YF GVVT C PY + C
Sbjct: 23 KQVLISDQDIVSCCTW-CGYGCQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPY-EIHPC 80
Query: 73 SHPGCEPAY-------PTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ EP Y TP+C R+C + + + + KHY +AY++ E I EI +N
Sbjct: 81 GYHKDEPYYGECDDLADTPRCKRRCQLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRN 140
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED---YW 173
GPV FTVYEDFAHYK G+YKH +G GGHAVK+IGWG+ G + YW
Sbjct: 141 GPVVAGFTVYEDFAHYKGGIYKHTSGKKTGGHAVKVIGWGSEQKGSEKIPYW 192
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 112/221 (50%), Gaps = 34/221 (15%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCAKK-RRGCNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATTPCPNSIEKSNRIYQCS-----PPYRVSSNETEIMREIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+HIT HAVKL GWGT E +WI AN W +
Sbjct: 381 HEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEITSAD 222
SWG +GYF+I RG NE IE+ ++A ++TSAD
Sbjct: 441 SWGENGYFRILRGVNESDIEKLIIAAW-------GQLTSAD 474
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 126 bits (317), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 106/212 (50%), Gaps = 36/212 (16%)
Query: 23 SLSVNDLLACCGFLCGD----GCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+LS +L++C GD GCDGG AW + + G+VT E C PY +
Sbjct: 117 NLSAQNLMSC-----GDDEKLGCDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPY-KNRP 170
Query: 72 CSHPG------CEPAYPTPK--CVRKCVKKN-------QLWRNSKHYSISAYRINSDPED 116
C H G C T C KCV KN L++ S Y S ++ +
Sbjct: 171 CDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW----TNVKQ 226
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI GPV VYE+F YK GVYK G+++G H VKLIGWG + G +YW+
Sbjct: 227 IQQEIMTYGPVTAFMYVYENFMGYKEGVYKSTAGELIGYHHVKLIGWGVDEAGIEYWLAM 286
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
N WN +WG DG FKI RG N C IE V+AGL
Sbjct: 287 NSWNSNWGNDGLFKILRGYNFCSIELLVMAGL 318
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 86/225 (38%), Positives = 122/225 (54%), Gaps = 27/225 (12%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLL CC CG GC+GGYP +AW ++ G+V+
Sbjct: 117 SDRLCIHSNGKVSVE---ISSEDLLTCCDS-CGMGCNGGYPSAAWDFWTDVGLVSGGLYD 172
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY G P TP+C+ +C ++ KHY S+Y +
Sbjct: 173 SHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVP 232
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
SD E I +EIYKNGPVE +FTVYEDF YK+GVY+H+TG +GGHA+K S GE+
Sbjct: 233 SDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVGGHAIK------SWLGEE 286
Query: 172 YWILAN--QWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
L + WG D GS+ CGIE ++VAG+P +++
Sbjct: 287 VCSLLALCHSDTDWG-DMVSLSSAGSDHCGIESEIVAGIPITQSF 330
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 89/160 (55%), Gaps = 17/160 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCS 73
+++S +D+L+CCG CG+GC+GGYPI AW+Y+V G+ T C PY C
Sbjct: 24 QVTISADDVLSCCGKKCGNGCEGGYPIEAWKYWVKTGICTGGSYESQSGCKPY-PIPPCG 82
Query: 74 H--------PGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P Y TP C KC+ + + + KHY SAY + I EI N
Sbjct: 83 HHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDKHYGTSAYNVAKTVAGIQKEIMTN 142
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
GPVE ++TVYEDF Y GVY H G +GGHAV+++GWG
Sbjct: 143 GPVEAAYTVYEDFYQYTGGVYTHTGGAEVGGHAVRILGWG 182
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 112/214 (52%), Gaps = 27/214 (12%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ SS + +LS LL+C GC GG+ AW + GVV+ +C P
Sbjct: 215 SDRLAIQSSGETGM---TLSPQHLLSC-NTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYP 270
Query: 66 YF----DSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
Y D G C PG P+ C + N+L H+S YRI ++ +I E
Sbjct: 271 YTSGDQDKKGVCMMPGKLPS----DCPTGRERNNEL-----HHSTPPYRIAANEREIQVE 321
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHI---TGDVMGGHA-----VKLIGWGTSDDGEDY 172
I +NGPV+ SF V EDF Y SGVY+H + D HA VKL+GWG ++G Y
Sbjct: 322 IMENGPVQASFEVKEDFFMYGSGVYRHTPIASNDAEQYHASEWHSVKLLGWGV-ENGIKY 380
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
W+ AN W WG DGYFKI RG NEC IE VVA
Sbjct: 381 WLGANSWGTKWGEDGYFKILRGENECNIESYVVA 414
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ C A
Sbjct: 271 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNDCAMASR 329
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 330 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 384
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
++DF HYK G+Y+H+T + HA+KL GWGT E +WI AN W +
Sbjct: 385 HDDFFHYKKGIYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRGAQGRKEKFWIAANSWGK 444
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 445 SWGENGYFRILRGVNESDIEKLIIAA 470
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 73/191 (38%), Positives = 103/191 (53%), Gaps = 24/191 (12%)
Query: 34 GFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCSH--------PGCE 78
G +C DGC G P +AW + +G+ TE C PY + C H P E
Sbjct: 152 GHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPY-NFPKCGHHQQDSKYQPCPE 210
Query: 79 PAYPTPKCVRKCVKKN--QLWRNSKHYS--ISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
Y TP C+ +C KN +H++ S Y++ ++I EI NGP +F++Y
Sbjct: 211 KNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGT-DNIKKEIMTNGPTSAAFSMY 269
Query: 135 EDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
+DF Y+SGVYKH +G +MG H V++IGWGT G DYW++ N WN WG G FKI +G
Sbjct: 270 DDFLSYESGVYKHTSGTLMGEHGVEIIGWGTK-QGVDYWLVMNSWNEGWGVHGTFKIAQG 328
Query: 195 SNECGIEEDVV 205
+CGI + +
Sbjct: 329 --DCGINDMAI 337
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 126 bits (316), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 97/172 (56%), Gaps = 10/172 (5%)
Query: 48 SAWRYFVHHGVVTEECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKKNQL-WRNS 100
S + Y+ G+ T PY D + C C TP C C + +
Sbjct: 346 SPFNYWKKMGIATG--GPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYDIPISDD 403
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
K Y+ Y ++S+ +IM EIY +GPV F VYEDF +Y SG+Y+ T MGGHA+++
Sbjct: 404 KFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIYQQTTYVAMGGHAIRI 463
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
IGWG ++G YW++AN WN ++G G+F+I+RG+NEC IE +V G+P +
Sbjct: 464 IGWG-EENGIPYWLIANSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKLR 514
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 11/131 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKK 93
GC G +A+ Y+ G+VT + C + + C+ C P PKC R C
Sbjct: 69 GCRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPCTM--CRPYMLAPKCQRTCQAS 126
Query: 94 NQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
L + K+Y S Y +N D DIM EIY+ GPV F VY DF +Y SG + I G+
Sbjct: 127 YNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQF--ICGNK 184
Query: 153 MGGHAVKLIGW 163
L W
Sbjct: 185 RCEEEENLTSW 195
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 126 bits (316), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 105/206 (50%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCT-KNRHGCNSGSVDRAWWYLRKRGLVSHACYPLFKDQNANNNGCAMASR 325
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N +++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 326 SDGRGKRHATKPCPNNIEKSNVIYQCS-----PPYRVSSNETEIMKEIMQNGPVQAIMQV 380
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGTSDDG----EDYWILANQWNR 181
+EDF HYK+G+Y+H+ + HAVKL GWG E +W+ AN W +
Sbjct: 381 HEDFFHYKTGIYRHVIRTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGK 440
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG DGYF+I RG NE IE+ ++A
Sbjct: 441 SWGEDGYFRILRGVNESDIEKLIIAA 466
>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
protease B2; Flags: Precursor
gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
Length = 300
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C+GG+ + W++ G T+EC PY + C PT KC + +
Sbjct: 141 CNGGWLPNVWKFLTKTGTTTDECVPYKSGSTTLRGTC----PT-----KCADGSSKVHLA 191
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
S Y + D +M + +GP++V+F V+ DF +Y+SGVY+H G + GGHAV++
Sbjct: 192 TATSYKDYGL--DIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGYMEGGHAVEM 249
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+G+GT DDG DYWI+ N W WG DGYF++ RG N+C IEE AG
Sbjct: 250 VGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDCSIEEQAYAGF 297
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC GG AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCARK-RHGCGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATN-GCAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATTPCPNHIEKSNRIYQCS-----PPYRVSSNETQIMKEIMQNGPVQAIMKV 379
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF YK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 380 HEDFFSYKTGIYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKGARGKKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYFKI RG NE IE+ ++A
Sbjct: 440 SWGENGYFKILRGVNESDIEKLIIAA 465
>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
(Silurana) tropicalis]
Length = 494
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 81/215 (37%), Positives = 112/215 (52%), Gaps = 20/215 (9%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ SLS +LL+C GC GG AW Y GVV+E C
Sbjct: 268 AAVASDRISIQSMGHMTQSLSPQNLLSC-DTRNQHGCRGGRVDGAWWYLRRRGVVSEPCY 326
Query: 65 PY--FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHY-SISAYRINSDPEDIMA 119
P+ ++ G S P + + R+ NQ + +++ Y S AYR+ S +DIM
Sbjct: 327 PFTSLNTNGHSAPCMMQSRSMGRGKRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMK 386
Query: 120 EIYKNGPVEVSFTVYEDFAHYKSGVYKHIT--------GDVMGGHAVKLIGWGTSDDGED 171
E+Y+NGPV+ V+EDF YKSG+Y+H G H+VK+ G G
Sbjct: 387 ELYENGPVQAIMEVHEDFFMYKSGIYRHTPVTEREPEHHRRHGTHSVKITG-GRDGQTHK 445
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
YW+ AN W R WG DGYF+I RG NEC IE +V
Sbjct: 446 YWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 480
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 125 bits (315), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 55/104 (52%), Positives = 76/104 (73%), Gaps = 1/104 (0%)
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
S+Y + DIM EI KNGPV+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG
Sbjct: 10 SSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV 69
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
++G YW++AN WN WG GYF+++RG+NECGIE + AGLP
Sbjct: 70 -ENGVKYWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/244 (30%), Positives = 116/244 (47%), Gaps = 54/244 (22%)
Query: 19 LQNLSLSVNDLLACCG--FLCGDG------------------------------------ 40
+ N LS +LL+CC F CG+G
Sbjct: 129 MINTVLSAQELLSCCTGVFSCGEGDSEHWQFRNSKFRKPRCQKFNKEILEARRNLETREK 188
Query: 41 CDGGYPISAWRYFVHHGVVTEE-------CDPYFDST------GCSHPGC-EPAYPTPKC 86
C GG AW+Y+ HG+ T C PY S + PGC TP C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248
Query: 87 VRKCVKKNQLWRNS-KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY 145
+KC + + +HY +S ++ + +I +++ NGP+ + VY+DF Y +G+Y
Sbjct: 249 EKKCKSGYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGIY 308
Query: 146 KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV 205
H+TG+ G +V+++GWG +G YW+LAN W + WG +G F++ RG NECG+E + V
Sbjct: 309 VHLTGNKQGHLSVRILGWGMY-EGVPYWLLANSWGKQWGENGTFRVLRGVNECGLEANCV 367
Query: 206 AGLP 209
+G+P
Sbjct: 368 SGMP 371
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 255 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 312
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 313 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 367
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 368 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 427
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 428 SWGENGYFRILRGVNESDIEKLIIAA 453
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 104/201 (51%), Gaps = 21/201 (10%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
+LS +L++C GC+GG AWRY HGVV+ C P F + Y
Sbjct: 272 NLSAQNLISC-DTRNQHGCNGGSIDGAWRYLKTHGVVSYACYPSFWNKHLGPSAENQCYV 330
Query: 83 TPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ + C K N+L+R + HY R++S DIM EI GPV+ V
Sbjct: 331 SNEYGKNHTNGPCPNAFEKSNRLYRCASHY-----RVSSKETDIMKEIKDRGPVQAIMKV 385
Query: 134 YEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGTSDDG----EDYWILANQWNRSWGADG 187
YEDF YK G+Y+H G H+VKL+GWG D + +WI AN W +SWG +G
Sbjct: 386 YEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALPDKNGQKQKFWIAANSWGKSWGENG 445
Query: 188 YFKIKRGSNECGIEEDVVAGL 208
YF+I RG NEC IE+ ++A L
Sbjct: 446 YFRILRGQNECDIEKLILATL 466
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/152 (44%), Positives = 91/152 (59%), Gaps = 11/152 (7%)
Query: 62 ECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS----KHYSISAYRINSDPEDI 117
EC + D+ G C+ P+P C C +N ++ S +H++ + ++I
Sbjct: 229 ECSHHVDTKGME--PCKGNSPSPVCSTTC--RNHHFKPSFESDRHFTEDEGYSLDEVDEI 284
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
EI NGPV +FTVYEDF +YKSGVYKH+ G +GGHAVK+IGWG D E YW++ N
Sbjct: 285 KREIIDNGPVAAAFTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGI-DQNEQYWLVMN 343
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WN +WG G FKI G ECGI+ +V AG+P
Sbjct: 344 SWNVNWGDQGIFKIAIG--ECGIDSEVTAGIP 373
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 380 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 380 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 94/185 (50%), Gaps = 21/185 (11%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPGCEPAYPTPKCVRKCVKKNQ 95
GC GG AW Y +G+V+ C P F T C A + ++ C +
Sbjct: 290 GCKGGSITGAWSYLKKYGLVSHACYPLFWNNLHQTSCEMSSVFDAEGKRQAIQPCPNR-- 347
Query: 96 LWRNSKHYSISA--YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI----- 148
W S H YRI+S DIM EI +NGPV+ VY+DF YKSG+YKHI
Sbjct: 348 -WEPSNHIYQCGLPYRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSGIYKHIWSLEG 406
Query: 149 ---TGDVMGGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSNECGIE 201
H++K++GWGT D E +WI AN W SWG +GYF+I RG NEC IE
Sbjct: 407 KTQNRHQKKPHSIKIVGWGTLRDAEGQRQKFWIAANSWGNSWGENGYFRILRGQNECDIE 466
Query: 202 EDVVA 206
+ V+A
Sbjct: 467 KTVIA 471
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 106/206 (51%), Gaps = 28/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW Y G+V+ C P F ++ GC A
Sbjct: 267 NLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANN-GCAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATKPCPNNIEKSNRIYQCS-----PPYRVSSSETEIMKEIMQNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
EDF HYK+G+Y+H+T + HAVKL GWGT E +WI AN W +
Sbjct: 380 REDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 57/92 (61%), Positives = 71/92 (77%), Gaps = 1/92 (1%)
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILAN 177
MAEI K GPVE +FTVY DF YKSGVY+H TG+ +GGHA+K++GWG ++DG DYW++AN
Sbjct: 1 MAEIQKYGPVEGAFTVYADFPSYKSGVYQHETGEALGGHAIKILGWG-NEDGHDYWLVAN 59
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
WN WG G+FKI RG +ECGIE + AG P
Sbjct: 60 SWNEDWGDQGFFKILRGVDECGIESQITAGSP 91
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 97/163 (59%), Gaps = 18/163 (11%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC------- 72
+N S +L++CC + CG GC+GG+P +AW Y+ G+V+ PY + GC
Sbjct: 77 KNFHFSAENLVSCC-WTCGFGCNGGFPGAAWNYWKTKGIVSG--GPYGSNMGCIPYEVAP 133
Query: 73 -------SHPGCEPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKN 124
+ C+ TPKCV+KC ++ + H+ SAY +++D + I EIY N
Sbjct: 134 CEHHVNGTRGPCKEGGKTPKCVKKCEDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTN 193
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
GPVE +FTVYEDF Y++GVYKH+ G +GGHA++++GWG +
Sbjct: 194 GPVEGAFTVYEDFIAYRAGVYKHVAGKALGGHAIRILGWGVQN 236
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 106/195 (54%), Gaps = 13/195 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
QN++LS L+C GC+GGY AW Y GVV+EEC PY T C
Sbjct: 127 QNVALSAQQFLSCNQHR-QKGCEGGYLDRAWWYIRKFGVVSEECYPYISGTTRKPEICYM 185
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
R+C + NS+ Y + +YR++S +DIM+EI NGPV+ +F V+ DF
Sbjct: 186 QKSKHANGRQCPSGHP---NSRVYRTTPSYRVSSREQDIMSEILTNGPVQATFRVHGDF- 241
Query: 139 HYKSGVYKHITG---DVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGYFKIK 192
+ +GVYKH+ ++ G H+V+L+GWG ++ YWI AN W +WG +G F+I
Sbjct: 242 -FIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGENGTFRIL 300
Query: 193 RGSNECGIEEDVVAG 207
RG N C IE V+
Sbjct: 301 RGENHCEIESFVIGA 315
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/200 (36%), Positives = 101/200 (50%), Gaps = 15/200 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+ + LS DL++C C GG+P WR+ +++G V+EEC PY ++ C
Sbjct: 246 VDKVELSPQDLMSCLNGGRRVVCQGGHPDRGWRFLLNYGGVSEECYPYEGVHSSANATCR 305
Query: 79 -PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
P P +C KH+S YR+ ++ EDIM EIY NGPV+ V EDF
Sbjct: 306 IPRRRDPIEDARCPTGRT---EQKHFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDF 362
Query: 138 AHYKSGVYKHI--------TGDVMGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGAD 186
Y+SGVY+H G H+V+++GWG YW+ AN W WG +
Sbjct: 363 FLYRSGVYRHTRIAESLRPQYSRSGWHSVRILGWGVDRSQYRPIKYWLCANSWGHGWGEN 422
Query: 187 GYFKIKRGSNECGIEEDVVA 206
GYF+I RG +E IE V+A
Sbjct: 423 GYFRIVRGEDESQIESFVLA 442
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 57/102 (55%), Positives = 71/102 (69%), Gaps = 1/102 (0%)
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
SAY + I EI NGPV FT+YED YKSGVY+H G ++GGHA+K+IGWGT
Sbjct: 113 SAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT 172
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
+G YW++AN W WG +G+FKI+RG NECGIE +VVAG
Sbjct: 173 -QNGIPYWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAG 213
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/201 (37%), Positives = 102/201 (50%), Gaps = 21/201 (10%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
+LSV +L++C GC GG AWRY HGVV+ C P F P Y
Sbjct: 272 NLSVQNLISC-DTKNQHGCGGGNIEGAWRYLKTHGVVSYACYPSFWKHSLDSPSENHCYV 330
Query: 83 TPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ + C N+L+R + HY RI+S DIM EI GPV+ V
Sbjct: 331 SSEYGKNHTNGPCPNALEDSNRLYRCASHY-----RISSKETDIMEEIMAKGPVQAIMKV 385
Query: 134 YEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADG 187
YEDF YK G+Y+H G H+VKL+GWG+ + + +WI AN W + WG +G
Sbjct: 386 YEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENG 445
Query: 188 YFKIKRGSNECGIEEDVVAGL 208
YF+I RG NEC IE+ ++ L
Sbjct: 446 YFRILRGQNECDIEKLILTTL 466
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 102/189 (53%), Gaps = 18/189 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ LS DL+ C + GC+GG P + Y G+V++ C PY G +H C P
Sbjct: 50 NVVLSPQDLVTCSWY--SFGCNGGIPGLVFDYIHKDGLVSDACFPYLSYDGNTHVKC-PD 106
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPED-------IMAEIYKNGPVEVSFTV 133
+ C K + +++ KH++ Y + ED I EI +GPV F V
Sbjct: 107 F----CYNN---KTKSFKSDKHFADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMV 159
Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
Y DF YKSGVY+H TG G HAVK+IGWGT ++G DYW++AN W ++G G+FKI R
Sbjct: 160 YSDFTVYKSGVYRHQTGSFEGIHAVKIIGWGT-ENGVDYWLIANSWGTTFGLQGFFKIVR 218
Query: 194 GSNECGIEE 202
G +EE
Sbjct: 219 GGKFIHLEE 227
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/201 (37%), Positives = 102/201 (50%), Gaps = 21/201 (10%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
+LSV +L++C GC+GG AWRY HGVV+ C P F P Y
Sbjct: 272 NLSVQNLISC-DTGNQRGCNGGSIDGAWRYLTTHGVVSYACYPSFWKHHLDSPSENQCYV 330
Query: 83 TPK---------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ + C N+L+R HY R++S DIM EI GPV+ V
Sbjct: 331 SSEYGKNHTNGPCPNALEDSNRLYRCGSHY-----RVSSKETDIMEEIMAKGPVQAIMKV 385
Query: 134 YEDFAHYKSGVYKH--ITGDVMGGHAVKLIGWGT----SDDGEDYWILANQWNRSWGADG 187
YEDF YK G+Y+H G H+VKL+GWG+ + + +WI AN W + WG +G
Sbjct: 386 YEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENG 445
Query: 188 YFKIKRGSNECGIEEDVVAGL 208
YF+I RG NEC IE+ ++ L
Sbjct: 446 YFRILRGQNECDIEKLILTTL 466
>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
Length = 349
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 99/177 (55%), Gaps = 22/177 (12%)
Query: 39 DGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-- 96
+GC+GG +A+++ G+V++ C PY G P C C +
Sbjct: 189 NGCNGGEFPTAFQFVETTGLVSDGCVPYQSGNGF----------VPPCPNSCANGEDINV 238
Query: 97 ---WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM 153
+NS+++ ++ D + + A I NGPV F VY DF +Y+SG YKH+ G ++
Sbjct: 239 RYRTKNSRNFDVN------DMKSVQASILANGPVISGFKVYRDFYNYRSG-YKHVAGGLV 291
Query: 154 GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
GGHA+K++GWG + YWI+AN W+ WG +GYF I RG+NEC IEE++ +P+
Sbjct: 292 GGHAIKVVGWGVTQSNVPYWIVANSWSDEWGMNGYFWILRGTNECSIEENMWETIPA 348
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 93/170 (54%), Gaps = 14/170 (8%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSH---- 74
+++L +S DLL+CC CG GC+GG P AW Y+V G+V+E C PY C+H
Sbjct: 44 VRDLRISAGDLLSCCN-ACGLGCNGGDPDWAWLYYVETGIVSEFCQPY-PFPPCAHHVNS 101
Query: 75 ---PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSF 131
C Y TP C C + S S S ED E++ GP EV+F
Sbjct: 102 THYTPCSVEYDTPFCNITCTNTIPPIKYKGRISYSL----SGEEDYKRELFLYGPFEVAF 157
Query: 132 TVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
TVYEDF Y GVYKH +G+ +GGHAV+L+GWG +G YW +AN WN
Sbjct: 158 TVYEDFVAYSDGVYKHFSGNALGGHAVRLVGWGNL-NGTPYWKIANSWNH 206
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 68/166 (40%), Positives = 92/166 (55%), Gaps = 19/166 (11%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSHPG 76
L+ D L+CC + CG GC GGYP AW Y++ G+VT C P+ T C H G
Sbjct: 116 LAAADPLSCCTY-CGQGCRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWM-FTKCDHVG 173
Query: 77 -------CEP-AYPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
C YP P C R C N+ + K Y S+Y + IM EI KNGPV
Sbjct: 174 DSRKYSRCPHYTYPKPPCARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPV 233
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
EV+F +++DF Y+SG+Y H+ G +G HAV++IGWG ++G +YW
Sbjct: 234 EVTFAIFQDFGVYRSGIYHHVAGKFIGRHAVRMIGWGV-ENGVNYW 278
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 114/231 (49%), Gaps = 46/231 (19%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ LS +L++C G GC GG AW Y GVVTE+C
Sbjct: 259 AAVASDRISIQSMGHMTPRLSPQNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTEDCY 317
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKN-----------------QLWRNSKHYSISA 107
PY +P + TP V +C+ ++ Q + N + S
Sbjct: 318 PY-----------QPPHQTPAEVGRCMMQSRSVGRGKRQATQRCPNTQNYHNDIYQSTPP 366
Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVK 159
YR++S+ ++IM EI NGPV+ V+EDF YK+G+YKH G H+V+
Sbjct: 367 YRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIYKHTDVSFTKPPQYRKHGTHSVR 426
Query: 160 LIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
+ GWG + YWI AN W ++WG +GYF+I RG NEC IE V+
Sbjct: 427 ITGWGEDRNVDGTSRKYWIAANSWGKNWGENGYFRIVRGENECEIETFVIG 477
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 112/208 (53%), Gaps = 20/208 (9%)
Query: 16 YVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHP 75
Y Q LS +L +CC CG GC+GG+P+ A++Y+ GV T PY +GC
Sbjct: 139 YKGEQQPFLSDEELTSCCT-SCGYGCNGGFPLLAFKYWNEIGVPTG--GPYGSKSGCKPF 195
Query: 76 GCEP------AYPTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPE---DIMAEIYKN 124
P A TP C KC+ K +L ++ ++Y S Y I S + I EI +
Sbjct: 196 SIAPPTSSSTAAQTPLCQLKCISDYKRKLDKD-RYYGESYYLITSSNQPVKTIQREIMDH 254
Query: 125 GPVEVSFTVYEDFAHYKSGVY---KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNR 181
GPV + ++E F +YKSGVY K +G HAVKLIGWG YW++ N WN
Sbjct: 255 GPVVAAMEIFESFLYYKSGVYSANKRNDDPSLGLHAVKLIGWGEQKR-IPYWLVVNSWNT 313
Query: 182 SWGADGYFKIKRGSNECGIEE-DVVAGL 208
++G G FKI+RG+NECGIE V AGL
Sbjct: 314 TFGEQGLFKIRRGTNECGIENLHVTAGL 341
>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
Length = 351
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/192 (37%), Positives = 101/192 (52%), Gaps = 24/192 (12%)
Query: 21 NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
N LS D++ C F GC+GGY ++A Y ++ GV E C PY D T
Sbjct: 168 NEELSPQDMVDCSHDNF----GCEGGYLMNALDYLMNEGVTKESCTPYKDKTN------- 216
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHY-SISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
KC C K + + KHY R+ ++ E I ++ +NGP+ V TVYEDF
Sbjct: 217 ------KCQYTCQNKTEEFH--KHYCKPGTLRVLTNEEQIKRDLMQNGPLMVGLTVYEDF 268
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
+Y +G YK + G+++GGHAVKL+GW T+ G+ W++ NQWN WG G+ I NE
Sbjct: 269 INYATGDYKFVAGEIVGGHAVKLMGWRTTQKGQTSWLIQNQWNDDWGEQGFGYIL--ENE 326
Query: 198 CGIEEDVVAGLP 209
GI+ V P
Sbjct: 327 VGIDSIGVGCTP 338
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 92/156 (58%), Gaps = 15/156 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY-----F 67
Q+ LS DL++CC CGDGC GG+P AW Y+V G+VT C PY
Sbjct: 139 QSAELSALDLISCCED-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCE 197
Query: 68 DSTGCSHPGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNG 125
T +P C Y TP+C +KC K + + KHY +Y + S+ + I EI NG
Sbjct: 198 HHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNG 257
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 161
PVE +F VYEDF +YKSG+Y+H+TG ++GGHA+++I
Sbjct: 258 PVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRII 293
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ C A
Sbjct: 266 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKEQSTNNNSCAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YRI+S+ +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATRPCPNSFEKSNRIYQCS-----PPYRISSNETEIMREIIQNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+H+ + HAVKL GWGT E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 96/166 (57%), Gaps = 19/166 (11%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
LS D+++CC + CG GC+GG P +W Y+ GVVT C PY CSH
Sbjct: 116 LSAIDIVSCCAY-CGYGCNGGIPAMSWDYWTREGVVTGGTLENPTGCLPY-PFPKCSHGV 173
Query: 75 --PGCEPA----YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
PG P YPTPKC +KC N+ + K S+Y + DIM EI KNGPV
Sbjct: 174 VTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPV 233
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
+ F ++EDF YKSG+Y + TG ++GGHA+++IGWG ++G +YW
Sbjct: 234 DGIFYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGV-ENGVNYW 278
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 103/200 (51%), Gaps = 18/200 (9%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGCEPA 80
SLS +LL+C GC GG AW Y GVVT+EC P+ DS + P +
Sbjct: 252 SLSPQNLLSC-DTRNQRGCSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHS 310
Query: 81 YPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
T + R+ + Q N + S AYR+ ++IM E+ +NGPV+ V+EDF
Sbjct: 311 RSTGRGKRQATARCPNPQTHANDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDF 370
Query: 138 AHYKSGVYKHIT--------GDVMGGHAVKLIGWGTSD--DGE--DYWILANQWNRSWGA 185
YKSG+Y+H G H+VK+ GWG DG+ YW AN W R+WG
Sbjct: 371 FLYKSGIYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQLPDGQVQKYWTAANSWGRAWGE 430
Query: 186 DGYFKIKRGSNECGIEEDVV 205
DG+F+I RG NEC +E VV
Sbjct: 431 DGHFRIARGVNECEVESFVV 450
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 72/179 (40%), Positives = 92/179 (51%), Gaps = 18/179 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT------EECDPYFDSTGCS 73
+ +++S D+L CC + CG GC GG+PI AW Y G VT + C C
Sbjct: 143 KQVNISATDILTCC-YKCGYGCQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCG 201
Query: 74 HPGCEPAY-------PTPKCVRKCVK--KNQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H G E Y TPKC C KN + + K AY + + + I EI KN
Sbjct: 202 HHGNETYYGECGGRARTPKCRTSCTPGYKNS-YSDDKIRGKDAYELPNSVKAIQREIMKN 260
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSW 183
GPV +FTVY DF++YK G+YKH G G HAVK+IGWG D YWI+ N W+ W
Sbjct: 261 GPVVAAFTVYADFSYYKKGIYKHTAGRARGSHAVKVIGWGEEGD-VPYWIVKNSWHNDW 318
>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
Length = 354
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 83/170 (48%), Gaps = 14/170 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
C GGY AW + G + C PY G PA KC Q +
Sbjct: 197 ACQGGYLKYAWSFLERTGTTVDSCIPYASGRATFSSGTCPA--------KCKVSTQ---S 245
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
Y R S +I A I G V+ FT+Y DF Y+SGVYKH++ +GGHAV
Sbjct: 246 MTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTIYRDFMSYRSGVYKHVSTTTLGGHAVA 305
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
LIGWG + G +YW+ N W +WG GYFKI +G ECGIE V AG P
Sbjct: 306 LIGWGV-ESGTNYWLAVNSWGSNWGMSGYFKIAQG--ECGIENQVYAGEP 352
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ C A
Sbjct: 266 NLSPQNLISCCAKK-RHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIRNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITGD--------VMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+H+ + HAVKL GWGT E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVISTNEESEKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
Length = 349
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/192 (36%), Positives = 102/192 (53%), Gaps = 24/192 (12%)
Query: 21 NLSLSVNDLLACC--GFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC 77
N LS DL++C F GC GG + + ++ G+V+E+C PY + T C
Sbjct: 173 NEDLSPQDLVSCSYENF----GCSGGQLTESVDFLIYEGIVSEKCKPYMNQDTYCKFKCQ 228
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
P K C +K+ L I SD E+I E+ NGP+ V +VYED
Sbjct: 229 NDKQPYTKYF--CEQKSML-------------ILSDIEEIQLELMTNGPMMVGLSVYEDL 273
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
+YK GVY++ TG+ +GGHA+K+IGWG ++ GE +W NQW + WG GY IK G E
Sbjct: 274 MNYKEGVYEYTTGNQVGGHAIKIIGWGHTEKGELFWKCQNQWGKDWGMGGYINIKAG--E 331
Query: 198 CGIEEDVVAGLP 209
G++ V+ +P
Sbjct: 332 LGMDTMVLGCMP 343
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/179 (37%), Positives = 90/179 (50%), Gaps = 12/179 (6%)
Query: 45 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQL 96
+P AWRY+V +G+ + C PY C H G + + TP+C C K
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTDKTIP 220
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
K+ AY + E+ E+Y NGP VY D YKSGVY+++ G MG
Sbjct: 221 L--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
AVK++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/211 (36%), Positives = 101/211 (47%), Gaps = 38/211 (18%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ LS LL+C GC+GGY AW Y GVV+E C PY +S PG
Sbjct: 232 NIPLSAQQLLSCNQHR-QRGCEGGYLDRAWWYIRKLGVVSELCYPY-ESGATQQPG---- 285
Query: 81 YPTPKCVRKCVKKNQLWRNSKH------------YSISA-YRINSDPEDIMAEIYKNGPV 127
+C +R H Y ++ YR++S +DIM EI NGPV
Sbjct: 286 --------ECRIPKSAYRTGAHIDCPSGAADPSVYRMTPPYRVSSREQDIMTEIITNGPV 337
Query: 128 EVSFTVYEDFAHYKSGVYKHI--------TGDVMGGHAVKLIGWG---TSDDGEDYWILA 176
+ +F VYEDF Y GVY+H+ V G H+V++IGWG ++ YW+ A
Sbjct: 338 QATFLVYEDFFMYSGGVYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGPQVKYWLAA 397
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
N W WG DG F+I RG N C IE V+
Sbjct: 398 NSWGNEWGEDGLFRILRGENHCEIESFVIGA 428
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ C A
Sbjct: 266 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+H+ + HAVKL GWGT E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 27/206 (13%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +L++CC GC+ G AW + G+V+ C P F ++ C A
Sbjct: 266 NLSPQNLISCCA-KNRHGCNSGSIDRAWWFLRKRGLVSHACYPLFKDQNTTNNICAMASR 324
Query: 81 -------YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
+ T C K N++++ S YR++S+ +IM EI +NGPV+ V
Sbjct: 325 SDGRGKRHATKPCPNSFEKSNRIYQCS-----PPYRVSSNETEIMREIIQNGPVQAIMQV 379
Query: 134 YEDFAHYKSGVYKHITG--------DVMGGHAVKLIGWGT----SDDGEDYWILANQWNR 181
+EDF +YK+G+Y+H+ + HAVKL GWGT E +WI AN W +
Sbjct: 380 HEDFFYYKTGIYRHVVSTNEEPEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGK 439
Query: 182 SWGADGYFKIKRGSNECGIEEDVVAG 207
SWG +GYF+I RG NE IE+ ++A
Sbjct: 440 SWGENGYFRILRGVNESDIEKLIIAA 465
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 12/200 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ ++LS LL+C C+GGY AW Y G+V E+C PY ++ C
Sbjct: 126 EKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRI 180
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C + R SK+ AYR+ ++ DIM EI +GPV+ + VY DF
Sbjct: 181 PRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFT 239
Query: 140 YKSGVYKHI---TGDVMGGHAVKLIGWGT--SDDG-EDYWILANQWNRSWGADGYFKIKR 193
YK G+Y+H T D G H+V+++GWG S +G + YW +AN W WG +GYF+I R
Sbjct: 240 YKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILR 299
Query: 194 GSNECGIEEDVVAGLPSSKN 213
GSNEC IE V+ +N
Sbjct: 300 GSNECEIESFVLGTWAEVEN 319
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/203 (37%), Positives = 99/203 (48%), Gaps = 31/203 (15%)
Query: 39 DGCDGGYPISAWRYFVHHGVVTEE------------CDPYFDSTGCSHPGCE-------- 78
DGCDGG I+ W Y G VT C +F + C H G
Sbjct: 90 DGCDGGQIITPWTYVAKAGAVTGGQYNGTGPFGAGLCADWF-APHCHHHGPRGDDPYPAE 148
Query: 79 -----PAYPTPKCVRKC----VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P+ +P+ + C + + KH + S IMA I + GPVE
Sbjct: 149 GDAGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVET 208
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
+FTVYEDF +Y G+Y H+TG+ GGHAVK +GWG ++G YW +AN WN WG GYF
Sbjct: 209 AFTVYEDFENYAGGIYHHVTGEEAGGHAVKFVGWGV-ENGTKYWKVANSWNPYWGEAGYF 267
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I RGSNE GIE+ V +K
Sbjct: 268 RILRGSNEGGIEDQVTGSHADAK 290
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 112/231 (48%), Gaps = 46/231 (19%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ LS +L++C G GC GG AW Y GVVTE+C
Sbjct: 40 AAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRLDGAWWYLRRRGVVTEDCY 98
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----------------WRNSKHYSISA 107
PY P TP + +C+ +++ ++N + S
Sbjct: 99 PY-----------RPPQQTPAELSRCMMQSRSVGRGKRQATQRCPNTNNYQNDIYQSTPP 147
Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVK 159
YR+++ ++IM EI NGPV+ V+EDF Y SG+YKH G H+VK
Sbjct: 148 YRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIYKHTDVSFTKPPHYRKHGTHSVK 207
Query: 160 LIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
+ GWG DG YWI AN W ++WG +GYF+I RG NEC IE V+
Sbjct: 208 ITGWGEERNFDGTTRKYWIAANSWGKNWGENGYFRIARGENECEIEAFVIG 258
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/179 (37%), Positives = 90/179 (50%), Gaps = 12/179 (6%)
Query: 45 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQL 96
+P AWRY+V +G+ + C PY C H G + + TP+C C K
Sbjct: 162 FPGFAWRYYVEYGIASSYCQPY-PFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTDKTIP 220
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
K+ AY + E+ E+Y NGP VY D YKSGVY+++ G MG
Sbjct: 221 L--IKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDGSYMGVT 278
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
AVK++GWG +G YW +AN W+ WG DGY I RG+NEC IE AG P + L
Sbjct: 279 AVKVVGWG-KLNGTPYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPDTSQLT 336
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 88/157 (56%), Gaps = 14/157 (8%)
Query: 63 CDPYFDSTGCSH-------PGCEPA-YPTPKCVRKC--VKKNQLWRNSKHYSISAYRINS 112
C PY D C+H P C YPTP CV +C K R+ +H+ + + +
Sbjct: 3 CWPY-DFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
D I +GPV SFTVYEDF Y+SGVYKH +G +GGHAVK+IGWG G+ Y
Sbjct: 62 SVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK-SGQAY 120
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
W+ N WN WG G FKI G+ CGI++D++ G P
Sbjct: 121 WLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTP 155
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 112/236 (47%), Gaps = 49/236 (20%)
Query: 21 NLSLSVNDLLACCGFL--CGD-GCDGGYPISAWRYFVHHGVVT-------------EECD 64
N LS ++LACC + C GC GG +AW + HG+VT + C
Sbjct: 110 NQLLSAGEMLACCNSVHSCNSHGCQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCW 169
Query: 65 PY------FDSTGCSHPGC---------------------EPAYPTPKCVRKCV--KKNQ 95
PY D + C + Y TP C+ +C K
Sbjct: 170 PYSFPKCAHDQEDSKYEPCPEVRVPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGT 229
Query: 96 LWRNSKHYSISAYR-INSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
+H++ A + ++I EI NGP SF+ YEDF+ YKSGVYKH +G +G
Sbjct: 230 PRDKDRHFTARALPYLFEGTDNIKKEIMTNGPTSASFSTYEDFSSYKSGVYKHTSGGYLG 289
Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPS 210
H+V++IGWGT + G DYW++ N WN WG G FKI +G +CGI++ V LP+
Sbjct: 290 DHSVEIIGWGT-EKGVDYWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPA 342
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 97/178 (54%), Gaps = 17/178 (9%)
Query: 18 SLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVV-------TEECDPYFDST 70
S + +S +D+L+CCG CG GC GG+ I A+++ C P S
Sbjct: 21 STIRVMISDSDILSCCGISCGYGCQGGWSIEAYKWMQRERCCYRWENTDRRVCKPVRPSI 80
Query: 71 GCSHPGCEPAY--------PTPKCVRKCVKKN-QLWRNSKHYSISAYRINSDPEDIMAEI 121
+ +P Y PTPKC + C +K + ++ KH++ AY + ++ I EI
Sbjct: 81 RVGNHPNDPYYGPCPGGLWPTPKCRKTCQRKYYKSYQEDKHFATRAYYLPNNERSIRQEI 140
Query: 122 YKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
YKNGPV +F VY+DF++YK G+Y H G G HAVK++GWG ++ DYW++AN W
Sbjct: 141 YKNGPVVAAFRVYQDFSYYKKGIYVHKWGGQTGAHAVKVVGWG-RENATDYWLIANSW 197
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 98/182 (53%), Gaps = 19/182 (10%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------F 67
N LS ++ CC CG+GC+GGYPI AW+ F +HG+VT E C+PY +
Sbjct: 78 NQLLSAEEITFCC-HKCGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY 136
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPEDIMAEIYKNGP 126
D G + +P P KC +KC + N H Y+ Y + I ++ GP
Sbjct: 137 DKDGKNTCSGQPMEPNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY--RGIQKDVINYGP 194
Query: 127 VEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGA 185
+E SF VY+DF +YKSG+Y K +GGH+VKLIGWG + G YW++ N WN WG
Sbjct: 195 IEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWG-EEYGVLYWLMVNSWNADWGD 253
Query: 186 DG 187
G
Sbjct: 254 KG 255
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 114/231 (49%), Gaps = 46/231 (19%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ LS +L++C G GC GG AW + GVVTE+C
Sbjct: 237 AAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCTGGRIDGAWWFLRRRGVVTEDCY 295
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----------------WRNSKHYSISA 107
PY P TP + +C+ +++ ++N + S
Sbjct: 296 PY-----------RPPQQTPAELGRCMMQSRSVGRGKRQATQRCPNTNNYQNDIYQSTPP 344
Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVK 159
YR++++ ++IM EI NGPV+ V+EDF YKSG+YKH G H+VK
Sbjct: 345 YRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIYKHTDVSFTKPPQYRKHGTHSVK 404
Query: 160 LIGWGTSD--DG--EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
+ GWG DG YWI AN W ++WG +GYF+I RG NEC IE V+
Sbjct: 405 ITGWGEERNVDGAKRKYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIG 455
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/170 (41%), Positives = 95/170 (55%), Gaps = 20/170 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTGCS 73
++LS DLL+CC CG GC+GG P+SAW+++V G+VT C PY C
Sbjct: 24 QVTLSAADLLSCC-RSCGFGCNGGDPLSAWKFWVKEGIVTGSNHSTNAGCKPY-PFPACE 81
Query: 74 H--------PGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
H P +PTPKC + C + ++ K++ SAY + + E I EI
Sbjct: 82 HHSNKTHYDPCKHDLFPTPKCEKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIIT 141
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
GPVEV+F VYEDF +Y G+Y H G + GGHAVK+IGWG D+G YW
Sbjct: 142 YGPVEVAFEVYEDFLNYAGGIYVHQGGALGGGHAVKMIGWGI-DNGVPYW 190
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 103/200 (51%), Gaps = 17/200 (8%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA--- 80
LS+ +LLAC GC+GG+ AW Y GVV EEC PY C+
Sbjct: 236 LSMQNLLAC-NNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRRG 294
Query: 81 -YPTPKCV------RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
T KC RK + ++ R S AYRI +DIM EI ++GPV+ + V
Sbjct: 295 NLATMKCQLVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMRV 354
Query: 134 YEDFAHYKSGVYKHITGDVM---GGHAVKLIGWGTSDDGED---YWILANQWNRSWGADG 187
+ DF Y+ GVY++ + G H+V+++GWG + YW++AN W R WG DG
Sbjct: 355 HPDFFLYRGGVYRYSGTNSQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDG 414
Query: 188 YFKIKRGSNECGIEEDVVAG 207
YF+I RG NE IE+ V+A
Sbjct: 415 YFRIVRGENESDIEKFVLAA 434
>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
Length = 568
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/194 (35%), Positives = 106/194 (54%), Gaps = 26/194 (13%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
LS ++++C + GC+GG+P + +Y G+V EEC PY AY
Sbjct: 389 LSPQEIVSCSEY--SQGCEGGFPYLIGGKYAQDFGLVEEECFPY------------QAYD 434
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
+P +KC + + S+++ + + + + E+ +NGP+ V+F VY+DF HY++
Sbjct: 435 SPCTPKKCSR----YYTSEYHYVGGFYGGCNEALMKHELIQNGPLTVAFEVYDDFIHYRT 490
Query: 143 GVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGS 195
G+Y H + HAV L+G+GT + GEDYWI+ N W SWG +GYF+I RG+
Sbjct: 491 GIYHHTGLRDNFNPFELTNHAVLLVGYGTDEKTGEDYWIVKNSWGTSWGENGYFRILRGT 550
Query: 196 NECGIEEDVVAGLP 209
+EC IE VA P
Sbjct: 551 DECAIESIAVAATP 564
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 104/207 (50%), Gaps = 15/207 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ + L+ +++C GC GG+ +AW Y G V EEC PY + H C+
Sbjct: 232 ETVQLAPQQIVSCVRR--SQGCSGGHLDTAWSYLRKVGTVNEECYPYISA----HNVCKI 285
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C ++ R + + A+ +N++ DIM EI K+GPV+ V+ DF
Sbjct: 286 RPSDTLITANCELPMKVDRTNMYKMGPAFSLNNE-TDIMLEIKKHGPVQAIMRVHRDFFS 344
Query: 140 YKSGVYKHITGDV-----MGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKI 191
YKSG+Y+H G H+V+LIGWG G + YWI N W WG +G F+I
Sbjct: 345 YKSGIYRHSAASTSADQRAGYHSVRLIGWGEERHGYEVTKYWIAVNSWGTWWGENGRFRI 404
Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEI 218
RGSNEC IE V+A LP VK++
Sbjct: 405 LRGSNECEIESYVLASLPYVHQQVKDL 431
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 45/224 (20%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
LS ++ AC GCDGG P AW + + G+ T + C PY D
Sbjct: 196 LSAGEMNACAPSF---GCDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPY-DFP 251
Query: 71 GCSH-------PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
C+H P C + +Y TP C +C K R+ +H+ + + D
Sbjct: 252 PCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNA 311
Query: 121 IYKNGPV---------------EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
I +GPV SF VYEDF Y+SGVYKH +G +GGHAVK+IGWG
Sbjct: 312 IRTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWG- 370
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ G+ YW++ N WN WG +G FKI G+ C I++D++ G P
Sbjct: 371 EETGQAYWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTP 412
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 12/200 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ ++LS LL+C C+GGY AW Y G+V E+C PY ++ C
Sbjct: 252 EKVTLSAQHLLSC-DRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPY----SATNEKCRI 306
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C + R SK+ AYR+ ++ DIM EI +GPV+ + VY DF
Sbjct: 307 PRRGDLVTANCQLPTNVDRRSKYKVAPAYRVGNE-TDIMYEILHSGPVQATMKVYHDFFT 365
Query: 140 YKSGVYKHI---TGDVMGGHAVKLIGWGT--SDDG-EDYWILANQWNRSWGADGYFKIKR 193
YK G+Y+H T D G H+V+++GWG S +G + YW +AN W WG +GYF+I R
Sbjct: 366 YKRGIYRHSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILR 425
Query: 194 GSNECGIEEDVVAGLPSSKN 213
GSNEC IE V+ +N
Sbjct: 426 GSNECEIESFVLGTWAEVEN 445
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 76/203 (37%), Positives = 107/203 (52%), Gaps = 22/203 (10%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA-- 80
+LS +LL+C GC+GG AW + GVVT+EC P F + +H PA
Sbjct: 304 ALSPQNLLSC-NTRHQQGCNGGRIDGAWWFLRRRGVVTDECYP-FSNQETNHSPNAPACM 361
Query: 81 ---YPTPKCVRKCVKKNQLWR---NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
T + R+ + + R N + S AYR++S+ ++IM E+ +NGPV+ V+
Sbjct: 362 MHSRSTGRGKRQAIARCPNPRSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVH 421
Query: 135 EDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTSD--DG--EDYWILANQWNRS 182
EDF Y++G+Y+H G H+VK+ GWG DG + YWI AN W +
Sbjct: 422 EDFFMYRTGIYRHTAVAAGKPEQYRRHGTHSVKITGWGEEQMPDGSNQKYWIAANSWGKD 481
Query: 183 WGADGYFKIKRGSNECGIEEDVV 205
WG GYF+I RG NEC IE VV
Sbjct: 482 WGEHGYFRITRGENECEIETFVV 504
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 90/179 (50%), Gaps = 12/179 (6%)
Query: 45 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP--------AYPTPKCVRKCVKKNQL 96
+P AW Y+V +G+ + C PY C H G + + TPKC C K+
Sbjct: 162 FPGFAWLYYVEYGIASSGCQPY-PFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDKSIP 220
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH 156
K+ + Y + ED E+Y NGP F VY D YKSGVY+++ GD +GG
Sbjct: 221 L--VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFLGGQ 278
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLV 215
AV+++GWG +G YW +AN W+ WG +GY I G+NEC IE G P L
Sbjct: 279 AVRIVGWGKL-NGTPYWKVANSWDTDWGMNGYMLILGGNNECNIEHLGFTGFPDPSQLT 336
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 94/186 (50%), Gaps = 9/186 (4%)
Query: 37 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 96
C GCDGGYP A+R+ G+ E C Y G C V +C +
Sbjct: 176 CSLGCDGGYPDGAFRFMQDEGITPELCVKYVSKDGTDPLECSDVQTM---VSECTATSNA 232
Query: 97 WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYK--HITGDVMG 154
N Y +SD E I +I ++GPV S+ V+EDF Y SGVY D +G
Sbjct: 233 TVNGDR---CYYHSSSDIETIQRDIMQHGPVLASYEVFEDFGEYDSGVYTCPDDGSDSIG 289
Query: 155 GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNL 214
HAV ++GWG +D YW++ N W +G DGYFKI RG+NEC IE +V L +++ +
Sbjct: 290 WHAVIIVGWGV-EDNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSLVNTEGV 348
Query: 215 VKEITS 220
V TS
Sbjct: 349 VFASTS 354
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 119 bits (299), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 54/102 (52%), Positives = 73/102 (71%), Gaps = 1/102 (0%)
Query: 106 SAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGT 165
AY++ + + I +I KNGPV ++TVYEDFAHY+SG+YKH G G HAVK+IGWG
Sbjct: 2 KAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWG- 60
Query: 166 SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
+ G YWI+AN W+ WG +G+F++ RGSN+CG EE + AG
Sbjct: 61 EEKGTPYWIVANSWHDDWGENGFFRMHRGSNDCGFEERMAAG 102
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 93/171 (54%), Gaps = 19/171 (11%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+ LS DL++CC + CG+GC GG P +AW Y+ +G+VT C PY
Sbjct: 111 MMQPELSAIDLVSCCSY-CGNGCQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPY-PFPQ 168
Query: 72 CSHPGCEPA--------YPTPKCVRKC-VKKNQLWRNSKHYSISAYRINSDPEDIMAEIY 122
C HPG YPTP C C ++ + K Y ++Y ++ IM EI
Sbjct: 169 CRHPGSRSQLNPCPGYIYPTPSCYPYCQAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIM 228
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYW 173
KNGPVE F VY DFA YKSG+Y H++G G HA+++IGWG ++G +YW
Sbjct: 229 KNGPVEAGFIVYTDFAVYKSGIYHHVSGRYAGKHAIRIIGWGV-ENGVNYW 278
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 119 bits (298), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 106/213 (49%), Gaps = 15/213 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
+ L+ +++C GC GG+ +AW Y G V +EC PY + C+
Sbjct: 235 VQLAPQQIISCVRR--SQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQN----ACKIRP 288
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
C ++ R + + A+ +N++ DIM EI K+GPV+ V+ DF YK
Sbjct: 289 SDTLITANCDLPTKVDRTNMYKMGPAFSLNNE-TDIMIEIKKHGPVQAILRVHRDFFSYK 347
Query: 142 SGVYKHIT----GDVMGG-HAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKIKR 193
SG+Y+H GD G H+V+LIGWG +G + YW+ N W R WG +G F+I R
Sbjct: 348 SGIYRHSAASSAGDERAGYHSVRLIGWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVR 407
Query: 194 GSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
G NEC IE V+A LP VK + ++
Sbjct: 408 GQNECEIESYVLASLPYVHQQVKPMRQVGELQE 440
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 89/157 (56%), Gaps = 14/157 (8%)
Query: 63 CDPYFDSTGCSH-------PGC-EPAYPTPKCVRKC--VKKNQLWRNSKHYSISAYRINS 112
C PY D C+H P C + +Y TP CV +C K +N +HY + +
Sbjct: 373 CWPY-DFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYMLESSPYQY 431
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
+ I +GP+ S+ VYEDF YKSGVYKH +G +GGHAVK+IGWG ++GE Y
Sbjct: 432 SVNNAKNAIRTDGPISASYLVYEDFLAYKSGVYKHTSGSYLGGHAVKIIGWG-EENGEAY 490
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
W++ N WN WG G FKI G+ C I++D++ G P
Sbjct: 491 WLVVNSWNEDWGDQGLFKIALGN--CEIDDDLLGGTP 525
>gi|308159555|gb|EFO62082.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 305
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 105/186 (56%), Gaps = 17/186 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGC 77
Q +SLSV +++C G+ GC GG S+W + GVV +C PY TG S
Sbjct: 128 QAVSLSVQHMVSCDN---GEAGCLGGEFESSWAFLETEGVVKSDCLPYTSGETGNSG--- 181
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
+C C + L ++ HY ++ ++ +IM + +GPV+ F V+EDF
Sbjct: 182 -------ECPMMC-QDGTLVEDAFHYKAASASPLNNYNEIMVSLLADGPVQTGFYVHEDF 233
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
+Y G+Y + G +GGHAV ++G+G+ +D DYWI+ N W WG +GYF+I RG+NE
Sbjct: 234 LYYVGGIYHKVYGSSLGGHAVLIVGYGSMND-HDYWIVRNSWGPDWGENGYFRILRGTNE 292
Query: 198 CGIEED 203
CGIE++
Sbjct: 293 CGIEKN 298
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/172 (39%), Positives = 89/172 (51%), Gaps = 16/172 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTG-CSHPGCEPAYPTPKCVRKCVKKNQLWR 98
GC GG S W + HG T EC PY D+ S P C C +++ R
Sbjct: 143 GCAGGLSFSVWTFLTEHGTTTLECVPYTDANKDISSP----------CPDACADGSEI-R 191
Query: 99 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 158
K Y N IM + +GPV+ S VY DF +Y+SGVY+H+ G + HAV
Sbjct: 192 LVKADGCLDYSGNVTA--IMQALANDGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHAV 249
Query: 159 KLIGWGTSDDGED--YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
++IG+G +DD + YWI+ N WG +GYF I RGSNEC IE V +GL
Sbjct: 250 EIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 115/233 (49%), Gaps = 50/233 (21%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ LS +L++C G GC GG AW Y GVVTE C
Sbjct: 234 AAVASDRISIQSMGHMTPQLSPQNLISCDTRNQG-GCAGGRIDGAWWYLRRRGVVTENCY 292
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL-----------------WRNSKHYSISA 107
PY +P P V +C+ +++ + N + S
Sbjct: 293 PY-----------QPPQQAPAEVGRCMMQSRAVGRGKRQATQRCPNTYNYHNDIYQSTPP 341
Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV----------MGGHA 157
Y+++S+ ++IM EI +NGPV+ V+EDF YK+G+YKH DV G H+
Sbjct: 342 YKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNGIYKHT--DVSSTKPPQYRKHGTHS 399
Query: 158 VKLIGWGTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
V++ GWG D YWI AN W ++WG +G+F+I RG+NEC IE V+
Sbjct: 400 VRITGWGEDKDYDGTPRKYWIAANSWGKNWGENGFFRIARGANECEIEAFVIG 452
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/160 (43%), Positives = 91/160 (56%), Gaps = 14/160 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT----EE---CDPY------F 67
N SLS DLL+CC CG GCDGG+P AW ++ HG+VT EE C PY
Sbjct: 19 NKSLSAVDLLSCCK-DCGYGCDGGFPPMAWDFWKTHGIVTGGSKEEPAGCRPYPFPKCQH 77
Query: 68 DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
S G P YPTPKCV+ C ++ K + ++Y ++ IM EI NGPV
Sbjct: 78 HSQGHYPPCPRRIYPTPKCVKHCDTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPV 137
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
E +F V+EDF YKSG+Y H G +GGHA++++GWG +
Sbjct: 138 EATFEVHEDFPEYKSGIYFHAWGGSVGGHAIRILGWGEEN 177
>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 303
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 91/169 (53%), Gaps = 15/169 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GCDGG W + G T EC Y D P C C +Q+
Sbjct: 144 GCDGGDFWPTWSFLTLTGATTAECVKYIDY---------PNIVASPCPAVCDDGSQI--- 191
Query: 100 SKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGHA 157
+ Y Y +++ + + IM + GPV+ VY D ++Y+SGVYKH G + +G HA
Sbjct: 192 -QLYKAHGYGQVSKNVQAIMHMLATGGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHA 250
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
++++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC IE+++ A
Sbjct: 251 LEMVGYGTTDDGTDYWIIRNSWGADWGENGYFRIVRGVNECRIEDEIYA 299
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 102/202 (50%), Gaps = 27/202 (13%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
+S T D + +Q + +S D+LACCG CG GC+GG AW Y GVVT
Sbjct: 126 VSAAETMSDRICVQSKGRVQKM-ISDVDILACCGRECGRGCNGGMDHKAWEYVKEFGVVT 184
Query: 61 ----EE---CDPYFDSTGCSHPGCE-----------PAYPTPKCVRKC-VKKNQLWRNSK 101
+E C PY HP CE ++ TP C + C + + K
Sbjct: 185 GGRYQEKGVCKPYH-----LHP-CEITGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDK 238
Query: 102 HYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI 161
Y S Y ++ D + I E+ KNGPV+ +FT YEDF+ Y+ G+Y H G G HAVK++
Sbjct: 239 SYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYEDFSFYRKGIYVHSYGRQRGAHAVKVV 298
Query: 162 GWGTSDDGEDYWILANQWNRSW 183
GWG ++G YW +AN W+ W
Sbjct: 299 GWGV-ENGTKYWNVANSWSTDW 319
>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
Length = 238
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 100/202 (49%), Gaps = 19/202 (9%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+ + LS DLL+C C GG+ WR+ H V+E+C PY + C
Sbjct: 13 VDKVELSPQDLLSCLNGGRRVTCQGGHVDRGWRFLGRHAGVSEDCYPYESGYSNASTTCR 72
Query: 79 PA---YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
A PT + +++Q K++S YR+ ++ EDIM EIY NGPV+ V E
Sbjct: 73 IARRRVPTEDPICPTGRQDQ-----KYFSTPPYRVPANEEDIMQEIYANGPVQALMLVKE 127
Query: 136 DFAHYKSGVYKHIT--------GDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWG 184
DF Y SGVYKH H+V+++GWG T + YW+ AN W WG
Sbjct: 128 DFFLYSSGVYKHTRLAHNLPPEYQKSDWHSVRILGWGVDRTQYRPQKYWLCANSWGSGWG 187
Query: 185 ADGYFKIKRGSNECGIEEDVVA 206
+GYF+I RG +E IE V+A
Sbjct: 188 ENGYFRIVRGEDESQIESFVLA 209
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 108/227 (47%), Gaps = 62/227 (27%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTE---- 61
++R + ++ +VS++ +S DLL CCG +CGDGC+GGYP AW ++ G+V+
Sbjct: 118 SDRICIHTNAHVSVE---VSAEDLLTCCGIMCGDGCNGGYPAGAWNFWTRKGLVSGGLYD 174
Query: 62 ---ECDPYFDSTGCSH------PGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY C H P C TPKC + C + ++ KHY ++Y ++
Sbjct: 175 SHVGCRPY-SIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVS 233
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED 171
+ +DIMAEIYKN G
Sbjct: 234 NSEKDIMAEIYKN--------------------------------------------GTP 249
Query: 172 YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKNLVKEI 218
YW++AN WN WG +G+FKI RG + CGIE +VVAG+P + ++I
Sbjct: 250 YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTDQYWEKI 296
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/134 (46%), Positives = 82/134 (61%), Gaps = 8/134 (5%)
Query: 81 YPTPKCVRKC--VKKNQLWRNSKHYSISAY--RINSDPEDIMAEIYKNGPVEVSFTVYED 136
Y TP C C K + +HY+ S + R S I EI NGP +F+VYED
Sbjct: 3 YDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGS-TSSIKKEIMTNGPTSAAFSVYED 61
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
F YKSGVYKH +G +GGHAV++IGWGT + G DYW++ N WN WG G FKI +G
Sbjct: 62 FLSYKSGVYKHTSGGFLGGHAVEIIGWGT-EKGVDYWLVMNSWNEEWGDHGTFKIVQG-- 118
Query: 197 ECGIEEDVVAGLPS 210
+CGI++ ++AG P+
Sbjct: 119 DCGIDDMILAGTPA 132
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 88/154 (57%), Gaps = 10/154 (6%)
Query: 63 CDPYFDSTGCSH------PGC-EPAYPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDP 114
C PY C H P C Y TP+C + C K + + K + + + ++
Sbjct: 188 CQPY-PFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPFEQDKPFGEGSSNVQNNE 246
Query: 115 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWI 174
+ +I GPVE +F VYEDF + KSG+ +H+TG ++GGH +++IGWG + G YW+
Sbjct: 247 KVFQRDIMMYGPVEAAFDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGV-EKGNPYWL 305
Query: 175 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+AN WN WG +G F++ RG +EC IE VVAGL
Sbjct: 306 IANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
Length = 183
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 68/167 (40%), Positives = 96/167 (57%), Gaps = 21/167 (12%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPY------- 66
N+ LS DLL+CC CG GC GG+ AW Y+ +G+VT C PY
Sbjct: 19 NVQLSARDLLSCC-TSCGFGCVGGWIGDAWDYWRDNGIVTGGDYQDKSTCLPYPFPPSHH 77
Query: 67 FDSTGCS---HPGCEPAYPTPKCVRKCVKKNQ-LWRNSKHYSISAYRINSDPEDIMAEIY 122
S G +P + YPTP CV KC + + K +++S+Y+I+ + +I EI
Sbjct: 78 LVSKGTPFEIYP--QTLYPTPPCVSKCQEGYPGEYEKDKIFALSSYKIDRNATEIQKEIL 135
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDG 169
NGPVE VY DF +YK+GVY+H TG+++GGHA++L+GWG + DG
Sbjct: 136 INGPVEAGMNVYADFPNYKTGVYQHTTGEILGGHAIRLLGWGKTKDG 182
>gi|253744204|gb|EET00443.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 309
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 18/195 (9%)
Query: 17 VSLQNLSLSVNDLLACCGFLCGDGCDG--GYPISAWRYFVHHGVVTEECDPYFDSTGCSH 74
V + S +L+C +GC G + +W + G+ E C Y D +
Sbjct: 119 VDQEATRYSAQYILSCA---TTNGCLAFPGQGVVSWDFIATTGIPLESCVKYTD-----Y 170
Query: 75 PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYR-INSDPEDIMAEIYKNGPVEVSFTV 133
E +YP P C + L Y Y + +PE + I GP++ FTV
Sbjct: 171 DKTESSYPCPSL---CNDNSSL----VLYKSDGYEGVGFNPEKLRRAIALRGPMQAMFTV 223
Query: 134 YEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
YEDFA+Y G+Y H+ G G +V+++G+GTSD+G+DYWI+ N W +WG DGYF+I R
Sbjct: 224 YEDFAYYLEGIYSHVYGGTAGYLSVEIVGYGTSDEGQDYWIVKNYWGSNWGEDGYFRIVR 283
Query: 194 GSNECGIEEDVVAGL 208
G NEC IEE V +
Sbjct: 284 GQNECQIEEAVYGAI 298
>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
Length = 464
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/214 (36%), Positives = 108/214 (50%), Gaps = 32/214 (14%)
Query: 11 LSSSPYVSLQN---LSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 66
L S V+ +N ++LS D+++C + GC+GG+P + A +Y HGVV EEC PY
Sbjct: 266 LESRLRVATKNQVQVNLSPQDIVSCSAY--SQGCEGGFPYLIAGKYAQDHGVVAEECYPY 323
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
TG C A KC R V +K+ + Y + E + + ++GP
Sbjct: 324 ---TG-RDSACSAA---KKCQRSYV--------AKYRYVGGYYGACNEELMKMSLVESGP 368
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDV----------MGGHAVKLIGWGT-SDDGEDYWIL 175
+ VSF VY DF HY GVY G + HAV L+G+GT S E YWI+
Sbjct: 369 LSVSFEVYSDFMHYAGGVYHRTDGLFNKINEFNPFELTNHAVLLVGYGTDSQTKEKYWIV 428
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
N W WG DG+F+I+RG +ECGIE V P
Sbjct: 429 KNSWGTKWGEDGFFRIRRGVDECGIESIAVEVTP 462
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/245 (32%), Positives = 111/245 (45%), Gaps = 66/245 (26%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------------EECDPYFDST 70
LS ++ AC GC+GG+P SAW + G+ T + C PY D
Sbjct: 591 LSAGEMNACAP---SHGCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPY-DFP 646
Query: 71 GCSH-------PGC----------------------EPAYPTPKCVRKC--VKKNQLWRN 99
C+H P C + +Y TP C +C K R+
Sbjct: 647 PCAHHINDTKYPECPKVSCSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRD 706
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPV---------------EVSFTVYEDFAHYKSGV 144
+H+ + + D I +GPV SF+VYEDF YKSGV
Sbjct: 707 DRHFMLESSPYQYSVNDAKNAIRTDGPVGPIYFCDPNVNFDQVSASFSVYEDFLAYKSGV 766
Query: 145 YKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 204
YKH +G+ +GGHAVK+IGWG + G+ YWI+ N WN WG G FKI G+ CGI++++
Sbjct: 767 YKHTSGEYLGGHAVKIIGWG-EESGQAYWIVVNSWNEDWGDHGLFKIALGN--CGIDDNL 823
Query: 205 VAGLP 209
+ G P
Sbjct: 824 LGGTP 828
>gi|417401357|gb|JAA47568.1| Putative dipeptidyl peptidase 1 [Desmodus rotundus]
Length = 463
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 105/204 (51%), Gaps = 31/204 (15%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCDGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C+ K +R S+++ + + + + E+ NGP+ V+F VY D
Sbjct: 331 -----------CMLKEDCFRYYTSEYHYVGGFYGGCNEALMKLELVHNGPMAVAFEVYND 379
Query: 137 FAHYKSGVYKHITGDV-------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGY 188
F HY+ G+Y H TG + HAV L+G+GT G DYWI+ N W +WG DGY
Sbjct: 380 FLHYQEGIYHH-TGLTDPFNPFELTNHAVLLVGYGTDPATGMDYWIVKNSWGTAWGEDGY 438
Query: 189 FKIKRGSNECGIEEDVVAGLPSSK 212
F+I+RG++EC IE VA P K
Sbjct: 439 FRIRRGTDECAIESIAVAATPIPK 462
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 83/221 (37%), Positives = 108/221 (48%), Gaps = 34/221 (15%)
Query: 21 NLSLSVNDLLACCGFL--C-GDGCDGGYPISAWRYFVHHGVVT-------------EECD 64
N LS +LLACC C GC GG AW + HG+ T + C
Sbjct: 134 NQLLSAGELLACCNLAHSCEARGCKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCW 193
Query: 65 PYFDSTGCSH--------PGCEPAYPTPKCVRKCV--KKNQLWRNSKHYSISA--YRINS 112
PY + C+H P + +Y TP C+ +C K +H++ A Y N
Sbjct: 194 PY-NFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNG 252
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDY 172
I EI K+GP SF YEDF YKSGVYK+ +G + H V+LIGWGT + G DY
Sbjct: 253 I-RSIKKEIMKHGPTSASFFTYEDFFSYKSGVYKYTSGAYVEFHTVELIGWGT-EKGVDY 310
Query: 173 WILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLPSSKN 213
W+ N WN W G FKI +G +CGI D+V G P++ N
Sbjct: 311 WLAKNDWNEEWADLGTFKIAQG--DCGI-NDLVLGAPAALN 348
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 110/211 (52%), Gaps = 14/211 (6%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
++ ++R A+ S+ + + LS LL+C GC GG+ AW + HG+V
Sbjct: 217 IATVASDRFAIQSN---GAERMVLSPQVLLSC-NIRRQQGCRGGHIDVAWNFARGHGLVD 272
Query: 61 EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAE 120
EEC PY +T P P + + R S+ Y + + DIM +
Sbjct: 273 EECFPYKAATTSC-----PFRPKANLIEDGCRPPVRQRTSR-YKVGPPGKLATENDIMYD 326
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHIT-GD--VMGGHAVKLIGWGTSDDGEDYWILAN 177
I ++GPV TV++DF HY G+Y+ GD + G H+V+++GWG D G+ YW++AN
Sbjct: 327 IMESGPVHAVMTVHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWG-EDRGDKYWVVAN 385
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
W WG +GYF+I RGSNE GIE VV L
Sbjct: 386 SWGCDWGENGYFRIARGSNESGIESFVVTVL 416
>gi|361069783|gb|AEW09203.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153583|gb|AFG58928.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153585|gb|AFG58929.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153587|gb|AFG58930.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153589|gb|AFG58931.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153591|gb|AFG58932.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153593|gb|AFG58933.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153595|gb|AFG58934.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153597|gb|AFG58935.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153599|gb|AFG58936.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153601|gb|AFG58937.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153603|gb|AFG58938.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153605|gb|AFG58939.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153607|gb|AFG58940.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
gi|383153609|gb|AFG58941.1| Pinus taeda anonymous locus CL4685Contig1_03 genomic sequence
Length = 68
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 54/69 (78%), Positives = 61/69 (88%), Gaps = 1/69 (1%)
Query: 137 FAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
FAHYKSGVYK+I GD+MGGHAVKL+GWGT + G DYW++AN WN +WG DGYFKI RGSN
Sbjct: 1 FAHYKSGVYKYIKGDLMGGHAVKLVGWGT-EGGTDYWLVANSWNTAWGEDGYFKIARGSN 59
Query: 197 ECGIEEDVV 205
ECGIEEDVV
Sbjct: 60 ECGIEEDVV 68
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 88/156 (56%), Gaps = 17/156 (10%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPYFDSTG 71
++ + +S D+++CC + CG GCDGG+PI AW++F GVVT C PY + T
Sbjct: 22 VKQVLISAQDMVSCCSY-CGYGCDGGWPIKAWQFFAREGVVTGGNYGRQGCCRPY-EITP 79
Query: 72 CSHPGCEPAY-------PTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYK 123
C H G EP Y TP+C RKC ++ K Y AY++ + + I EI
Sbjct: 80 CGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKRYGRKAYQLPNSVKAIQREIMM 139
Query: 124 NGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
+GPV +TVYEDF++Y G+YKH G GGHAVK
Sbjct: 140 HGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAVK 175
>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain 1; AltName: Full=Dipeptidyl peptidase I
heavy chain 1; Contains: RecName: Full=Dipeptidyl
peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
peptidase I heavy chain 2; Contains: RecName:
Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
Full=Dipeptidyl peptidase I heavy chain 3; Contains:
RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
AltName: Full=Dipeptidyl peptidase I heavy chain 4;
Contains: RecName: Full=Dipeptidyl peptidase 1 light
chain; AltName: Full=Dipeptidyl peptidase I light chain;
Flags: Precursor
gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
Length = 435
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 106/201 (52%), Gaps = 26/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY G P C+
Sbjct: 252 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CK 305
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P C R + +S++Y + + + + E+ ++GP+ V+F VY+DF
Sbjct: 306 PN----DCFR--------YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFF 353
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY+ G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF+I
Sbjct: 354 HYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRI 413
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA P K
Sbjct: 414 RRGTDECAIESIAVAATPIPK 434
>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
protease B1; Flags: Precursor
Length = 303
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ +S S L++C L GCDGG W + G T EC Y D G
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
A P P QL++ + +S S P IM + GP++ VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231
Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
Y+SGVYKH G + +G HA++++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291
Query: 199 GIEEDVVA 206
IE+++ A
Sbjct: 292 RIEDEIYA 299
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 104/188 (55%), Gaps = 20/188 (10%)
Query: 4 TRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT--- 60
+ ++R + S +S++ LS +LL+CC CG GC+GG P AW Y+ G+VT
Sbjct: 128 SMSDRICIHSKGRISIE---LSAVNLLSCCS-RCGFGCNGGIPGMAWDYWKDEGIVTGGS 183
Query: 61 ----EECDPY------FDSTGCSHPGCEPAY-PTPKCVRKCVKKNQL-WRNSKHYSISAY 108
C PY ST +H CE Y TP+C + C + + N K+Y S+Y
Sbjct: 184 NETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSY 243
Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
+ SD IM EI NGPVE +F VY+DF +YK+GVYK++TG ++GGHA++ I W
Sbjct: 244 YVTSDEVSIMKEILLNGPVEATFYVYDDFLNYKTGVYKYVTGSLLGGHAIR-ITWLGCIH 302
Query: 169 GEDYWILA 176
E Y IL
Sbjct: 303 IESYTILV 310
>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 303
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ +S S L++C L GCDGG W + G T EC Y D G
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
A P P QL++ + +S S P IM + GP++ VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231
Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
Y+SGVYKH G + +G HA++++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291
Query: 199 GIEEDVVA 206
IE+++ A
Sbjct: 292 RIEDEIYA 299
>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
Length = 303
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 15/188 (7%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ +S S L++C L GCDGG W + G T EC Y D G
Sbjct: 126 EAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------GHTV 177
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
A P P QL++ + +S S P IM + GP++ VY D ++
Sbjct: 178 ASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYADLSY 231
Query: 140 YKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNEC 198
Y+SGVYKH G + +G HA++++G+GT+DDG DYWI+ N W WG +GYF+I RG NEC
Sbjct: 232 YESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGVNEC 291
Query: 199 GIEEDVVA 206
IE+++ A
Sbjct: 292 RIEDEIYA 299
>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
Length = 459
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 106/201 (52%), Gaps = 26/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY G P C+
Sbjct: 276 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---AGSDSP-CK 329
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P C R + +S++Y + + + + E+ ++GP+ V+F VY+DF
Sbjct: 330 PN----DCFR--------YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFF 377
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY+ G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF+I
Sbjct: 378 HYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRI 437
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA P K
Sbjct: 438 RRGTDECAIESIAVAATPIPK 458
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 104/215 (48%), Gaps = 15/215 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ + L+ +LAC GC GG+ +AW+Y GVV EEC PY + +
Sbjct: 234 EMVQLAPQQMLACVRR--QQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCKISNDD 291
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
T C VK N R + A+ +N++ DIMAEI G V+ VY DF
Sbjct: 292 TLITANCELP-VKVN---RTLMYKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFS 346
Query: 140 YKSGVYKHITG-----DVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKI 191
Y+SG+Y+H + H+V+LIGWG G D YWI N W + WG +G F+I
Sbjct: 347 YRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINSWGQWWGENGRFRI 406
Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
RGSNEC IE V+A P V+ I ++
Sbjct: 407 LRGSNECDIESYVLASNPYVHEHVQAIRKVGELQE 441
>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
Length = 269
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 100/191 (52%), Gaps = 15/191 (7%)
Query: 17 VSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG 76
+ + +S S L++C L GCDGG W + G T EC Y D G
Sbjct: 89 IDKEAVSYSQQHLISCS--LENFGCDGGDFQPTWSFLTFTGATTAECVKYVDY------G 140
Query: 77 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
A P P QL++ + +S S P IM + GP++ VY D
Sbjct: 141 HTVASPCPAVCDDG-SPIQLYKAHGYGQVS----KSVPA-IMGMLVAGGPLQTMIVVYAD 194
Query: 137 FAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
++Y+SGVYKH G + +G HA++++G+GT+DDG DYWI+ N W WG +GYF+I RG
Sbjct: 195 LSYYESGVYKHTYGTINLGFHALEIVGYGTTDDGTDYWIIKNSWGPDWGENGYFRIVRGV 254
Query: 196 NECGIEEDVVA 206
NEC IE+++ A
Sbjct: 255 NECRIEDEIYA 265
>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
Length = 463
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSSQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 305
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 103/187 (55%), Gaps = 17/187 (9%)
Query: 20 QNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGC 77
Q +SLSV +++C G+ GC GG S+W + G V +C PY TG S
Sbjct: 128 QAVSLSVQHMVSCDS---GEAGCQGGEFESSWAFLETEGAVKSDCLPYTSGETGKSG--- 181
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
+C C + ++ HY ++ S+ +IM + +GPV+ F V+EDF
Sbjct: 182 -------ECPTTCQDGTPV-ESAFHYKAASASRLSNYNEIMVSLLADGPVQTGFYVHEDF 233
Query: 138 AHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNE 197
+Y G+Y + G +GGHAV ++G+G+ ++ DYWI+ N W WG +GYF+I RG+NE
Sbjct: 234 LYYVGGIYHKVYGTSLGGHAVLIVGYGSMNN-HDYWIVRNSWGSDWGENGYFRILRGTNE 292
Query: 198 CGIEEDV 204
CGIE++
Sbjct: 293 CGIEKNA 299
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 107/220 (48%), Gaps = 45/220 (20%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
+N +S +++CC +LCG GCDGG +W Y+ HG V+ + C PY
Sbjct: 110 KNPIMSAQQIISCC-YLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPY------ 162
Query: 73 SHPGCE------PAYP--------TPKCVRKCVKKNQLWR------NSKHYSISAYRINS 112
+ P C+ P + TP C +KC N K+Y +S Y
Sbjct: 163 TIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDIYKGKYYKLSPYMA-- 220
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITG---DVMGGHAVKLIGWGTSDDG 169
M +I+ NGP+ F +Y D YKSGVY++ D H+VK+ GWG ++G
Sbjct: 221 -----MKDIFDNGPITTQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWG-EENG 274
Query: 170 EDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
YW++AN + WG +G FKI RG++ C +E + AGLP
Sbjct: 275 VPYWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 314
>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
Length = 463
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 105/201 (52%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y G+V E C PY TG P C+
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCDGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP-CK 332
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P CVR + +S+++ + + + + E+ +GP+ V+F VY DF
Sbjct: 333 PK---EDCVR--------YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYNDFL 381
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY+ G+Y H + HAV L+G+GT G DYWI+ N W WG DGYF+I
Sbjct: 382 HYRKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDPVSGMDYWIVKNSWGIGWGEDGYFRI 441
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA P K
Sbjct: 442 RRGTDECAIESIAVAATPIPK 462
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 96/177 (54%), Gaps = 22/177 (12%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCS---HPGCE 78
LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT Y D TGC +P CE
Sbjct: 25 LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCKPYPYPPCE 82
Query: 79 -----------PA--YPTPKCVRKCVKKN--QLWRNSKHY-SISAYRINSDPEDIMAEIY 122
P+ YPT + K + + H+ +I + + I I
Sbjct: 83 HHVNGTHYKPCPSNMYPTGQNANALGKLDIALTYHKDLHFRTILHTPASKEAAGIPKGIK 142
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQW 179
+G + TV+EDF HY GVY H G +GGHAVK++GWG D+G YW++AN W
Sbjct: 143 THGQLRGGITVFEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLIANSW 198
>gi|443687066|gb|ELT90166.1| hypothetical protein CAPTEDRAFT_138389 [Capitella teleta]
Length = 446
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 96/193 (49%), Gaps = 27/193 (13%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
S D++ CC + GCDGG+P + +Y G+V E CDPY
Sbjct: 272 FSPQDIVDCCQY--SQGCDGGFPYLVGGKYAEDFGLVDESCDPYVGED------------ 317
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
RKC + R + Y + E M + GP+ VSF VY+DF HYKS
Sbjct: 318 -----RKCKSTSCSRRYATRYRYVGGYYGACNEQEMKLALQRGPLSVSFMVYDDFMHYKS 372
Query: 143 GVYKH--ITGDV----MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
GVY+H +T + HAV L+G+G +D+G YWI+ N W + WG +GYF+I RG++
Sbjct: 373 GVYRHSGLTDKYNPFEITNHAVLLVGYG-ADEGTKYWIVKNSWGKGWGEEGYFRILRGAD 431
Query: 197 ECGIEEDVVAGLP 209
EC IE V P
Sbjct: 432 ECAIESIAVETFP 444
>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
Length = 463
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
Length = 455
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 100/201 (49%), Gaps = 26/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
Q S +++C + GCDGG+P +Y G+V E+C PY TG P P
Sbjct: 272 QQPVFSPQQVVSCSQY--SQGCDGGFPYLIGKYIQDFGIVEEDCFPY---TGSDSPCNLP 326
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
A KC K + S ++ + + +M E+ KNGP+ V+ VY DF +
Sbjct: 327 A--------KCTK----YYASDYHYVGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMN 374
Query: 140 YKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
YK G+Y H TG + HAV L+G+G GE YWI+ N W WG +G+F+I
Sbjct: 375 YKEGIYHH-TGLRDANNPFELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRI 433
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA P K
Sbjct: 434 RRGTDECAIESIAVAATPIPK 454
>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
Length = 463
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
Length = 462
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 100/197 (50%), Gaps = 26/197 (13%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
LS +++C + GCDGG+P + A +Y GVV E C PY G P C P
Sbjct: 283 LSTQQIVSCSEY--SQGCDGGFPYLIAGKYVQDFGVVEENCFPYL---GHDSP-CSPK-- 334
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
C R V S ++ + + + + E+ +NGP+ V+F VY DF HY+
Sbjct: 335 --NCTRYYV--------SDYHYVGGFYGACNEALMKLELVENGPMAVAFEVYNDFIHYQK 384
Query: 143 GVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYFKIKRGS 195
GVY H + HAV L+G+GT + GE YWI+ N W WG DGYF+I RG+
Sbjct: 385 GVYHHTGLRDSFNPFEITNHAVLLVGYGTDEKTGEHYWIVKNSWGSYWGEDGYFRILRGT 444
Query: 196 NECGIEEDVVAGLPSSK 212
+ECGIE V+ P K
Sbjct: 445 DECGIESIAVSATPIPK 461
>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
Length = 463
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 100/195 (51%), Gaps = 26/195 (13%)
Query: 30 LACCGFL---CGDG--CDGGYPISAWRYFVHHGVVTEE-------CDPYF----DST--- 70
L+CC L CGDG CDG +P +++ HG+ T C PY D T
Sbjct: 2 LSCCVGLMSICGDGWGCDGSWPKDILKWWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPN 61
Query: 71 GCSHPGCEPAYPTPKCVRKCVKKNQLW----RNSKHYSISAYRINSDPEDIMAEIYKNGP 126
G + C P Y TP C +C N W + KH+ + Y + DI EI +NGP
Sbjct: 62 GTTSVPC-PGYHTPVCEERCTS-NITWPISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGP 119
Query: 127 VEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGAD 186
V SF +Y+DF YKSG+Y H GD GG K+IGWG D+G YW+ +QW +G +
Sbjct: 120 VIASFIIYDDFWDYKSGIYVHTAGDQEGGMDTKIIGWGV-DNGVPYWLCVHQWGTDFGEN 178
Query: 187 GYFKIKRGSNECGIE 201
G+ +I RG NE IE
Sbjct: 179 GFMRILRGVNEVHIE 193
>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
Length = 466
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 101/194 (52%), Gaps = 24/194 (12%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
+S D+++C + GC GG+P + A +Y G+V E C PY G P E
Sbjct: 287 MSPQDVVSCSEY--AQGCAGGFPYLIAGKYGEDFGLVEESCFPY---NGKDEPCKETK-- 339
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
KC R + +Y + + + +M E+ KNGP+ +SF VY DF HYK
Sbjct: 340 -SKCRRHST--------TNYYYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKG 390
Query: 143 GVYKHI-TGD-----VMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 195
G+Y+H GD + HAV L+G+GT G+DYWI+ N W WG +G+F+I RG
Sbjct: 391 GIYQHTGLGDSYNPWQITNHAVLLVGYGTDQKSGKDYWIVKNSWGTKWGENGFFRILRGV 450
Query: 196 NECGIEEDVVAGLP 209
+EC IE + VA P
Sbjct: 451 DECSIENEAVAVTP 464
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 90/176 (51%), Gaps = 17/176 (9%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY---PTPKCVRKCVKKNQLW 97
C GGY +W++F++ G+ E C PY + Y +C C + L
Sbjct: 154 CQGGYGYYSWKFFMNTGIPLESCVPYTKDS--------LVYGNTTNAQCRSTCTDGSPL- 204
Query: 98 RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV-MGGH 156
+ + SAY I S + EI NGPVE F VY DF YKSG+Y+ G +GGH
Sbjct: 205 --KLYKAASAYYIYSPITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGH 262
Query: 157 AVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN--ECGIEEDVVAGLPS 210
AVK++GW + +G YWI NQW SWG GYF I RG++ C + ++AG S
Sbjct: 263 AVKVLGWASDSNGTPYWIAQNQWGTSWGMGGYFYIYRGNSTLNCKFDNYMIAGTVS 318
>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
Length = 463
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 106/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLSVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 112/211 (53%), Gaps = 14/211 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC 77
L + LS LL+C GC GG+ AW + G+V + C P+ + T C P
Sbjct: 236 LTKVDLSPQHLLSCNKGQ--RGCQGGHLSRAWTFIRKFGLVDDYCYPWTGTPTKCKIPK- 292
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
P + + + L R+ + AY+I D +DIM EI ++GPV+ + VY+DF
Sbjct: 293 RPNFDALSSICPPSLGSNL-RSELYRVGPAYKIQ-DEKDIMEEIMQSGPVQATMKVYQDF 350
Query: 138 AHYKSGVYKHITGDV----MGGHAVKLIGWGTSDD--GE--DYWILANQWNRSWGADGYF 189
YKSGVY + G H+VK++GWG + G+ YW+ AN W + WG +G+F
Sbjct: 351 FSYKSGVYTKSNTERESSNFGYHSVKILGWGEETNIYGQPIKYWLAANSWGQQWGENGFF 410
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNLVKEITS 220
KI+RG+NEC IEE V+A + + +EI +
Sbjct: 411 KIRRGTNECEIEEFVLAAWAETNDPSREIIT 441
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 96/203 (47%), Gaps = 24/203 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF----DSTGCSHPG 76
N LS LL+C GC GGY AW + G V+ C PY + T
Sbjct: 245 NPRLSEQHLLSC-NIRGQRGCSGGYLDRAWYHLRRAGAVSRACYPYHSGLDEDTIMQKLR 303
Query: 77 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C AY + +C + V + + S YRI + DIM EIY+NGPV+ +F V D
Sbjct: 304 CRVAYGSSQCPERGVTSDL------YLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKND 357
Query: 137 FAHYKSGVYKHIT---------GDVMGGHAVKLIGWGTSD----DGEDYWILANQWNRSW 183
F Y GVY+++ D G H+VK++GWG + YW+ N W R+W
Sbjct: 358 FFVYNRGVYRNVKQEFTASQSDSDQAGWHSVKIVGWGIDRSDWYNPIKYWLCTNSWGRNW 417
Query: 184 GADGYFKIKRGSNECGIEEDVVA 206
G G F+I RG NEC IE V+
Sbjct: 418 GEQGMFRIVRGVNECEIESFVLG 440
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 102/211 (48%), Gaps = 36/211 (17%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
+LS +LL+C GC GG AW + G+V+ C P+ + H G PA P
Sbjct: 252 ALSPQNLLSC-NTHNQHGCRGGRLDGAWWFLRRRGLVSNNCYPFSEG---DHNGAAPAAP 307
Query: 83 ---------------TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
T C N +++ + YR++S +DIM E+ +NGPV
Sbjct: 308 CMMHSRHMGRGKRQATAHCPNSRTHANHIYQ-----ATPPYRLSSHEKDIMKELMENGPV 362
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWIL 175
+ V+EDF YKSG+YKH + G H+VK+ GWG DG+ YW
Sbjct: 363 QALLEVHEDFFLYKSGIYKHTPASLGKPERYRQHGTHSVKITGWGEEIQPDGQKVKYWTA 422
Query: 176 ANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
AN W +WG +GYF+I RG+NEC IE VV
Sbjct: 423 ANSWGPTWGENGYFRIVRGANECDIESFVVG 453
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 92/183 (50%), Gaps = 21/183 (11%)
Query: 37 CGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPG-CEPAYPTPKCVRKCVKKNQ 95
C +GC GG+ A ++ G+V++EC Y S S P C+ P
Sbjct: 117 CNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPI------------ 164
Query: 96 LWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 155
N+ Y ++ R +D EI NGPV +F +Y DF +K VY + +
Sbjct: 165 --SNTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVES 222
Query: 156 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVV------AGLP 209
HAV+++GWGT+ DG DYWI AN W WG GYFKI+RGS+E EE + A +P
Sbjct: 223 HAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASVP 282
Query: 210 SSK 212
+S+
Sbjct: 283 TSQ 285
>gi|6562770|emb|CAB62589.1| putative cathepsin B-like protease [Pisum sativum]
Length = 206
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 49/57 (85%), Positives = 52/57 (91%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 77
++ LSVNDLLACCGFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD GCSHPGC
Sbjct: 150 DVPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 206
>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
Length = 301
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 91/175 (52%), Gaps = 15/175 (8%)
Query: 30 LACCGFLCGDG-CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVR 88
+ C F GDG C+GG+ + W++ GV +C YF C+
Sbjct: 133 VVSCDF--GDGACNGGWLSNVWKFLTKTGVPKLDCLKYFSGMTGDRE---------SCIT 181
Query: 89 KCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI 148
C + + + I+ D + +M + +GP++V+F VY DF +Y SGVY+H+
Sbjct: 182 HCTDGSPVELYQASHVIN---YGMDLDRMMEALVYDGPLQVAFVVYSDFGYYSSGVYQHV 238
Query: 149 TGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
G + GGHAV+++G+G + G YWI+ N W WG GYF+I R NECGIEE
Sbjct: 239 NGMMEGGHAVEMVGYGIDESGLKYWIIRNSWGPDWGEGGYFRIIRRVNECGIEEQ 293
>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
Length = 463
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
Length = 323
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 103/190 (54%), Gaps = 18/190 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N++LS L+AC + GC+GG P AW Y G+ T EC PY G
Sbjct: 143 NVTLSPQALVAC-DDIGNQGCNGGVPQLAWEYMEWKGLPTFECYPYTAGNGTDG------ 195
Query: 81 YPTPKCVRKCVKKNQL-WRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C R+C + + + +K +S++ + I EI GPV + VY+DF
Sbjct: 196 ----TCQRQCADGSAMTYYRAKPFSMTTC---NSVACIQNEIITYGPVVGTMMVYQDFMS 248
Query: 140 YKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGA-DGYFKIKRGSN 196
Y SGVY + T +++GGHA++++GWGT + DYWI+ N W+ +WG DGYF I+RG+N
Sbjct: 249 YSSGVYVYDGTAELLGGHAIEIVGWGTDATSKLDYWIVKNSWSAAWGGLDGYFWIQRGTN 308
Query: 197 ECGIEEDVVA 206
CGI+ D A
Sbjct: 309 MCGIDHDASA 318
>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
Length = 463
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
Length = 463
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
leucogenys]
Length = 463
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 105/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY+ G+Y H + HAV L+G+GT S G DYWI+ N W WG DGYF
Sbjct: 380 FLHYEKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 103/194 (53%), Gaps = 13/194 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+++ LS LL+C GC GGY AW + G+V +EC P+ TG + C
Sbjct: 250 EDVELSAQHLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPW---TG-RNDQCRL 304
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
+ V C K R + AYR+ ++ DIM EI +GPV+ + VY+DF
Sbjct: 305 RKRSNLNVAGCRKPPNPLRQELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFV 363
Query: 140 YKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIK 192
YK+GVY+H + G H++++IGWG YW++AN W R WG +G F+I+
Sbjct: 364 YKNGVYRHSRSAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQ 423
Query: 193 RGSNECGIEEDVVA 206
RG+NEC IE V+A
Sbjct: 424 RGTNECEIESYVLA 437
>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
Length = 463
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 106/203 (52%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P ++A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLTAGKYAQDFGLVEEACFPY---TGNDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY++G+Y H + HAV L+G+GT S G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYQNGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I RG++EC IE VA P K
Sbjct: 440 RIHRGTDECAIESIAVAATPIPK 462
>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
Length = 469
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 101/202 (50%), Gaps = 27/202 (13%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYF-DSTGCSHPGC 77
Q LS +++C + GCDGG+P + A +Y GVV E+C PY T C
Sbjct: 285 QTPILSTQQIVSCSEY--SQGCDGGFPYLIAGKYTQDFGVVEEDCFPYTARDTQC----- 337
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
P +C R + S + + + + + E+ ++GP+ V+F VY DF
Sbjct: 338 ---VPKKECPR--------YYASDYQYVGGFYGGCNEALMKLELVRHGPMAVAFEVYNDF 386
Query: 138 AHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFK 190
HY+ GVY H + HAV L+G+GT G DYWI+ N W +WG DGYF+
Sbjct: 387 LHYREGVYHHTGLRDPFNPFELTNHAVLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFR 446
Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
I+RGS+EC IE VA P +
Sbjct: 447 IRRGSDECAIESIAVAATPIPR 468
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 90/164 (54%), Gaps = 17/164 (10%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGC 72
Q++ LS DL++CC CG GCDGG+P AW Y+V HG+VT C PY C
Sbjct: 61 QSVELSAIDLISCCEN-CGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPY-PFPKC 118
Query: 73 SH------PGC-EPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRINSDPEDIMAEIYKN 124
H P C + Y TP+C RKC K + + KHY + + + I EI
Sbjct: 119 EHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMY 178
Query: 125 GPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDD 168
GPVE ++EDF +YKSG+Y++ TG +G H V++IGWG ++
Sbjct: 179 GPVEAYLLIFEDFLNYKSGIYRYTTGSFVGEHYVRIIGWGIENE 222
>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 84/170 (49%), Gaps = 15/170 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC GG W + G T EC Y D C PT C +Q+
Sbjct: 144 GCSGGDFFPTWSFLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI--- 191
Query: 100 SKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HA 157
+ Y Y +++ IM + GPV+ VY D +Y GVY+H G + G HA
Sbjct: 192 -QFYKAHGYGQVSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHA 250
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
++++G+GT+DDG DYW + N W WG DGYF+I RG NEC IE+++ A
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
Length = 464
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 105/205 (51%), Gaps = 29/205 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
+I+RG++EC IE VA P K L
Sbjct: 440 RIRRGTDECAIESIAVAATPIPKLL 464
>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
Length = 464
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 105/205 (51%), Gaps = 29/205 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSKNL 214
+I+RG++EC IE VA P K L
Sbjct: 440 RIRRGTDECAIESIAVAATPIPKLL 464
>gi|253742315|gb|EES99155.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 303
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 84/170 (49%), Gaps = 15/170 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC GG W + G T EC Y D C PT C +Q+
Sbjct: 144 GCSGGDFFPTWSFLTQTGATTAECVKYVDYGSSVAAAC----PT-----TCDDGSQI--- 191
Query: 100 SKHYSISAY-RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG-HA 157
+ Y Y +++ IM + GPV+ VY D +Y GVY+H G + G HA
Sbjct: 192 -QFYKAHGYGQLSKSVPAIMQMLVSGGPVQTMIVVYADLLYYAGGVYRHTYGPISNGLHA 250
Query: 158 VKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
++++G+GT+DDG DYW + N W WG DGYF+I RG NEC IE+++ A
Sbjct: 251 LEMVGYGTTDDGTDYWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300
>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
Length = 463
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 106/201 (52%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY T P C
Sbjct: 279 QTPVLSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TATDSP-C- 331
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K + C + + +S+++ + + + + E+ +GPV VSF VY+DF
Sbjct: 332 ------KVKKDCFR----YYSSEYHYVGGFYGGCNEALMKLELVNHGPVVVSFEVYDDFI 381
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY G+Y H + HAV L+G+GT S G DYWI+ N W+ +WG DGYF+I
Sbjct: 382 HYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGLDYWIVKNSWSATWGEDGYFRI 441
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++ECGIE + P K
Sbjct: 442 RRGTDECGIESIALTATPIPK 462
>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
Length = 316
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 132 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 183
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 184 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 232
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 233 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 292
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 293 RIRRGTDECAIESIAVAATPIPK 315
>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
Length = 345
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 104/185 (56%), Gaps = 22/185 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+ S D+++C L C+GGY S+ +Y GVV+E+C Y + G S
Sbjct: 168 NMQFSRQDMVSCD--LGNAACNGGYLSSSVQYLQTEGVVSEQCLAYASADGNS------- 218
Query: 81 YPTPKCVRKCVKKNQLWRN--SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P+C +C K+ ++ K+ S+ +I + EDI EIY NGPV V F VY+DF+
Sbjct: 219 --VPRCNYRCDDKSLEYKKYGCKYNSM---KILTTYEDIKEEIYTNGPVMVGFVVYDDFS 273
Query: 139 HYKSGVYKHITGDVM--GGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSN 196
Y +G+Y+ +T D + GGHAV L GWG D+G YWI NQW +WG G+F+I G
Sbjct: 274 SYSTGIYE-VTPDSVEEGGHAVTLNGWGY-DNGRLYWIGQNQWQNTWGESGFFRIYAG-- 329
Query: 197 ECGIE 201
E GI+
Sbjct: 330 EAGID 334
>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
Length = 463
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y GVV E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSKY--AQGCEGGFPYLIAGKYAQDFGVVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY+ G+Y H + HAV L+G+GT S G YWI+ N W SWG DGYF
Sbjct: 380 FLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGIHYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 85/159 (53%), Gaps = 18/159 (11%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGC---SHPGC 77
N SLS DLL+CC CG GC GGYP AW Y+ HG+VT D +GC P C
Sbjct: 19 NKSLSAVDLLSCCEN-CGFGCRGGYPAVAWDYWKTHGIVTGGSKE--DPSGCRSYPFPKC 75
Query: 78 E------------PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
E YPTP+CV++C + + K + +Y I + IM EI G
Sbjct: 76 EHHVQGHYPPCPRELYPTPECVQQCDTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRG 135
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
PVE FT+YEDF Y SGVY H G M GHAV+++GWG
Sbjct: 136 PVEAIFTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWG 174
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 90/155 (58%), Gaps = 16/155 (10%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
+S DLL+CC CG GC GG+P AW +++ +G+VT C Y CSH
Sbjct: 22 ISATDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGGSKENPSGCRSY-PFPRCSHHG 79
Query: 75 ----PGC-EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C + + TP CV C K + + K ++ S+Y + S+ IM EI +NGPVE
Sbjct: 80 KGKYPPCPKTIFDTPNCVDHCDKPDIDYAADKTHAKSSYNVQSNERVIMKEIMRNGPVEA 139
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
+F VYEDF YKSG+Y H G ++GGHA++++GWG
Sbjct: 140 AFMVYEDFIEYKSGIYFHSHGKLLGGHAIRMLGWG 174
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 107/224 (47%), Gaps = 31/224 (13%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ ++ LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSMGHMTPVLSPQNLLSCDTHH-QQGCQGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVK-KNQLWR---------NSKHYSISAYRINSDP 114
P+ +G PA P R + K Q R N + AYR+ SD
Sbjct: 294 PF---SGHEQAEAGPATPCMMHSRAMGRGKRQATRRCPNSHDDANEIYQVTPAYRLGSDE 350
Query: 115 EDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWGTS 166
++IM E+ +NGPV+ VYEDF YKSG+Y H + G H+VK+ GWG
Sbjct: 351 KEIMKELMENGPVQALMEVYEDFFLYKSGIYSHTLVSMGRPEQYRRHGTHSVKITGWGEE 410
Query: 167 --DDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W SWG GYF+I RGSNEC IE V+
Sbjct: 411 MLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFVLG 454
>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
Length = 463
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|1582221|prf||2118248A prepro-cathepsin C
Length = 463
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
C
Length = 441
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 255 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 306
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 307 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 355
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 356 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 415
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 416 RIRRGTDECAIESIAVAATPIPK 438
>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
Length = 446
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 262 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 313
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 314 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 362
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 363 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 422
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 423 RIRRGTDECAIESIAVAATPIPK 445
>gi|6562768|emb|CAB62588.1| putative cathepsin B-like protease [Pisum sativum]
Length = 166
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 49/57 (85%), Positives = 52/57 (91%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC 77
++ LSVNDLLACCGFLCG GCDGGYPISAW+YF HHGVVTEECDPYFD GCSHPGC
Sbjct: 110 DVPLSVNDLLACCGFLCGSGCDGGYPISAWKYFAHHGVVTEECDPYFDQIGCSHPGC 166
>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
Length = 463
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
Length = 463
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
Length = 463
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|426252217|ref|XP_004019812.1| PREDICTED: dipeptidyl peptidase 1, partial [Ovis aries]
Length = 455
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 105/201 (52%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E+C PY TG P C
Sbjct: 271 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP-C- 323
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K C + + +S+++ + + + + E+ GP+ V+F VY DF
Sbjct: 324 ------KLKEGCFR----YYSSEYHYVGGFYGGCNEALMKLELVHRGPMAVAFEVYNDFL 373
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY+ GVY H + HAV L+G+GT + G DYWI+ N W SWG DGYF+I
Sbjct: 374 HYRQGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGEDGYFRI 433
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A P K
Sbjct: 434 RRGTDECAIESIALAATPIPK 454
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 100/199 (50%), Gaps = 14/199 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N SLS LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 306 NASLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 363
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF
Sbjct: 364 PKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 423
Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
Y GVY+H + G H+V+++GWG ++ YW+ AN W WG DGY
Sbjct: 424 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 483
Query: 189 FKIKRGSNECGIEEDVVAG 207
FKI RG N C IE V+
Sbjct: 484 FKILRGENHCEIESFVIGA 502
>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
Length = 461
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 103/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY +
Sbjct: 277 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPYMGAD-------F 327
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C R + +S ++ + + + + E+ +GP+ V+F VY+DF
Sbjct: 328 PCKPKKDCFR--------YYSSDYHYVGGFYGGCNEALMKLELVHHGPIAVAFQVYDDFF 379
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY++G+Y H + HAV L+G+GT + G DYWI+ N W WG +GYF+I
Sbjct: 380 HYRTGIYYHTGLRDPFNPFELTNHAVLLVGYGTDTASGMDYWIVKNSWGAGWGENGYFRI 439
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA P K
Sbjct: 440 RRGTDECAIESIAVAATPVPK 460
>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
Length = 528
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 62/198 (31%), Positives = 99/198 (50%), Gaps = 28/198 (14%)
Query: 25 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 84
S ++++C + GCDGG+ ++ G++ E+CDPY TG H
Sbjct: 347 SPENIISCSFY--SQGCDGGFAYLISKWGEDFGIIAEQCDPY---TGTPH---------- 391
Query: 85 KC-VRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
KC + + Q W N ++ Y E++ ++ K GP+ VS VY D +Y SG
Sbjct: 392 KCNLNQACSTRQYWTNYRY--TGGYYGAVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSG 449
Query: 144 VYKHITGDVMG----------GHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
+Y+H++ + H V ++GWG ++ GE YWI+ N W S+G DGYF I R
Sbjct: 450 IYRHVSSSKLTSPVPNPFELTNHVVLIVGWGENEKGEKYWIVKNSWGTSFGMDGYFLIAR 509
Query: 194 GSNECGIEEDVVAGLPSS 211
G +EC IE + + +P+
Sbjct: 510 GVDECAIESENASAIPTQ 527
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 100/209 (47%), Gaps = 34/209 (16%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
LS +LL+C L GC GG+ AW + GVV++ C P+ A P
Sbjct: 255 LSPQNLLSC-DTLHQQGCRGGHLDGAWWFLRRRGVVSDHCYPFSGREQAE------AGPA 307
Query: 84 PKCV--------------RKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ R+C + N + AYR+ SD ++IM E+ +NGPV+
Sbjct: 308 PPCMMHSRAMGRGKRQATRRCPNSHTD-ANDIYQVTPAYRLGSDEKEIMKELMENGPVQA 366
Query: 130 SFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSDDGE--DYWILAN 177
V+EDF YK G+Y H + G H+VK+ GWG T DG YW AN
Sbjct: 367 LMEVHEDFFLYKGGIYSHTPLSMARPEQYRRHGTHSVKITGWGEETLPDGRTLKYWTAAN 426
Query: 178 QWNRSWGADGYFKIKRGSNECGIEEDVVA 206
W SWG G+F+I RGSNEC IE V+
Sbjct: 427 SWGPSWGERGHFRILRGSNECDIESFVLG 455
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 72/198 (36%), Positives = 100/198 (50%), Gaps = 14/198 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N SLS LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 246 NASLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 303
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF
Sbjct: 304 PKRDYTDRRGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 363
Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
Y GVY+H + G H+V+++GWG ++ YW+ AN W WG DGY
Sbjct: 364 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDGY 423
Query: 189 FKIKRGSNECGIEEDVVA 206
FKI RG N C IE V+
Sbjct: 424 FKILRGDNHCEIESFVIG 441
>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
Length = 463
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 103/203 (50%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCAGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CTVKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYF 189
F HY+ G+Y H + HAV L+G+GT G DYWI+ N W SWG DGYF
Sbjct: 380 FLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 71/195 (36%), Positives = 103/195 (52%), Gaps = 30/195 (15%)
Query: 20 QNLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
+N+ +S LL+C L G GC+GG A+ + HG+V+E+C PY
Sbjct: 232 ENVRMSSQTLLSC--HLKGQRGCNGGNLDIAFDFVKTHGLVSEQCFPY------------ 277
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
V + ++ + + Y + S EDIM +I +GP TVY+DF
Sbjct: 278 ---------EGAVTQCRIGNDCRRYRVGVPFSISKEEDIMYDIMTSGPALGIMTVYQDFF 328
Query: 139 HYKSGVYKHIT-GDVM--GGHAVKLIGWGTSDDGED-YWILANQWNRSWGADGYFKIKRG 194
HY+ G+Y+H GD + G H+V+++GWG +D ED YWI+AN W SWG GYF+I RG
Sbjct: 329 HYREGIYRHTRHGDQLMRGLHSVRIVGWG--EDAEDKYWIVANSWGTSWGEKGYFRIARG 386
Query: 195 SNECGIEEDVVAGLP 209
+ GIE V+ LP
Sbjct: 387 HSGTGIESSVLTVLP 401
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 112/227 (49%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG+ SAW + GVV++ C
Sbjct: 114 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCY 172
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P F G + G P P+C+ R+ + +Q+ N + AYR+
Sbjct: 173 P-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLG 226
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S ++IM E+ +NGPV+ V+EDF Y++G+Y H + G H+VK+ GW
Sbjct: 227 SSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGW 286
Query: 164 GTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 287 GEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 333
>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
Length = 455
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 97/202 (48%), Gaps = 28/202 (13%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF--DSTGCSHPGC 77
Q+ LS +++C + GCDGG+P +Y G+V E C PY DS C
Sbjct: 272 QSPVLSPQQVVSCSEY--SQGCDGGFPYLTGKYVQDFGIVDESCFPYMGKDSPCGISQSC 329
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
Y +++ + + +M E+ KNGP+ V+ VY DF
Sbjct: 330 RRGYA-----------------AEYKYVGGFYGGCSEAAMMVELVKNGPMAVALEVYSDF 372
Query: 138 AHYKSGVYKH--ITGDV----MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFK 190
YK G+Y H +T V + HAV L+G+G G+ YWI+ N W SWG DGYF+
Sbjct: 373 MSYKGGIYHHTGLTDHVNPFELTNHAVLLVGYGRCHMTGQKYWIVKNSWGSSWGEDGYFR 432
Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
I+RGS+EC IE VA P K
Sbjct: 433 IRRGSDECAIESIAVAASPIPK 454
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 109/227 (48%), Gaps = 38/227 (16%)
Query: 10 ALSSSPYVSLQNL-----SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S +S+Q++ LS +L++C DGC GG AW + GVVT++C
Sbjct: 232 AAVASDRISIQSMGHMTPQLSPQNLISC-DTRHQDGCAGGRIDGAWWFMRRRGVVTQDCY 290
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKC-------------VKKNQLWRNSKHYSISAYRIN 111
P+ P + A +C+ + + + N + S YR++
Sbjct: 291 PF-------SPPEQSAVEVARCMMQSRAVGRGKRQATAHCPNSHSYHNDIYQSTPPYRLS 343
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGW 163
++ +IM EI NGPV+ V+EDF YKSG+++H + H+V++ GW
Sbjct: 344 TNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGW 403
Query: 164 GTSDD----GEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G D YWI AN W ++WG DGYF+I RG NEC IE V+
Sbjct: 404 GEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIG 450
>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
Length = 462
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 107/201 (53%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY TG P C
Sbjct: 278 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEESCFPY---TGTDAP-C- 330
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K + C++ + +S+++ + + + + E+ +GP+ V+F VY+DF
Sbjct: 331 ------KMKKDCIR----YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
HY+ G+Y+H + HAV L+G+GT G DYWI+ N W SWG DG+F+I
Sbjct: 381 HYQKGIYQHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGFFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG +EC IE +A P K
Sbjct: 441 RRGIDECSIESIAMAATPIPK 461
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 103/203 (50%), Gaps = 20/203 (9%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY----FDSTGCSHPGCE 78
+LS +LL+C GC GG AW + G+V+ C P+ D+T + P
Sbjct: 251 ALSPQNLLSC-DTHNQKGCRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMM 309
Query: 79 PAYPTPKCVRKCVK---KNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
+ + R+ ++ N + + YR++SD +DIM E+ +NGPV+ V+E
Sbjct: 310 HSRSMGRGKRQATAHCPNSRAHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHE 369
Query: 136 DFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSDDGE--DYWILANQWNRSW 183
DF YKSG+YKH + G H+VK+ GWG DG+ YW AN W +W
Sbjct: 370 DFFLYKSGIYKHTPASLGKPARYRQHGTHSVKITGWGEERQPDGQRLKYWTAANSWGPTW 429
Query: 184 GADGYFKIKRGSNECGIEEDVVA 206
G G+F+I RG+NEC IE VV
Sbjct: 430 GEKGHFRILRGANECDIESFVVG 452
>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
Length = 569
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 101/202 (50%), Gaps = 30/202 (14%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
L+V D+++C + C GG P + R+ +V E C PY S +
Sbjct: 374 LAVQDIVSCSPY--AQKCHGGIPYAVGRHLRDFNLVPESCFPYKGSENVA---------- 421
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
C KC + + +K+ +S Y S+ ++M EIY++GP+ S+ +Y DF +Y G
Sbjct: 422 --CSSKCKNPEYIVKVTKYRYVSDYYGGSNYANMMKEIYEHGPISASYLIYPDFKYYSKG 479
Query: 144 VYKH-----------ITGDVMG----GHAVKLIGWGTS-DDGEDYWILANQWNRSWGADG 187
+YKH I ++ G H+V + GWG GE YW + N W+ SWG +G
Sbjct: 480 IYKHSGKGYPMKTDRINREMNGWEPTTHSVVITGWGEDPKTGEKYWNVLNSWSESWGENG 539
Query: 188 YFKIKRGSNECGIEEDVVAGLP 209
F+IKRG++EC IE + VA P
Sbjct: 540 RFRIKRGNDECAIEAEGVAFYP 561
>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
niloticus]
Length = 455
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 100/199 (50%), Gaps = 28/199 (14%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAY 81
+LS +++C + GCDGG+P +Y G+V E C PY +T C P
Sbjct: 275 TLSPQQVVSCSEY--SQGCDGGFPYLIGKYTQDFGIVDESCFPYVGQNTPCGVP------ 326
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
+K Q +++ + + +M E+ KNGP+ V+F VY DF +YK
Sbjct: 327 ----------QKCQRIYAAEYNYVGGFYGGCSEAAMMLELVKNGPMAVAFEVYPDFMNYK 376
Query: 142 SGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKR 193
G+Y H TG + HAV L+G+G G++YWI+ N W WG +GYF+I+R
Sbjct: 377 EGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRR 435
Query: 194 GSNECGIEEDVVAGLPSSK 212
G++EC IE VA P K
Sbjct: 436 GNDECAIESIAVAANPIPK 454
>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
Length = 460
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 103/203 (50%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y GVV E C PY TG P
Sbjct: 276 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEACFPY---TGTDSP--- 327
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 328 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 376
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY G+Y H + HAV L+G+GT S G YWI+ N W SWG DGYF
Sbjct: 377 FLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGIHYWIVKNSWGTSWGEDGYF 436
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 437 RIRRGTDECAIESIAVAATPIPK 459
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 97/195 (49%), Gaps = 17/195 (8%)
Query: 24 LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-EECDP---YFDSTGC--- 72
LSV +CC G GC GG + + +HG+VT +E P + GC
Sbjct: 91 LSVGYFTSCCNPANGCPKAKGCQGGNLLEGLNFLKNHGIVTGDEFKPAGQLSSADGCWPY 150
Query: 73 SHPGCEPA-YPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P C+ A Y +P C KC K + H + S R+ + P++I EI+ NGPV
Sbjct: 151 PFPKCKHAGYSSPACQTKCTNKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEIFTNGPVIG 210
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYF 189
++YED YK+GVY H TG G H +K+IGWG + G+DYW+ N WN WG G
Sbjct: 211 MLSIYEDIRVYKAGVYVHQTGSFQGIHTLKIIGWGV-ESGQDYWLAVNSWNEEWGDHGMI 269
Query: 190 KIKRGSNECGIEEDV 204
K+ G GIE V
Sbjct: 270 KLAVGRT--GIENSV 282
>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
niloticus]
Length = 461
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 100/199 (50%), Gaps = 28/199 (14%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAY 81
+LS +++C + GCDGG+P +Y G+V E C PY +T C P
Sbjct: 281 TLSPQQVVSCSEY--SQGCDGGFPYLIGKYTQDFGIVDESCFPYVGQNTPCGVP------ 332
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
+K Q +++ + + +M E+ KNGP+ V+F VY DF +YK
Sbjct: 333 ----------QKCQRIYAAEYNYVGGFYGGCSEAAMMLELVKNGPMAVAFEVYPDFMNYK 382
Query: 142 SGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKR 193
G+Y H TG + HAV L+G+G G++YWI+ N W WG +GYF+I+R
Sbjct: 383 EGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRR 441
Query: 194 GSNECGIEEDVVAGLPSSK 212
G++EC IE VA P K
Sbjct: 442 GNDECAIESIAVAANPIPK 460
>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
Length = 463
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ GP+ V+F VY+D
Sbjct: 331 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY+ GVY H + HAV L+G+GT + G DYWI+ N W SWG +GYF
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE +A P K
Sbjct: 440 RIRRGTDECAIESIALAATPIPK 462
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 101/201 (50%), Gaps = 20/201 (9%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGC--- 77
N SLS LL+C GC+GGY AW Y GVV + C PY C
Sbjct: 250 NASLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIP 308
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYED 136
+ Y + +R C +Q +S + ++ Y+++S EDI E+ NGPV+ +F V+ED
Sbjct: 309 KRDYTNRQGLR-CPSGDQ---DSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHED 364
Query: 137 FAHYKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGA 185
F Y GVY+H + G H+V+++GWG ++ YW+ AN W WG
Sbjct: 365 FFMYAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGE 424
Query: 186 DGYFKIKRGSNECGIEEDVVA 206
DGYFKI RG N C IE V+
Sbjct: 425 DGYFKILRGENHCEIESFVIG 445
>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
gigas]
Length = 464
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/163 (39%), Positives = 81/163 (49%), Gaps = 15/163 (9%)
Query: 49 AWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAY 108
AW + G++TEEC PY S G C T C N Y Y
Sbjct: 269 AWWFVKRRGIITEECYPYTASDG----ECLDGETT------CPNANSSTAKIVLYVTPPY 318
Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGH-AVKLIGWG--- 164
R+ D EDI AEIY+NGPV+ +F V DF Y+SGVY+H D+ +V++IGWG
Sbjct: 319 RVRQDEEDIKAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIGWGEKT 378
Query: 165 -TSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
YWI N W WG G F+I RG N GIEE+V+A
Sbjct: 379 NKKGKKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEENVLA 421
>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
Length = 356
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 105/201 (52%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y GVV E C PY TG P C
Sbjct: 172 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEGCFPY---TGTDSP-C- 224
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K + C + + +S +Y + + + I E+ +GP+ V+F VY DF
Sbjct: 225 ------KLKKDCFR----YYSSDYYYVGGFYGGCNEALIKLELVHHGPMAVAFEVYNDFL 274
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY G+Y H + HAV L+G+GT S G+DYWI+ N W SWG DGYF+I
Sbjct: 275 HYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRI 334
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A P K
Sbjct: 335 RRGTDECAIESIAMAATPIPK 355
>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
Length = 463
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ GP+ V+F VY+D
Sbjct: 331 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY+ GVY H + HAV L+G+GT + G DYWI+ N W SWG +GYF
Sbjct: 380 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE +A P K
Sbjct: 440 RIRRGTDECAIESIALAATPIPK 462
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 94/200 (47%), Gaps = 13/200 (6%)
Query: 21 NLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N LS +L+ C G G G + W Y HG+V+ Y + GC P
Sbjct: 136 NQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPP 191
Query: 80 AYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
P C +C N + H +S Y EDI E+ GPV V F V
Sbjct: 192 IGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRV 251
Query: 134 YEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
Y+DF YKSGVY + + H KLIGWG ++G DYW+L N W WG +G FKIK
Sbjct: 252 YDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWLLVNSWGNEWGQNGLFKIK 310
Query: 193 RGSNECGIEEDVVAGLPSSK 212
RG+NE +E+ V AG P K
Sbjct: 311 RGTNEVHVEDYVYAGEPEIK 330
>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
Length = 458
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 105/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 274 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 325
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ GP+ V+F VY+D
Sbjct: 326 -----------CRLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD 374
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HY+ GVY H + HAV L+G+GT + G DYWI+ N W SWG +GYF
Sbjct: 375 FLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYF 434
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE +A P K
Sbjct: 435 RIRRGTDECAIESIALAATPIPK 457
>gi|6449324|gb|AAF08932.1|AF195117_1 tubulointerstitial nephritis antigen isoform TIN2 [Homo sapiens]
Length = 333
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/171 (37%), Positives = 89/171 (52%), Gaps = 26/171 (15%)
Query: 58 VVTEECDPYFDSTGCSHPGCEPAY---------PTPKCVRKCVKKNQLWRNSKHYSISAY 108
+V+ C P F ++ GC A T C K N++++ S Y
Sbjct: 158 LVSHACYPLFKDQNATNNGCAMASRSDGRGKRDATKPCPNNVEKSNRIYQCS-----PPY 212
Query: 109 RINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKL 160
R++S+ +IM EI +NGPV+ V EDF HYK+G+Y+H+T + HAVKL
Sbjct: 213 RVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKL 272
Query: 161 IGWGT----SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
GWGT E +WI AN W +SWG +GYF+I RG NE IE+ V+A
Sbjct: 273 TGWGTRRGAQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 323
>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
Length = 463
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 103/203 (50%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 279 QTPILSPQEVVSCSQH--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + HAV L+G+GT S G DYWI+ N W WG +GYF
Sbjct: 380 FLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
Length = 463
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 104/203 (51%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E+C PY TG P
Sbjct: 279 QTPILSPQEIVSCSQY--AQGCNGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSP--- 330
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 331 -----------CKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 379
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 189
F HY G+Y H + HAV L+G+GT G DYWI+ N W SWG +GYF
Sbjct: 380 FLHYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDPATGVDYWIVKNSWGTSWGENGYF 439
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE VA P K
Sbjct: 440 RIRRGTDECAIESIAVAATPIPK 462
>gi|159114116|ref|XP_001707283.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157435387|gb|EDO79609.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 332
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 88/171 (51%), Gaps = 14/171 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC GG + W + HG T EC Y D+ C PA + VK +
Sbjct: 169 GCAGGTSFNVWTFLTEHGTTTLECVRYTDADKDLSSPC-PALCDDGSEIQLVKADGCLDY 227
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
S + + IM + +GPV+ +VY DF +Y+ GVYKH+ G + HAV+
Sbjct: 228 SGNVTA-----------IMQTLANDGPVQAVMSVYRDFLYYRGGVYKHVYGIQISSHAVE 276
Query: 160 LIGWGTSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGL 208
+IG+GT+DD E YWI+ N +WG +GYF I RGSNEC IE V +GL
Sbjct: 277 IIGYGTTDDEERIPYWIVKNSLGPNWGEEGYFNIVRGSNECDIESAVYSGL 327
>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
Length = 478
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 105/201 (52%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y GVV E C PY TG P C
Sbjct: 294 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGVVEEGCFPY---TGTDSP-C- 346
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K + C + + +S +Y + + + I E+ +GP+ V+F VY DF
Sbjct: 347 ------KLKKDCFR----YYSSDYYYVGGFYGGCNEALIKLELVHHGPMAVAFEVYNDFL 396
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY G+Y H + HAV L+G+GT S G+DYWI+ N W SWG DGYF+I
Sbjct: 397 HYHDGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRI 456
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A P K
Sbjct: 457 RRGTDECAIESIAMAATPIPK 477
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 99/190 (52%), Gaps = 13/190 (6%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
LS LL+C GC GGY AW + G+V +EC P+ + C+ +
Sbjct: 254 LSAQQLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGK----NDQCKLRKRS 308
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
C K + R + AYR+ ++ DIM EI +GPV+ + VY+DF YKSG
Sbjct: 309 TLKAAGCRKPSHPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFIYKSG 367
Query: 144 VYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIKRGSN 196
+Y+H + G H+V++IGWG YW++AN W +WG +G FKI++G+N
Sbjct: 368 IYRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVANSWGYNWGDNGLFKIQKGTN 427
Query: 197 ECGIEEDVVA 206
EC IE V+A
Sbjct: 428 ECEIESYVLA 437
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 204 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWFLRRRGVVSDHCY 262
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDI 117
P+ D G + + P + R+ + NQ+ N + AYR+ S+ ++I
Sbjct: 263 PFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVTPAYRLGSNEKEI 322
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF Y+SG+Y H + G H+VK+ GWG T
Sbjct: 323 MKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLP 382
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 383 DGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 423
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 112/227 (49%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG+ SAW + GVV++ C
Sbjct: 234 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DKRNQQGCQGGHLDSAWWFLRRRGVVSDHCY 292
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P F G + G P P+C+ R+ + +Q+ N + AYR+
Sbjct: 293 P-FSGQGRTETG-----PAPRCMMHSRAMGRGKRQATARCPNHQVHANDIYQVTPAYRLG 346
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S ++IM E+ +NGPV+ V+EDF Y++G+Y H + G H+VK+ GW
Sbjct: 347 SSEKEIMKELMENGPVQALMEVHEDFFLYQNGIYSHTPVSLGRPERYRRHGTHSVKITGW 406
Query: 164 GTSD--DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 407 GEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 453
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 110/221 (49%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCHGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK---NQLWRNSKHYSISAYRINSDPEDI 117
P+ D G + + P + R+ + NQ+ N + AYR+ S+ ++I
Sbjct: 294 PFSGQERDKAGPAPLCMMHSRPMGRGKRQATARCPNNQVQANDIYQVTPAYRLGSNEKEI 353
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF Y+SG+Y H + G H+VK+ GWG T
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLQRPEGYRRHGTHSVKITGWGEETLP 413
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 414 DGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454
>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
Length = 458
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/213 (33%), Positives = 105/213 (49%), Gaps = 32/213 (15%)
Query: 8 RDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 66
R LS P +S Q + ++C + GC+GG+P + A +Y +G+V E PY
Sbjct: 269 RSQLSQKPILSPQQV-------VSCSNY--SQGCEGGFPYLIAGKYVSDYGIVEESDLPY 319
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
TG P C K Q + ++++ + + + + E+ GP
Sbjct: 320 ---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGGCNEAYMKLELVLGGP 364
Query: 127 VEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQW 179
+ V+F VY+DF HY+SGVY H + HAV L+G+GT GE YWI+ N W
Sbjct: 365 LSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSW 424
Query: 180 NRSWGADGYFKIKRGSNECGIEEDVVAGLPSSK 212
SWG GYF+I+RG++EC IE V+ P K
Sbjct: 425 GESWGEKGYFRIRRGTDECAIESIAVSAEPIIK 457
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 99/191 (51%), Gaps = 14/191 (7%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
LS LL+C GCDGGY AW + G+V E+C P+ + C+ T
Sbjct: 248 LSAQHLLSC-NKKGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGV----YEQCKLQKRT 302
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
C R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+SG
Sbjct: 303 NLEAAGCRAPANPLRKELYKVGPAYRLGNE-TDIMREILTSGPVQATMKVYQDFFSYESG 361
Query: 144 VYKHITGDVM---GGHAVKLIGWG---TSDDGE--DYWILANQWNRSWGADGYFKIKRGS 195
+Y H + G H+V++IGWG ++D G YW++ N W + WG +G F+I+RG
Sbjct: 362 IYMHTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNSWGQEWGENGLFRIRRGI 421
Query: 196 NECGIEEDVVA 206
NEC IE VVA
Sbjct: 422 NECDIESFVVA 432
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 98/194 (50%), Gaps = 13/194 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
++ LS LL+C GC GGY AW + G+V ++C P+ G C+
Sbjct: 308 EDAELSAQHLLSC-NNRGQQGCRGGYLDRAWLFMRKFGLVDKDCYPWTGKNG----QCKL 362
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C K R + AYR+ ++ DIM EI +GPV+ + VY+DF
Sbjct: 363 RKRNNLQAAGCRKPPNPLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFV 421
Query: 140 YKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIK 192
YK+G+Y+H + G H+V++IGWG YW++ N W +WG +G FKI+
Sbjct: 422 YKNGIYRHSQSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVVNSWGYNWGENGLFKIQ 481
Query: 193 RGSNECGIEEDVVA 206
RG+NEC IE V+A
Sbjct: 482 RGTNECEIESYVLA 495
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 94/200 (47%), Gaps = 13/200 (6%)
Query: 21 NLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N LS +L+ C G G G + W Y HG+V+ Y + GC P
Sbjct: 136 NQLLSTEELIFCGGIKTKQSGAVRGDDV--WEYLKSHGLVS--GGKYNTNDGCQPSKIPP 191
Query: 80 AYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
P C +C N + H +S Y EDI E+ GPV V F V
Sbjct: 192 IGNIPTHLYNHTCEERCYGNNTIHYYHDHVKVSHYYNIKSNEDIQKEVQTYGPVSVKFRV 251
Query: 134 YEDFAHYKSGVYKHITGDV-MGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
Y+DF YKSGVY + + H KLIGWG ++G DYW+L N W WG +G FKIK
Sbjct: 252 YDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGV-ENGVDYWLLVNFWGNEWGQNGLFKIK 310
Query: 193 RGSNECGIEEDVVAGLPSSK 212
RG+NE +E+ V AG P K
Sbjct: 311 RGTNEVHVEDYVYAGEPEIK 330
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 105/218 (48%), Gaps = 21/218 (9%)
Query: 1 MSVTR--TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 58
+S TR ++R AL S ++ LS LL+C C GGY AW Y G+
Sbjct: 231 ISATRVASDRFALMSK---GADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLYMRKFGL 286
Query: 59 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 118
V E+C P+ + C+ T C R + AYR+ ++ DIM
Sbjct: 287 VDEDCYPWEGTNA----QCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGNE-TDIM 341
Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGE----- 170
EI +GPV+ + VY+DF Y+SG+YKH G H+V++IGWG
Sbjct: 342 YEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHHNL 401
Query: 171 --DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
YW++ N W + WG G F+I+RG+NEC IE VVA
Sbjct: 402 PIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
Length = 466
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 104/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q+ LS ++++C + GC+GG+P + A +Y G+V E C PY TG P C
Sbjct: 282 QSPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP-C- 334
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K C++ + S+++ + + + + E+ +GP+ V+F VY+DF
Sbjct: 335 ------KMKEDCIR----YYTSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFL 384
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
HY G+Y H + HAV L+G+GT G DYWI+ N W SWG GYF+I
Sbjct: 385 HYNQGIYHHTGLKDPFNPFELTNHAVLLVGYGTDPKTGLDYWIVKNSWGTSWGEQGYFRI 444
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A P K
Sbjct: 445 RRGTDECAIESIAMAATPIPK 465
>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
Cysteine Protease Of The Papain Family
Length = 438
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 254 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 304
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 305 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 356
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G DYWI+ N W WG GYF+I
Sbjct: 357 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 416
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A +P K
Sbjct: 417 RRGTDECAIESIAMAAIPIPK 437
>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
Length = 462
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G DYWI+ N W WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A +P K
Sbjct: 441 RRGTDECAIESIAMAAIPIPK 461
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 98/203 (48%), Gaps = 20/203 (9%)
Query: 23 SLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP 82
SLS +LL+C GC+GG AW + G+V+++C P + P + P
Sbjct: 107 SLSPQNLLSC-NTRHQQGCNGGRLDRAWSFLRRRGLVSDKCYPLASQNSIAEPCRMYSRP 165
Query: 83 TPKCVRKCV-------KKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYE 135
+ R+ + + N + S YR++S+ +DIM EI +NGPV+ V+E
Sbjct: 166 MGRGKRQATGPCPNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHE 225
Query: 136 DFAHYKSGVYKHITGD--------VMGGHAVKLIGWG--TSDDGE--DYWILANQWNRSW 183
DF YK G+Y+H G H+VK+ GWG +G +W AN W +W
Sbjct: 226 DFFLYKDGIYRHTPASNGKPPQFRRQGTHSVKITGWGEELQPNGRRVKFWRAANSWGPTW 285
Query: 184 GADGYFKIKRGSNECGIEEDVVA 206
G G F+I RG NEC IE VV
Sbjct: 286 GEGGSFRILRGCNECDIESFVVG 308
>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
Peptide, 462 aa]
Length = 462
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S++Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPKENCLR--------YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G DYWI+ N W WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE +A +P K
Sbjct: 441 RRGTDECAIESIAMAAIPIPK 461
>gi|253747738|gb|EET02294.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 305
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 87/166 (52%), Gaps = 13/166 (7%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFD-STGCSHPGCEPAYPTPKCVRKCVKKNQLWR 98
GC GG ++W + G + +C PY TG S +C C + L
Sbjct: 146 GCQGGGFNTSWAFLETEGAIMRDCLPYVSGETGLS----------GECPTTC-QDGTLLN 194
Query: 99 NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAV 158
++ HY + + +IM + GPV+ F V+EDF +Y G+Y G +GGHAV
Sbjct: 195 DTIHYKAVSASHLKNYNEIMTSLLNEGPVQTGFYVHEDFLYYVGGIYHKTYGSSIGGHAV 254
Query: 159 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDV 204
++G+G+ ++ DYWI+ N W WG +GYF+I RG+NECGIE +
Sbjct: 255 LIVGYGSMNN-HDYWIVRNSWGSDWGENGYFRILRGTNECGIENNA 299
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 62/158 (39%), Positives = 89/158 (56%), Gaps = 16/158 (10%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT-------EECDPYFDSTGCSH-- 74
+S DLL+CC CG GC GG+P AW +++ +G+VT C Y C+H
Sbjct: 22 ISSTDLLSCCE-SCGFGCHGGFPPRAWDFWMENGLVTGGSKENPSGCRSY-PFPKCNHHG 79
Query: 75 -----PGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEV 129
P E +PTP C + C + K + S+Y + + + IM EI +NGPVE
Sbjct: 80 KGPDAPCPEKIFPTPACNKTCDTPEVNYILDKTKAKSSYNVPNSEKAIMKEIMQNGPVEA 139
Query: 130 SFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
+F VYEDF HY+SGVY H G ++GGHA++++GWG +
Sbjct: 140 AFEVYEDFLHYESGVYFHSFGRMIGGHAIRMLGWGEEN 177
>gi|290987261|ref|XP_002676341.1| predicted protein [Naegleria gruberi]
gi|284089943|gb|EFC43597.1| predicted protein [Naegleria gruberi]
Length = 218
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 28/180 (15%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC GY +A+++ + G+VTE C P+ G P C +KC+ N
Sbjct: 54 GCSYGYFDTAFQFVENQGIVTENCFPFVSGEGNY---------IPPCPKKCLAYNPF--- 101
Query: 100 SKHYSISAYRINSD----PEDIMA---EIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV 152
+ +++N+ P+DI I G + S +Y DF Y+ GVY+H+ G+
Sbjct: 102 ------TLFKVNNSRAFLPQDIQGMQLSIMNGGSLAASLDIYRDFVQYRGGVYRHLVGNY 155
Query: 153 MGGHAVKLIGWGTSDDGE---DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
M H+V+++GWG + + YWI N W WG G+F I RGSNEC IE DV P
Sbjct: 156 MFTHSVRIVGWGITSPQQGSIPYWICGNNWTEEWGMQGWFWILRGSNECNIELDVWETTP 215
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 68/193 (35%), Positives = 100/193 (51%), Gaps = 15/193 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
+ LS LL+C GC GG+ AW + G+V E C P+ ST C
Sbjct: 249 VELSAQHLLSC-NNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTE----TCRLRK 303
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
T C R + AYR+ ++ DIM EI +GPV+ + VY+DF Y+
Sbjct: 304 RTDLRSAGCAPPPNPLRTELYKVGPAYRLANE-TDIMQEILTSGPVQATMRVYQDFFSYE 362
Query: 142 SGVYKH-ITGDVMGG--HAVKLIGWG------TSDDGEDYWILANQWNRSWGADGYFKIK 192
SGVYKH +T ++ H+V++IGWG + + YW++AN W + WG +G F+I+
Sbjct: 363 SGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQ 422
Query: 193 RGSNECGIEEDVV 205
+G+NEC IE V+
Sbjct: 423 KGTNECEIESFVL 435
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 106/218 (48%), Gaps = 21/218 (9%)
Query: 1 MSVTR--TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGV 58
+S TR ++R AL S ++ LS LL+C C GGY AW Y G+
Sbjct: 231 ISTTRVASDRFALMSK---GADSVLLSAQHLLSC-NNRGQQACSGGYLDRAWLYMRKFGL 286
Query: 59 VTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIM 118
V E+C P+ + + C+ T C R + AYR+ ++ DIM
Sbjct: 287 VDEDCYPWEGT----NVQCKLRKRTDLKTAGCRPPVNPLRTELYKVGPAYRLGNE-TDIM 341
Query: 119 AEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD---VMGGHAVKLIGWGTSDDGE----- 170
EI +GPV+ + VY+DF Y+SG+YKH G H+V++IGWG
Sbjct: 342 YEILTSGPVQATMKVYQDFFSYESGIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYRNL 401
Query: 171 --DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
YW++ N W + WG G F+I+RG+NEC IE VVA
Sbjct: 402 PIKYWLVVNSWGQQWGESGLFRIQRGTNECDIESFVVA 439
>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
Length = 483
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 100/201 (49%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 299 QTPILSPQEVVSCSMY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 349
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + S +Y + + + + E+ ++GP+ V+F V +DF
Sbjct: 350 PCKPKENCLR--------YYTSGYYYVGGFYGGCNEALMKLELVQHGPMAVAFEVQDDFL 401
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G D G DYW + N W WG GYF+I
Sbjct: 402 HYHSGIYHHTGLRDPFNPFELTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRI 461
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 462 RRGTDECAIESIAVAAIPIPK 482
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 100/194 (51%), Gaps = 13/194 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ + LS LL+C GC GGY AW + G+V EEC P+ TG + C
Sbjct: 250 ETVELSAQHLLSC-NNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPW---TG-RNDQCRL 304
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
+ C R + AYR+ ++ DIM EI +GPV+ + VY+DF
Sbjct: 305 RKRSNLKTAGCQNPPNSLRTELYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVYQDFFV 363
Query: 140 YKSGVYKHITGDVM---GGHAVKLIGWGTSDDGE----DYWILANQWNRSWGADGYFKIK 192
Y+SGVY+H + G H+V++IGWG YW++AN W +WG +G F+I+
Sbjct: 364 YQSGVYRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQ 423
Query: 193 RGSNECGIEEDVVA 206
+G+NEC IE V+A
Sbjct: 424 KGTNECEIESYVLA 437
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 100/198 (50%), Gaps = 14/198 (7%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N +LS LL+C GC+GGY AW Y GVV + C PY S PG
Sbjct: 232 NSTLSSQQLLSCNQHR-QKGCEGGYLDRAWWYIRKLGVVGDHCYPYV-SGQSREPGHCLI 289
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
R+ ++ ++S + ++ Y+++S EDI E+ NGPV+ +F V+EDF
Sbjct: 290 PKRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM 349
Query: 140 YKSGVYKH--------ITGDVMGGHAVKLIGWG---TSDDGEDYWILANQWNRSWGADGY 188
Y GVY+H + G H+V+++GWG ++ YW+ AN W WG DGY
Sbjct: 350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409
Query: 189 FKIKRGSNECGIEEDVVA 206
FK+ RG N C IE V+
Sbjct: 410 FKVLRGENHCEIESFVIG 427
>gi|354498051|ref|XP_003511129.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1-like
[Cricetulus griseus]
Length = 470
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 100/201 (49%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY +
Sbjct: 286 QTPILSPQEVVSCSMY--AQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDA------- 336
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + S +Y + + + + E+ ++GP+ V+F V +DF
Sbjct: 337 PCKPKENCLR--------YYTSGYYYVGGFYGGCNEALMKLELVQHGPMAVAFEVQDDFL 388
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G D G DYW + N W WG GYF+I
Sbjct: 389 HYHSGIYHHTGLRDPFNPFELTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRI 448
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 449 RRGTDECAIESIAVAAIPIPK 469
>gi|17933077|gb|AAL48195.1| cathepsin C [Homo sapiens]
Length = 463
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 95/185 (51%), Gaps = 27/185 (14%)
Query: 38 GDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQL 96
GC+GG+P + A +Y G+V E C PY TG P C K
Sbjct: 295 AQGCEGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSP--------------CKMKEDC 337
Query: 97 WR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKH------I 148
+R +S+++ + + + + E+ +GP+ V+F VY+DF HYK G+Y H
Sbjct: 338 FRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPF 397
Query: 149 TGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
+ HAV L+G+GT S G DYWI+ N W WG +GYF+I+RG++EC IE VA
Sbjct: 398 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAA 457
Query: 208 LPSSK 212
P K
Sbjct: 458 TPIPK 462
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 102/215 (47%), Gaps = 15/215 (6%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
+ + L+ LLAC C GG+ +AW+Y GVV +EC PY + C+
Sbjct: 343 EQVQLAPQQLLACVRR--QQACSGGHLDTAWQYLRRVGVVNDECYPYIAAKN----QCKI 396
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
C + R + + AY +N++ DIM EI + G V+ VY DF
Sbjct: 397 NDGDTLVSANCELPANVNRTAMYRMGPAYSLNNE-TDIMTEIKERGTVQAILRVYRDFFS 455
Query: 140 YKSGVYKHITG-----DVMGGHAVKLIGWGTSDDGED---YWILANQWNRSWGADGYFKI 191
Y++G+Y+H + H+V+LIGWG G D YWI N W WG +G F+I
Sbjct: 456 YQNGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGENGRFRI 515
Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEITSADMFED 226
RG+NEC IE V+A P V+ + + ++
Sbjct: 516 LRGTNECEIESYVLASNPYVHQHVQTVRNVGDLQE 550
>gi|437323|gb|AAB00354.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 133
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 65/159 (40%), Positives = 80/159 (50%), Gaps = 51/159 (32%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAY 81
LS+S +D+ ACCG +CG+GC+GGYPI AWR++V G VT Y D TGC Y
Sbjct: 25 LSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTG--GSYQDKTGCK------PY 76
Query: 82 PTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYK 141
P P P EV+FTVYEDF HY
Sbjct: 77 PYP-----------------------------------------PFEVAFTVYEDFEHYS 95
Query: 142 SGVYKHITGDVM-GGHAVKLIGWGTSDDGEDYWILANQW 179
GVY H G + GGHAVK++GWG D+G YW++AN W
Sbjct: 96 GGVYVHTAGASLGGGHAVKMLGWGV-DNGTPYWLIANSW 133
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 108/227 (47%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 196 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 254
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
P+ S G + A P P C+ + R N + AYR+
Sbjct: 255 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 308
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H + G H+VK+ GW
Sbjct: 309 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 368
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 369 GEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 415
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 99/197 (50%), Gaps = 14/197 (7%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
++ + LS L++C GC GGY AW + GVV E+C P+ C
Sbjct: 280 IEKVQLSGQHLISC-NNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWLSG---RSDKCR 335
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSIS-AYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
C ++N ++ Y + AYR+ ++ DIM EI +GPV+ + V+ DF
Sbjct: 336 IPRRGKLSDAGCQRRNSYNLRNEMYKVGPAYRLGNE-TDIMQEILTSGPVQATMRVHRDF 394
Query: 138 AHYKSGVYKH---ITGDVMGGHAVKLIGWGTSDDGED-----YWILANQWNRSWGADGYF 189
HY+SG+Y H G H+V+++GWG + +W +AN W R WG DGYF
Sbjct: 395 FHYESGIYVHSRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYF 454
Query: 190 KIKRGSNECGIEEDVVA 206
+I RG+NEC IE V+
Sbjct: 455 RIVRGNNECEIESFVLG 471
>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 228
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 74/208 (35%), Positives = 97/208 (46%), Gaps = 31/208 (14%)
Query: 24 LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------------EECDPY 66
LS+ L +CC G +GC G + +HG+VT + C PY
Sbjct: 18 LSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEKLGNDDGCWPY 77
Query: 67 FDSTGCSH-PGCEPAYP-------TPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPED 116
C+H PG E YP P C C K + H + S R+ PE
Sbjct: 78 -PFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEK 136
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI+ NGPV T+YEDF +YKSGVY H TG ++ H +KLIGWG + G++YW+
Sbjct: 137 IKQEIFDNGPVAAMMTLYEDFRYYKSGVYVHKTGQLLAAHTLKLIGWGV-ESGQEYWLAM 195
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDV 204
N WN WG G K+ G G+E V
Sbjct: 196 NAWNEEWGDHGMIKLAVGKT--GLEHQV 221
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 92/195 (47%), Gaps = 29/195 (14%)
Query: 24 LSVNDLLACCGFLCG----DGCDGGYPISAWRYFVHHGVVT-------EE------CDPY 66
LS+ L +CC G +GC G + +HG+VT EE C PY
Sbjct: 199 LSLGYLTSCCNRANGCPKSNGCMFGSVPEGLNFMKNHGLVTGGEYKPPEELGNDDGCWPY 258
Query: 67 FDSTGCSH-PGCEPAYP-------TPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPED 116
C+H PG E YP P C C K + H + S R+ PE
Sbjct: 259 -PFPKCNHVPGLESKYPRCAQVRDLPACATTCPNKAYGTSMQKDTHRAKSWGRLPIGPEK 317
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI+ NGPV T+YEDF YKSGVY H TG ++ H +KLIGWG + G++YW+
Sbjct: 318 IKQEIFDNGPVAAMMTLYEDFRFYKSGVYVHKTGQMLAAHTLKLIGWGV-ESGQEYWLAV 376
Query: 177 NQWNRSWGADGYFKI 191
N WN WG G K+
Sbjct: 377 NAWNEEWGDHGMIKL 391
>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
Length = 488
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 98/192 (51%), Gaps = 27/192 (14%)
Query: 25 SVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTP 84
S D++ C + GCDGG+ +Y +G+ E CDPY G +
Sbjct: 310 SPQDIVECSAY--SQGCDGGFMYLVSKYAEDYGLAEESCDPY--------KGVDSVCKKD 359
Query: 85 KCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGV 144
+C ++ N + + + +A +++M E+Y GP+ ++F VY+DF +YK GV
Sbjct: 360 QCPKRAYGTNYAYTGGFYGATNA-------KNMMYELYHGGPLAIAFEVYDDFFNYKGGV 412
Query: 145 YKHIT---------GDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGS 195
Y H T G HAV L+GWG ++G YW++ N W SWG +G+FKIKRG+
Sbjct: 413 YTHSTALKTKIAEPGWEETNHAVLLVGWG-EENGVPYWLVKNSWGTSWGINGFFKIKRGT 471
Query: 196 NECGIEEDVVAG 207
+EC E + V+
Sbjct: 472 DECDCESEAVSA 483
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 70/187 (37%), Positives = 94/187 (50%), Gaps = 28/187 (14%)
Query: 23 SLSVNDLLACCGFLCGDG----CDGGYPISAWRYFVHHGVVT-------EECDPYFDSTG 71
+LS +L++C GDG CDGG AW ++ G+VT E C PY +
Sbjct: 79 NLSAQNLMSC-----GDGEKMGCDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRP- 132
Query: 72 CSHPG------CEPAYPTPK--CVRKCVKKNQL--WRNSKHYSISAYRIN-SDPEDIMAE 120
C H G C T C +KCV KN + + H + Y + ++ + I E
Sbjct: 133 CDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTNVKQIQQE 192
Query: 121 IYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWN 180
I GPV VYE+F YK G+YK TG+++G H VKLIGWG DG +YW+ N WN
Sbjct: 193 IMTYGPVTAFMYVYENFMGYKEGIYKSTTGELIGYHHVKLIGWGVDGDGTEYWLAMNSWN 252
Query: 181 RSWGADG 187
+WG DG
Sbjct: 253 SNWGNDG 259
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 94/164 (57%), Gaps = 20/164 (12%)
Query: 45 YPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYS 104
Y +AW Y+++ G+ + Y S GC P E ++ + +CVK
Sbjct: 150 YIKNAWDYYINEGIAS--GGDYNSSEGC-QPYSESSFQYAE-ASECVK------------ 193
Query: 105 ISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWG 164
Y + ++ I EI NGPV + V+EDFA +KSGVY + +G +G H+VK+IGWG
Sbjct: 194 --FYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYYYKSGKFVGRHSVKVIGWG 251
Query: 165 TSDDGEDYWILANQWNRSWGA-DGYFKIKRGSNECGIEEDVVAG 207
T ++G YW++AN W WG G+FK++RG+NEC IE+++ AG
Sbjct: 252 T-EEGIPYWLIANSWGSEWGELGGFFKMRRGTNECWIEQEMTAG 294
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 111/227 (48%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 130 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCHGGRLDGAWWFLRRRGVVSDHCY 188
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P+ S G + A P P C+ R+ + + + N + AYR+
Sbjct: 189 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 242
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H + G H+VK+ GW
Sbjct: 243 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 302
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 303 GEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 349
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 109 bits (272), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 49/91 (53%), Positives = 64/91 (70%), Gaps = 1/91 (1%)
Query: 117 IMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILA 176
I EI K GPVE +F VYEDF +YKSG+YKHITG + HA+++IGWG ++ YW++
Sbjct: 3 IQKEIMKYGPVEANFIVYEDFLNYKSGIYKHITGKLFSWHAIRIIGWG-EENNTPYWLIP 61
Query: 177 NQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
N WN WG +G F+I RG +EC IE +V AG
Sbjct: 62 NSWNEDWGENGNFRILRGRHECSIESEVTAG 92
>gi|37905530|gb|AAO64478.1| cathepsin C precursor [Fundulus heteroclitus]
Length = 450
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 94/197 (47%), Gaps = 26/197 (13%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGCEPAYP 82
LS +++C + GCDGG+P +Y G+V E C PY + + C P
Sbjct: 271 LSPQQVVSCSEY--SQGCDGGFPYLIGKYVQDFGIVDESCFPYIAADSPCGVP------- 321
Query: 83 TPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKS 142
C R +++ + + + E+ KNGP+ V+ VY DF HYK
Sbjct: 322 -QNCGRM--------YTAEYRYVGGFYGGCSETAMKLELVKNGPMAVALEVYPDFMHYKE 372
Query: 143 GVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKIKRGS 195
G+Y H + + HAV L+G+G G+ YWI+ N W WG DGYF+I+RGS
Sbjct: 373 GIYHHTGFRDSVNPFELTNHAVLLVGYGRCHKTGQKYWIVKNSWGSGWGEDGYFRIRRGS 432
Query: 196 NECGIEEDVVAGLPSSK 212
+EC IE VA P K
Sbjct: 433 DECAIESIAVAAKPIPK 449
>gi|74199074|dbj|BAE30750.1| unnamed protein product [Mus musculus]
Length = 447
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 263 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 313
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 314 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 365
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 366 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 425
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 426 RRGTDECAIESIAVAAIPIPK 446
>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
Length = 463
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 105/202 (51%), Gaps = 27/202 (13%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G+V E C PY G P C
Sbjct: 279 QTPILSPQEIVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEESCFPY---KGIDVP-C- 331
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K + CV+ + S+++ + + + + E+ ++GP+ V+F VY+DF
Sbjct: 332 ------KVKKDCVR----YYTSEYHYVGGFYGGCNEALMKLELVQHGPMAVAFEVYDDFL 381
Query: 139 HYKSGVYKHITGDV-------MGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFK 190
HY G+Y H TG + HAV L+G+GT G DYWI+ N W WG DGYF+
Sbjct: 382 HYHKGIY-HRTGLRDPFNPFELTNHAVLLVGYGTDPVSGRDYWIVKNSWGTGWGEDGYFR 440
Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
I RG++EC IE +A P K
Sbjct: 441 ILRGTDECAIESIAMAATPIPK 462
>gi|160707990|ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus]
gi|3023454|sp|P97821.1|CATC_MOUSE RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
peptidase I; Short=DPP-I; Short=DPPI; AltName:
Full=Dipeptidyl transferase; Contains: RecName:
Full=Dipeptidyl peptidase 1 exclusion domain chain;
AltName: Full=Dipeptidyl peptidase I exclusion domain
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
chain; Contains: RecName: Full=Dipeptidyl peptidase 1
light chain; AltName: Full=Dipeptidyl peptidase I light
chain; Flags: Precursor
gi|1881656|gb|AAB49457.1| preprodipeptidyl peptidase I [Mus musculus]
gi|7609786|gb|AAB58400.3| dipeptidyl peptidase I precursor [Mus musculus]
gi|45219895|gb|AAH67063.1| Cathepsin C [Mus musculus]
gi|74147157|dbj|BAE27487.1| unnamed protein product [Mus musculus]
gi|74178079|dbj|BAE29829.1| unnamed protein product [Mus musculus]
gi|148674849|gb|EDL06796.1| cathepsin C, isoform CRA_b [Mus musculus]
Length = 462
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 104/215 (48%), Gaps = 17/215 (7%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDP 65
++R A+ S + ++ LS L++C F +G G W Y GVV+ C P
Sbjct: 324 SDRLAIQSKNFTVVE---LSPQHLVSC--FSSHEG-RGERLDRTWWYLRKKGVVSTVCYP 377
Query: 66 YFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNG 125
S G C N + N + + YR++S+ E+IM EI++NG
Sbjct: 378 ESRSKSTQGIGSCGLVAHSSGAHICPNGNVISSNEIYKTSPVYRVSSNEENIMKEIFENG 437
Query: 126 PVEVSFTVYEDFAHYKSGVYKHITGDVM--------GGHAVKLIGWG---TSDDGEDYWI 174
PV+ V DF YKSGVY D + H+VK+IGWG + + YWI
Sbjct: 438 PVQAVMRVQPDFFVYKSGVYSSTAIDNIVVEQVKDNTYHSVKIIGWGEKKSKTNSGKYWI 497
Query: 175 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ N W +WG GYF+I++G NECGIEE ++A P
Sbjct: 498 VQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWP 532
>gi|74204274|dbj|BAE39895.1| unnamed protein product [Mus musculus]
Length = 462
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461
>gi|291236490|ref|XP_002738176.1| PREDICTED: cathepsin C-like [Saccoglossus kowalevskii]
Length = 438
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 94/196 (47%), Gaps = 24/196 (12%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N+++S D++ CC + GC GG+P +Y G V E C PY G P
Sbjct: 258 NITISPQDVVQCCNY--SQGCSGGFPYLVSKYSEDFGFVEETCLPYTAQDG-------PC 308
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
KC R +K+ + + + + E+ KNGP+ V+F VY+DF Y
Sbjct: 309 VSEIKCKRH--------YGTKYRYVGDFYGGCNEALMKIELVKNGPMAVAFMVYDDFMSY 360
Query: 141 KSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYFKIKR 193
+ G+Y H + HAV L+G+G D E +WI+ N W WG +GYF+I+R
Sbjct: 361 QGGIYHHTGLQDKFNPFEITNHAVLLVGYGYDHDTKEKFWIVKNSWGTGWGEEGYFRIRR 420
Query: 194 GSNECGIEEDVVAGLP 209
G++EC IE V P
Sbjct: 421 GNDECSIESIAVESTP 436
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 108/227 (47%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 302 AAVASDRVSIHSLGHMSPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 360
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
P+ S G + A P P C+ + R N + AYR+
Sbjct: 361 PF------SGHGRDEAVPAPPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 414
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H + G H+VK+ GW
Sbjct: 415 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 474
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 475 GEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 521
>gi|74191569|dbj|BAE30359.1| unnamed protein product [Mus musculus]
Length = 462
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 99/193 (51%), Gaps = 13/193 (6%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDS-TGCSHPGC--EPA 80
LS LL+C L GC GG+ AW + G++TEEC P+ + C+ P E
Sbjct: 246 LSPQHLLSC-NNLNQQGCQGGHLTRAWNWIRKFGLITEECYPWQGRMSTCAVPKKKKETM 304
Query: 81 YPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHY 140
P VR ++ + H YR+ ++ E IM EI +GPV+ V DF Y
Sbjct: 305 AQCPSRVRS--NNDRTTKTRLHRVGPVYRVATE-EGIMHEILTSGPVQAVMKVSRDFFMY 361
Query: 141 KSGVYK---HITGDVMGGHAVKLIGWGTSDDG---EDYWILANQWNRSWGADGYFKIKRG 194
KSGVYK +G G H+V+++GWG G YWI +N W WG +GYF+I +G
Sbjct: 362 KSGVYKCSNLASGSRTGYHSVRIVGWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKG 421
Query: 195 SNECGIEEDVVAG 207
+EC IE+ V+A
Sbjct: 422 VDECEIEDFVIAA 434
>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
Length = 470
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 29/200 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY--FDSTGCSHPG 76
Q LS ++++C + GC+GG+P + A +Y GVV EEC PY DS+
Sbjct: 285 QQFVLSPQEIVSCGKY--SQGCEGGFPYLIAGKYAEDFGVVLEECYPYEGKDSSCKDTSR 342
Query: 77 CEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C Y T + + + + E + E+ KNGP+ V+F VY D
Sbjct: 343 CGRGYAT-----------------NYRYVGGFYGGCNEELMQLELVKNGPMAVAFEVYSD 385
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYF 189
F HYK GVY+H + HAV L+G+G + G +W + N W WG +G+F
Sbjct: 386 FMHYKGGVYEHTGLSDPFNPFEITNHAVLLVGYGRDPETGAKFWTVKNSWGEKWGEEGFF 445
Query: 190 KIKRGSNECGIEEDVVAGLP 209
+I+RG++EC IE VA P
Sbjct: 446 RIRRGTDECAIESIAVAADP 465
>gi|74212565|dbj|BAE31022.1| unnamed protein product [Mus musculus]
Length = 191
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 102/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 7 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 57
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C+R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 58 PCKPRENCLR--------YYSSDYYYVGGFYGGCNEALMELELVKHGPMAVAFEVHDDFL 109
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 110 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 169
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 170 RRGTDECAIESIAVAAIPIPK 190
>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
Length = 470
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 102/206 (49%), Gaps = 32/206 (15%)
Query: 8 RDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPY 66
R LS P +S Q + ++C + GC+GG+P + A +Y +G+V E PY
Sbjct: 269 RSQLSQKPILSPQQV-------VSCSNY--SQGCEGGFPYLIAGKYVSDYGIVEESDLPY 319
Query: 67 FDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGP 126
TG P C K Q + ++++ + + + + E+ GP
Sbjct: 320 ---TGSDSP----------CTLK--DSQQKYYTAEYHYVGGFYGGCNEAYMKLELVLGGP 364
Query: 127 VEVSFTVYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQW 179
+ V+F VY+DF HY+SGVY H + HAV L+G+GT GE YWI+ N W
Sbjct: 365 LSVAFEVYDDFMHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSW 424
Query: 180 NRSWGADGYFKIKRGSNECGIEEDVV 205
SWG GYF+I+RG++EC IE V
Sbjct: 425 GESWGEKGYFRIRRGTDECAIESIAV 450
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 91/176 (51%), Gaps = 20/176 (11%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE-------CDPY-FDSTGCSHP 75
+S D+L+CCG CG GC GG I AW++ + +GV T C PY F G
Sbjct: 27 ISDTDILSCCGRFCGYGCRGGANIRAWKHVMRNGVCTGGPCGYKYGCRPYAFHPCGVHKD 86
Query: 76 GC------EPAYPTPKCVRKCVKK--NQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPV 127
+Y TP+C + C + + ++Y+ SAY + +D + IM EI + GPV
Sbjct: 87 QVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRYYAASAYFVKNDTKAIMREIMRGGPV 146
Query: 128 EVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGED----YWILANQW 179
++ Y DF YK GVY+H G+ GGH++K++GWG YW++AN W
Sbjct: 147 HGAYDTYTDFRLYKGGVYEHTAGERTGGHSIKIMGWGNYKHPNGTVIPYWLVANSW 202
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 101/199 (50%), Gaps = 15/199 (7%)
Query: 21 NLSLSVNDLLACCGFLCGD-GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEP 79
N LS +L++C G + G Y + W Y +HG+V+ Y + GC P
Sbjct: 140 NQLLSTEELISCSGIKEDEFGSVNDYYV--WEYLKNHGLVS--GGKYNTNNGCQPSKIPP 195
Query: 80 AYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTV 133
P C ++C N + N H I + + + EDI E+ GPV ++F V
Sbjct: 196 IGNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNH-YDIEYEDIQREVQNYGPVSMAFKV 254
Query: 134 YE-DFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
++ DF YKSGVY+ T + + KLIGWG ++G DYW+L N W WG +G FKI
Sbjct: 255 FDNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGV-ENGVDYWLLVNFWGYEWGQNGLFKI 313
Query: 192 KRGSNECGIEEDVVAGLPS 210
KRG++EC IE V AG P
Sbjct: 314 KRGTDECNIETFVHAGEPQ 332
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/198 (35%), Positives = 99/198 (50%), Gaps = 13/198 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N LS +L++C G + D W Y +HG+V+ Y + GC P
Sbjct: 140 NQLLSTEELISCSG-IKEDEFGSVNDDYVWEYLKNHGLVS--GGKYNTNNGCQPSKIPPI 196
Query: 81 YPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
P C ++C N + N H I + + + EDI E+ GPV ++F V+
Sbjct: 197 GNLPTGLYENTCEKRCYGNNTINYNQDHVKIKNH-YDIEYEDIQREVQNYGPVSMAFRVF 255
Query: 135 E-DFAHYKSGVYKHITG-DVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIK 192
+ DF YKSGVY+ T + + KLIGWG ++G DYW+L N W WG +G FKIK
Sbjct: 256 DNDFFLYKSGVYEKTTNSEFIQWQYAKLIGWGV-ENGVDYWLLVNSWGYEWGQNGLFKIK 314
Query: 193 RGSNECGIEEDVVAGLPS 210
RG++EC IE V AG P
Sbjct: 315 RGTDECNIETFVHAGEPQ 332
>gi|185135783|ref|NP_001117966.1| prepro-cathepsin C precursor [Oncorhynchus mykiss]
gi|51038277|gb|AAT94060.1| prepro-cathepsin C [Oncorhynchus mykiss]
Length = 457
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 93/196 (47%), Gaps = 24/196 (12%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPT 83
S +++C + GCDGG+P +Y G+V E C PY G P P
Sbjct: 278 FSPQQVVSCSQY--SQGCDGGFPYLIGKYVQDFGIVEESCYPY---AGTDSPCDVPD--- 329
Query: 84 PKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSG 143
C+R S + + + +M E+ KNGP+ V+F VY DF HYK G
Sbjct: 330 -GCLRH--------YTSDYSYVGGFYGGCSESAMMLELVKNGPMGVAFEVYPDFMHYKEG 380
Query: 144 VYKHI------TGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSN 196
+Y H + HAV L+G+G G+ +W++ N W WG +G+FK++RGS+
Sbjct: 381 IYHHTGLHDSYNPFELTNHAVLLVGYGQCHVTGQKFWVVKNSWGTKWGEEGFFKVRRGSD 440
Query: 197 ECGIEEDVVAGLPSSK 212
EC IE VA P K
Sbjct: 441 ECAIESIAVAAKPIPK 456
>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
Length = 575
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 102/201 (50%), Gaps = 31/201 (15%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A ++ G+V E C PY TG P
Sbjct: 391 QTPILSPQEVVSCSQY--AQGCEGGFPYLVAGKHAQDFGLVEEACFPY---TGTDAP--- 442
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 443 -----------CTMKEGCRRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 491
Query: 137 FAHYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGY 188
F HY G+Y H TG + HAV L+G+GT S G YWI+ N W WG DGY
Sbjct: 492 FLHYHRGIYHH-TGLTDPFNPFELTNHAVLLVGYGTDSATGIQYWIVKNSWGTGWGEDGY 550
Query: 189 FKIKRGSNECGIEEDVVAGLP 209
F+I+RG++EC IE VA P
Sbjct: 551 FRIRRGTDECAIESIAVAATP 571
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 84/155 (54%), Gaps = 11/155 (7%)
Query: 63 CDPYF------DSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKH-YSISAYRINSDPE 115
C+PY + G S +P +C R C L N H ++ Y +
Sbjct: 15 CEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG-- 72
Query: 116 DIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHI-TGDVMGGHAVKLIGWGTSDDGEDYWI 174
I ++ GP+E SF VY+DF YKSGVY+ +GGHAVKLIGWG ++G YW+
Sbjct: 73 SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNATKLGGHAVKLIGWGV-EEGIPYWL 131
Query: 175 LANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
+ N W+ WG +G FKI+RG++ECGI+ AG+P
Sbjct: 132 MVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVP 166
>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
Length = 362
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 85/168 (50%), Gaps = 14/168 (8%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
C GGY +W + + G + C PY G + + C +C K
Sbjct: 205 ACQGGYLKYSWTFLENTGTPLDSCIPYASGRG--------TFSSGTCPTQC--KIASMSM 254
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
SK+ + + I S +I I G V+ FTVY D YKSGVYKHI V+GGHAV
Sbjct: 255 SKYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHIENTVLGGHAVA 313
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
LIG+G + G +YW+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 314 LIGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 358
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 108/221 (48%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LLAC GC GG AW + GVV++ C
Sbjct: 90 AAVASDRVSIHSLGHMTPVLSPQNLLACDTH-HQQGCRGGRLDGAWWFLRRRGVVSDHCY 148
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G + P + + R+ + N N+ Y ++ YR+ S+ ++I
Sbjct: 149 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEI 208
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GWG T
Sbjct: 209 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 268
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 269 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 309
>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
Length = 363
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 16/164 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC+GG P++A+ + + G V C Y C KC +N +
Sbjct: 208 GCNGGEPVNAFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQ-----KCDDGSAVENIV--- 259
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
+ S + S + ++A +GPV +F V +DF +YKSGVY+H G +GGHAV+
Sbjct: 260 ----ATSGAKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVE 311
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
++G+G +D G DYW + N W WG DGYF+I RG +ECGIE++
Sbjct: 312 IVGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEQE 355
>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
Length = 325
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 82/167 (49%), Gaps = 14/167 (8%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C GGY +W + + G + C PY G G P +C + ++
Sbjct: 169 CQGGYLKYSWTFLENTGTPLDTCIPYASGRGTFSSGTCPT----QCKIASMSMSK----- 219
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
Y R + +I I G V+ FTVY D YKSGVYKH+ V+GGHAV L
Sbjct: 220 --YKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVAL 277
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
IG+G + G +YW+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 278 IGFGV-EGGSNYWLAANSWGANWGMSGYFKIAQG--EGGIENQVYAG 321
>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 82/167 (49%), Gaps = 14/167 (8%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C GGY +W + + G + C PY G + + C +C + +
Sbjct: 69 CQGGYLKYSWTFLENTGTPLDTCIPYASGRG--------TFSSGTCPTQCKIASM---SM 117
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
Y R + +I I G V+ FTVY D YKSGVYKH+ V+GGHAV L
Sbjct: 118 SKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVAL 177
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
IG+G + G +YW+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 178 IGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 221
>gi|363729389|ref|XP_417207.2| PREDICTED: dipeptidyl peptidase 1 [Gallus gallus]
Length = 460
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/207 (35%), Positives = 101/207 (48%), Gaps = 37/207 (17%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q S +++C + GCDGG+P + A +Y GVV E+C PY
Sbjct: 276 QKPVFSPQQVVSCSQY--SQGCDGGFPYLIAGKYVQDFGVVEEDCFPY------------ 321
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI-----NSDPEDIMA-EIYKNGPVEVSFT 132
A TP C+ K R+ HY S Y + E +M E+ +GP+ V+F
Sbjct: 322 TAKDTP-CLFK--------RSCYHYYTSEYHYVGGFYGACNEALMKLELVLSGPMAVAFE 372
Query: 133 VYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGA 185
VY DF YK G+Y H + HAV L+G+G + GE +WI+ N W SWG
Sbjct: 373 VYNDFMFYKEGIYHHTGLKDEFNPFELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGE 432
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSK 212
DGYF+I+RG++EC IE VA P K
Sbjct: 433 DGYFRIRRGTDECAIESIAVAATPIPK 459
>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 86/169 (50%), Gaps = 14/169 (8%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C GGY +W + + G + C PY G + + C +C K S
Sbjct: 154 CQGGYLKYSWTFLENTGTPLDTCIPYASGGG--------TFSSGTCPTQC--KIASMSMS 203
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
K+ + + I S +I I G V+ FTVY D YKSGVYKH+ V+GGHAV L
Sbjct: 204 KYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHLVSTVLGGHAVAL 262
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
IG+G + G +YW+ AN W +WG GYFKI +G E GIE V AG P
Sbjct: 263 IGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAGEP 308
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 109/221 (49%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG+ AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPILSPQNLLSCNTHH-QQGCRGGHLDGAWWFLRRRGVVSDHCY 293
Query: 65 PYF----DSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G P + T + R+ N N+ Y ++ AYR+ S+ +I
Sbjct: 294 PFLGRERDKAGPVPPCMMHSRATGRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEI 353
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H ++ G H+VK+ GWG T
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETWP 413
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 414 DGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 101/190 (53%), Gaps = 20/190 (10%)
Query: 24 LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYP- 82
S +LL CC C C GGY AW Y+++ G+V+ Y S GC P + ++
Sbjct: 130 FSPENLLTCCED-CRLECVGGYTAKAWDYYINEGIVSG--GDYNSSEGC-QPYSKASFQY 185
Query: 83 --TPKCVRKC--VKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
KCV+ C K + + + KHY S Y + ++ I EI NGPV +F V+ED
Sbjct: 186 AVASKCVKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFNVFEDII 245
Query: 139 HYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWG-ADGYFKIKRGSNE 197
+YKSG+ V ++ WGT ++G YW++AN W WG G+ KIKRG+NE
Sbjct: 246 YYKSGIQL---------SNVSILRWGT-EEGVPYWLIANSWGTWWGDLGGFIKIKRGTNE 295
Query: 198 CGIEEDVVAG 207
C IE+++ AG
Sbjct: 296 CAIEQEMAAG 305
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/227 (34%), Positives = 106/227 (46%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCQGGRLDGAWWFLRRRGVVSDHCY 188
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
P+ H E A P P+C+ + R N + AYR+
Sbjct: 189 PF-----SGHERNE-AGPAPRCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLG 242
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGD--------VMGGHAVKLIGW 163
S+ +DIM E+ +NGPV+ V+EDF Y+SG+Y H G H+VK+ GW
Sbjct: 243 SNEKDIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITGW 302
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 303 GEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFVLG 349
>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
Length = 458
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 100/203 (49%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS +++C + GCDGG+P + A +Y G+V E PY G P
Sbjct: 274 QKPILSPQQVVSCSNY--SQGCDGGFPYLIAGKYLNDFGIVEESDFPYI---GSDSP--- 325
Query: 79 PAYPTPKCVRKCVKKN--QLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K+ Q + ++++ + + + + E+ GP+ V+F VY+D
Sbjct: 326 -----------CTLKDSYQRYYTAEYHYVGGFYGGCNEAYMKLELVLGGPLSVAFEVYDD 374
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSDD-GEDYWILANQWNRSWGADGYF 189
F HY+SGVY H + HAV L+G+GT GE YWI+ N W SWG G+F
Sbjct: 375 FIHYRSGVYHHTGLQDKFNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSWGESWGEKGFF 434
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RGS+EC IE V+ P K
Sbjct: 435 RIRRGSDECAIESIAVSANPIIK 457
>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
Length = 360
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 88/164 (53%), Gaps = 16/164 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC+GG P++A+ + + G V C Y C KC +N +
Sbjct: 205 GCNGGEPVNAFNFLHNTGTVLASCVGYTAGDDAVVKFCPQ-----KCDDGSAVENVV--- 256
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
+ S + S + ++A +GPV +F V +DF +YKSGVY+H G +GGHAV+
Sbjct: 257 ----ATSGSKSGSAIDVLLA----HGPVVATFNVAQDFMYYKSGVYQHRWGLWLGGHAVE 308
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
+IG+G +D G DYW + N W WG DGYF+I RG +ECGIE +
Sbjct: 309 IIGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGGDECGIEHE 352
>gi|432892467|ref|XP_004075795.1| PREDICTED: dipeptidyl peptidase 1-like [Oryzias latipes]
Length = 453
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 96/203 (47%), Gaps = 30/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHP-GC 77
Q S +++C + GCDGG+P +Y G+V E C PY + C P C
Sbjct: 270 QTPVFSPQQVVSCSEY--SQGCDGGFPYLIGKYSQDFGIVEESCFPYIAKDSPCGVPQNC 327
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
AY +++ + + +M E+ +GP+ V+F VY DF
Sbjct: 328 GRAY-----------------TAEYKYVGGFYGGCSEMAMMKELVHHGPMAVAFEVYPDF 370
Query: 138 AHYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYF 189
HY G+Y H TG + HAV L+G+G GE YWI+ N W SWG +G+F
Sbjct: 371 MHYAGGIYHH-TGLADPFNPFELTNHAVLLVGYGRCHKTGEKYWIVKNSWGTSWGENGFF 429
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RGS+EC IE VA P K
Sbjct: 430 RIRRGSDECSIESIAVAATPIPK 452
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 105/217 (48%), Gaps = 15/217 (6%)
Query: 19 LQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPGC 77
L +LS LL+C L GC GG+ SAW + + G+VTEEC P+ +T C+
Sbjct: 268 LMRDALSPKHLLSCNNDL-QRGCQGGHLTSAWNWVMTFGLVTEECYPWDGRATDCAVSNQ 326
Query: 78 EPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDF 137
+ K + L R Y ++ E IM EI G V+ V ++F
Sbjct: 327 RSNNNLIVTCPRSAKTSPLRRVGLMYRVAT------EEGIMYEIMNWGSVQAMMKVSKEF 380
Query: 138 AHYKSGVYKHITGDV---MGGHAVKLIGWGTSDDG---EDYWILANQWNRSWGADGYFKI 191
Y+SGVYK D+ G H V+++GWG YWI++N W WG GYF+I
Sbjct: 381 FMYESGVYKCSKLDLGSKTGYHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSKNLVKEITSADMFEDAS 228
+G+NEC IE+ VVA +P N I+ E+AS
Sbjct: 441 LKGTNECQIEDFVVAAMPDIDNFCN-ISDQSFRENAS 476
>gi|26340150|dbj|BAC33738.1| unnamed protein product [Mus musculus]
Length = 462
Score = 107 bits (266), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 101/201 (50%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GCDGG+P + A +Y GVV E C PY
Sbjct: 278 QTPILSPQEVVSCSPY--AQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDS------- 328
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P P C R + +S +Y + + + + E+ K+GP+ V+F V++DF
Sbjct: 329 PCKPRENCHR--------YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKI 191
HY SG+Y H + HAV L+G+G G +YWI+ N W +WG GYF+I
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
+RG++EC IE VA +P K
Sbjct: 441 RRGTDECAIESIAVAAIPIPK 461
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 183 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCY 241
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P+ A PTP+C+ R+ + Q+ N + AYR+
Sbjct: 242 PFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLG 295
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H G H+VK+ GW
Sbjct: 296 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 355
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 356 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 402
>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
Length = 257
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 87/179 (48%), Gaps = 15/179 (8%)
Query: 30 LACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRK 89
L C GC+GG P AW Y HG+ T C PY G CV+
Sbjct: 87 LVSCDIFGNQGCNGGIPQLAWEYMELHGIPTYGCFPYTSGNGTDG----------SCVKN 136
Query: 90 CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHIT 149
N+ + + ++ + + E I +I K GP++ + VY DF Y SGVY
Sbjct: 137 SCVDNEQYTLYRAKPLT-LKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTP 195
Query: 150 G-DVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G ++GGHA+K++GWG ++YWI+AN W SWG DG+F I ++CGI D A
Sbjct: 196 GSSLLGGHAIKIVGWGFDQASNQNYWIVANSWGPSWGIDGFFWIAF--DQCGINSDACA 252
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 234 AAVASDRVSIHSLGHMTPILSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDNCY 292
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P+ A PTP+C+ R+ + Q+ N + AYR+
Sbjct: 293 PFSGREQ------NEASPTPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLG 346
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
SD ++IM E+ +NGPV+ V+EDF Y+ G+Y H G H+VK+ GW
Sbjct: 347 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W WG G+F+I RG+NEC IE V+
Sbjct: 407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLG 453
>gi|325180819|emb|CCA15230.1| cathepsinlike cysteine protease putative [Albugo laibachii Nc14]
Length = 660
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 99/206 (48%), Gaps = 8/206 (3%)
Query: 5 RTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
+ +R+ SP L+ + L+ LL C GC GG P+SA+RY +G+ E C
Sbjct: 94 QRSRNRKEKSPVDVLREVVLAPQVLLNC--DTADGGCHGGDPLSAFRYIHENGIPDESCQ 151
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLW--RNSKHYSISAYRINSDPEDIMAEIY 122
Y ++TG H P C C W ++ + Y +S + + AEIY
Sbjct: 152 RY-EATG--HDTGNQCRPQDVC-ENCAPSRGCWAQKSYEKYYVSEFGTVRGEHQMKAEIY 207
Query: 123 KNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRS 182
G + + V + F +Y+ GV+ T V HA+ ++GWG DG YW++ N W
Sbjct: 208 ARGSIVCTVDVTDAFLNYEGGVFDDKTHAVSMDHAISVVGWGEMKDGTKYWVVRNSWGSF 267
Query: 183 WGADGYFKIKRGSNECGIEEDVVAGL 208
WG DG+F+I RG N GIE + G+
Sbjct: 268 WGEDGWFRIVRGVNNLGIESECTFGV 293
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 86/191 (45%), Gaps = 8/191 (4%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY-FDSTGCSHPG-CEP 79
++LS L+ C G G C GG P + Y HG+ + C Y + C+ CE
Sbjct: 444 IALSPQVLINCHG---GGSCAGGNPGLVYEYAHRHGIPDQTCQAYQAQNLNCNEFAICET 500
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
+ T + + + K Y +S Y S + + AEI+K GP+ ++F
Sbjct: 501 CWSTNTSFTP--GRCEAIKKFKKYYVSEYGKVSGVDRMKAEIFKRGPIGCGIHATKNFVA 558
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGE-DYWILANQWNRSWGADGYFKIKRGSNEC 198
Y G+Y + H + + GWG +D + +YWI N W WG G+F+IK +
Sbjct: 559 YTGGIYSESVIWPIPNHEISVAGWGFDEDTQTEYWIGRNSWGTYWGEHGWFRIKMHHSNL 618
Query: 199 GIEEDVVAGLP 209
GIE D G+P
Sbjct: 619 GIESDCDWGVP 629
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 107/227 (47%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 222 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQRGCHGGRLDGAWWFLRRRGVVSDHCY 280
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWR-------------NSKHYSISAYRIN 111
P+ + A P P+C+ + R N + AYR+
Sbjct: 281 PFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHAHANDIYQVTPAYRLG 334
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S+ ++IM E+ +NGPV+ V+EDF Y+SG+Y H + G H+VK+ GW
Sbjct: 335 SNEKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGW 394
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 395 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 441
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 84/141 (59%), Gaps = 4/141 (2%)
Query: 70 TGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSK-HYSISAYRINSDPEDIMAEIYKNGPVE 128
+ +P + TP+C +C + R K ++ + YRI M EIY+NGP+
Sbjct: 7 SAVENPCSNKTFFTPECKVQCYNPDYGTRYVKDNHKGTQYRIPG--YTAMKEIYENGPIT 64
Query: 129 VSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGY 188
SF +Y+DF +Y+SGVY +G + AVK++GWG ++G YW+ AN +N WG +G+
Sbjct: 65 ASFYMYQDFVNYQSGVYAFNSGKYVTTQAVKILGWG-EENGTPYWLAANSFNTYWGDNGF 123
Query: 189 FKIKRGSNECGIEEDVVAGLP 209
KI RG+NEC IEE + AGLP
Sbjct: 124 VKILRGANECYIEEFMYAGLP 144
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 95/203 (46%), Gaps = 26/203 (12%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
VT D L + L LS ++ AC F GC GG P SAW + G+ T
Sbjct: 11 FGVTEAFNDRLCIKSDGAFTEL-LSAGEMNACTLFF---GCGGGDPYSAWSWVHDKGIAT 66
Query: 61 -------------EECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNSKHYSISA 107
+ C PY D C+H + YP KC + + +H+ + +
Sbjct: 67 GGDYVAKDDMTKDDGCWPY-DFPPCAHHINDTKYP--KCPKVSCSGDD-----RHFMLES 118
Query: 108 YRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTSD 167
+ D I +GPV SFTVYEDF Y+SGVYKH +G +GGHAVK+IGWG
Sbjct: 119 SPYHYSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGEK- 177
Query: 168 DGEDYWILANQWNRSWGADGYFK 190
G+ YW+ N WN WG G F+
Sbjct: 178 SGQAYWLAVNSWNEDWGDHGLFR 200
>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
Length = 310
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 85/167 (50%), Gaps = 14/167 (8%)
Query: 41 CDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRNS 100
C GGY +W + + G + C PY G + + C +C K S
Sbjct: 154 CQGGYLKYSWTFLENTGTPLDTCIPYASGRG--------TFSSGTCPTQC--KIASMSMS 203
Query: 101 KHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVKL 160
K+ + + I S +I I G V+ FTVY D YKSGVYKH+ V+GGHAV L
Sbjct: 204 KYKAKNTVYI-SGINNIKTAIMTYGSVQAGFTVYRDLTGYKSGVYKHVVSTVLGGHAVAL 262
Query: 161 IGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAG 207
IG+G + G +YW+ AN W +WG GYFKI +G E GIE V AG
Sbjct: 263 IGFGV-EGGSNYWLAANSWGPNWGMSGYFKIAQG--EGGIENQVYAG 306
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/197 (36%), Positives = 96/197 (48%), Gaps = 16/197 (8%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGY--PISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
N LS +L++C G + GY + W YF HG+V+ Y + GC
Sbjct: 136 NQLLSTEELISCSGI---KEREDGYVNRVLVWEYFKTHGLVS--GGKYNTNEGCQPSKVP 190
Query: 79 PAYPTPK------CVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFT 132
Y + CV C K+ + N H +S + +DI E+ GPV V F
Sbjct: 191 TVYNSQTKIYKRTCVEYCYGKDTINYNHDHVKVSNHYF-IRIKDIQKEVQTYGPVSVFFD 249
Query: 133 VYEDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKI 191
+++D YKSGVY K H KLIGWG ++G DYW+L N W WG +G FKI
Sbjct: 250 LHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGV-ENGVDYWLLVNSWGYEWGQNGLFKI 308
Query: 192 KRGSNECGIEEDVVAGL 208
KRG++EC +E V AGL
Sbjct: 309 KRGTDECSVESHVYAGL 325
>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 296
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 84/164 (51%), Gaps = 16/164 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC+GG P A+ + G V C Y C PK ++
Sbjct: 141 GCNGGEPTKAFDFLHSTGTVLTSCVDYTAGADNVVKFC------PKTCDDGSAVENVFAA 194
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
S S SA + + +GPV +F V +DF +YKSGVY+H G +GGHAV+
Sbjct: 195 SGSKSGSAIDV----------LLSHGPVVATFNVAQDFMYYKSGVYQHRWGVWLGGHAVE 244
Query: 160 LIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
++G+G +D G DYW + N W WG DGYF+I RGS+ECGIE++
Sbjct: 245 VVGYGVTDSGLDYWTVRNSWGPDWGEDGYFRIVRGSDECGIEQE 288
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 73/199 (36%), Positives = 93/199 (46%), Gaps = 12/199 (6%)
Query: 21 NLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPA 80
N LS +L++C G +G S W Y HGVV+ Y + GC P
Sbjct: 115 NKLLSTEELISCSGIKENNGSVPS-ERSIWEYLKSHGVVS--GGKYNSNDGCQPFKFPPI 171
Query: 81 YPTPKCVRK------CVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVY 134
PK + K C + + N H + Y DI E+ GPV V F V
Sbjct: 172 ANIPKHLHKHTCDDHCYGNSTINYNHDHVRVRNY-YTIRTRDIQKEVQTYGPVVVRFMVC 230
Query: 135 EDFAHYKSGVY-KHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKR 193
+DF YKSGVY K + KLIGWG ++G DYW++ N W WG G FKIK
Sbjct: 231 DDFFLYKSGVYAKSDKAKGIRTQYAKLIGWGV-ENGVDYWLVINSWGHEWGQKGLFKIKS 289
Query: 194 GSNECGIEEDVVAGLPSSK 212
G+N+CG+E V AGLP K
Sbjct: 290 GTNQCGVESFVYAGLPEIK 308
>gi|327269233|ref|XP_003219399.1| PREDICTED: dipeptidyl peptidase 1-like [Anolis carolinensis]
Length = 467
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 98/202 (48%), Gaps = 27/202 (13%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q +LS +++C + GCDGG+P + A +Y GVV E+C PY +
Sbjct: 283 QTPTLSPQKVVSCSQY--SQGCDGGFPYLIAGKYAQDFGVVEEDCFPYTATD-------S 333
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
P T C A + E+ K+GP+ V+F VY DF
Sbjct: 334 PCNFTHSCYHYYATNYYYVGGFYGGCNEAL--------MKLELVKHGPMAVAFEVYSDFM 385
Query: 139 HYKSGVYKHITGDV-------MGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFK 190
HY+ G+Y H TG + + HAV L+G+GT + GE +WI+ N W +WG GYF+
Sbjct: 386 HYRGGIYHH-TGLMDPFNPFELTNHAVLLVGYGTDPETGEPFWIVKNSWGPAWGEQGYFR 444
Query: 191 IKRGSNECGIEEDVVAGLPSSK 212
I+RG++EC IE VA P K
Sbjct: 445 IRRGTDECAIESIAVASTPIPK 466
>gi|449269572|gb|EMC80333.1| Dipeptidyl-peptidase 1 [Columba livia]
Length = 412
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 102/203 (50%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q S +++C + GCDGG+P + A +Y GVV E+C PY T P
Sbjct: 228 QKPIFSPQQVVSCSQY--SQGCDGGFPYLIAGKYVQDFGVVEEDCFPY---TAQDSP--- 279
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C+ K + S+++ + + + + E+ +GP+ V+F VY D
Sbjct: 280 -----------CLFKRSCYHYYTSEYHYVGGFYGGCNEALMKLELVLHGPMAVAFEVYND 328
Query: 137 FAHYKSGVYKH--ITGDV----MGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGADGYF 189
F HYK G+Y H + D + HAV L+G+GT GE +WI+ N W WG +GYF
Sbjct: 329 FIHYKEGIYHHTGLRDDFNPFELTNHAVLLVGYGTDPQSGEKFWIVKNSWGILWGENGYF 388
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE V+ P +K
Sbjct: 389 RIRRGTDECAIESIAVSATPIAK 411
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 66/163 (40%), Positives = 92/163 (56%), Gaps = 18/163 (11%)
Query: 6 TNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEE--- 62
++R + S+ VS++ +S DLLACC CG GC+GGYP +AW ++ G+V+
Sbjct: 13 SDRLCIHSNGKVSVE---ISSEDLLACCD-SCGMGCNGGYPSAAWDFWTDVGLVSGGLYD 68
Query: 63 ----CDPY------FDSTGCSHPGCEPAYPTPKCVRKCVKK-NQLWRNSKHYSISAYRIN 111
C PY G P TP+C+ +C ++ KHY S+Y +
Sbjct: 69 SHVGCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVP 128
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMG 154
SD E I +EIYKNGPVE +FTVYEDF YK+GVY+H+TG +G
Sbjct: 129 SDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGVYQHMTGSAVG 171
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 109/227 (48%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDKHN-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PYFDSTGCSHPGCEPAYPTPKCVRKCV-----KKNQLWRNSKH-------YSIS-AYRIN 111
P+ A P P+C+ K+ + R H Y ++ AYR+
Sbjct: 294 PFSGQER------NEAGPEPRCMMHSRAMGRGKRQAIARCPNHHVHANDIYQVTPAYRLG 347
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S+ ++IM E+ +NGPV+ V+EDF Y+ G+Y H + G H+VK+ GW
Sbjct: 348 SNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGKPERYRRHGTHSVKITGW 407
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGTNECDIESFVLG 454
>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
Length = 462
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 25/201 (12%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q LS ++++C + GC+GG+P + A +Y G V E C PY TG P C
Sbjct: 278 QTPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGFVEESCFPY---TGTDAP-C- 330
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFA 138
K C++ + S+++ + + + + E+ ++GP+ V+F V +DF
Sbjct: 331 ------KMKEDCMR----YYTSEYHYVGGFYGGCNEALMKLELVQHGPMAVAFEVCDDFM 380
Query: 139 HYKSGVYKH------ITGDVMGGHAVKLIGWGT-SDDGEDYWILANQWNRSWGADGYFKI 191
HY G+Y H + HAV L+G+GT S +G DYWI+ N W SWG GYF+I
Sbjct: 381 HYHKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSANGMDYWIVKNSWGTSWGEKGYFRI 440
Query: 192 KRGSNECGIEEDVVAGLPSSK 212
RG++EC IE +A P K
Sbjct: 441 LRGTDECAIESIAMAATPIPK 461
>gi|326914532|ref|XP_003203579.1| PREDICTED: dipeptidyl peptidase 1-like [Meleagris gallopavo]
Length = 420
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 99/207 (47%), Gaps = 37/207 (17%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q S +++C + GCDGG+P + A +Y GVV E+C PY T P
Sbjct: 236 QKPVFSPQQVVSCSQY--SQGCDGGFPYLIAGKYVQDFGVVEEDCFPY---TAQDSP--- 287
Query: 79 PAYPTPKCVRKCVKKNQLWRNSKHYSISAYRI-----NSDPEDIMA-EIYKNGPVEVSFT 132
C+ K R+ HY S Y + E +M E+ +GP+ V+F
Sbjct: 288 -------CLFK--------RSCYHYYTSEYHYVGGFYGACNEALMKLELVLSGPMAVAFE 332
Query: 133 VYEDFAHYKSGVYKH------ITGDVMGGHAVKLIGWGTS-DDGEDYWILANQWNRSWGA 185
VY DF YK G+Y H + HAV L+G+G GE +WI+ N W SWG
Sbjct: 333 VYNDFMFYKEGIYHHTGLKDNFNPFELTNHAVLLVGYGKDPKSGEKFWIVKNSWGTSWGE 392
Query: 186 DGYFKIKRGSNECGIEEDVVAGLPSSK 212
DGYF+I+RG++EC IE VA P K
Sbjct: 393 DGYFRIRRGTDECAIESIAVAATPIPK 419
>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 315
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 87/171 (50%), Gaps = 22/171 (12%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC GG + G+ T+ C PY D E A+ P C CV + + R
Sbjct: 146 GCTGGTMEDVGDFLRDTGIATDTCVPYVD---------EDAHWEP-CPVSCVDGSPI-RT 194
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGGHAVK 159
+ + R + + E +M I NGP+ S +YEDF +Y+SG+Y I G G HA++
Sbjct: 195 VQ--LMDFVRYDGNLEAMMEAIAMNGPIHASMMIYEDFMYYQSGIYHFIYGSGCGMHAIE 252
Query: 160 LIGWGTSDDGE---------DYWILANQWNRSWGADGYFKIKRGSNECGIE 201
L+G+GT G+ DYWI N W WG +GYF+I RG+NECGIE
Sbjct: 253 LVGYGTDISGDSEAGEEVRVDYWIARNSWGEDWGENGYFRIVRGNNECGIE 303
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 106/221 (47%), Gaps = 19/221 (8%)
Query: 1 MSVTRTNRDALSSSPYVSLQNLSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVT 60
S + D LS L+++ LS L++C GC+GG+ AW G V+
Sbjct: 242 FSTSTVAADRLSIHSGGELKDM-LSAQYLISCTTDHHQKGCEGGHVDRAWWQLRRVGTVS 300
Query: 61 EECDPYFDSTGCSHPG--CEPAYPTPKCVRKCVKKNQLWRNSKHYSISA-YRINSDPEDI 117
++C PY S + PG Y PK +C + SK Y S YRI + +I
Sbjct: 301 KDCYPY-TSGDTNDPGKCLMSKYKLPKKNIECPVGQGI--TSKLYQASPPYRIAAKEREI 357
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG---------HAVKLIGWGTSDD 168
M EI NGPV+ V +DF Y+ GVYKH H+V++IGWGT
Sbjct: 358 MNEIILNGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHLGKEAYHSVRIIGWGTDYT 417
Query: 169 GED---YWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G+D YW+ AN W R WG G+F+I RGS+E IE VV
Sbjct: 418 GDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVVG 458
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 188
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G + P + + R+ + N N+ Y ++ YR+ S+ ++I
Sbjct: 189 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVNNNDIYQVTPVYRLGSNDKEI 248
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GWG T
Sbjct: 249 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 308
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 309 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
Length = 460
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 103/203 (50%), Gaps = 29/203 (14%)
Query: 20 QNLSLSVNDLLACCGFLCGDGCDGGYP-ISAWRYFVHHGVVTEECDPYFDSTGCSHPGCE 78
Q+ LS ++++C + GC+GG+P + A +Y G+V E C PY TG P
Sbjct: 276 QSPILSPQEVVSCSQY--AQGCEGGFPYLIAGKYAQDFGLVEETCFPY---TGTDSP--- 327
Query: 79 PAYPTPKCVRKCVKKNQLWR--NSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYED 136
C K +R +S+++ + + + + E+ +GP+ V+F VY+D
Sbjct: 328 -----------CKLKENCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD 376
Query: 137 FAHYKSGVYKH------ITGDVMGGHAVKLIGWGTSD-DGEDYWILANQWNRSWGADGYF 189
F HY G+Y H + HAV L+G+GT G +YW + N W SWG +GYF
Sbjct: 377 FLHYHKGIYHHTGLKDPFNPFELTNHAVLLVGYGTDPASGLNYWTVKNSWGTSWGENGYF 436
Query: 190 KIKRGSNECGIEEDVVAGLPSSK 212
+I+RG++EC IE +A P K
Sbjct: 437 RIRRGTDECAIESIAMAATPIPK 459
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 83/165 (50%), Gaps = 15/165 (9%)
Query: 40 GCDGGYPISAWRYFVHHGVVTEECDPYFDSTGCSHPGCEPAYPTPKCVRKCVKKNQLWRN 99
GC GG + YF +GVVTE+C+ Y A C C
Sbjct: 100 GCGGGRLDTPLAYFRDNGVVTEKCESY------------KATQASSCSNTCDDGTSFSNT 147
Query: 100 SKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVY-KHITGDVMGGHAV 158
+K++S YR++S E A+IY NGP+ F +Y D +YKSGVY K + HA
Sbjct: 148 TKYHSKDCYRLSS-IEQAKADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAG 206
Query: 159 KLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEED 203
++IGWG +DG YW+ AN W WG G FKI+ G+NE G E +
Sbjct: 207 RVIGWGV-EDGVQYWLAANSWGTGWGQQGLFKIRSGTNEVGFEAN 250
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 109/227 (48%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHNQQGCRGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK---NQLWRNSKHYSISAYRIN 111
P+ + A P P+C+ R+ + + + N + AYR+
Sbjct: 294 PFVGREQ------DEAGPAPRCMMHSRAMGRGKRQATARCPSSHVHANDIYQVTPAYRLG 347
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
++ ++IM E+ +NGPV+ V+EDF Y+ G+Y H + G H+VK+ GW
Sbjct: 348 TNEKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGW 407
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFVLG 454
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 108/221 (48%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 204 AAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCY 262
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G + P + + R+ N N+ Y ++ AYR+ S+ +I
Sbjct: 263 PFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEI 322
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H ++ G H+VK+ GWG T
Sbjct: 323 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRP 382
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 383 DGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 423
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 87/174 (50%), Gaps = 13/174 (7%)
Query: 39 DGCDGGYPISAWRYFVHHGVVTEECDPYF-DSTGCSHPGCEPAYPTPKCVRKCVKKNQLW 97
+GC GG+P++A++Y HGV E C Y + C+ R C + +
Sbjct: 112 NGCQGGHPLTAFKYMHDHGVPEEGCMRYMAKNMECT---------DINICRDCDSEKGCF 162
Query: 98 --RNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDVMGG 155
+N Y + Y + +++M EIY GP+ S V +D YK G+Y+ TG
Sbjct: 163 AVKNYTKYYVDEYGSVAGEKNMMKEIYARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLD 222
Query: 156 HAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVAGLP 209
HA+ ++GWG +DG+ YWI N W WG G+F+I RG N GIE D +P
Sbjct: 223 HAISVVGWG-EEDGQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVP 275
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 78/175 (44%), Gaps = 13/175 (7%)
Query: 22 LSLSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECDPY--FDSTGCSHPGCEP 79
+ LS +++ C CDGG + Y + G+ + C Y D C
Sbjct: 381 VELSAQEVINCSN---AGTCDGGSDADVFEYAFNEGIPDQTCQVYEAIDKECNDMARCMD 437
Query: 80 AYPTPKCVRKCVKKNQLWRNSKHYSISAYRINSDPEDIMAEIYKNGPVEVSFTVYEDFAH 139
P C ++ K Y +S Y +I AEI+ GPV S V E+F
Sbjct: 438 CPPGEDCYPV--------KDYKRYKVSEYGEVKGEMEIKAEIFARGPVSCSMIVTEEFLA 489
Query: 140 YKSGVYKHITGDVMGGHAVKLIGWGTSDDGEDYWILANQWNRSWGADGYFKIKRG 194
Y+ G++ G ++G HAV++ GWG ++DG YWI N W WG G+F++ G
Sbjct: 490 YQGGIFVDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSWGPYWGEHGWFRMIVG 544
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/226 (33%), Positives = 109/226 (48%), Gaps = 35/226 (15%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PYF----DSTGCSHP--------GCEPAYPTPKCVRKCVKKNQLWRNSKHYSISAYRINS 112
P+ D G + P G T +C V N +++ + AYR+ S
Sbjct: 294 PFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVHANDIYQVT-----PAYRLGS 348
Query: 113 DPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG 164
+ ++IM E+ +NGPV+ V+EDF Y+ G+Y H + G H+VK+ GWG
Sbjct: 349 NEKEIMKELLENGPVQALMEVHEDFFLYQGGIYSHTPVSLERPERYRRHGTHSVKITGWG 408
Query: 165 --TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
T DG YW AN W +WG G+F+I RG+NEC IE V+
Sbjct: 409 EETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTNECDIESFVLG 454
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 108/221 (48%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 235 AAVASDRVSIHSLGHMTPVLSPQNLLSCNTHH-QQGCRGGRLDGAWWFLRRRGVVSDHCY 293
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G + P + + R+ N N+ Y ++ AYR+ S+ +I
Sbjct: 294 PFSGRERDKAGPAPPCMMHSRAMGRGKRQATAHCPNGHVNNNNIYQVTPAYRLGSNDTEI 353
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H ++ G H+VK+ GWG T
Sbjct: 354 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVNLGRPERYRRHGTHSVKITGWGEETRP 413
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 414 DGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 454
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/227 (34%), Positives = 109/227 (48%), Gaps = 37/227 (16%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 188
Query: 65 PYFDSTGCSHPGCEPAYPTPKCV----------RKCVKK--NQLWRNSKHYSIS-AYRIN 111
P+ S + A PTP C+ R+ N N+ Y ++ YR+
Sbjct: 189 PF------SGRERDEAGPTPPCMMHSRAMGRGKRQATASCPNSHVNNNDIYQVTPVYRLG 242
Query: 112 SDPEDIMAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGW 163
S+ ++IM E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GW
Sbjct: 243 SNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGW 302
Query: 164 G--TSDDGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
G T DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 303 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 107/221 (48%), Gaps = 25/221 (11%)
Query: 10 ALSSSPYVSLQNLS-----LSVNDLLACCGFLCGDGCDGGYPISAWRYFVHHGVVTEECD 64
A +S VS+ +L LS +LL+C GC GG AW + GVV++ C
Sbjct: 130 AAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQ-QQGCRGGRLDGAWWFLRRRGVVSDHCY 188
Query: 65 PY----FDSTGCSHPGCEPAYPTPKCVRKCVKK--NQLWRNSKHYSIS-AYRINSDPEDI 117
P+ D G + P + + R+ N N+ Y ++ YR+ S+ ++I
Sbjct: 189 PFSGRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQVTPVYRLGSNDKEI 248
Query: 118 MAEIYKNGPVEVSFTVYEDFAHYKSGVYKHITGDV--------MGGHAVKLIGWG--TSD 167
M E+ +NGPV+ V+EDF YK G+Y H + G H+VK+ GWG T
Sbjct: 249 MKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLP 308
Query: 168 DGE--DYWILANQWNRSWGADGYFKIKRGSNECGIEEDVVA 206
DG YW AN W +WG G+F+I RG NEC IE V+
Sbjct: 309 DGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFVLG 349
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.135 0.439
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,172,401,015
Number of Sequences: 23463169
Number of extensions: 191520933
Number of successful extensions: 385834
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4779
Number of HSP's successfully gapped in prelim test: 1867
Number of HSP's that attempted gapping in prelim test: 372648
Number of HSP's gapped (non-prelim): 7581
length of query: 229
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 91
effective length of database: 9,121,278,045
effective search space: 830036302095
effective search space used: 830036302095
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)